web-crawler-mcp-server

web-crawler-mcp-server

3.2

If you are the rightful owner of web-crawler-mcp-server and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

A Model Context Protocol (MCP) server that provides a web crawling and content extraction tool for AI assistants such as Claude Desktop, Cursor, and other MCP-compatible clients.

The Web Crawler MCP Server is designed to facilitate web content extraction for AI assistants by leveraging the Model Context Protocol. It utilizes Puppeteer with a stealth plugin to navigate and extract content from public web pages while bypassing anti-bot measures. The server processes the extracted data to produce clean, whitespace-normalized text, making it ideal for consumption by language models. This server is easily integrable with MCP-compatible clients like Claude Desktop, providing a seamless experience for users looking to enhance their AI's web interaction capabilities.

Features

  • Extracts and cleans main text content from any public web page.
  • Uses Puppeteer with stealth plugin to bypass anti-bot protections.
  • Returns readable, whitespace-normalized text for LLM consumption.
  • Easy integration with Claude Desktop and other MCP clients.

Tools

  1. web-crawler

    Extracts and returns the cleaned text content from a specified URL.