mcp-server-fetch
If you are the rightful owner of mcp-server-fetch and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
Fetch MCP Server is a Model Context Protocol server that provides web content fetching capabilities using browser automation, OCR, and multiple extraction methods.
Fetch MCP Server is designed to enable Language Model Models (LLMs) to retrieve and process content from web pages, even those requiring JavaScript rendering or employing techniques that prevent simple scraping. It uses a combination of browser automation, OCR, and various extraction methods to ensure comprehensive content retrieval. The server employs a sophisticated scoring system to select the best content, ensuring high-quality and reliable results. Debug logging is available to track scoring decisions, making it a robust tool for content extraction.
Features
- Browser automation with undetected-chromedriver for dynamic content rendering.
- OCR using pytesseract with layout detection for text extraction from images.
- HTML extraction using requests and BeautifulSoup for static content.
- Document parsing capabilities for formats like PDF, DOCX, and PPTX.
- Sophisticated scoring system to ensure high-quality content selection.