mcp-server-spider
If you are the rightful owner of mcp-server-spider and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
A Model Context Protocol server for Spider crawler interaction and automation.
mcp-server-spider: A spider MCP server
Overview
A Model Context Protocol server for Spider crawler interaction and automation. This server provides tools to crawl and scrape web pages.
Please note that mcp-server-spider is currently in early develpoment. There might be bugs and features added in the future.
Tools
crawl
- Crawls the given url and returns the list of URLs that were found
- Input:
url
: The url to crawlheaders
: Additional headers passed along with crawl requestsuser_agent
: User agent to use for the crawl requestsdepth
: The depth of link traversalblacklist
: A list of regural expression to blacklist URLs from the crawling processwhitelist
: A list of regular expression to whitelist URLS from the crawling processrespect_robots_txt
: Whether to respectrobots.txt
fileaccept_invalid_certs
: Whether to accept invalid certifcates or not
- Returns: List of URLs found
scrape
- Scrapes the given url and returns a list of JSON objects that contain the url, links and content of each page discovered
- Input: Same as
crawl
- Returns: A list of JSON objects (as a string) that contain the url, links and content of each page discovered
Installation
Using uv (recommended)
When using uv
no specific installation is needed. We will
use uvx
to directly run mcp-server-spider.
Using PIP
Alternatively you can install mcp-server-spider
via pip:
pip install mcp-server-spider
After installation, you can run it as a script using:
python -m mcp_server_spider