crawlbase-mcp

crawlbase/crawlbase-mcp

3.4

If you are the rightful owner of crawlbase-mcp and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

Crawlbase MCP is a Model Context Protocol server that enables AI agents to access real-time web data, enhancing their capabilities with fresh and structured content.

What is Crawlbase MCP?

Crawlbase MCP is a Model Context Protocol (MCP) server that bridges AI agents and the live web. Instead of relying on outdated training data, your LLMs can now fetch fresh, structured, real-time content β€” powered by Crawlbase’s proven crawling infrastructure trusted by 70,000+ developers worldwide.

It handles the complexity of scraping for you:

  • JavaScript rendering for modern web apps
  • Proxy rotation & anti-bot evasion
  • Structured outputs (HTML, Markdown, screenshots)

How It Works

  • Get Free Crawlbase Tokens β†’ Sign up at Crawlbase ↗️, get free Normal, and JavaScript tokens.
  • Set Up MCP Configuration β†’ Configure the MCP server in your preferred client (Claude, Cursor, or Windsurf) by updating the MCP Servers settings.
  • Start Crawling β†’ Use commands like crawl, crawl_markdown, or crawl_screenshot to bring live web data into your AI agent.

Setup & Integration

Claude Desktop

  1. Open Claude Desktop β†’ File β†’ Settings β†’ Developer β†’ Edit Config
  2. Add to claude_desktop_config.json:
  3. Replace your_token_here and your_js_token_here with the tokens from your dashboard.
{
  "mcpServers": {
    "crawlbase": {
      "type": "stdio",
      "command": "npx",
      "args": ["@crawlbase/mcp@latest"],
      "env": {
        "CRAWLBASE_TOKEN": "your_token_here",
        "CRAWLBASE_JS_TOKEN": "your_js_token_here"
      }
    }
  }
}

Claude Code

Add to your claude.json configuration:

{
  "mcpServers": {
    "crawlbase": {
      "type": "stdio",
      "command": "npx",
      "args": ["@crawlbase/mcp@latest"],
      "env": {
        "CRAWLBASE_TOKEN": "your_token_here",
        "CRAWLBASE_JS_TOKEN": "your_js_token_here"
      }
    }
  }
}

Cursor IDE

  1. Open Cursor IDE β†’ File β†’ Preferences β†’ Cursor Settings β†’ Tools and Integrations β†’ Add Custom MCP
  2. Add to mcp.json:
  3. Replace your_token_here and your_js_token_here with the tokens from your dashboard.
{
  "mcpServers": {
    "crawlbase": {
      "type": "stdio",
      "command": "npx",
      "args": ["@crawlbase/mcp@latest"],
      "env": {
        "CRAWLBASE_TOKEN": "your_token_here",
        "CRAWLBASE_JS_TOKEN": "your_js_token_here"
      }
    }
  }
}

Windsurf IDE

  1. Open WindSurf IDE β†’ File β†’ Preferences β†’ WindSurf Settings β†’ General β†’ MCP Servers β†’ Manage MCPs β†’ View raw config
  2. Add to mcp_config.json:
  3. Replace your_token_here and your_js_token_here with the tokens from your dashboard.
{
  "mcpServers": {
    "crawlbase": {
      "type": "stdio",
      "command": "npx",
      "args": ["@crawlbase/mcp@latest"],
      "env": {
        "CRAWLBASE_TOKEN": "your_token_here",
        "CRAWLBASE_JS_TOKEN": "your_js_token_here"
      }
    }
  }
}

πŸ”‘ Get your free tokens at Crawlbase ↗️.

Usage

Once configured, use these commands inside Claude, Cursor, or Windsurf:

  • crawl β†’ Fetch raw HTML
  • crawl_markdown β†’ Extract clean Markdown
  • crawl_screenshot β†’ Capture screenshots

Example prompts:

  • β€œCrawl Hacker News and return top stories in markdown.”
  • β€œTake a screenshot of TechCrunch homepage.”
  • β€œFetch Tesla investor relations page as HTML.”

Use Cases

  • Market research β†’ Pull live data from competitors, news, and reports
  • E-commerce monitoring β†’ Track products, reviews, and prices in real time
  • News & finance feeds β†’ Keep AI agents up-to-date with live events
  • Autonomous AI agents β†’ Give them vision to act on fresh web data

Resources & Next Steps

Looking to supercharge your AI agents with live web data? Get started here:


MSeeP.ai Security Assessment Badge

Copyright 2025 Crawlbase