mcp-smart-crawler

mcp-smart-crawler

3.3

If you are the rightful owner of mcp-smart-crawler and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

MCP Smart Crawler is a Model Context Protocol server that uses Playwright to crawl web content, extract metadata, and download resources such as videos and images.

MCP Smart Crawler is a specialized server designed to interact with web content using the Model Context Protocol (MCP). It leverages Playwright, a powerful browser automation tool, to navigate and extract data from web pages. This server is particularly adept at handling content from Xiaohongshu (小红书), a popular Chinese social media platform. By extracting metadata such as titles, descriptions, and images, and downloading videos and images from shared links, MCP Smart Crawler provides a comprehensive solution for content retrieval and analysis. Its integration with MCP clients is straightforward, requiring simple configuration adjustments to enable seamless operation.

Features

  • Extract metadata (title, description, images) from Xiaohongshu posts.
  • Download videos and images from Xiaohongshu share links.
  • Uses Playwright for browser automation.