playwright-fetch
If you are the rightful owner of playwright-fetch and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
A Model Context Protocol server that provides web content fetching capabilities using Playwright for browser automation.
The Playwright Fetch MCP Server is designed to enable language models to retrieve and process JavaScript-rendered content from web pages. By leveraging Playwright, a powerful browser automation tool, this server can render web pages with full JavaScript support, extract the main content, and convert it into markdown format for easier consumption by language models. It supports pagination for handling large content, respects robots.txt directives, and can route requests through a proxy server. The server is available as a pre-built Docker image, making it easy to deploy and integrate into various environments.
Features
- Browser Automation: Uses Playwright to render web pages with full JavaScript support.
- Content Extraction: Automatically identifies and extracts main content areas from web pages.
- Markdown Conversion: Converts HTML to clean, readable markdown.
- Pagination Support: Handles large content through pagination.
- Robots.txt Compliance: Respects robots.txt directives for autonomous fetching.
Tools
playwright-fetch
Fetches a URL using Playwright browser automation and extracts its contents as markdown.