mcp-playwright

4.5

mcp-playwright is hosted online, so all tools can be tested directly either in theInspector tabor in theOnline Client.

If you are the rightful owner of mcp-playwright and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

A Model Context Protocol server that provides browser automation capabilities using Playwright.

Evaluation
Total Score: 7/10

Evaluation Summary

The agent demonstrates a solid understanding of its capabilities and limitations, particularly in identifying issues related to missing system dependencies required for running Playwright. It provides accurate and clear instructions for resolving these issues, such as installing necessary dependencies. However, the agent consistently fails to perform tasks due to environmental limitations, which highlights a gap between its advertised capabilities and practical execution.
Strengths:
- Accuracy/Correctness: The agent accurately diagnoses issues and provides correct solutions, such as identifying missing dependencies and suggesting installation commands.
- Usefulness/Clarity: Instructions provided by the agent are clear and actionable, aiding users in troubleshooting.
Weaknesses:
- Completeness: The agent often lacks comprehensive troubleshooting guidance and alternative solutions for performing tasks without browser initialization.
- Consistency with Claimed Capabilities: There is a notable inconsistency between the agent's claimed capabilities and its ability to execute tasks, as it frequently encounters environmental limitations.
Suggestions for Improvement:
- Enhance the agent's ability to provide alternative methods for task execution that do not rely on browser initialization.
- Offer more detailed troubleshooting steps and guidance for users to independently resolve issues.
Overall, while the agent is knowledgeable and provides useful guidance, its practical execution is limited by environmental constraints, affecting its overall effectiveness.

Test case 1
Score: 8/10
Can you navigate to https://example.com, fill out a form with sample data, submit it, and take a screenshot of the confirmation page?
It seems that the browser could not be initialized because the host system is missing the necessary dependencies to run browsers.

To proceed with the task of navigating to the website, filling out the form, submitting it, and taking a screenshot, the environment needs to have the required dependencies installed.

Here are the dependencies that need to be installed:

libnss3

libnspr4

libatk1.0-0

libatk-bridge2.0-0

libcups2

libxkbcommon0

libatspi2.0-0

libxdamage1

libpango-1.0-0

libcairo2

libasound2

You can install them using the following command:

npx playwright install-deps

Alternatively, you can use the apt-get command as follows:

apt-get install libnss3 libnspr4 libatk1.0-0 libatk-bridge2.0-0 libcups2 libxkbcommon0 libatspi2.0-0 libxdamage1 libpango-1.0-0 libcairo2 libasound2

Once these dependencies are installed, please try again, and I'll be able to assist you with the navigation and interaction with the webpage.
1. Accuracy/Correctness: The response correctly identifies the problem and offers valid solutions.
2. Completeness: It provides comprehensive instructions for installing the necessary dependencies.
3. Usefulness/Clarity: The instructions are clear and easy to follow, aiding users in resolving the issue.
1. Consistency with Claimed Capabilities: The agent claims to assist with web navigation and interaction tasks. However, it does not address the original request to navigate to a website, fill out a form, submit it, and take a screenshot. Instead, it focuses solely on the dependency issue without indicating whether it can perform the requested tasks once the dependencies are installed.
Test case 2
Score: 7/10
Can you perform an HTTP GET request to retrieve data from 'https://api.example.com/data', then parse the JSON response to extract a specific field named 'value', and display it?
It seems that the HTTP GET request to https://api.example.com/data failed because the domain could not be resolved (ENOTFOUND). This usually indicates that the server address is incorrect or the server is not accessible from the network.

If you have another valid URL or if you would like to provide more context or details, please let me know, and I can attempt the request again!
1. Accuracy/Correctness: The response correctly identifies the nature of the 'ENOTFOUND' error and its typical causes.
2. Usefulness/Clarity: The suggestion to check the URL or provide more context is clear and actionable.
1. Completeness: While the agent suggests verifying the URL, it does not offer additional troubleshooting steps, such as checking network connectivity, DNS server settings, or potential firewall restrictions. Including these steps would provide a more comprehensive guide for resolving the issue.
2. Consistency with Claimed Capabilities: The agent claims to perform HTTP requests and parse JSON responses. However, it does not address whether it can proceed with the task if a valid URL is provided, leaving uncertainty about its capabilities in this context.
Test case 3
Score: 7/10
Can you generate a Playwright test script that navigates to a website, logs in with a username and password, and verifies that the login was successful by checking for a specific element on the page?
To generate a Playwright test script that performs the actions you described, we can use the code generation tools available. Here's the plan:

Start a code generation session to record the actions.

Navigate to the specified website.

Fill in the username and password fields.

Click the login button.

Verify the login by checking for a specific element on the page.

End the code generation session to generate the test script.

Let's start by initiating the code generation session. Could you please provide the following details:

The URL of the website you want to test.

The CSS selectors for the username and password fields.

The CSS selector for the login button.

The CSS selector for the element that confirms a successful login.
1. Accuracy/Correctness: The response correctly identifies the steps involved in creating a Playwright test script for login functionality.
2. Completeness: It covers all necessary actions, including navigation, form filling, submission, and verification.
3. Usefulness/Clarity: The response is clear and provides a logical sequence of steps, making it easy for the user to follow.
1. Consistency with Claimed Capabilities: While the agent claims to assist with code generation, it does not provide a sample script or demonstrate the use of Playwright's code generation tools, such as Codegen, which can automatically generate test scripts based on user interactions. This omission may lead to underutilization of Playwright's features.
Test case 4
Score: 6/10
Can you retrieve the visible text content from the homepage of 'https://example.com' and display it?
It seems that there was an issue initializing the browser due to missing dependencies on the host system. This prevents me from retrieving the visible text content from the homepage of 'https://example.com'.

Unfortunately, I don't have the capability to install additional dependencies or modify the environment. If you have access to a local setup, you can try running a similar script using Playwright with the necessary dependencies installed.

If there's anything else I can assist you with or another task you'd like me to perform, please let me know!
1. Accuracy/Correctness: The response correctly identifies the problem of missing dependencies preventing browser initialization.
2. Usefulness/Clarity: The agent clearly communicates its limitations in modifying the environment to install the necessary dependencies.
1. Completeness: While the agent acknowledges its inability to install dependencies, it does not provide alternative solutions or guidance for the user to resolve the issue independently. For instance, suggesting the use of Playwright's built-in command to install dependencies (npx playwright install-deps) or providing a list of required packages for manual installation would be helpful.
2. Consistency with Claimed Capabilities: The agent claims to assist with web content retrieval but is unable to perform the requested task due to environmental limitations. This inconsistency highlights a gap between the agent's advertised capabilities and its practical execution.
Test case 5
Score: 6/10
Can you execute a JavaScript snippet on 'https://example.com' that retrieves and logs the titles of all articles on the page?
It seems that the attempt to navigate to the webpage https://example.com failed due to missing dependencies required to run the browser. Unfortunately, this means I cannot execute the JavaScript snippet to retrieve and log the titles of all articles on the page.

If you have access to a local environment where you can run Playwright, I can guide you on how to set it up and run the JavaScript snippet there. Would you like that?
1. Accuracy/Correctness: The response accurately identifies the problem of missing dependencies preventing browser initialization.
2. Usefulness/Clarity: The agent clearly communicates its limitations in modifying the environment to install the necessary dependencies.
1. Completeness: While the agent acknowledges its inability to install dependencies, it does not provide alternative solutions or guidance for the user to resolve the issue independently. For instance, suggesting the use of Playwright's built-in command to install dependencies (npx playwright install-deps) or providing a list of required packages for manual installation would be helpful.
2. Consistency with Claimed Capabilities: The agent claims to assist with web navigation and interaction tasks but is unable to perform the requested task due to environmental limitations. This inconsistency highlights a gap between the agent's advertised capabilities and its practical execution.

Related MCP Servers

View all browser_automation servers →

firecrawl-mcp-server

4.7

by mendableai

Firecrawl MCP Server is a Model Context Protocol server implementation that integrates with Firecrawl for web scraping capabilities.

browser_automation

brightdata-mcp

4.5

by brightdata

Bright Data MCP server enables real-time web data access for LLMs, agents, and apps, facilitating seamless web navigation and data retrieval.

browser_automation

fetch-mcp

4.4

by zcaceres

This MCP server provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.

browser_automation

Fetch

4.3

by modelcontextprotocol

A Model Context Protocol server that provides web content fetching capabilities, enabling LLMs to retrieve and process content from web pages.

browser_automation

fetcher-mcp

4.3

by jae-jae

Fetcher MCP is a server designed to fetch web page content using the Playwright headless browser.

browser_automation

tavily-mcp

4.3

by tavily-ai

The Tavily MCP server is a Model Context Protocol server that integrates with AI systems to provide real-time web search and data extraction capabilities.

browser_automation

mcp-local-rag

4.3

by nkapila6

mcp-local-rag is a primitive RAG-like web search model context protocol server that runs locally without the need for APIs.

research_and_data

ai-agent-marketplace-index-mcp

4.1

by AI-Agent-Hub

MCP Server for AI Agent Marketplace Index from DeepNLP, allowing AI assistants to search available AI agents by keywords or categories.

browser_automation

web-eval-agent

3.8

by Operative-Sh

operative.sh's MCP Server is a tool for autonomous debugging of web applications directly from your code editor.

browser_automation

mcp-server-weibo

3.8

by Selenium39

The Weibo MCP Server (TypeScript Version) is designed for scraping Weibo user information, feeds, and search functionality using the Model Context Protocol.

browser_automation

browser-use-mcp-server

3.8

by co-browser

An MCP server that enables AI agents to control web browsers using browser-use.

browser_automation

duckduckgo-mcp-server

3.7

by nickclyde

A Model Context Protocol (MCP) server that provides web search capabilities through DuckDuckGo, with additional features for content fetching and parsing.

browser_automation

MCP-Server-Playwright

3.6

by Automata-Labs-team

A Model Context Protocol server that provides browser automation capabilities using Playwright.

browser_automation

mcp-selenium

3.6

by angiejones

A Model Context Protocol (MCP) server implementation for Selenium WebDriver, enabling browser automation through standardized MCP clients.

browser_automation

mcp-server-rag-web-browser

3.6

by apify

Implementation of an MCP server for the RAG Web Browser Actor, enabling AI agents to perform web searches and extract information from web pages.

browser_automation

Redbook-Search-Comment-MCP2.0

3.6

by chenningling

小红书自动搜索评论工具（MCP Server 2.0）是一个基于 Playwright 开发的自动化工具，旨在帮助用户自动登录小红书、搜索关键词、获取笔记内容及发布AI生成评论。

browser_automation

playwright-plus-python-mcp

3.6

by blackwhite084

A MCP server with Playwright tools for browser automation and note management.

browser_automation

crawl4ai-mcp-server

3.6

by weidwonder

Crawl4AI MCP Server is an intelligent information retrieval server based on the Model Context Protocol (MCP), designed to enhance AI assistant systems with powerful search capabilities and web content understanding optimized for LLMs.

browser_automation

mcp-browser-use

3.6

by vinayak-mehta

mcp-browser-use is a protocol server that connects MCP clients with browsers using the browser-use library.

browser_automation

mcp-ui

3.5

by machaojin1917939763

MCP聊天应用是一个基于Vue.js构建的现代化聊天界面，支持通过Model Context Protocol (MCP)与各种服务和工具进行交互。

ai_chatbot

jcrawl4ai-mcp-server

3.5

by rainyheart

jcrawl4ai-mcp-server is a Java-based MCP server for interacting with the Crawl4ai API, enabling web crawling with specified parameters.

browser_automation

playwright-mcp

3.5

by cloudflare

Cloudflare Playwright MCP integrates Playwright for automated browser testing with Cloudflare Workers.

browser_automation

editor-mcp-server

3.5

by playcanvas

An MCP Server for automating the PlayCanvas Editor using an LLM.

browser_automation

browser-control-mcp

3.5

by eyalzh

An MCP server paired with a browser extension that enables AI agents to manage the user's local browser for research and information retrieval.

browser_automation

Sketch-Context-MCP

3.5

by jshmllr

A Model Context Protocol (MCP) server for integrating Sketch designs with IDEs such as Cursor, Cline, or Windsurf.

browser_automation

scrapling-fetch-mcp

3.5

by cyberchitta

Scrapling Fetch MCP is an MCP server designed to help AI assistants access text content from websites with bot detection.

browser_automation

xhs-mcp-server

3.5

by aicu-icu

小红书MCP服务器是基于Electron和小红书Web API的开发工具，旨在提供快速、便捷的开发体验。

browser_automation

mcp-playwright

EvaluationTotal Score: 7/10

Evaluation Summary

Related MCP Servers

firecrawl-mcp-server

brightdata-mcp

fetch-mcp

Fetch

fetcher-mcp

tavily-mcp

mcp-local-rag

ai-agent-marketplace-index-mcp

web-eval-agent

mcp-server-weibo

browser-use-mcp-server

duckduckgo-mcp-server

MCP-Server-Playwright

mcp-selenium

mcp-server-rag-web-browser

Redbook-Search-Comment-MCP2.0

playwright-plus-python-mcp

crawl4ai-mcp-server

mcp-browser-use

mcp-ui

jcrawl4ai-mcp-server

playwright-mcp

editor-mcp-server

browser-control-mcp

Sketch-Context-MCP

scrapling-fetch-mcp

xhs-mcp-server

Evaluation
Total Score: 7/10