selenium-mcp-server-python

Jyothishkumarav/selenium-mcp-server-python

3.3

If you are the rightful owner of selenium-mcp-server-python and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

A Model Context Protocol (MCP) server implementation for Selenium WebDriver automation using Python, enabling programmatic control of web browsers through Claude's desktop application.

The Selenium MCP Server is a Python-based implementation designed to facilitate browser automation using Selenium WebDriver. It supports both Chrome and Firefox browsers, allowing users to perform a wide range of browser interactions such as navigation, element interaction, and screenshot capture. The server integrates seamlessly with the Claude desktop application, providing an easy-to-use interface for automating web tasks. With support for window and iframe handling, local storage management, and more, this server is a comprehensive solution for developers looking to automate browser tasks programmatically. The server is easy to set up and requires Python 3.7 or higher, along with the Claude desktop application.

Features

  • Browser automation with support for Chrome and Firefox
  • Comprehensive Selenium WebDriver operations
  • Seamless integration with Claude desktop application
  • Support for various browser interactions including navigation and element interaction
  • Advanced features like screenshot capture and local storage management

Usages

usage with claude desktop

{
  "mcpServers": {
    "selenium": {
      "command": "python",
      "args": ["server.py"]
    }
  }
}

Tools

  1. Browser Management

    Start and close browser sessions, switch between windows.

  2. Navigation

    Navigate to URLs, refresh pages, and wait for page loads.

  3. Element Interaction

    Find elements, click, send keys, clear fields, double click, and right click.

  4. Element Information

    Get element text, attributes, check presence, visibility, and selection.

  5. Page Operations

    Take screenshots, get page content, scroll pages, manage local storage.

  6. Frame Handling

    Switch to iframes and return to default content.