Windows-MCP
If you are the rightful owner of Windows-MCP and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
Windows MCP is a lightweight, open-source project that enables seamless integration between AI agents and the Windows operating system.
Windows MCP acts as a Model Context Protocol (MCP) server, bridging the gap between Language Learning Models (LLMs) and the Windows operating system. It allows AI agents to perform tasks such as file navigation, application control, UI interaction, and QA testing. The project is designed to be lightweight and open-source, with minimal dependencies and easy setup. It supports a wide range of Windows operating systems, from Windows 7 to Windows 11, and is compatible with any LLM, reducing complexity and setup time. Windows MCP provides a rich toolset for UI automation, including tools for basic keyboard and mouse operations, capturing window/UI state, and more. It is customizable and extendable, allowing users to adapt or extend tools to suit their unique automation or AI integration needs. The project is licensed under the MIT License, encouraging contributions and collaboration from the community.
Features
- Seamless Windows Integration: Interacts natively with Windows UI elements, opens apps, controls windows, simulates user input, and more.
- Use Any LLM (Vision Optional): Works with any LLMs, reducing complexity and setup time.
- Rich Toolset for UI Automation: Includes tools for basic keyboard, mouse operation and capturing window/UI state.
- Lightweight & Open-Source: Minimal dependencies and easy setup with full source code available under MIT license.
- Customizable & Extendable: Easily adapt or extend tools to suit your unique automation or AI integration needs.
Tools
Click-Tool
Click on the screen at the given coordinates.
Type-Tool
Type text on an element (optionally clears existing text).
Clipboard-Tool
Copy or paste using the system clipboard.
Scroll-Tool
Scroll vertically or horizontally on the window or specific regions.
Drag-Tool
Drag from one point to another.
Move-Tool
Move mouse pointer.
Shortcut-Tool
Press keyboard shortcuts (Ctrl+c, Alt+Tab, etc).
Key-Tool
Press a single key.
Wait-Tool
Pause for a defined duration.
State-Tool
Combined snapshot of active apps and interactive, textual and scrollable elements along with screenshot of the desktop.
Screenshot-Tool
Capture a screenshot of the desktop.
Launch-Tool
To launch an application from the start menu.
Shell-Tool
To execute PowerShell commands.
Scrape-Tool
To scrape the entire webpage for information.