scanzy/mcp-server-win-ui
3.2
If you are the rightful owner of mcp-server-win-ui and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
The Windows UI MCP server is a tool designed to integrate Model Context Protocol with Windows UI applications, enabling AI models to interact with these applications through guided user interaction.
Windows UI MCP server
A Windows integration for Model Context Protocol that enables AI models to discover, map and interact with Windows UI applications through guided user interaction.
Workflow
-
Initial Discovery
- AI asks user about target application context, purpose and workflow
- User provides context about main UI components and interactions
- AI guides user through application exploration,
-
UI Exploration and Mapping
- AI and user collaborate to identify key windows and controls
- System captures reliable identifiers (handle, text, class, position)
- Validation performed at each identification step
- Validated control information saved to YAML files with multiple identifiers for reliability
-
UI Automation
- Load stored YAML context files
- Use validated identifiers to locate UI elements
- Perform automated interactions with the application
- Handle validation and error recovery
Information to collect
- Application name, context, purpose and general workflow
- Main concepts and terms
- Window types:
- normal: for main operations (e.g. write, select, etc.)
- dialogs: for simple actions (e.g. import, export, save, open, settings,etc.)
- popup: for confirmation, error, etc. (e.g. do you want to save?)
- Control zones:
- fixed: controls that are always present (e.g. menu, toolbar, status bar)
- dynamic: controls that are present only in certain contexts (e.g. search, filter, etc.)
- Flows: steps to follow to complete a task (e.g. login at startup, wizards, etc.)
- Actions and commands: buttons, menus, and controls to click
- Recommendations and gotchas
Installation
[JSON MCP config file here]
Project Structure
Generated by AI (maybe overkill)
win32-mcp/
āāā src/
ā āāā core/ # Core functionality
ā ā āāā window_manager.py # Windows enumeration, filtering, and state monitoring
ā ā āāā control_manager.py # UI control discovery and interaction
ā ā āāā context_store.py # YAML context serialization and validation
ā ā āāā validator.py # Validation rules and utilities
ā āāā prompts/ # AI conversation prompts in YAML format
ā ā āāā discovery.yaml # App purpose and workflow discovery
ā ā āāā mapping.yaml # UI element identification and mapping
ā ā āāā validation.yaml # Control validation and verification
ā āāā utils/ # Helper utilities
ā ā āāā win32_utils.py # Win32 API wrapper functions
ā ā āāā yaml_utils.py # YAML processing helpers
ā ā āāā logging.py # Logging configuration
ā āāā mcp/ # MCP protocol implementation
ā āāā server.py # Async server with JSON-RPC
ā āāā handlers.py # Request/response and event handlers
ā āāā tools.py # Tool definitions for UI operations
āāā mcp_config.json # Server configuration
āāā README.md # Project documentation
Development Status
š§ Under Development
TODO:
- [o] Scouting with win32 api and flaUI
- Think about tools to develop
- Think about storage of context
- Think about project structure
- Prompts for discovery, mapping, validation