README - scenic_mcp_experimental by scenic-contrib

Scenic MCP - Input Control for Scenic Applications

A Model Context Protocol (MCP) server that enables external keyboard and mouse input injection into Scenic GUI applications.

Features

Keyboard Input: Send text and special keys to Scenic applications
Mouse Control: Move cursor and click at specific coordinates
Visual Feedback: Get descriptions of what's displayed on screen (v0.2.0)
MCP Integration: Works with any MCP-compatible client (Claude Desktop, etc.)
Real-time Communication: TCP-based connection for low-latency input
Scenic Compatible: Uses proper Scenic ViewPort input routing

Installation

Add to your Scenic application's mix.exs:

defp deps do
  [
    {:scenic_mcp, path: "../scenic_mcp"}
  ]
end

Configure your Scenic viewport with proper naming:

IMPORTANT: For scenic_mcp to work correctly, your Scenic application MUST name both the viewport and driver. Update your scenic_config() function:

def scenic_config() do
  [
    name: :main_viewport,  # Required for viewport lookup
    size: @default_resolution,
    default_scene: {YourApp.RootScene, []},
    drivers: [
      [
        name: :scenic_driver,  # Required for driver lookup
        module: Scenic.Driver.Local,
        window: [
          title: "Your App",
          resizeable: true
        ],
        on_close: :stop_system
      ]
    ]
  ]
end

Install Node.js dependencies and build:

cd scenic_mcp
npm install
npm run build

Configure for Claude Code:

# Add the MCP server to Claude Code
claude mcp add scenic-mcp /path/to/scenic_mcp/dist/index.js

# Verify it was added
claude mcp list

Note: Replace /path/to/scenic_mcp with the actual path to your scenic_mcp directory.

Usage

Using with Claude Code

Once configured, you can use the MCP tools directly within Claude Code conversations:

Start your Scenic application (with the ScenicMcp.Server running on port 9999)
Use MCP tools in Claude Code:
- connect_scenic - Test connection to your Scenic app
- get_scenic_status - Check connection status
- send_keys - Send keyboard input
- send_mouse_move - Move mouse cursor
- send_mouse_click - Click at coordinates
- inspect_viewport - Get visual description of current screen

Example conversation:

You: "Use the connect_scenic tool to test connection to my Flamelex app"
Claude: [Uses connect_scenic tool and shows connection status]

You: "Send the text 'hello world' using send_keys"
Claude: [Uses send_keys tool to type text into your app]

MCP Tools

The server provides these MCP tools:

`connect_scenic`

Test connection to the Scenic application.

`get_scenic_status`

Check server status and available commands.

`send_keys`

Send keyboard input to the Scenic application.

Parameters:

text (string): Text to type (each character sent as individual key press)
key (string): Special key name (enter, escape, tab, backspace, delete, up, down, left, right, home, end, page_up, page_down, f1-f12)
modifiers (array): Modifier keys (ctrl, shift, alt, cmd, meta)

`send_mouse_move`

Move mouse cursor to specific coordinates.

Parameters:

x (number): X coordinate
y (number): Y coordinate

`send_mouse_click`

Click mouse at specific coordinates.

Parameters:

x (number): X coordinate
y (number): Y coordinate
button (string): Mouse button (left, right, middle) - default: left

`get_scenic_graph` (NEW in v0.2.0)

Return the script table for a ViewPort, providing a visual description of the scene.

`take_screenshot`

Capture a screenshot of the current Scenic display.

Parameters:

filename (string, optional): Custom filename for the screenshot
format (string, optional): Output format - "path" (default) or "base64"

Examples

Send text:

{
  "action": "send_keys",
  "text": "hello world"
}

Send special key:

{
  "action": "send_keys", 
  "key": "enter"
}

Send key with modifiers:

{
  "action": "send_keys",
  "key": "c",
  "modifiers": ["ctrl"]
}

Move mouse:

{
  "action": "send_mouse_move",
  "x": 100,
  "y": 200
}

Click mouse:

{
  "action": "send_mouse_click",
  "x": 150,
  "y": 250,
  "button": "left"
}

Get visual feedback:

{
  "action": "get_scenic_graph"
}

Architecture

MCP Client (Claude Desktop)
    ↓
TypeScript MCP Server (scenic_mcp)
    ↓ (TCP port 9999)
Elixir GenServer Bridge (ScenicMcp.Server)
    ↓ (ScenicMcp.Probes)
Scenic.Driver.send_input/2
    ↓
Scenic Driver Process (:scenic_driver)
    ↓
Your Scenic Application

Key Components

ScenicMcp.Server: TCP server that receives commands from the TypeScript bridge
ScenicMcp.Probes: Direct interface to Scenic internals, sends input via Scenic.Driver.send_input/2
Process Names: Uses :main_viewport and :scenic_driver registered process names

Development

Start the Elixir server:

cd your_scenic_app
mix run --no-halt

Test the MCP server:

cd scenic_mcp
node src/index.ts

Testing

This project includes comprehensive testing to ensure reliability and guide improvements.

Test Suites

Elixir Unit Tests - Core server functionality

mix test
mix test --cover  # With coverage reporting

TypeScript Tests - MCP integration testing

npm test
npm run test:coverage  # With coverage
npm run test:watch     # Watch mode

LLM Tool Testing - Validates tool descriptions and usability
```
npm run test:llm-tools
```

Testing Innovation: LLM-Driven Tool Enhancement

One of the unique aspects of this project is how we use LLM testing to improve the tool descriptions:

Tool Usage Analysis: We run scenarios through LLMs to see which tools they discover and use
Description Enhancement: Based on usage patterns, we automatically enhance tool descriptions
Real-World Validation: The enhanced descriptions are tested against real development scenarios

This approach has led to significant improvements in tool discoverability and correct usage by AI assistants.

Running All Tests

npm run test:all  # Runs both Elixir and TypeScript tests

For detailed testing documentation, see .

Requirements

Elixir/OTP 24+
Node.js 18+
Scenic 0.11+

License

MIT License

scenic-contrib/scenic_mcp_experimental

Scenic MCP - Input Control for Scenic Applications

Features

Installation

Usage

Using with Claude Code

MCP Tools

connect_scenic

get_scenic_status

send_keys

send_mouse_move

send_mouse_click

get_scenic_graph (NEW in v0.2.0)

take_screenshot

Examples

Architecture

Key Components

Development

Testing

Test Suites

Testing Innovation: LLM-Driven Tool Enhancement

Running All Tests

Requirements

License

`connect_scenic`

`get_scenic_status`

`send_keys`

`send_mouse_move`

`send_mouse_click`

`get_scenic_graph` (NEW in v0.2.0)

`take_screenshot`