screen2claude by zetaphoenix888-byte - MCP Server

Headless Screen2Claude MCP

Version: 1.0.0 Platform: Windows 10 / 11 Runtime: Node.js 18+

A headless MCP (Model Context Protocol) server that captures the active window screenshot and sends it to a configured HTTP endpoint for AI-powered UI analysis.

🎯 Features

Headless Operation: No GUI, runs as a background process
MCP Integration: Works with Claude Code and other MCP clients
Flexible Prompts: Support for custom prompts and presets
HTTP Integration: Send screenshots to your own analysis endpoint
Non-Intrusive: Captures without stealing focus from active window

📋 Requirements

OS: Windows 10 or Windows 11
Node.js: Version 18.0.0 or higher
HTTP Endpoint: A server that can receive and analyze screenshots (see HTTP Endpoint Requirements)

🚀 Quick Start

1. Installation

# Clone or download this repository
cd screen

# Install dependencies
npm install

# Build the project
npm run build

2. Configuration

Copy the example configuration file and edit it:

cp config.example.json config.json

Edit config.json with your settings:

{
  "endpoint_url": "http://localhost:12345/claude/ui-review",
  "auth_token": "",
  "default_prompt": "Please review this UI and point out 3 improvements.",
  "presets": {
    "ui_review": "Please review this UI and point out 3 improvements.",
    "element_list": "List all visible UI elements with their role and label."
  }
}

3. Register with Claude Code

Add to your .claude.json:

{
  "mcpServers": {
    "screen2claude": {
      "command": "node",
      "args": ["C:/Users/YOUR_USERNAME/screen/dist/server.js"],
      "env": {
        "S2C_CONFIG": "C:/Users/YOUR_USERNAME/screen/config.json"
      }
    }
  }
}

4. Usage

From Claude Code, you can now use the capture_ui_screen tool:

Can you capture the current screen and analyze the UI?

Or with custom prompt:

Capture the screen with prompt: "List all accessibility issues"

Or with preset:

Capture the screen using preset: element_list

⚙️ Configuration

Configuration File Location

The server looks for configuration in this order:

Path specified in S2C_CONFIG environment variable
./config.json in the current directory

Configuration Fields

Field	Required	Description
`endpoint_url`	✅ Yes	HTTP endpoint to receive screenshot + prompt
`auth_token`	❌ No	Bearer token for authentication (if empty, no auth header sent)
`default_prompt`	❌ No	Default prompt when none specified
`presets`	❌ No	Named prompts for quick access

Example Presets

{
  "presets": {
    "ui_review": "Please review this UI and point out 3 improvements.",
    "element_list": "List all visible UI elements with their role and label.",
    "accessibility": "Analyze this UI for accessibility issues and suggest improvements.",
    "color_contrast": "Check the color contrast ratios and identify any WCAG violations."
  }
}

🌐 HTTP Endpoint Requirements

Your HTTP endpoint should:

Accept POST Requests

Endpoint: As configured in endpoint_url Method: POST Content-Type: application/json

Request Body:

{
  "prompt": "Please review this UI...",
  "image_base64": "iVBORw0KGgoAAAANSUhEUgAA...",
  "meta": {
    "captured_at": "2025-11-09T12:34:56.789Z",
    "source": "headless-mcp"
  }
}

Return Analysis Results

Recommended Response (JSON):

{
  "result": "This UI has the following improvements..."
}

Alternative Response (Plain Text):

This UI has the following improvements...

Both formats are supported. The server will extract the result field from JSON or use the entire response body as plain text.

🛠️ Development

Available Scripts

# Build TypeScript to JavaScript
npm run build

# Build and run the server
npm run dev

# Run the built server
npm run start

# Clean build artifacts
npm run clean

# Clean and rebuild
npm run rebuild

# Run test HTTP endpoint (for development)
npm run test:endpoint

# Run production HTTP endpoint (Claude API integration)
npm run prod:endpoint

Production Endpoint (Claude API Integration)

This project includes a production-ready HTTP endpoint that integrates with Claude API for real UI analysis.

Setup

Get Claude API Key: Sign up at https://console.anthropic.com/ and get your API key
Start the production endpoint:

# Using environment variable
ANTHROPIC_API_KEY=your-api-key-here npm run prod:endpoint

# Or using .env file
echo "ANTHROPIC_API_KEY=your-api-key-here" > .env.production
source .env.production  # On Windows: set /p < .env.production
npm run prod:endpoint

Update MCP config to use the production endpoint:

{
  "endpoint_url": "http://localhost:3001/claude/vision",
  "auth_token": "",
  "default_prompt": "Please review this UI and point out 3 improvements.",
  "presets": { ... }
}

Environment Variables

Variable	Required	Default	Description
`ANTHROPIC_API_KEY`	✅ Yes	-	Your Claude API key
`PORT`	❌ No	3001	Server port
`AUTH_TOKEN`	❌ No	-	Bearer token for authentication
`CLAUDE_MODEL`	❌ No	claude-3-5-sonnet-20241022	Claude model to use
`MAX_TOKENS`	❌ No	4096	Maximum tokens in response

Example with Authentication

# Start with authentication enabled
ANTHROPIC_API_KEY=your-api-key \
AUTH_TOKEN=your-secret-token \
npm run prod:endpoint

Then update your config:

{
  "endpoint_url": "http://localhost:3001/claude/vision",
  "auth_token": "your-secret-token"
}

Manual Testing

Start your HTTP endpoint (see Creating a Test Endpoint)
Run the MCP server manually:
```
npm run dev
```
The server communicates via stdio (JSON-RPC 2.0)
Send a tool execution request to test

Creating a Test Endpoint

Create a simple Express server to test:

// test-server.js
import express from 'express';

const app = express();
app.use(express.json({ limit: '50mb' }));

app.post('/claude/ui-review', (req, res) => {
  console.log('Received prompt:', req.body.prompt);
  console.log('Image size:', req.body.image_base64?.length);

  res.json({
    result: `Analysis: This is a test response. Prompt was: ${req.body.prompt}`
  });
});

app.listen(12345, () => {
  console.log('Test endpoint running on http://localhost:12345');
});

Run with:

node test-server.js

📡 MCP Tool Reference

Tool: `capture_ui_screen`

Captures the active window screenshot and sends it for analysis.

Input Parameters

{
  prompt?: string;   // Custom prompt (overrides preset and default)
  preset?: string;   // Preset name from config.json
}

Priority:

prompt (if provided)
preset (if provided and exists in config)
default_prompt (from config)
Fallback: "Please analyze this UI screenshot."

Output (Success)

{
  "status": "success",
  "analysis": "This UI has...",
  "endpoint": "http://localhost:12345/claude/ui-review"
}

Output (Error)

{
  "status": "error",
  "error_code": "capture_failed | http_failed | config_missing",
  "message": "Detailed error message"
}

🐛 Troubleshooting

Error: `endpoint_url is required in config.json`

Solution: Make sure config.json exists and contains the endpoint_url field.

Error: `Capture failed`

Possible causes:

No active window
Screenshot library error

Solution: Check the error logs for details. Make sure you have an active window open.

Error: `HTTP failed`