README - unstuckmcp by dtdannen

unstuck-ai

MCP server enabling AI agents to instantly pay humans sats (Bitcoin) to solve visual roadblocks (captchas, web navigation, computer use) via a Nostr marketplace. Includes the MCP server and web app for humans to bid on tasks, complete them, and get paid.

🎥 Video Demonstrations

Project Presentation & Q&A - Complete overview of Unstuck AI, architecture, and live Q&A session
Goose Agent in Docker Sandbox - Live demo of Goose running in sandbox, paying Bitcoin invoice for human help
Original Proof of Concept - First working version showing the core concept

Quick Start with Goose

Want to use this with Goose? See for complete setup instructions.

⚠️ IMPORTANT: Goose Version Distinction

There are two different AI assistants named "Goose":

Block's Goose (https://github.com/block/goose) - Version 1.0.x+, has MCP support with --with-remote-extension

goose-ai (PyPI package) - Version 0.9.x, different project, NO MCP support

This project requires Block's Goose for MCP integration. Install it from: https://block.github.io/goose/docs/getting-started/installation

Install dependencies: cd mcp_server && pip install -r requirements.txt
Configure environment variables in mcp_server/.env
Add the extension to your Goose config (~/.config/goose/config.yaml)
Run goose session and ask for visual help!

Development Plan

Components:

Write a wiki page on kind 5109 so it shows up nicely in https://stats.dvmdash.live/kind-stats
- example article: https://njump.me/naddr1qvzqqqrcvgpzpkscaxrqqs8nhaynsahuz6c6jy4wtfhkl2x4zkwrmc4cyvaqmxz3qqykk6twvsar2vesxq02d424
  - only needs these sections:
    - Introduction that describes the inputs (text + screenshot) and outputs (mouse movement commands)
    - example input event
    - example output event
- instructions to write these wiki pages can be found here https://habla.news/u/dustind@dtdannen.github.io/1743798227950
- use a 'd' tag with the value 'kind:5109' so it shows on dvmdash
- use a 'title' tag like 'Nostr DVM Kind 5109/6109 - Visual Computer Task Help'

How to use

Currently, the demo requires digital ocean spaces credentials for hosting the uploaded screenshots (any boto3 provider might work easily, like AWS) and it requires a Nostr Wallet Connect string (this is how the MCP Server pays the human's invoice). These should go into the .env file.

cp .env.example .env

Then put the credentials needed. The NOSTR_PRIVATE_KEY is for the AI agent, you probably want to generate a new one to use (i.e. it's not meant to be your personal nsec).

Set up python env

cd mcp_server/
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

then start the mcp server from within the mcp_server folder

fastmcp run unstuck_ai/server.py:mcp --transport sse

then in another terminal run goose

goose session --with-remote-extension http://127.0.0.1:8000/sse

and then you can try using it with a propmt to goose like:

( O)> can you use the unstuck ai tool to get help so I can open safari on my machine? First take a screenshot of my screen, save it and print the file path, and then give that file path when you call the tool. There are lots of screenshots, so make sure you save the screenshot with a timestamp and record that timestamp so you use the right screenshot

This is a robust way to take screenshots:

screencapture -x /Users/dustin/screenshot_$(date +"%Y%m%d_%H%M%S").png

This is a robust way to check for the most recent screenshot

ls -la /Users/dustin/screenshot_*.png | tail -1

Make sure the file exists before calling the unstuck ai tool