README - repo-to-text-mcp-server by bacoco

Repo-to-Text MCP Server with AI Task Generation (Shotgun Mode)

The Ultimate Bridge Between Your Codebase, LLMs, and Implementation

A powerful MCP (Model Context Protocol) server that converts entire repositories into LLM-friendly text format with AI-powered analysis, intelligent filtering, and Gemini-powered implementation task generation. Like Shotgun, but better - it generates complete implementation directives for Cursor, Windsurf, and Claude Desktop.

🚀 Why This is Better Than Everything Else

Feature	Shotgun	repo-to-text	This MCP Server
Integration	Manual copy-paste	Command line	✅ Direct Claude integration
Format	Messy delimiters	Basic XML	✅ Clean, optimized XML
Intelligence	None	None	✅ AI-powered analysis
Multi-format	Shotgun only	Limited	✅ XML, Shotgun, Markdown
Token estimation	None	None	✅ Multi-provider support
Task Generation	Manual	None	✅ Gemini-powered directives
Implementation Plan	None	None	✅ Complete workflow automation
IDE Integration	Basic	None	✅ Cursor/Windsurf/Claude optimized

✨ Key Features

🧠 AI-Powered Analysis - Claude analyzes your project and suggests optimal exclusions
🎯 Multiple Output Formats - XML (best for LLMs), Shotgun (Cursor-compatible), Markdown
📊 Token Estimation - Know exactly how much context you're using across different LLMs
🔄 Smart Chunking - Automatically split large repos for different model limits
🛠 Patch Application - Apply LLM responses back to your codebase with dry-run safety
🎨 Project-Type Aware - Detects languages/frameworks and optimizes accordingly
⚡ Blazing Fast - Efficient file processing with intelligent filtering

🎯 Perfect for Gemini 2.5 Pro

This MCP server is optimized for Gemini 2.5 Pro's 2 million token context:

Clean XML format that Gemini loves
Smart preprocessing to maximize context utilization
Token estimation to know you're using <5% of available context
Handles massive codebases that other tools can't

📦 Installation

Prerequisites

Python 3.7+
Claude Desktop (for MCP integration)
Gemini 2.5 Pro access (for task generation features)

Quick Install

# Clone the repository
git clone https://github.com/YOUR_USERNAME/repo-to-text-mcp-server.git
cd repo-to-text-mcp-server

# Install dependencies
pip install -r requirements.txt

# Run installation script
chmod +x install.sh
./install.sh

Manual Installation

Install Python dependencies:
```
pip install pyyaml pathspec mcp
```

Configure Claude Desktop: Add to your Claude Desktop config file (~/.claude/config.json):

{
  "mcpServers": {
    "repo-to-text": {
      "command": "python",
      "args": ["/path/to/repo_to_text_mcp_server.py"]
    }
  }
}

Restart Claude Desktop

🚀 Quick Start

1. Analyze Your Project

Analyze my project at /path/to/my/code

Get comprehensive insights about your codebase with AI-powered suggestions.

2. Generate LLM-Ready Context

Generate repo context for /path/to/my/code in XML format

Creates clean, structured text perfect for any LLM.

3. Generate Implementation Plan (NEW!)

Generate implementation plan for /path/to/my/code with requirements: "Add user authentication"

Get complete Gemini prompt for generating implementation directives.

4. Parse AI Response

Parse this Gemini response for Cursor: [paste response]

Convert AI output into IDE-specific implementation steps.

🛠 Available Tools

Tool	Description	Perfect For
`analyze_project`	Comprehensive project analysis	Understanding your codebase
`generate_repo_context`	Main conversion with smart filtering	Creating LLM prompts
`estimate_tokens`	Multi-provider token counting	Context planning
`preview_selection`	See what files will be included	Large project optimization
`suggest_exclusions`	AI-powered exclusion recommendations	Noise reduction
`chunk_for_llm`	Smart chunking for model limits	Handling huge repos
`apply_patch`	Apply LLM responses to codebase	Completing the workflow
`generate_implementation_tasks`	Generate Gemini prompts for implementation	Creating implementation plans
`parse_gemini_response`	Parse Gemini's response into IDE directives	Converting AI output to tasks
`generate_implementation_plan`	Complete Shotgun-style workflow	End-to-end implementation

🎯 NEW: Shotgun-Style Implementation Generation

This MCP server now includes powerful task generation features inspired by Shotgun, but better:

The Complete Workflow

Describe Your Requirements

"I need to add user authentication with JWT tokens to my Express app"

Generate Implementation Plan

Use generate_implementation_plan for /path/to/project with requirements: "Add JWT authentication..."

Get Gemini-Optimized Prompt
- Automatically analyzes your entire codebase
- Creates a comprehensive prompt for Gemini 2.5 Pro
- Includes project context, structure, and requirements

Parse Gemini's Response

Parse this Gemini response for Cursor IDE...

Get IDE-Specific Directives
- Cursor/Windsurf: Step-by-step implementation guide
- Claude Desktop: Task breakdown with priorities
- Generic: Structured JSON for any tool

Example: Adding Authentication to Express App

Important Note: This tool generates prompts for Gemini 2.5 Pro. You need to:

Copy the generated prompt from this tool
Paste it into Gemini 2.5 Pro (via web interface or API)
Copy Gemini's response back
Use the parse_gemini_response tool to convert it to IDE directives

# Step 1: Analyze the project
"Analyze my Express project at /Users/me/myapp"

# Step 2: Generate implementation tasks
"Generate implementation tasks for /Users/me/myapp with requirements: 
Add JWT-based authentication with:
- User registration and login endpoints
- Password hashing with bcrypt
- JWT token generation and validation
- Protected route middleware
- User profile endpoint"

# Step 3: Copy prompt to Gemini 2.5 Pro
# (The tool provides a complete prompt)

# Step 4: Parse Gemini's response
"Parse this Gemini response for Cursor:
[Paste Gemini's response here]"

# Result: Complete implementation directives!

Output Example for Cursor

# 🎯 CURSOR IMPLEMENTATION DIRECTIVES

## 📋 TASK OVERVIEW
Implement JWT authentication system for Express application...

## 🏗️ ARCHITECTURE CHANGES
- Add auth middleware directory
- Create user model with Mongoose
- Add authentication routes
- Implement JWT utility functions

## 📝 IMPLEMENTATION STEPS

### Step 1: Install Dependencies
```bash
npm install jsonwebtoken bcrypt express-validator

Step 2: Create User Model

Create models/User.js:

const mongoose = require('mongoose');
const bcrypt = require('bcrypt');

const userSchema = new mongoose.Schema({
  email: { type: String, required: true, unique: true },
  password: { type: String, required: true },
  // ... complete implementation
});

Step 3: Create Auth Middleware

[... detailed implementation steps ...]


## 📝 Output Formats

### XML Format (Recommended)
Clean, structured format that works perfectly with all modern LLMs:
```xml
<repo-to-text>
Directory: myproject
<directory_structure>
├── src/
│   └── main.py
└── README.md
</directory_structure>
<content full_path="src/main.py">
def hello():
    print("Hello World")
</content>
</repo-to-text>

Shotgun Format (Cursor Compatible)

Compatible with Cursor, Windsurf, and copy-paste workflows:

*#*#*src/main.py*#*#*begin*#*#*
def hello():
    print("Hello World")
*#*#*end*#*#*

Markdown Format

Perfect for documentation and human reading:

# Repository: myproject
## File Contents
### src/main.py
```python
def hello():
    print("Hello World")


## 🎯 Usage Examples

### Basic Workflow

1. Analyze first

"Analyze my React project at /Users/me/myapp"

2. Generate context

"Generate repo context excluding node_modules and test files"

3. Use with your favorite LLM

→ Paste into Gemini 2.5 Pro, get massive patch

→ Apply back to codebase


### Advanced Usage

Custom exclusions

"Generate repo context for /path/to/code excluding ['*.test.js', 'docs/', 'examples/']"

Preview before generating

"Preview what files would be included from /path/to/my/large/project"

Token optimization

"Chunk this repo context for Claude 3.5 Sonnet"

Apply LLM patches

"Apply this patch to /path/to/code with dry run first"


## ⚙️ Configuration

Create `.repo-to-text-settings.yaml` in your project root for custom settings:

```yaml
gitignore-import-and-ignore: true

ignore-tree-and-content:
  - "node_modules/"
  - "__pycache__/"
  - "*.log"
  - "dist/"
  - "build/"

ignore-content-only:
  - "package-lock.json"
  - "yarn.lock"
  - "README.md"

force-include:
  - "Dockerfile"
  - "package.json"
  - "requirements.txt"

🎛 Smart Defaults

The server automatically excludes common noise:

Dependencies: node_modules/, venv/, __pycache__/
Build artifacts: dist/, build/, target/
Version control: .git/, .svn/
IDE files: .vscode/, .idea/
Media files: *.png, *.jpg, *.mp4
Archives: *.zip, *.tar.gz

And includes important files:

Config files: package.json, requirements.txt, Dockerfile
Documentation: README.md, LICENSE

🚀 Perfect Workflows

Gemini 2.5 Pro Workflow (Recommended)

Analyze → Claude understands your project
Generate XML → Perfect format for Gemini's 2M context
Paste & Prompt → Get massive, coherent patches
Apply safely → Dry-run first, then apply

Cursor/Windsurf Workflow

Generate Shotgun format → Compatible delimiters
Paste into IDE → Direct integration
Get diff responses → Apply through IDE tools

Multi-LLM Workflow

Estimate tokens → Know what fits where
Chunk intelligently → Split for smaller models
Use best model → Route to optimal LLM

🔧 Troubleshooting

Issue	Solution
`ModuleNotFoundError: mcp`	Install with `pip install mcp`
Server not appearing in Claude	Check config path and restart Claude Desktop
Permission errors	Ensure script has execute permissions
Large output truncated	Use exclusions or chunking
Binary files included	They're auto-excluded, check your patterns

🤝 Contributing

Contributions welcome! Please read our contributing guidelines:

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

📊 Performance

Blazing fast: Processes 1000+ files in milliseconds
Memory efficient: Streams large files, doesn't load all in memory
Smart caching: Reuses analysis results
Handles huge repos: Git repositories with 10k+ files

🌟 Star History

📄 License

MIT License - see file for details.

🙏 Acknowledgments

Inspired by Shotgun and repo-to-text
Built for the Model Context Protocol
Optimized for Gemini 2.5 Pro's massive context window
Enhanced by Claude's intelligence

🔗 Related Projects

Shotgun Code - Desktop GUI approach
repo-to-text - CLI tool
git2gpt - Simple converter
MCP - Model Context Protocol

Transform your entire codebase into LLM-ready context with AI-powered intelligence. Perfect for Gemini 2.5 Pro's massive context window, but works brilliantly with any LLM.

⭐ Star this repo if it helps you build better software faster!# repo-to-text-mcp-server