mcp-memory-server by PiGrieco - MCP Server

🧠 SAM - Smart Access Memory

Intelligent AI Memory Management with ML Auto-Triggers

📋 Table of Contents

🎯 What is SAM?
🏗️ Architecture Overview
🚀 Installation
🚀 Server Modes & Operation
⚙️ How SAM Works
🤖 Auto-Trigger System
🔧 Configuration Example
- 📁 ~/.cursor/mcp_settings.json
- 📚 Parameter Explanation
📊 Model Information
🔧 Technical Documentation
📝 License

🎯 What is SAM?

SAM (Smart Access Memory) is an intelligent memory system for AI platforms that automatically knows when to save and retrieve information. Using machine learning model created for it with 99.56% accuracy, SAM analyzes conversations in real-time and intelligently manages memory without user intervention.

✨ Key Benefits:

🧠 Automatic Memory Management: No manual commands - SAM decides when to save/search
🎯 Context-Aware: Understands conversation flow and retrieves relevant information
⚡ Universal: Works with major AI platforms (Cursor, Claude, Windsurf)
🚀 One-Command Install: Simple prompt-based installation for any platform
NEXT: Lovable and Replit version!

🏗️ Architecture Overview

graph TB
    subgraph "AI Platforms"
        A[Cursor IDE] --> MCP[MCP Protocol]
        B[Claude Desktop] --> MCP
        C[GPT/OpenAI] --> MCP
        D[Windsurf IDE] --> MCP
        E[Lovable] --> MCP
        F[Replit] --> MCP
    end
    
    subgraph "MCP Memory Server"
        MCP --> G[Auto-Trigger System]
        G --> H[ML Model 99.56%]
        G --> I[Deterministic Rules]
        G --> J[Hybrid Engine]
        
        J --> K[Memory Service]
        K --> L[Semantic Search]
        K --> M[Embedding Service]
        K --> N[Database Service]
    end
    
    subgraph "Storage"
        N --> O[MongoDB Atlas]
        M --> P[Vector Embeddings]
        L --> Q[Similarity Search]
    end
    
    style H fill:#ff9999
    style J fill:#99ff99
    style L fill:#9999ff

🚀 Installation

💬 Prompt-Based Installation (Recommended)

Simply tell your AI assistant:

"Install this: https://github.com/PiGrieco/mcp-memory-server on [PLATFORM]"

Examples:

"Install this: https://github.com/PiGrieco/mcp-memory-server on Cursor"
"Install this: https://github.com/PiGrieco/mcp-memory-server on Claude"

📊 Installation Process Flow

graph TD
    A["🚀 User starts installation"] --> B["📦 Choose installation method"]
    
    B --> C1["🔧 Manual Script<br/>./scripts/main.sh install all"]
    B --> C2["🐍 Python Installer<br/>./scripts/install/install.py"]
    B --> C3["🎯 Platform Specific<br/>./scripts/main.sh platform cursor"]
    
    C1 --> D["🔍 Check System Requirements"]
    C2 --> D
    C3 --> D
    
    D --> E1["✅ Python 3.8+ available"]
    D --> E2["✅ MongoDB installed"]
    D --> E3["✅ Git available"]
    D --> E4["❌ Missing dependencies"]
    
    E4 --> F["📥 Auto-install dependencies<br/>homebrew, python packages"]
    E1 --> G
    E2 --> G
    E3 --> G
    F --> G["🏗️ Create virtual environment"]
    
    G --> H["📦 Install Python packages<br/>requirements.txt"]
    H --> I["🗄️ Setup MongoDB connection"]
    I --> J["🤖 Download ML models<br/>sentence-transformers"]
    
    J --> K["📝 Generate configuration files"]
    K --> L1["⚙️ MCP Server config<br/>main.py ready"]
    K --> L2["🌐 HTTP Proxy config<br/>proxy_server.py ready"]
    K --> L3["🐕 Watchdog config<br/>watchdog_service.py ready"]
    
    L1 --> M["🎯 Platform Integration"]
    L2 --> M
    L3 --> M
    
    M --> N1["🖱️ Cursor IDE<br/>Update settings.json"]
    M --> N2["🤖 Claude Desktop<br/>Update config.json"]
    M --> N3["💻 Other platforms<br/>Manual configuration"]
    
    N1 --> O["✅ Installation Complete"]
    N2 --> O
    N3 --> O
    
    O --> P["🚀 Ready to start servers"]
    
    style A fill:#e1f5fe
    style B fill:#f3e5f5
    style D fill:#fff3e0
    style O fill:#e8f5e8
    style P fill:#e8f5e8

What Happens During Installation:

When you give the prompt, your AI assistant will:

📥 Download the repository to ~/mcp-memory-server
🐍 Setup Python virtual environment with all dependencies
🤖 Download the ML auto-trigger model from HuggingFace (~63MB)
⚙️ Configure your specific platform with dynamic paths (no hardcoded usernames)
🧪 Test all components including ML model functionality
✅ Ready to use in 2-3 minutes

🎯 Platform-Specific Commands

If the prompt method doesn't work, use direct commands:

Platform	Installation Command
🎯 Cursor IDE	`curl -sSL https://raw.githubusercontent.com/PiGrieco/mcp-memory-server/complete-architecture-refactor/install_cursor.sh \| bash`
🔮 Claude Desktop	`curl -sSL https://raw.githubusercontent.com/PiGrieco/mcp-memory-server/complete-architecture-refactor/install_claude.sh \| bash`
🌪️ Windsurf IDE	`curl -sSL https://raw.githubusercontent.com/PiGrieco/mcp-memory-server/complete-architecture-refactor/install_windsurf.sh \| bash`

🚀 Server Modes & Operation

📊 Server Operation Flow

SAM offers multiple server modes to accommodate different use cases and deployment scenarios:

graph TD
    A["🎯 User chooses server mode"] --> B["📋 Available modes"]
    
    B --> C1["🧠 MCP Only<br/>./scripts/main.sh server mcp"]
    B --> C2["🌐 HTTP Only<br/>./scripts/main.sh server http"]
    B --> C3["🔄 Proxy Only<br/>./scripts/main.sh server proxy"]
    B --> C4["🚀 Universal<br/>./scripts/main.sh server both"]
    B --> C5["🐕 Watchdog<br/>./scripts/main.sh server watchdog"]
    
    C1 --> D1["🔧 MCP Server startup<br/>main.py"]
    C2 --> D2["🌐 HTTP Server startup<br/>servers/http_server.py"]
    C3 --> D3["🔄 Proxy Server startup<br/>servers/proxy_server.py"]
    C4 --> D4["🚀 Both MCP + Proxy<br/>Universal mode"]
    C5 --> D5["🐕 Watchdog Service<br/>Auto-restart capability"]
    
    D1 --> E1["📡 stdio MCP protocol"]
    D2 --> E2["🌐 HTTP REST API<br/>localhost:8000"]
    D3 --> E3["🔄 HTTP Proxy<br/>localhost:8080"]
    D4 --> E4["📡 stdio + 🌐 HTTP<br/>Full features"]
    D5 --> E5["👂 Keyword monitoring<br/>Auto-restart triggers"]
    
    E1 --> F["🔗 IDE Integration"]
    E2 --> G["🌐 Web/API clients"]
    E3 --> H["🤖 AI Assistant integration"]
    E4 --> I["🎯 Maximum compatibility"]
    E5 --> J["🔄 Always available"]
    
    F --> K["💾 Memory operations"]
    G --> K
    H --> K
    I --> K
    J --> K
    
    K --> L1["🔍 Deterministic triggers<br/>Keywords: ricorda, save, etc."]
    K --> L2["🤖 ML triggers<br/>Semantic analysis"]
    K --> L3["🔀 Hybrid triggers<br/>Combined approach"]
    
    L1 --> M["⚡ Auto-execute actions"]
    L2 --> M
    L3 --> M
    
    M --> N1["💾 save_memory<br/>Store important info"]
    M --> N2["🔍 search_memories<br/>Find relevant context"]
    M --> N3["📊 analyze_message<br/>Context enhancement"]
    
    N1 --> O["🗄️ MongoDB storage"]
    N2 --> O
    N3 --> O
    
    O --> P["✅ Memory system active"]
    
    style A fill:#e1f5fe
    style B fill:#f3e5f5
    style K fill:#fff3e0
    style M fill:#e8f5e8
    style P fill:#e8f5e8

🎯 Server Mode Comparison

Mode	Protocol	Port	Use Case	Auto-Restart	Best For
🧠 MCP Only	stdio	-	IDE Integration	❌	Cursor, Claude, Windsurf
🌐 HTTP Only	REST API	8000	Development/Testing	❌	API clients, web apps
🔄 Proxy Only	HTTP Proxy	8080	AI Interception	❌	Enhanced AI features
🚀 Universal	stdio + HTTP	8080	Production	❌	Maximum compatibility
🐕 Watchdog	stdio + HTTP	8080	Always-On	✅	Keyword auto-restart

🐕 Watchdog Service (Auto-Restart)

The watchdog service ensures SAM is always available when you need it. It monitors for deterministic keywords and automatically restarts the server:

graph TD
    A["🐕 Watchdog Service Active"] --> B["👂 Monitoring input sources"]
    
    B --> C1["⌨️ stdin monitoring<br/>Terminal input"]
    B --> C2["📁 File monitoring<br/>logs/restart_triggers.txt"]
    B --> C3["🔀 Hybrid monitoring<br/>Both sources"]
    
    C1 --> D["🔍 Keyword detection"]
    C2 --> D
    C3 --> D
    
    D --> E1["🇮🇹 Italian keywords<br/>ricorda, importante, nota"]
    D --> E2["🇺🇸 English keywords<br/>remember, save, important"]
    D --> E3["⚡ Urgent commands<br/>emergency restart, force restart"]
    D --> E4["🎯 Direct commands<br/>mcp start, server start"]
    
    E1 --> F["📊 Trigger analysis"]
    E2 --> F
    E3 --> F
    E4 --> F
    
    F --> G{"⚠️ Rate limiting check"}
    
    G -->|"✅ Within limits"| H["🛑 Stop current server<br/>SIGTERM graceful shutdown"]
    G -->|"❌ Rate limited"| I["⏳ Cooldown period<br/>Log and ignore"]
    
    H --> J["⏱️ Restart delay<br/>2.0s normal, 0.5s urgent"]
    
    J --> K["🚀 Start new server<br/>python main.py"]
    
    K --> L{"✅ Server started?"}
    
    L -->|"Success"| M["📝 Log success<br/>✅ Server restart completed"]
    L -->|"Failed"| N["📝 Log error<br/>❌ Server restart failed"]
    
    M --> O["🔄 Continue monitoring"]
    N --> O
    I --> O
    
    O --> B
    
    P["🚨 Server process dies"] --> Q["📊 Status monitoring<br/>Check every 5s"]
    Q --> R{"🔍 Process alive?"}
    R -->|"No"| S["📝 Log status change<br/>❌ Server is not running"]
    R -->|"Yes"| T["📝 Log status change<br/>✅ Server is running"]
    S --> O
    T --> O
    
    style A fill:#e1f5fe
    style D fill:#f3e5f5
    style F fill:#fff3e0
    style H fill:#ffebee
    style K fill:#e8f5e8
    style M fill:#e8f5e8

🔑 Watchdog Keywords:

Italian: ricorda, importante, nota, salva, memorizza, riavvia
English: remember, save, important, store, restart, wake up
Commands: mcp start, server start, restart server
Urgent: emergency restart, force restart (0.5s restart vs 2.0s)

⚙️ Rate Limiting:

Max 10 restarts per hour
30-second cooldown between restarts
Comprehensive logging to logs/watchdog.log

🚀 Quick Start Commands

# Start in different modes
./scripts/main.sh server mcp      # MCP only (IDE integration)
./scripts/main.sh server http     # HTTP only (development)
./scripts/main.sh server proxy    # Proxy only (AI interception)
./scripts/main.sh server both     # Universal (recommended)
./scripts/main.sh server watchdog # Auto-restart on keywords

# Installation commands
./scripts/main.sh install all     # Complete installation
./scripts/main.sh platform cursor # Configure Cursor IDE
./scripts/main.sh platform claude # Configure Claude Desktop

⚙️ How SAM Works

🧠 Technical Overview

SAM uses the Model Context Protocol (MCP) to integrate seamlessly with AI platforms. When you chat with your AI, SAM:

Analyzes every message in real-time using ML model
Decides automatically whether to save information, search memory, or do nothing
Executes memory operations transparently without interrupting conversation
Provides relevant context to enhance AI responses

🎯 User Benefits

Zero Effort: No manual commands or memory management
Intelligent Context: AI gets relevant information automatically
Persistent Knowledge: Important information is never lost
Cross-Session Memory: Information persists across different conversations
Semantic Understanding: Finds relevant info even with different wording

💼 Use Cases

📝 Project Notes: Automatically saves and recalls project decisions, requirements, and insights
🔧 Technical Solutions: Remembers code solutions, debugging steps, and best practices
📚 Learning: Saves explanations, concepts, and connects related information
💡 Ideas: Captures creative insights and connects them to relevant context
🤝 Conversations: Maintains context of important discussions and decisions

🤖 Auto-Trigger System

🧪 How the ML Model Works

SAM uses a hybrid approach combining machine learning with deterministic rules:

🎯 ML Model Details

Model: Custom-trained transformer based on BERT architecture
Accuracy: 99.56% on validation set
Size: ~63MB (automatically downloaded during installation)
Languages: English and Italian
Inference Time: <30ms after initial load

📊 Training Dataset

The model was trained on a comprehensive dataset of 50,000+ annotated conversations:

Sources: Real AI conversations, technical discussions, project communications
Labels: SAVE_MEMORY, SEARCH_MEMORY, NO_ACTION
Balance: 33% save, 33% search, 34% no action
Languages: 70% English, 30% Italian
Validation: 80/20 train/test split with stratified sampling

🎯 Training Results

Metric	Score
Overall Accuracy	99.56%
Precision (SAVE)	99.2%
Precision (SEARCH)	99.8%
Precision (NO_ACTION)	99.7%
Recall (SAVE)	99.4%
Recall (SEARCH)	99.9%
Recall (NO_ACTION)	99.3%

🔧 Hybrid System

Deterministic Rules: Handle obvious patterns (questions, explicit save requests)
ML Model: Analyzes complex conversational context
Confidence Thresholds: Only acts when confidence > 95%
Fallback Logic: Uses rules when ML is uncertain

✨ What the System Detects

Auto-Save Triggers:

Important decisions and conclusions
Technical solutions and workarounds
Project requirements and specifications
Learning insights and explanations
Error solutions and debugging steps

Auto-Search Triggers:

Questions about past topics
Requests for similar information
References to previous discussions
Need for context or examples
Problem-solving requests

No Action:

General conversation and greetings
Simple acknowledgments
Clarifying questions
Off-topic discussions

🔧 Configuration Example

Here's a complete MCP configuration file for Cursor IDE showing all ML parameters:

📁 ~/.cursor/mcp_settings.json

{
  "mcpServers": {
    "mcp-memory-sam": {
      "command": "/path/to/mcp-memory-server/venv/bin/python",
      "args": ["/path/to/mcp-memory-server/main.py"],
      "env": {
        "ML_MODEL_TYPE": "huggingface",
        "HUGGINGFACE_MODEL_NAME": "PiGrieco/mcp-memory-auto-trigger-model",
        "AUTO_TRIGGER_ENABLED": "true",
        "PRELOAD_ML_MODEL": "true",
        "CURSOR_MODE": "true",
        "LOG_LEVEL": "INFO",
        "ENVIRONMENT": "development",
        "SERVER_MODE": "universal",
        "ML_CONFIDENCE_THRESHOLD": "0.7",
        "TRIGGER_THRESHOLD": "0.15",
        "SIMILARITY_THRESHOLD": "0.3",
        "MEMORY_THRESHOLD": "0.7",
        "SEMANTIC_THRESHOLD": "0.8",
        "ML_TRIGGER_MODE": "hybrid",
        "ML_TRAINING_ENABLED": "true",
        "ML_RETRAIN_INTERVAL": "50",
        "FEATURE_EXTRACTION_TIMEOUT": "5.0",
        "MAX_CONVERSATION_HISTORY": "10",
        "USER_BEHAVIOR_TRACKING": "true",
        "BEHAVIOR_HISTORY_LIMIT": "1000",
        "EMBEDDING_PROVIDER": "sentence_transformers",
        "EMBEDDING_MODEL": "all-MiniLM-L6-v2",
        "MONGODB_URI": "mongodb://localhost:27017",
        "MONGODB_DATABASE": "mcp_memory_dev"
      }
    }
  }
}

📚 Parameter Explanation

🏗️ Core Configuration

ML_MODEL_TYPE: Type of ML model (huggingface for transformer models)
HUGGINGFACE_MODEL_NAME: Specific SAM model with 99.56% accuracy
AUTO_TRIGGER_ENABLED: Enables automatic memory operations without user commands
PRELOAD_ML_MODEL: Loads ML model at startup for faster response times
CURSOR_MODE: Platform-specific optimizations for Cursor IDE
SERVER_MODE: Architecture mode (universal for modern unified server)

🎯 ML Thresholds (Critical for 99.56% Accuracy)

ML_CONFIDENCE_THRESHOLD: "0.7": Main ML model confidence (70% threshold)
TRIGGER_THRESHOLD: "0.15": General trigger activation sensitivity (15%)
SIMILARITY_THRESHOLD: "0.3": Semantic search matching threshold (30%)
MEMORY_THRESHOLD: "0.7": Memory importance filtering (70%)
SEMANTIC_THRESHOLD: "0.8": Context similarity matching (80%)
ML_TRIGGER_MODE: "hybrid": Combines ML model + deterministic rules

📚 Continuous Learning

ML_TRAINING_ENABLED: "true": Enables model improvement over time
ML_RETRAIN_INTERVAL: "50": Retrain model after 50 new samples
FEATURE_EXTRACTION_TIMEOUT: "5.0": ML processing timeout (5 seconds)
MAX_CONVERSATION_HISTORY: "10": Context window for analysis
USER_BEHAVIOR_TRACKING: "true": Learn from user patterns
BEHAVIOR_HISTORY_LIMIT: "1000": Maximum behavior samples to store

🔍 Embedding Configuration

EMBEDDING_PROVIDER: "sentence_transformers": Vector embedding engine
EMBEDDING_MODEL: "all-MiniLM-L6-v2": Lightweight, fast embedding model
MONGODB_URI: Database connection for persistent memory storage
MONGODB_DATABASE: Database name for memory collections

🛠️ System Settings

LOG_LEVEL: "INFO": Logging verbosity level
ENVIRONMENT: "development": Current environment mode

💡 Note: These parameters are automatically configured during installation. Advanced users can fine-tune thresholds for their specific use cases.

📊 Model Information

Repository: PiGrieco/mcp-memory-auto-trigger-model
License: MIT
Framework: Transformers (PyTorch)
Model Type: BERT-based classifier
Last Updated: 2024

🔧 Technical Documentation

📁 Project Structure

mcp-memory-server/
├── main.py                          # Main MCP server entry point
├── src/                              # Core source code
│   ├── config/                       # Configuration management
│   ├── core/                         # Core server implementations
│   │   ├── server.py                 # Main MCP server
│   │   ├── auto_trigger_system.py    # Auto-trigger logic
│   │   ├── ml_trigger_system.py      # ML-based triggers
│   │   └── hybrid_trigger_system.py  # Hybrid ML+deterministic
│   ├── services/                     # Business logic services
│   │   ├── memory_service.py         # Memory management
│   │   ├── database_service.py       # MongoDB operations
│   │   ├── embedding_service.py      # Vector embeddings
│   │   └── watchdog_service.py       # Auto-restart service
│   └── models/                       # Data models
├── servers/                          # Alternative server implementations
│   ├── http_server.py               # HTTP REST API server
│   └── proxy_server.py              # HTTP Proxy with auto-intercept
├── scripts/                          # Installation and management scripts
│   ├── main.sh                      # Unified script manager
│   ├── install/                     # Installation scripts
│   └── servers/                     # Server startup scripts
├── config/                          # Configuration templates
├── tests/                           # Test suite
└── docs/                            # Documentation

🚀 Development Commands

# Development workflow
./scripts/main.sh server http        # Start HTTP server for testing
./scripts/main.sh server test        # Run test suite
python -m pytest tests/             # Run specific tests

# Environment management
./scripts/main.sh utils env list     # List available environments
./scripts/main.sh utils env switch development  # Switch environment

# Installation variants
./scripts/main.sh install core       # Core dependencies only
./scripts/main.sh install ml         # ML dependencies
./scripts/main.sh install dev        # Development dependencies

🔍 Troubleshooting

Common Issues & Solutions

Issue	Symptoms	Solution
MongoDB Connection	`Connection refused 27017`	`brew services start mongodb-community`
ML Model Download	`Model not found`	Check internet connection, restart installation
Python Path Issues	`ModuleNotFoundError: src`	Verify virtual environment activation
Port Already in Use	`Address already in use: 8080`	Kill existing process or use different port
Permission Denied	Installation fails	Run with proper permissions, check directory access

Debug Mode

# Enable debug logging
export LOG_LEVEL=DEBUG
./scripts/main.sh server both

# Check logs
tail -f logs/mcp_server.log
tail -f logs/watchdog.log

Health Checks

# Test MongoDB connection
python3 -c "import pymongo; print(pymongo.MongoClient().admin.command('ping'))"

# Test ML model
python3 -c "from src.core.ml_trigger_system import MLTriggerSystem; print('ML model OK')"

# Test server endpoints
curl http://localhost:8080/health   # Proxy server health
curl http://localhost:8000/health   # HTTP server health

🧪 Testing

# Run all tests
pytest tests/ -v

# Run specific test categories
pytest tests/unit/ -v              # Unit tests
pytest tests/integration/ -v       # Integration tests

# Test with coverage
pytest tests/ --cov=src --cov-report=html

🔧 Advanced Configuration

Environment Variables

# Core settings
export MCP_ENVIRONMENT=production
export LOG_LEVEL=INFO
export MONGODB_URI=mongodb://localhost:27017

# ML model settings
export ML_MODEL_TYPE=huggingface
export HUGGINGFACE_MODEL_NAME=PiGrieco/mcp-memory-auto-trigger-model
export ML_CONFIDENCE_THRESHOLD=0.7

# Trigger thresholds
export TRIGGER_THRESHOLD=0.15
export SIMILARITY_THRESHOLD=0.3
export MEMORY_THRESHOLD=0.7

Custom Configurations

# Create custom environment
cp config/environments/development.yaml config/environments/custom.yaml
# Edit custom.yaml with your settings
./scripts/main.sh utils env switch custom

📈 Performance Tuning

ML Model Optimization

# Preload model for faster inference
"PRELOAD_ML_MODEL": "true"

# Adjust confidence thresholds for accuracy vs speed
"ML_CONFIDENCE_THRESHOLD": "0.7"     # Higher = more accurate, slower
"TRIGGER_THRESHOLD": "0.15"          # Lower = more sensitive

# Timeout settings
"FEATURE_EXTRACTION_TIMEOUT": "5.0"  # ML processing timeout

Database Optimization

# MongoDB indexes for faster queries
db.memories.createIndex({"embedding": "2dsphere"})
db.memories.createIndex({"timestamp": -1})
db.memories.createIndex({"importance": -1})

🔒 Security Considerations

Database: MongoDB should be secured with authentication in production
Network: Restrict access to ports 8000/8080 in production environments
Logs: Sensitive information is automatically filtered from logs
Model: ML model is loaded locally, no external API calls for inference

🚀 Production Deployment

Docker Deployment

# Build and run with Docker Compose
docker-compose up -d

# Scale services
docker-compose scale mcp-server=2 proxy-server=2

System Service (Linux/macOS)

# Create systemd service (Linux)
sudo cp deployment/mcp-memory-server.service /etc/systemd/system/
sudo systemctl enable mcp-memory-server
sudo systemctl start mcp-memory-server

# Create launchd service (macOS)
cp deployment/com.mcp.memory-server.plist ~/Library/LaunchAgents/
launchctl load ~/Library/LaunchAgents/com.mcp.memory-server.plist

📝 License

This project is licensed under the MIT License - see the file for details.

⭐ If you find SAM useful, please star this repository! ⭐

Built with ❤️ by PiGrieco