mcp-search-server by devopsexpertlearning - MCP Server

🌐 MCP Browser Search Server

A comprehensive Model Context Protocol (MCP) server providing advanced web search and content extraction capabilities. Built for AI applications, Claude Desktop, and any MCP-compatible client.

✨ Key Features

🎯 Three Server Versions

🚀 Enhanced Server - Advanced features with caching, bulk operations, and domain analysis
🔧 Full Browser Server - Complete Playwright automation for JavaScript-heavy sites
⚡ Fallback Server - Lightweight curl-based version for maximum compatibility

🔍 Search Capabilities

Multiple Search Engines: Google, Bing, DuckDuckGo, Searx, StartPage
News Search: Recent articles with source attribution
Bulk Search: Parallel searches across multiple queries
Smart Caching: Configurable result caching for performance

📄 Content Extraction

Intelligent Parsing: Article content, metadata, links, and images
Multiple Formats: Full HTML, clean text, or article-only extraction
Domain Analysis: Endpoint discovery and basic site information
Link Discovery: Extract and analyze page relationships

🚀 Quick Start

Installation

# Clone and set up locally
git clone https://github.com/devopsexpertlearning/mcp-search-server/tree/main
cd mcp-browser-search

# Install dependencies
npm install

# Build the project
npm run build

# Start the server
npm start

Basic Usage

# Start Enhanced Server (recommended)
npm run start:enhanced

# Start Fallback Server (maximum compatibility)
npm run start:fallback

# Start Full Browser Server (default)
npm start

# Start Ollama-powered Server
npm run start:ollama

🛠️ Development

Prerequisites

Node.js >= 18
npm or yarn
Git

Build from Source

# Clone the repository
git clone https://github.com/your-repo/mcp-browser-search.git
cd mcp-browser-search

# Install dependencies
npm install

# Build TypeScript code
npm run build

# Test the build
npm start

Development Mode

# Watch mode for development
npm run dev

# Build specific server
npm run build:enhanced
npm run build:fallback
npm run build:ollama

🐳 Deployment

Docker Deployment

Basic Docker Setup

# Build Docker image
docker build -t mcp-browser-search .

# Run with Enhanced Server
docker run -p 3000:3000 mcp-browser-search

# Run with environment variables
docker run -e MCP_SERVER_TYPE=enhanced -e CACHE_ENABLED=true mcp-browser-search

Docker Compose

version: '3.8'
services:
  mcp-browser-search:
    build: .
    environment:
      - MCP_SERVER_TYPE=enhanced
      - CACHE_ENABLED=true
      - REDIS_URL=redis://redis:6379
    ports:
      - "3000:3000"
    depends_on:
      - redis
  
  redis:
    image: redis:alpine
    ports:
      - "6379:6379"

Kubernetes Deployment

apiVersion: apps/v1
kind: Deployment
metadata:
  name: mcp-browser-search
spec:
  replicas: 3
  selector:
    matchLabels:
      app: mcp-browser-search
  template:
    metadata:
      labels:
        app: mcp-browser-search
    spec:
      containers:
      - name: mcp-browser-search
        image: mcp-browser-search:latest
        ports:
        - containerPort: 3000
        env:
        - name: MCP_SERVER_TYPE
          value: "enhanced"

🔧 Integration

Claude Desktop

Add to your Claude Desktop configuration:

{
  "mcpServers": {
    "browser-search": {
      "command": "node",
      "args": ["/path/to/mcp-browser-search/dist/index.js"],
      "env": {
        "MCP_SERVER_TYPE": "enhanced"
      }
    }
  }
}

Generic MCP Clients

The server implements the standard MCP protocol and can be used with any MCP-compatible client.

🛠️ Available Tools

🔍 `search_web`

Search the web using various search engines.

Parameters:

query (string): Search query
engine (string, optional): Search engine (google, bing, duckduckgo)
max_results (number, optional): Maximum results to return

📄 `visit_page`

Extract content from a specific webpage.

Parameters:

url (string): URL to visit
extraction_type (string, optional): Type of extraction (article, full, clean)

📰 `search_news`

Search for recent news articles.

Parameters:

query (string): News search query
max_results (number, optional): Maximum results to return

🔍 `bulk_search`

Perform multiple searches in parallel.

Parameters:

queries (array): Array of search queries
engine (string, optional): Search engine to use

🌐 `analyze_domain`

Analyze a domain for endpoints and basic information.

Parameters:

domain (string): Domain to analyze

🔗 `extract_links`

Extract links from a webpage.

Parameters:

url (string): URL to extract links from

⚙️ Configuration

Environment Variables

# Server type
MCP_SERVER_TYPE=enhanced|browser|fallback|ollama

# Cache settings
CACHE_ENABLED=true
CACHE_TTL=3600
REDIS_URL=redis://localhost:6379

# Search settings
DEFAULT_SEARCH_ENGINE=google
MAX_SEARCH_RESULTS=10

# Browser settings (for browser server)
HEADLESS=true
TIMEOUT=30000

# Ollama settings
OLLAMA_BASE_URL=http://localhost:11434
OLLAMA_MODEL=llama2

Configuration File

Create config.json:

{
  "server": {
    "type": "enhanced",
    "port": 3000
  },
  "cache": {
    "enabled": true,
    "ttl": 3600,
    "redisUrl": "redis://localhost:6379"
  },
  "search": {
    "defaultEngine": "google",
    "maxResults": 10,
    "engines": {
      "google": {
        "enabled": true
      },
      "bing": {
        "enabled": true
      }
    }
  }
}

✅ Testing & Validation

Quick Test Commands

After building the project, you can test the MCP server functionality using these methods:

Method 1: Test Tools List

# List all available tools
echo '{"jsonrpc": "2.0", "id": 1, "method": "tools/list", "params": {}}' | node dist/index.js

Method 2: Test Web Search (Fallback Server - Recommended)

# Start fallback server (works without browser dependencies)
npm run start:fallback &

# Test search functionality
echo '{"jsonrpc": "2.0", "id": 2, "method": "tools/call", "params": {"name": "search_web_fallback", "arguments": {"query": "hello world", "engine": "duckduckgo", "max_results": 3}}}' | node dist/servers/FallbackServer.js

Method 3: Test Full Browser Server

# Install browser dependencies first
npx playwright install

# Test with full browser server
echo '{"jsonrpc": "2.0", "id": 3, "method": "tools/call", "params": {"name": "search_web", "arguments": {"query": "test search", "engine": "duckduckgo", "max_results": 3}}}' | node dist/index.js

Method 4: Test Page Visit

# Test content extraction
echo '{"jsonrpc": "2.0", "id": 4, "method": "tools/call", "params": {"name": "visit_page_fallback", "arguments": {"url": "https://example.com"}}}' | node dist/servers/FallbackServer.js

Expected Successful Response

{
  "result": {
    "content": [
      {
        "type": "text",
        "text": "[\n  {\n    \"title\": \"Example Result\",\n    \"url\": \"https://example.com\",\n    \"snippet\": \"Description of the result\",\n    \"domain\": \"example.com\"\n  }\n]"
      }
    ]
  }
}

Testing Different Server Types

# Test Enhanced Server
npm run start:enhanced &
echo '{"jsonrpc": "2.0", "id": 1, "method": "tools/list", "params": {}}' | node dist/servers/EnhancedServer.js

# Test Fallback Server (No browser required)
npm run start:fallback &
echo '{"jsonrpc": "2.0", "id": 1, "method": "tools/list", "params": {}}' | node dist/servers/FallbackServer.js

# Test Ollama Server
npm run start:ollama &
echo '{"jsonrpc": "2.0", "id": 1, "method": "tools/list", "params": {}}' | node dist/servers/OllamaServer.js

Interactive Testing Script

Create test_mcp.js:

const { spawn } = require('child_process');

const mcpServer = spawn('node', ['dist/servers/FallbackServer.js'], {
  stdio: ['pipe', 'pipe', 'pipe']
});

const testMessages = [
  { jsonrpc: "2.0", id: 1, method: "tools/list", params: {} },
  { 
    jsonrpc: "2.0", 
    id: 2, 
    method: "tools/call", 
    params: { 
      name: "search_web_fallback", 
      arguments: { query: "test", max_results: 2 } 
    } 
  }
];

mcpServer.stdout.on('data', (data) => {
  console.log('MCP Response:', data.toString());
});

mcpServer.stderr.on('data', (data) => {
  console.error('MCP Error:', data.toString());
});

testMessages.forEach((msg, i) => {
  setTimeout(() => {
    console.log(`Sending: ${JSON.stringify(msg)}`);
    mcpServer.stdin.write(JSON.stringify(msg) + '\n');
  }, i * 1000);
});

setTimeout(() => {
  mcpServer.kill();
}, 5000);

Run with: node test_mcp.js

🧪 Testing

Run Tests

# Run all tests
npm test

# Run specific test suite
npm run test:unit
npm run test:integration

# Watch mode
npm run test:watch

Manual Testing

# Start server in debug mode
DEBUG=* npm start

# Test with curl
curl -X POST http://localhost:3000/mcp \
  -H "Content-Type: application/json" \
  -d '{"method": "search_web", "params": {"query": "test"}}'

🔒 Security Considerations

Best Practices

Use environment variables for sensitive configuration
Enable rate limiting in production
Use HTTPS in production deployments
Regularly update dependencies
Monitor for suspicious activity

Security Configuration

# Enable security features
RATE_LIMIT_ENABLED=true
RATE_LIMIT_MAX=100
RATE_LIMIT_WINDOW=900000

# CORS settings
CORS_ENABLED=true
CORS_ORIGIN=https://your-domain.com

🐛 Troubleshooting

Common Issues

Installation Issues:

# Clear npm cache
npm cache clean --force

# Reinstall dependencies
rm -rf node_modules package-lock.json
npm install

Browser Server Issues:

# Install Playwright browsers
npx playwright install

# Check browser dependencies
npx playwright install-deps

# If Playwright installation fails, use fallback server
npm run start:fallback

MCP Testing Issues:

"No response from MCP server"

# Check if server is running
ps aux | grep node

# Test basic connectivity
echo '{"jsonrpc": "2.0", "id": 1, "method": "ping"}' | node dist/index.js

"Tool not found" errors

# List available tools first
echo '{"jsonrpc": "2.0", "id": 1, "method": "tools/list", "params": {}}' | node dist/servers/FallbackServer.js

# Use correct tool names:
# - search_web_fallback (for fallback server)
# - search_web (for full browser server)  
# - visit_page_fallback (for fallback server)

"browserType.launch: Executable doesn't exist" error

# This means Playwright browsers aren't installed
# Solution 1: Install browsers
npx playwright install

# Solution 2: Use fallback server instead
npm run start:fallback

"Search returns no results"

# Try different search engines
echo '{"jsonrpc": "2.0", "id": 2, "method": "tools/call", "params": {"name": "search_web_fallback", "arguments": {"query": "test", "engine": "bing"}}}' | node dist/servers/FallbackServer.js

# Check network connectivity
curl -I https://duckduckgo.com

Memory Issues:

# Increase Node.js memory limit
NODE_OPTIONS="--max-old-space-size=4096" npm start

Port Conflicts:

# Check what's using the port
lsof -i :3000

# Kill conflicting processes
pkill -f "node.*mcp"

Debug Mode

# Enable debug logging
DEBUG=mcp:* npm start

# Verbose logging
LOG_LEVEL=debug npm start

🤝 Contributing

Development Setup

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

Code Standards

Follow TypeScript best practices
Use ESLint and Prettier
Write comprehensive tests
Document new features

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

Model Context Protocol (MCP) team
Playwright for browser automation
The open-source community

Made with ❤️ for the MCP community

devopsexpertlearning/mcp-search-server

🌐 MCP Browser Search Server

✨ Key Features

🎯 Three Server Versions

🔍 Search Capabilities

📄 Content Extraction

🚀 Quick Start

Installation

Basic Usage

🛠️ Development

Prerequisites

Build from Source

Development Mode

🐳 Deployment

Docker Deployment

Basic Docker Setup

Docker Compose

Kubernetes Deployment

🔧 Integration

Claude Desktop

Generic MCP Clients

🛠️ Available Tools

🔍 search_web

📄 visit_page

📰 search_news

🔍 bulk_search

🌐 analyze_domain

🔗 extract_links

⚙️ Configuration

Environment Variables

Configuration File

✅ Testing & Validation

Quick Test Commands

Method 1: Test Tools List

Method 2: Test Web Search (Fallback Server - Recommended)

Method 3: Test Full Browser Server

Method 4: Test Page Visit

Expected Successful Response

Testing Different Server Types

Interactive Testing Script

🧪 Testing

Run Tests

Manual Testing

🔒 Security Considerations

Best Practices

Security Configuration

🐛 Troubleshooting

Common Issues

Debug Mode

🤝 Contributing

Development Setup

Code Standards

📄 License

🙏 Acknowledgments

🔍 `search_web`

📄 `visit_page`

📰 `search_news`

🔍 `bulk_search`

🌐 `analyze_domain`

🔗 `extract_links`