paper-search-mcp-nodejs

Dianel555/paper-search-mcp-nodejs

3.4

If you are the rightful owner of paper-search-mcp-nodejs and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

A Node.js Model Context Protocol (MCP) server for searching and downloading academic papers from multiple sources.

Tools
12
Resources
0
Prompts
0

Paper Search MCP (Node.js)

English|

A Node.js Model Context Protocol (MCP) server for searching and downloading academic papers from multiple sources, including arXiv, Web of Science, PubMed, Google Scholar, Sci-Hub, ScienceDirect, Springer, Wiley, Scopus, and 13 academic platforms in total.

Node.js TypeScript License Platforms

✨ Key Features

  • šŸŒ 13 Academic Platforms: arXiv, Web of Science, PubMed, Google Scholar, bioRxiv, medRxiv, Semantic Scholar, IACR ePrint, Sci-Hub, ScienceDirect, Springer Nature, Wiley, Scopus
  • šŸ”— MCP Protocol Integration: Seamless integration with Claude Desktop and other AI assistants
  • šŸ“Š Unified Data Model: Standardized paper format across all platforms
  • ⚔ High-Performance Search: Concurrent search with intelligent rate limiting
  • šŸ›”ļø Type Safety: Complete TypeScript support
  • šŸŽÆ Academic Papers First: Smart filtering prioritizing academic papers over books
  • šŸ”„ Smart Error Handling: Platform fallback and auto-retry mechanisms

šŸ“š Supported Platforms

PlatformSearchDownloadFull TextCitationsAPI KeySpecial Features
arXivāœ…āœ…āœ…āŒāŒPhysics/CS preprints
Web of Scienceāœ…āŒāŒāœ…āœ… RequiredHigh-quality journal index
PubMedāœ…āŒāŒāŒšŸŸ” OptionalBiomedical literature
Google Scholarāœ…āŒāŒāœ…āŒComprehensive academic search
bioRxivāœ…āœ…āœ…āŒāŒBiology preprints
medRxivāœ…āœ…āœ…āŒāŒMedical preprints
Semantic Scholarāœ…āœ…āŒāœ…šŸŸ” OptionalAI semantic search
IACR ePrintāœ…āœ…āœ…āŒāŒCryptography papers
Sci-Hubāœ…āœ…āŒāŒāŒUniversal paper access via DOI
ScienceDirectāœ…āŒāŒāœ…āœ… RequiredElsevier's full-text database
Springer Natureāœ…āœ…*āŒāŒāœ… RequiredDual API: Meta v2 & OpenAccess
Wileyāœ…āœ…āŒāŒāœ… RequiredText and Data Mining API
Scopusāœ…āŒāŒāœ…āœ… RequiredLargest citation database

āœ… Supported | āŒ Not supported | 🟔 Optional | āœ…* Open Access only

šŸš€ Quick Start

System Requirements

  • Node.js >= 18.0.0
  • npm or yarn

Installation

# Clone repository
git clone https://github.com/your-username/paper-search-mcp-nodejs.git
cd paper-search-mcp-nodejs

# Install dependencies
npm install

# Copy environment template
cp .env.example .env

Configuration

  1. Get Web of Science API Key

  2. Get PubMed API Key (Optional)

    • Without API key: Free usage, 3 requests/second limit
    • With API key: 10 requests/second, more stable service
    • Get key: See NCBI API Keys
  3. Configure Environment Variables

    # Edit .env file
    WOS_API_KEY=your_actual_api_key_here
    WOS_API_VERSION=v1
    
    # PubMed API key (optional, recommended for better performance)
    PUBMED_API_KEY=your_ncbi_api_key_here
    
    # Semantic Scholar API key (optional, increases rate limits)
    SEMANTIC_SCHOLAR_API_KEY=your_semantic_scholar_api_key
    
    # Elsevier API key (required for ScienceDirect and Scopus)
    ELSEVIER_API_KEY=your_elsevier_api_key
    
    # Springer Nature API keys (required for Springer)
    SPRINGER_API_KEY=your_springer_api_key  # For Metadata API v2
    # Optional: Separate key for OpenAccess API (if different from main key)
    SPRINGER_OPENACCESS_API_KEY=your_openaccess_api_key
    
    # Wiley TDM token (required for Wiley)
    WILEY_TDM_TOKEN=your_wiley_tdm_token
    

Build and Run

Method 1: NPX (Recommended for MCP)
# Direct run with npx (most common MCP deployment)
npx -y paper-search-mcp-nodejs

# Or install globally
npm install -g paper-search-mcp-nodejs
paper-search-mcp
Method 2: Local Development
# Build TypeScript code
npm run build

# Start server
npm start

# Or run in development mode
npm run dev

MCP Server Configuration

Add the following configuration to your Claude Desktop config file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

NPX Configuration (Recommended)
{
  "mcpServers": {
    "paper-search-nodejs": {
      "command": "npx",
      "args": ["-y", "paper-search-mcp-nodejs"],
      "env": {
        "WOS_API_KEY": "your_web_of_science_api_key"
      }
    }
  }
}
Local Installation Configuration
{
  "mcpServers": {
    "paper_search_nodejs": {
      "command": "node",
      "args": ["/path/to/paper-search-mcp-nodejs/dist/server.js"],
      "env": {
        "WOS_API_KEY": "your_web_of_science_api_key"
      }
    }
  }
}

šŸ› ļø MCP Tools

search_papers

Search academic papers across multiple platforms

// Random platform selection (default behavior)
search_papers({
  query: "machine learning",
  platform: "all",      // Randomly selects one platform for efficiency
  maxResults: 10,
  year: "2023",
  sortBy: "date"
})

// Search specific platform
search_papers({
  query: "quantum computing",
  platform: "webofscience",  // Target specific platform
  maxResults: 5
})

Platform Selection Behavior:

  • platform: "all" - Randomly selects one platform for efficient, focused results
  • Specific platform - Searches only that platform
  • Available platforms: arxiv, webofscience/wos, pubmed, biorxiv, medrxiv, semantic, iacr, googlescholar/scholar, scihub, sciencedirect, springer, wiley, scopus

search_arxiv

Search arXiv preprints specifically

search_arxiv({
  query: "transformer neural networks",
  maxResults: 10,
  category: "cs.AI",
  author: "Attention"
})

search_webofscience

Search Web of Science database specifically

search_webofscience({
  query: "CRISPR gene editing",
  maxResults: 15,
  year: "2022",
  journal: "Nature"
})

search_pubmed

Search PubMed/MEDLINE biomedical literature database

search_pubmed({
  query: "COVID-19 vaccine efficacy",
  maxResults: 20,
  year: "2023",
  author: "Smith",
  journal: "New England Journal of Medicine",
  publicationType: ["Journal Article", "Clinical Trial"]
})

search_google_scholar

Search Google Scholar academic database

search_google_scholar({
  query: "machine learning",
  maxResults: 10,
  yearLow: 2020,
  yearHigh: 2023,
  author: "Bengio"
})

search_biorxiv / search_medrxiv

Search biology and medical preprints

search_biorxiv({
  query: "CRISPR",
  maxResults: 15,
  days: 30
})

search_semantic_scholar

Search Semantic Scholar AI semantic database

search_semantic_scholar({
  query: "deep learning",
  maxResults: 10,
  fieldsOfStudy: ["Computer Science"],
  year: "2023"
})

search_iacr

Search IACR ePrint cryptography archive

search_iacr({
  query: "zero knowledge proof",
  maxResults: 5,
  fetchDetails: true
})

search_scihub

Search and download papers from Sci-Hub using DOI or paper URL

search_scihub({
  doiOrUrl: "10.1038/nature12373",
  downloadPdf: true,
  savePath: "./downloads"
})

check_scihub_mirrors

Check health status of Sci-Hub mirror sites

check_scihub_mirrors({
  forceCheck: true  // Force fresh health check
})

download_paper

Download paper PDF files

download_paper({
  paperId: "2106.12345",  // or DOI for Sci-Hub
  platform: "arxiv",      // or "scihub" for Sci-Hub downloads
  savePath: "./downloads"
})

get_paper_by_doi

Get paper information by DOI

get_paper_by_doi({
  doi: "10.1038/s41586-023-12345-6",
  platform: "all"
})

get_platform_status

Check platform status and API keys

get_platform_status({})

šŸ“Š Data Model

All platform paper data is converted to a unified format:

interface Paper {
  paperId: string;           // Unique identifier
  title: string;            // Paper title
  authors: string[];        // Author list
  abstract: string;         // Abstract
  doi: string;             // DOI
  publishedDate: Date;     // Publication date
  pdfUrl: string;          // PDF link
  url: string;             // Paper page URL
  source: string;          // Source platform
  citationCount?: number;   // Citation count
  journal?: string;         // Journal name
  year?: number;           // Publication year
  categories?: string[];    // Subject categories
  keywords?: string[];      // Keywords
  // ... more fields
}

šŸ”§ Development

Project Structure

src/
ā”œā”€ā”€ models/
│   └── Paper.ts              # Paper data model
ā”œā”€ā”€ platforms/
│   ā”œā”€ā”€ PaperSource.ts        # Abstract base class
│   ā”œā”€ā”€ ArxivSearcher.ts      # arXiv searcher
│   ā”œā”€ā”€ WebOfScienceSearcher.ts # Web of Science searcher
│   ā”œā”€ā”€ PubMedSearcher.ts     # PubMed searcher
│   ā”œā”€ā”€ GoogleScholarSearcher.ts # Google Scholar searcher
│   ā”œā”€ā”€ BioRxivSearcher.ts    # bioRxiv/medRxiv searcher
|   ā”œā”€ā”€ SemanticScholarSearcher.ts # Semantic Scholar searcher
|   ā”œā”€ā”€ IACRSearcher.ts       # IACR ePrint searcher
|   ā”œā”€ā”€ SciHubSearcher.ts     # Sci-Hub searcher with mirror management
|   ā”œā”€ā”€ ScienceDirectSearcher.ts # ScienceDirect (Elsevier) searcher
│   ā”œā”€ā”€ SpringerSearcher.ts   # Springer Nature searcher (Meta v2 & OpenAccess APIs)
|   ā”œā”€ā”€ WileySearcher.ts      # Wiley TDM API searcher
|   └── ScopusSearcher.ts     # Scopus citation database searcher
ā”œā”€ā”€ utils/
│   └── RateLimiter.ts        # Token bucket rate limiter
└── server.ts                 # MCP server main file

Adding New Platforms

  1. Create new searcher class extending PaperSource
  2. Implement required abstract methods
  3. Register new searcher in server.ts
  4. Add corresponding MCP tool

Testing

# Run tests
npm test

# Run linting
npm run lint

# Code formatting
npm run format

🌟 Platform-Specific Features

Springer Nature Dual API System

Springer Nature provides two APIs:

  1. Metadata API v2 (Main API)

    • Endpoint: https://api.springernature.com/meta/v2/json
    • Searches all Springer content (subscription + open access)
    • Requires API key from https://dev.springernature.com/
  2. OpenAccess API (Optional)

    • Endpoint: https://api.springernature.com/openaccess/json
    • Only searches open access content
    • May require separate API key or special permissions
    • Better for finding downloadable PDFs
// Search all Springer content
search_springer({
  query: "machine learning",
  maxResults: 10
})

// Search only open access papers
search_springer({
  query: "COVID-19",
  openAccess: true,  // Uses OpenAccess API if available
  maxResults: 5
})

Web of Science Advanced Search

// Use Web of Science query syntax
search_webofscience({
  query: 'TS="machine learning" AND PY=2023',
  maxResults: 20
})

// Author search
search_webofscience({
  query: 'AU="Smith, J*"',
  maxResults: 10
})

// Journal search
search_webofscience({
  query: 'SO="Nature" AND PY=2022-2023',
  maxResults: 15
})

Supported Fields:

  • TS: Topic search
  • AU: Author
  • SO: Source journal
  • PY: Publication year
  • DO: DOI
  • TI: Title

Google Scholar Features

  • Academic Paper Priority: Automatically filters out books, prioritizes peer-reviewed papers
  • Citation Data: Provides citation counts and academic metrics
  • Anti-Detection: Smart request patterns to avoid blocking
  • Comprehensive Coverage: Searches across all academic publishers

Semantic Scholar Features

  • AI-Powered Search: Semantic understanding of queries
  • Citation Networks: Paper relationships and influence metrics
  • Open Access PDFs: Direct links to freely available papers
  • Research Fields: Filter by specific academic disciplines

Sci-Hub Features

  • Universal Access: Access papers using DOI or direct URLs
  • Mirror Network: Automatic detection and use of fastest available mirror (11+ mirrors)
  • Health Monitoring: Continuous monitoring of mirror site availability
  • Automatic Failover: Seamless switching between mirrors when one fails
  • Smart Retry: Automatic retry with different mirrors on failure
  • Response Time Optimization: Mirrors sorted by response time for best performance

šŸ“ License

MIT License - see file for details.

šŸ¤ Contributing

Contributions welcome! See for guidelines.

  1. Fork the project
  2. Create feature branch (git checkout -b feature/amazing-feature)
  3. Commit changes (git commit -m 'Add amazing feature')
  4. Push to branch (git push origin feature/amazing-feature)
  5. Open Pull Request

šŸ› Issue Reporting

If you encounter issues, please report them at GitHub Issues.

šŸ™ Acknowledgments

  • Original paper-search-mcp for the foundation
  • MCP community for the protocol standards

⭐ If this project helps you, please give it a star!