pophive-mcp-server

pophive-mcp-server

3.2

If you are the rightful owner of pophive-mcp-server and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

The PopHIVE MCP Server provides access to comprehensive public health data from Yale School of Public Health, including immunizations, respiratory diseases, and chronic diseases, through standardized MCP tools and resources.

PopHIVE MCP Server

A Model Context Protocol (MCP) server that provides access to PopHIVE (Population Health Information Visual Explorer) public health data from Yale School of Public Health. This server exposes comprehensive health surveillance data including immunizations, respiratory diseases, and chronic diseases through standardized MCP tools, resources, and prompts.

šŸŽÆ Production-Ready: All critical bugs fixed, enhanced error handling, and comprehensive dataset metadata included.

šŸ“¦ Desktop Extension Ready: Fully compliant with Anthropic's Desktop Extension (DXT) specification for one-click installation in Claude Desktop and other MCP-enabled applications.

What is PopHIVE?

PopHIVE (Population Health Information Visual Explorer) is Yale's comprehensive platform that aggregates near real-time public health data from authoritative sources including CDC surveillance systems, Epic Cosmos EHR networks, and Google Health Trends. It's an invaluable resource for epidemiologists, researchers, and public health professionals.

šŸ‘‰ Explore PopHIVE: https://www.pophive.org/

Recent Improvements

āœ… Critical Bug Fixes (v1.0.0)

  • Fixed Geography Filter Bug: The "national" geography filter now works correctly and returns proper results
  • Enhanced Search Algorithm: Search now includes both field names and values for comprehensive matching
  • Improved Error Handling: Added descriptive error messages with specific suggestions when searches return no results
  • Enhanced Dataset Metadata: Added geographic granularity, date ranges, key metrics, and data quality indicators

šŸ”§ Technical Improvements

  • Better Geography Handling: Supports "national", "US", and state-level filtering with proper normalization
  • Comprehensive Error Messages: Context-aware suggestions based on search terms and geography filters
  • Dataset Capability Documentation: Clear indication of which datasets support state vs national analysis
  • Production-Ready Error Handling: Graceful handling of edge cases with helpful user guidance

Overview

PopHIVE aggregates near real-time health data from multiple authoritative sources:

  • CDC National Immunization Survey (NIS): Gold-standard vaccination coverage data
  • Epic Cosmos EHR Network: Real-world clinical data from electronic health records
  • CDC Laboratory Surveillance (NREVSS): Respiratory virus test positivity rates
  • CDC Wastewater Surveillance (NWWS): Environmental viral monitoring
  • Google Health Trends: Population behavior and symptom search patterns

Features

šŸ”§ MCP Tools

  • filter_data: Filter datasets by state, date range, demographics, and conditions
  • compare_states: Compare health metrics across multiple states with statistical analysis
  • time_series_analysis: Analyze trends over time with aggregation options
  • get_available_datasets: Comprehensive catalog of all available datasets
  • search_health_data: Search across datasets for specific conditions or keywords

šŸ“Š MCP Resources

  • dataset://immunizations_nis: CDC National Immunization Survey data
  • dataset://immunizations_epic: Epic Cosmos immunization data by demographics
  • dataset://respiratory_ed: Emergency department visits for respiratory viruses
  • dataset://respiratory_lab: Laboratory test positivity rates
  • dataset://respiratory_wastewater: Wastewater viral surveillance data
  • dataset://respiratory_trends: Google search trends for respiratory symptoms
  • dataset://chronic_obesity: Obesity prevalence by state and age group
  • dataset://chronic_diabetes: Diabetes prevalence and glycemic control data

šŸ’” MCP Prompts

  • immunization_gaps: Analyze vaccination coverage gaps by demographics
  • respiratory_surge_detection: Detect and analyze respiratory disease surges
  • chronic_disease_trends: Analyze chronic disease prevalence trends
  • multi_source_analysis: Comprehensive analysis integrating multiple data sources

Installation

Option 1: Desktop Extension (Recommended)

For Claude Desktop users:

  1. Download the .dxt file from the releases page
  2. Double-click the file to open with Claude Desktop
  3. Click "Install" in the installation dialog
  4. Configure any required settings (update frequency, cache size)
  5. The extension will be automatically available in Claude Desktop

For other MCP-enabled applications:

  • Use the same .dxt file with any application supporting Desktop Extensions
  • Follow your application's extension installation process

Option 2: Manual Installation

Prerequisites:

  • Node.js 18+
  • npm or yarn

Setup:

  1. Clone and install dependencies:
git clone <repository-url>
cd pophive-mcp-server
npm install
  1. Configure environment (optional):
# Create .env file for custom configuration
echo "DATA_CACHE_DIR=./data" > .env
echo "UPDATE_FREQUENCY=daily" >> .env
  1. Test the server:
npm test
  1. Start the server:
npm start

Option 3: Build Your Own Extension

Create a Desktop Extension from source:

  1. Install DXT CLI tools:
npm install -g @anthropic-ai/dxt
  1. Clone and prepare:
git clone <repository-url>
cd pophive-mcp-server
npm install
  1. Package as extension:
dxt pack
  1. Install the generated .dxt file in Claude Desktop or other MCP applications

Configuration

Environment Variables

VariableDefaultDescription
DATA_CACHE_DIR./dataDirectory for cached data files
UPDATE_FREQUENCYdailyData refresh frequency (hourly, daily, weekly)
NODE_ENVdevelopmentEnvironment mode

MCP Client Configuration

Add to your MCP client configuration (e.g., Claude Desktop):

{
  "mcpServers": {
    "pophive": {
      "command": "node",
      "args": ["server/index.js"],
      "cwd": "/path/to/pophive-mcp-server"
    }
  }
}

Dataset Selection Guide

Choose the right dataset for your analysis:

DatasetGeographic LevelBest Use CasesDate RangeUpdate FrequencyKey Limitations
immunizations_nisNational + StateNational vaccination trends, state comparisons2019-2024AnnualSurvey data, limited demographics
immunizations_epicNational + StateReal-world vaccination patterns, insurance analysis2020-2024MonthlyEHR network bias
respiratory_edNational + StateEmergency department surveillance, outbreak detection2020-2024WeeklyHealthcare utilization only
respiratory_labNational onlyClinical test positivity, laboratory surveillance2020-2024WeeklyNational aggregates only
respiratory_wastewaterRegionalEnvironmental surveillance, early warning2022-2024WeeklyLimited geographic coverage
respiratory_trendsNational + StatePopulation behavior, symptom searches2020-2024WeeklyBehavioral proxy, not clinical
chronic_obesityNational + StateObesity prevalence, chronic disease tracking2020-2024QuarterlyClinical populations only
chronic_diabetesNational + StateDiabetes management, glycemic control2020-2024QuarterlyClinical populations only

Quick Dataset Selection

For national trends: Use immunizations_nis, respiratory_lab, or any dataset with geography="national"

For state comparisons: Use respiratory_ed, chronic_obesity, chronic_diabetes, or immunizations_nis

For real-time surveillance: Use respiratory_ed, respiratory_wastewater, or respiratory_trends

For clinical outcomes: Use immunizations_epic, chronic_obesity, or chronic_diabetes

Usage Examples

Basic Data Filtering

// āœ… WORKING: Filter immunization data for California
{
  "tool": "filter_data",
  "arguments": {
    "dataset": "immunizations_nis",
    "state": "CA"
  }
}

// āœ… WORKING: Filter national immunization data
{
  "tool": "filter_data",
  "arguments": {
    "dataset": "immunizations_nis",
    "state": "US"
  }
}

// āŒ AVOID: This will return 0 results
{
  "tool": "filter_data",
  "arguments": {
    "dataset": "respiratory_lab",
    "state": "CA"  // respiratory_lab only has national data
  }
}

State Comparison

// āœ… WORKING: Compare obesity rates across states
{
  "tool": "compare_states",
  "arguments": {
    "dataset": "chronic_obesity",
    "states": ["CA", "TX", "FL", "NY"],
    "metric": "prevalence_rate",
    "time_period": "latest"
  }
}

// āœ… WORKING: Compare vaccination coverage
{
  "tool": "compare_states",
  "arguments": {
    "dataset": "immunizations_nis",
    "states": ["California", "Texas", "New York"],  // Full names work too
    "metric": "coverage_rate"
  }
}

Time Series Analysis

// āœ… WORKING: Analyze national respiratory trends
{
  "tool": "time_series_analysis",
  "arguments": {
    "dataset": "respiratory_ed",
    "metric": "ed_visits_per_100k",
    "geography": "national",  // Use "national" for US-level data
    "aggregation": "weekly"
  }
}

// āœ… WORKING: Analyze state-level trends
{
  "tool": "time_series_analysis",
  "arguments": {
    "dataset": "respiratory_ed",
    "metric": "ed_visits_per_100k",
    "geography": "CA",
    "start_date": "2024-01-01",
    "end_date": "2024-12-01"
  }
}

Search Health Data

// āœ… WORKING: Search with national geography
{
  "tool": "search_health_data",
  "arguments": {
    "query": "RSV",
    "geography": "national"  // Fixed: Use "national" instead of "US"
  }
}

// āœ… WORKING: Search specific datasets
{
  "tool": "search_health_data",
  "arguments": {
    "query": "vaccination coverage",
    "datasets": ["immunizations_nis", "immunizations_epic"]
  }
}

Using Prompts

// āœ… WORKING: Generate immunization gap analysis
{
  "prompt": "immunization_gaps",
  "arguments": {
    "state": "Texas",
    "demographic_focus": "insurance"
  }
}

// āœ… WORKING: Detect respiratory surges
{
  "prompt": "respiratory_surge_detection",
  "arguments": {
    "region": "California",
    "virus_type": "RSV",
    "time_period": "last_4_weeks"
  }
}

Common Issues & Solutions

Issue: "No data found" or 0 results

Cause: Geographic mismatch or dataset limitations

Solutions:

  1. Check dataset capabilities: Use get_available_datasets to see supported geographies
  2. Use correct geography values:
    • For national data: "geography": "national" (not "US")
    • For states: Use state codes ("CA") or full names ("California")
  3. Try alternative datasets: Some datasets only support national-level analysis
// āŒ Problem: Wrong geography for national data
{
  "tool": "search_health_data",
  "arguments": {
    "query": "influenza",
    "geography": "US"  // Should be "national"
  }
}

// āœ… Solution: Use correct geography
{
  "tool": "search_health_data",
  "arguments": {
    "query": "influenza",
    "geography": "national"
  }
}

Issue: Empty results for state-level queries

Cause: Dataset only contains national-level data

Solutions:

  1. Check dataset metadata first using get_available_datasets
  2. Use state-capable datasets: respiratory_ed, chronic_obesity, chronic_diabetes, immunizations_nis
  3. Switch to national analysis for datasets like respiratory_lab

Issue: Metric not found

Cause: Incorrect metric name or dataset mismatch

Solutions:

  1. Use dataset-appropriate metrics:
    • Immunizations: coverage_rate, sample_size
    • Respiratory: ed_visits_per_100k, positivity_rate
    • Chronic: prevalence_rate, patient_count
  2. Check sample data using get_available_datasets with include_sample: true

Working Parameter Combinations

Immunization Analysis

// National vaccination trends
{
  "tool": "time_series_analysis",
  "arguments": {
    "dataset": "immunizations_nis",
    "metric": "coverage_rate",
    "geography": "national"
  }
}

// State vaccination comparison
{
  "tool": "compare_states",
  "arguments": {
    "dataset": "immunizations_nis",
    "states": ["CA", "TX", "NY", "FL"],
    "metric": "coverage_rate"
  }
}

Respiratory Surveillance

// Emergency department trends
{
  "tool": "filter_data",
  "arguments": {
    "dataset": "respiratory_ed",
    "state": "CA",
    "condition": "RSV"
  }
}

// National lab surveillance
{
  "tool": "time_series_analysis",
  "arguments": {
    "dataset": "respiratory_lab",
    "metric": "positivity_rate",
    "geography": "national"
  }
}

Chronic Disease Analysis

// Obesity prevalence by state
{
  "tool": "filter_data",
  "arguments": {
    "dataset": "chronic_obesity",
    "state": "TX",
    "age_group": "18-64"
  }
}

// Diabetes trends
{
  "tool": "time_series_analysis",
  "arguments": {
    "dataset": "chronic_diabetes",
    "metric": "prevalence_rate",
    "geography": "CA"
  }
}

Data Sources & Quality

Immunization Data

  • NIS Data: Household survey, gold standard for coverage rates
  • Epic Cosmos: EHR data with demographic breakdowns
  • Update Frequency: Annual (NIS), Monthly (Epic)
  • Geographic Level: State
  • Quality: High confidence, large sample sizes

Respiratory Disease Surveillance

  • ED Visits: Near real-time healthcare utilization
  • Lab Data: Clinical test positivity rates
  • Wastewater: Environmental viral monitoring (early indicator)
  • Search Trends: Population behavior signals
  • Update Frequency: Weekly
  • Quality: High for clinical data, moderate for environmental/behavioral

Chronic Disease Data

  • Source: Epic Cosmos EHR network
  • Metrics: Clinical measurements (BMI, HbA1c)
  • Update Frequency: Quarterly
  • Geographic Level: State with age stratification
  • Quality: High - real-world clinical data

API Reference

Tools

filter_data

Filter datasets by various criteria.

Parameters:

  • dataset (required): Dataset identifier
  • state (optional): State code or name
  • start_date (optional): Start date (YYYY-MM-DD)
  • end_date (optional): End date (YYYY-MM-DD)
  • age_group (optional): Age group filter
  • condition (optional): Condition/metric filter
compare_states

Compare health metrics across multiple states.

Parameters:

  • dataset (required): Dataset identifier
  • states (required): Array of state codes/names
  • metric (required): Metric to compare
  • time_period (optional): Time period for comparison
time_series_analysis

Analyze trends over time.

Parameters:

  • dataset (required): Dataset identifier
  • metric (required): Metric to analyze
  • geography (optional): Geographic focus
  • start_date (optional): Analysis start date
  • end_date (optional): Analysis end date
  • aggregation (optional): Time aggregation (weekly, monthly, quarterly, yearly)

Resources

All resources return JSON data with standardized schemas:

// Example immunization record
{
  "geography": "CA",
  "year": 2024,
  "vaccine": "MMR",
  "age_group": "19-35 months",
  "coverage_rate": 96.1,
  "sample_size": 1876,
  "source": "CDC NIS"
}

// Example respiratory surveillance record
{
  "geography": "US",
  "date": "2024-12-01",
  "week": "2024-48",
  "virus": "RSV",
  "ed_visits_per_100k": 3.8,
  "percent_change": 15.2,
  "source": "Epic Cosmos"
}

Development

Project Structure

pophive-mcp-server/
ā”œā”€ā”€ server/
│   ā”œā”€ā”€ index.js                 # Main MCP server
│   ā”œā”€ā”€ utils/
│   │   └── data-loader.js       # Data loading and caching
│   ā”œā”€ā”€ tools/
│   │   └── analysis-tools.js    # MCP tool implementations
│   ā”œā”€ā”€ prompts/
│   │   └── prompt-templates.js  # MCP prompt templates
│   └── scrapers/
│       ā”œā”€ā”€ immunizations.js     # Immunization data scraper
│       ā”œā”€ā”€ respiratory.js       # Respiratory data scraper
│       └── chronic-diseases.js  # Chronic disease data scraper
ā”œā”€ā”€ data/                        # Cached data files
ā”œā”€ā”€ package.json
ā”œā”€ā”€ manifest.json               # MCP server manifest
└── README.md

Adding New Data Sources

  1. Create a scraper in server/scrapers/
  2. Update data loader to include new datasets
  3. Add resource mappings in the main server
  4. Update tool logic to handle new data types
  5. Create prompts for new analysis types

Testing

# Run all tests
npm test

# Test specific components
npm run test:tools
npm run test:scrapers
npm run test:integration

Data Refresh

The server automatically refreshes data based on the UPDATE_FREQUENCY setting. Manual refresh:

npm run refresh-data

Troubleshooting

Common Issues

Server won't start:

  • Check Node.js version (18+ required)
  • Verify all dependencies installed: npm install
  • Check for port conflicts

No data returned:

  • Data may be initializing on first run
  • Check data directory permissions
  • Verify network connectivity for scraping

MCP client connection issues:

  • Verify server path in client configuration
  • Check server logs for errors
  • Ensure MCP client supports stdio transport

Logging

Server logs are written to stderr and include:

  • Data scraping activities
  • Tool execution results
  • Error messages and stack traces

Enable verbose logging:

DEBUG=pophive:* npm start

Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Make changes with tests
  4. Submit a pull request

Code Style

  • Use ESLint configuration
  • Follow existing patterns
  • Add JSDoc comments for public APIs
  • Include error handling

License

MIT License - see LICENSE file for details.

Support

  • Issues: GitHub Issues
  • Documentation: This README and inline code comments
  • Data Questions: Refer to original PopHIVE sources

Acknowledgments

  • Yale School of Public Health for PopHIVE initiative
  • CDC for surveillance data systems
  • Epic Systems for Cosmos EHR network data
  • Model Context Protocol community for standards