meds-mcp by VISTA-Stanford - MCP Server

MEDS MCP

MEDS Model Context Protocol (MCP) Server and Client

Development Roadmap

Add support for general (non-Stanford) secure LLM client wrapper, e.g., aisuite
Implement retriever backend supporting faceted search (e.g., elasticsearch, meilisearch) for creating a single index over the STARR patient population.
Bake-off vector store retrievers for event/document embeddings vs. BM25 baseline
Support native OMOP queries via BigQuery

I. Launch MCP Server

Launch server

python src/meds_mcp/server/main.py \
--config configs/local.py

Test client

python scripts/test_mcp_client_sdk.py

Server Configuration YAML

Lightweight configuration for launching the MCP server

# Server settings
server:
  host: "0.0.0.0"
  port: 8000

# Data directories
data:
  # Ontology data directory
  ontology_dir: "data/athena_omop_ontologies"
  # Corpus/collections directory
  corpus_dir: "data/collections/dev-corpus"
  # Use lazy loading for ontology (true/false)
  use_lazy_ontology: false

# Logging settings
logging:
  level: "INFO"
  format: "%(asctime)s - %(name)s - %(levelname)s - %(message)s"

II. MCP Chat Demo

1. Apply for Access to MedAlign

[!CAUTION] The Stanford Dataset DUA prohibts sharing data with third parties including LLM API providers. We follow the guidelines for responsible use as originally outlined by PhysioNet:

If you are interested in using the GPT family of models, we suggest using one of the following services:

Azure OpenAI service. You'll need to opt out of human review of the data via this form. Reasons for opting out are: 1) you are processing sensitive data where the likelihood of harmful outputs and/or misuse is low, and 2) you do not have the right to permit Microsoft to process the data for abuse detection due to the data use agreement you have signed.

Amazon Bedrock. Bedrock provides options for fine-tuning foundation models using private labeled data. After creating a copy of a base foundation model for exclusive use, data is not shared back to the base model for training.

Google's Gemini via Vertex AI on Google Cloud Platform. Gemini doesn't use your prompts or its responses as data to train its models. If making use of additional features offered through the Gemini for Google Cloud Trusted Tester Program, you should obtain the appropriate opt-outs for data sharing, or otherwise not perform tasks that require the sharing of data.

Anthropic Claude. Claude does not use your prompts or its responses as data to train its models by default, and routine human review of data is not performed.

export REDIVIS_ACCESS_TOKEN="your_redivis_api_key_here"
python scripts/download_data.py medalign --files

[!TIP] Tumor Board Patients in MedAlign

These patients' records make mention of "tumor board".

125718675, 126035422, 126061094, 126394715, 126467596, 127672063, 127807353, 127850729, 127969918, 127980943, 128126942

2. Initalize and Launch the MCP Server

python src/meds_mcp/server/main.py \
--config configs/medalign.yaml

3. Launch Gradio Chat Demo

LLM performance for evidence citation varies wildly. You should use the latest frontier LLM available to you. Suggested LLMs available at Stanford:

apim:claude-3.7
apim:o3-mini
nero:gemini-2.5-pro (requires GCP/Nero Vertex API)

export VAULT_SECRET_KEY="your_apim_key_here"
python examples/mcp_chat_demo/evidence_review_demo.py \
--model apim:o3-mini \
--mcp_url "http://localhost:8000/mcp" \
--patient_id 127672063