sbl-sdsc/mcp-genelab
If you are the rightful owner of mcp-genelab and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
The MCP GeneLab Server is a Model Context Protocol server that translates natural language queries into Cypher queries for execution against Neo4j endpoints, facilitating AI-assisted analysis of NASA's space biology experiments.
MCP GeneLab Server
A Model Context Protocol (MCP) server that converts natural language queries into Cypher queries and executes them against the configured Neo4j endpoints. Customized tools provide seamless access to the NASA GeneLab Knowledge Graph, enabling AI-assisted analysis of spaceflight experiments and their biological effects. This server allows researchers to query differential gene expression and DNA methylation data from NASA's space biology experiments through natural language interactions with AI assistants like Claude.
The GeneLab Knowledge Graph with data from NASA's GeneLab Data Repository, part of the NASA Open Science Data Repository (OSDR), can be integrated with biomedical knowledge from the SPOKE (Scalable Precision Medicine Open Knowledge Engine) knowledge graph. This integration connects spaceflight experimental results with a comprehensive biological context, including genes, proteins, anatomical structures, pathways, and diseases.
This server is part of the NSF-funded Proto-OKN Project (Prototype Open Knowledge Network). It's an extension of the Neo4j Cypher MCP server.
Features
- Natural Language Querying: Ask questions in plain English - no need to write complex graph queries
- NASA GeneLab Queries: Ask questions about spaceflight experiments in the NASA GeneLab knowledge graph
- Differential Gene Expression Analysis: Find genes that are upregulated or downregulated in spaceflight conditions compared to ground controls
- DNA Methylation Data: Access epigenetic changes observed in spaceflight experiments
- Multi-Organism Support: Query data across multiple model organisms including mice, rats, and other species used in space research
- Tissue-Specific Analysis: Filter results by specific organs, tissues, or cell types
- Biomedical Context Integration: Connect spaceflight gene expression changes to pathways, diseases, and other biological knowledge from SPOKE
- Federated Queries: Combine data from GeneLab with other Neo4j knowledge graphs for comprehensive biomedical analysis
- Multiple Access Methods: Use through Claude Desktop, VS Code with GitHub Copilot, or programmatically via the MCP protocol
- Pre-configured Endpoints: Ready-to-use access to both local and remote Neo4j databases containing the GeneLab Knowledge Graph
Prerequisites
Before installing the MCP server, ensure you have:
- Operating System: macOS, Linux, or Windows
- Client Application: One of the following:
- Claude Desktop with Pro or Max subscription
- VS Code Insiders with GitHub Copilot subscription
- Neo4j Knowledge Graphs:
Installation
Prerequisites
The MCP GeneLab server requires installing the uv
package manager on your system. If you don't have it installed, run:
# macOS/Linux
curl -LsSf https://astral.sh/uv/install.sh | sh
Check if uv
is in the PATH
which uv
If uv
is not found, add the following line to your ~/.zshrc
file for zsh or /.bash_profile
for bash.
export PATH="$HOME/.local/bin:$PATH"
Then reload the shell.
source ~/.zshrc # or source ~/.bash_profile
If you are using macOS 12
, you also need to install realpath
.
To check if you have realpath
installed, run:
which realpath
Download the Homebrew installer and click on the downloaded package to install Homebrew.
Then run the following command to install coreutils and check if realpath
is available.
brew install coreutils
which realpath
Windows
powershell -c "irm https://astral.sh/uv/install.ps1 | iex"
Note: Once
uv
is installed, theuvx
command in the configuration below will automatically download and run the latest version of the MCP server from PyPI when needed.
Claude Desktop Setup
Recommended for most users
-
Download and Install Claude Desktop
Visit https://claude.ai/download and install Claude Desktop for your operating system.
Requirements: Claude Pro or Max subscription is required for MCP server functionality.
-
Configure MCP Server
Option A: Download Pre-configured File (Recommended)
Download the pre-configured
claude_desktop_config.json
file with Neo4j endpoints from the repository and copy it to the appropriate location:macOS:
# Download the config file curl -o /tmp/claude_desktop_config.json https://raw.githubusercontent.com/sbl-sdsc/mcp-genelab/main/config/claude_desktop_config.json # Copy to Claude Desktop configuration directory cp /tmp/claude_desktop_config.json "$HOME/Library/Application Support/Claude/"
Windows PowerShell:
# Download the config file Invoke-WebRequest -Uri "https://raw.githubusercontent.com/sbl-sdsc/mcp-genelab/main/config/claude_desktop_config.json" -OutFile "$env:TEMP\claude_desktop_config.json" # Copy to Claude Desktop configuration directory Copy-Item "$env:TEMP\claude_desktop_config.json" "$env:APPDATA\Claude\"
Option B: Manual Configuration
Alternatively, you can manually edit the configuration file in Claude Desktop. Navigate to
Claude->Settings->Developer->Edit Config
to edit it.Below is an example of how to configure local and remote Neo4j endpoints. For remotely hosted Neo4j servers, update the url, username, and password.
{ "mcpServers": { "genelab-local-cypher": { "command": "uvx", "args": ["mcp-genelab"], "env": { "NEO4J_URI": "bolt://localhost:7687", "NEO4J_USERNAME": "neo4j", "NEO4J_PASSWORD": "neo4jdemo", "NEO4J_DATABASE": "spoke-genelab-v0.0.4", "INSTRUCTIONS": "Query the GeneLab Knowledge Graph to identify NASA spaceflight experiments containing omics datasets, specifically differential gene expression (transcriptomics) and DNA methylation (epigenomics) data." } }, "genelab-remote-cypher": { "command": "uvx", "args": ["mcp-genelab"], "env": { "NEO4J_URI": "bolt://remote_url:7687", "NEO4J_USERNAME": "username", "NEO4J_PASSWORD": "password", "NEO4J_DATABASE": "spoke-genelab-v0.0.4", "INSTRUCTIONS": "Query the GeneLab Knowledge Graph to identify NASA spaceflight experiments containing omics datasets, specifically differential gene expression (transcriptomics) and DNA methylation (epigenomics) data." } } }
Important: If you have existing MCP server configurations, do not use Option A as it will overwrite your existing configuration. Instead, use Option B and manually merge the Neo4j endpoints with your existing
mcpServers
configuration. -
Restart Claude Desktop
After saving the configuration file, quit Claude Desktop completely and restart it. The application needs to restart to load the new configuration and start the MCP servers.
-
Verify Installation
- Launch Claude Desktop
- Navigate to
Claude->Settings->Connectors
- Verify that the configured Neo4j endpoints appear in the connector list
- You can configure each service to always ask for permission or to run it unsupervised (recommended)
VS Code Setup
For advanced users and developers
-
Install VS Code Insiders
Download and install VS Code Insiders from https://code.visualstudio.com/insiders/
Note: VS Code Insiders is required as it includes the latest MCP (Model Context Protocol) features.
-
Install GitHub Copilot Extension
- Open VS Code Insiders
- Sign in with your GitHub account
- Install the GitHub Copilot extension
Requirements: GitHub Copilot subscription is required for MCP integration.
-
Configure MCP Server
Option A: Download Pre-configured File (Recommended)
Download the pre-configured
mcp.json
file from the repository and copy it to the appropriate location.macOS:
# Download the config file curl -o /tmp/mcp.json https://raw.githubusercontent.com/sbl-sdsc/mcp-genelab/main/config/mcp.json # Copy to VS Code Insiders configuration directory cp /tmp/mcp.json "$HOME/Library/Application Support/Code - Insiders/User/mcp.json"
Note: VS Code Insiders mcp.json file is identical to the claude_desktop_config.json file, except "mcpServer" is replaced by "server".
-
Use the MCP Server
- Open a new chat window in VS Code
- Select Agent mode
- Choose Claude Sonnet 4.5 or later model for optimal performance
- The MCP servers will automatically connect and provide knowledge graph access
Quick Start
Once configured, you can immediately start querying knowledge graphs through natural language prompts in Claude Desktop or VS Code chat interface. The AI assistant will automatically convert your natural language queries into appropriate Cypher queries, execute them against the configured endpoints, and return structured, interpretable results.
Example Queries
- Querying Specific MCP Servers
To direct a query to a specific MCP server, use the @ operator followed by the server name. For example:
@genelab-remote-cypher
See .
- Federated Queries Across Multiple Servers
You can also perform federated queries across multiple MCP servers.
@genelab-remote-cypher @spokeokn-remote-cypher
This will execute the query across both servers and combine the results as applicable.
See .
- Node Metadata
Describe the Assay node and its properties in @genelab-remote-cypher, and include an example for a ground control vs. space flight comparison.
See .
- Relationship Metadata
Describe the MEASURED_DIFFERENTIAL_EXPRESSION_ASmMG relationship and its properties in @genelab-remote-cypher, and include an example for a ground control vs. space flight comparison.
See .
- High-level overview of the GeneLab KG
Give a breakdown of missions, studies, and the type of technologies used in the studies.
See .
- Find upregulated genes in space flight:
Find the top 5 upregulated genes in mouse liver tissue during Space Flight.
See .
Development
Installing from Source
If you want to run a development version:
# Clone the repository
git clone https://github.com/sbl-sdsc/mcp-genelab.git
cd mcp-genelab
# Install dependencies
uv sync
Building and Publishing (maintainers only)
# Increment version number (major|minor|patch)
uv version --bump minor
# Build the package
uv build
# Publish to TestPyPI first (recommended)
uv publish --publish-url https://test.pypi.org/legacy/ --token pypi-YOUR_TEST_PYPI_TOKEN_HERE
# Test the deployment
For testing, add the following parameters to the `args` option.
"args": [
"--index-url",
"https://test.pypi.org/simple/",
"--extra-index-url",
"https://pypi.org/simple/",
"mcp-genelab"
]
# Publish to PyPI
uv publish --token pypi-YOUR_PYPI_TOKEN_HERE
API Reference
Available Tools
get_neo4j_schema
Lists all nodes, their attributes, and their relationships to other nodes in the Neo4j database.
Parameters:
- None
Returns:
- JSON array containing node labels, their attributes (with data types), and relationships to other nodes
Note: If this fails with a message that includes "Neo.ClientError.Procedure.ProcedureNotFound", the APOC plugin needs to be installed and enabled on the Neo4j database.
read_neo4j_cypher
Executes a read-only Cypher query on the Neo4j database.
Parameters:
query
(string, required): The Cypher query to executeparams
(object, optional): Parameters to pass to the Cypher query for parameterized queries
Returns:
- JSON object containing query results
Example:
MATCH (s:Study)-[:PERFORMED_SpAS]->(a:Assay)
WHERE s.organism = $organism
RETURN s.name, a.name
LIMIT 10
get_node_metadata
Retrieves metadata descriptions for all node types from MetaNode nodes in the knowledge graph.
Parameters:
- None
Returns:
- JSON array containing detailed descriptions of each node type's properties, including data types and semantic meanings
get_relationship_metadata
Retrieves descriptions of properties for all relationship types in the knowledge graph.
Parameters:
- None
Returns:
- JSON array containing descriptions of each relationship type and their properties
find_upregulated_genes
Finds genes that are upregulated in spaceflight conditions compared to ground control for a specific organism and tissue/organ/cell type.
Parameters:
organism
(string, required): The organism to search for (e.g., 'Mus musculus', 'Rattus norvegicus')material
(string, required): The organ, tissue, or cell type to search for (e.g., 'liver', 'hippocampus', 'skeletal muscle')factor_space_1
(string, optional): First experimental grouping, default: 'Ground Control'factor_space_2
(string, optional): Second experimental grouping, default: 'Space Flight'top_n
(integer, optional): Number of top upregulated genes to return, default: 100
Returns:
- JSON array of upregulated genes with symbols, log2 fold changes, adjusted p-values, and assay information
Note: Only compares experiments with matching experimental factors to ensure valid comparisons.
find_downregulated_genes
Finds genes that are downregulated in spaceflight conditions compared to ground control for a specific organism and tissue/organ/cell type.
Parameters:
organism
(string, required): The organism to search for (e.g., 'Mus musculus', 'Rattus norvegicus')material
(string, required): The organ, tissue, or cell type to search for (e.g., 'liver', 'hippocampus', 'skeletal muscle')factor_space_1
(string, optional): First experimental grouping, default: 'Ground Control'factor_space_2
(string, optional): Second experimental grouping, default: 'Space Flight'top_n
(integer, optional): Number of top downregulated genes to return, default: 100
Returns:
- JSON array of downregulated genes with symbols, log2 fold changes, adjusted p-values, and assay information
Note: Only compares experiments with matching experimental factors to ensure valid comparisons.
Command Line Interface
The MCP GeneLab server is configured through environment variables. When using uvx
, the configuration is specified in the MCP configuration file (e.g., claude_desktop_config.json
).
Required Environment Variables:
NEO4J_URI
: Neo4j database connection URI (e.g.,bolt://localhost:7687
)NEO4J_USERNAME
: Neo4j database usernameNEO4J_PASSWORD
: Neo4j database passwordNEO4J_DATABASE
: Neo4j database name (e.g.,spoke-genelab-v0.0.4
)INSTRUCTIONS
: Custom instructions for the MCP server describing its purpose
Troubleshooting
Common Issues
MCP server not appearing in Claude Desktop:
- Ensure you've completely quit and restarted Claude Desktop (not just closed the window)
- Check that your JSON configuration is valid (use a JSON validator)
- Verify that
uvx
is installed and accessible in your PATH
Connection errors:
- Check your internet connection
- Verify the SPARQL endpoint URL is correct and accessible
- Some endpoints may have rate limits or temporary downtime
Performance issues:
- Complex SPARQL queries may take time to execute
- Consider breaking down complex queries into smaller parts
- Check the endpoint's documentation for query best practices
License
This project is licensed under the BSD 3-Clause License. See the file for details.
Citation
If you use MCP GeneLab in your research, please cite the following works:
@software{rose2025mcp-genelab,
title={MCP GeneLab Server},
author={Rose, P.W. and Saravia-Butler, A.M. and Nelson, C.A. and Shi, Y. and Baranzini, S.E.},
year={2025},
url={https://github.com/sbl-sdsc/mcp-proto-okn}
}
@software{rose2025spoke-genelab,
title={NASA SPOKE-GeneLab Knowledge Graph},
author={Rose, P.W. and Nelson, C.A. and Saravia-Butler, A.M. and Gebre, S.G. and Soman, K. and Grigorev, K.A. and Sanders, L.M. and Costes, S.V. and Baranzini, S.E.},
year={2025},
url={https://github.com/BaranziniLab/spoke_genelab}
}
Related Publications
-
Nelson, C.A., Rose, P.W., Soman, K., Sanders, L.M., Gebre, S.G., Costes, S.V., Baranzini, S.E. (2025). "Nasa Genelab-Knowledge Graph Fabric Enables Deep Biomedical Analysis of Multi-Omics Datasets." NASA Technical Reports, 20250000723. Link
-
Sanders, L., Costes, S., Soman, K., Rose, P., Nelson, C., Sawyer, A., Gebre, S., Baranzini, S. (2024). "Biomedical Knowledge Graph Capability for Space Biology Knowledge Gain." 45th COSPAR Scientific Assembly, July 13-21, 2024. Link
Acknowledgments
Funding
This work is supported by:
- National Science Foundation Award #2333819: "Proto-OKN Theme 1: Connecting Biomedical information on Earth and in Space via the SPOKE knowledge graph"
Related Projects
- Proto-OKN Project - Prototype Open Knowledge Network initiative
- NASA Open Science Data Repository (OSDR) - Repository of multi-modal space life science data
- NASA GeneLab Data Repository - GeneLab data repository used to create the GeneLab KG
- NASA GeneLab KG - Git Repository for creating the GeneLab KG
- Model Context Protocol - AI assistant integration standard
- Original Neo4j Cypher MCP server - Base implementation reference
For questions, issues, or contributions, please visit our GitHub repository.