adrinjalali/sklearn-mcp
If you are the rightful owner of sklearn-mcp and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
A Modular Context Provider (MCP) server for data science projects, offering guidance, rules, and code examples for ML and data science workflows.
Data Science MCP Server
A Modular Context Provider (MCP) server for data science projects. This server exposes endpoints to provide guidance, rules, and code examples for ML and data science workflows, as well as general Python project best practices for agent-based systems.
Overview
This MCP server provides specialized knowledge and best practices for data science and machine learning projects, including:
- General Python project guidelines for agent development
- Best practices on using scikit-learn
- Guidelines on using additional libraries such as
skops
,skore
, andskrub
- Serialization and deployment
Endpoints return curated Markdown documents with actionable guidance for AI agents and developers.
Project Structure
ds_mcp/server.py
– Main FastAPI MCP server, exposes endpoints as MCP tools.ds_mcp/routers/workflow_guidance.md
– Markdown with best practices for DS/ML workflows.ds_mcp/routers/python_general.md
– Markdown with general Python project guidelines.ds_mcp/core/
– Configuration and utilities.tests/
– Test suite.
All endpoints that return static guidance use a shared utility to read Markdown documents from the routers/
directory.
Getting Started
Prerequisites
- pixi for environment management
Installation
-
Clone the repository
-
Set up the environment:
pixi install
-
Run the server:
pixi run mcp-server
Integration with AI Code Editors
Example: Windsurf
Add this to your Windsurf MCP config file:
{
"mcpServers": {
"ds-mcp": {
"command": "pixi",
"args": [
"run",
"--manifest-path",
"/path/to/ds-agent/ds-mcp/pixi.toml",
"mcp-server"
]
}
}
}
Replace /path/to/ds-agent/ds-mcp
with the actual path to this project on your system.
API Endpoints
-
/get_workflow_guidance_tool
Returns workflow guidance for data science/ML tasks as Markdown. Arguments:task_description
(str): Description of the DS/ML taskdata_type
(str, default: "tabular"): Type of datacontext
(str, optional): Additional context
-
/get_python_general_guidelines_tool
Returns general Python project guidelines as Markdown.
All endpoints are available via the MCP protocol and are documented for interactive exploration at /docs
when the server is running.
Development
- Use
ruff
for linting and formatting (88 char line length). - Pre-commit hooks are configured.
- Tests use
pytest
.
To contribute, please follow the guidelines in CONTRIBUTING.md
.