vduquette/toronto-open-data-mcp-server
If you are the rightful owner of toronto-open-data-mcp-server and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
The Toronto Open Data MCP Server provides direct access to Toronto's Open Data through the CKAN API, enabling efficient discovery, exploration, and querying of over 500 public datasets.
toronto_start_here
Essential first call that explains the workflow.
toronto_search_datasets
Find relevant datasets by keywords.
toronto_smart_data_helper
Intelligent data retrieval for specific datasets.
toronto_popular_datasets
Quick access to commonly used datasets.
Toronto Open Data MCP Server
An MCP (Model Context Protocol) server that provides direct access to Toronto's Open Data through the CKAN API. This server allows LLM agents to efficiently discover, explore, and query Toronto's 500+ public datasets.
Features
- 🔍 Smart Dataset Discovery: Search through 500+ Toronto datasets with intelligent suggestions
- 🧠 Intelligent Data Helper: Automatically handles both API and CSV data sources
- 📊 Flexible Querying: Support for filtering, sorting, and field selection on API datasets
- 📁 CSV Support: Automatic fetching and preview of downloadable CSV datasets
- 🤝 LLM-Friendly: Designed for collaborative use with web search when additional context is needed
- ✅ Robust Error Handling: Clear error messages with actionable suggestions
Installation
Option 1: Using uvx (Recommended for MCP)
The easiest way to use this server with any MCP client:
brew install uv
# No installation needed! uvx will handle everything
# Just use in your MCP client configuration:
uvx toronto-open-data-mcp-server
Option 2: Development Installation
-
Clone the repository:
git clone https://github.com/yourusername/toronto-open-data-mcp-server.git cd toronto-open-data-mcp-server
-
Install dependencies:
# Install main dependencies pip install -e . # Install test dependencies (optional) pip install -e ".[test]"
-
Run the server:
python main.py
MCP Client Configuration
Using uvx (Easiest Method)
Add this to your MCP client configuration:
{
"mcpServers": {
"Toronto Open Data Server": {
"command": "uvx",
"args": ["toronto-open-data-mcp-server"]
}
}
}
Configuration File Locations
- Claude Desktop:
~/Library/Application Support/Claude/claude_desktop_config.json
- Cursor: Check Cursor's MCP documentation for config location
- Other MCP clients: Refer to your client's documentation
Usage
Quick Start
The server provides several tools designed for LLM agents:
toronto_start_here()
- Essential first call that explains the workflowtoronto_search_datasets(query)
- Find relevant datasets by keywordstoronto_smart_data_helper(dataset_id, user_question)
- Intelligent data retrievaltoronto_popular_datasets()
- Quick access to commonly used datasets
Example Workflow
# 1. Start with guidance
toronto_start_here()
# 2. Search for relevant data
toronto_search_datasets("restaurant inspection")
# 3. Get data intelligently
toronto_smart_data_helper("dinesafe", "recent restaurant inspection failures")
# 4. Advanced filtering (if needed)
toronto_query_dataset_data("dinesafe",
filters={"establishment_status": "Conditional Pass"},
sort="inspection_date desc",
limit=10)
Popular Datasets
dinesafe
- Restaurant inspections and health scorestraffic-signals
- Traffic light locations and timingparks-facilities
- Parks, pools, and recreation facilitiesbusiness-licences
- Licensed businesses in Torontobuilding-permits
- Construction and renovation permits
Testing
This project includes comprehensive tests covering unit tests, integration tests, and workflow tests.
Prerequisites
Install test dependencies:
pip install -e ".[test]"
Running Tests
Quick Test Commands
# Run unit tests only (recommended for development)
python run_tests.py
# Run with verbose output
python run_tests.py --verbose
# Run with coverage report
python run_tests.py --coverage
# Run integration tests (hits real Toronto API)
python run_tests.py --integration
# Run all tests (unit + integration)
python run_tests.py --all
API Reference
Core Tools
toronto_start_here() -> str
Essential first call that provides workflow guidance and server capabilities.
toronto_search_datasets(query: str, limit: int = 10) -> str
Search Toronto datasets by keywords.
toronto_smart_data_helper(dataset_id: str, user_question: str, limit: int = 10) -> str
Intelligent helper that automatically handles both API and CSV data sources.
toronto_query_dataset_data(dataset_id: str, filters: Dict = None, fields: List = None, limit: int = 10, sort: str = None) -> str
Advanced querying with filtering and sorting for API datasets.
Utility Tools
toronto_popular_datasets() -> str
Quick access to commonly used datasets.
toronto_get_dataset_schema(dataset_id: str) -> str
Get field names and types for API datasets.
toronto_fetch_csv_data(csv_url: str, max_lines: int = 50) -> str
Fetch and preview CSV file content.
Architecture
- FastMCP Framework: Built on the FastMCP framework for easy tool definition
- CKAN API: Direct integration with Toronto's CKAN-based Open Data portal
- Collaborative Design: Works alongside web search rather than replacing it
- Error Recovery: Intelligent error handling with actionable suggestions
Contributing
- Fork the repository
- Create a feature branch
- Add tests for new functionality
- Run the test suite:
python run_tests.py --all
- Submit a pull request