Toronto MCP Server

Toronto MCP Server

An MCP server that provides tools for intelligently querying, analyzing, and retrieving datasets from Toronto's CKAN-powered open data portal. It enables AI assistants to perform natural language searches, inspect data structures, and track dataset update frequencies across the city's open data catalog.

Category
Visit Server

README

Toronto MCP Server: Toronto Open Data Tools

This project implements a Model Context Protocol (MCP) server for Toronto Open Data, deployable on Cloudflare Workers. It exposes a comprehensive set of tools for intelligently querying, analyzing, and retrieving datasets from Toronto's CKAN-powered open data portal, making them accessible to MCP-compatible clients like Claude Desktop, Cursor, and other AI assistants.

๐Ÿš€ Live Server

Deployed at: https://toronto-mcp.s-a62.workers.dev

  • SSE Endpoint: https://toronto-mcp.s-a62.workers.dev/sse (for Claude Desktop)
  • MCP Endpoint: https://toronto-mcp.s-a62.workers.dev/mcp (for other clients)

What does it do?

  • Provides a remote MCP server exposing tools for Toronto's Open Data via the CKAN API
  • Intelligently discovers relevant datasets using advanced relevance scoring
  • Analyzes data freshness patterns with comprehensive update frequency tracking
  • Provides deep data structure insights including field analysis and schema information
  • Enables natural language querying of Toronto's 500+ open datasets
  • Supports comprehensive data analysis combining multiple analytical dimensions

๐Ÿ› ๏ธ Features

Basic CKAN Tools

  • list_datasets: List all available datasets
  • search_datasets: Search datasets by keyword
  • get_package: Retrieve complete metadata for a dataset
  • get_first_datastore_resource_records: Get records from the first active resource
  • get_resource_records: Get records from a specific resource by ID

๐Ÿง  Advanced Analysis Tools

  • find_relevant_datasets: Intelligently find and rank datasets using relevance scoring (title, description, tags, organization)
  • analyze_dataset_updates: Analyze update frequencies with categorization (daily, weekly, monthly, quarterly, annually, irregular)
  • analyze_dataset_structure: Deep-dive into dataset structure with field definitions, data types, record counts, and optional data previews
  • get_data_categories: Explore all available organizations and topic groups
  • get_dataset_insights: Comprehensive analysis combining relevance ranking, update frequency, and data structure insights

๐Ÿ’ก Use Cases

For AI Assistants & Researchers

  • "What traffic data is available in Toronto?" โ†’ Ranked datasets with relevance scores and update frequencies
  • "How current is Toronto's environmental data?" โ†’ Update frequency analysis across environmental datasets
  • "What fields are in the building permits dataset?" โ†’ Complete schema analysis with data types and sample records
  • "Give me insights about Toronto's budget data" โ†’ Comprehensive analysis with relevance, freshness, and structure
  • "Which datasets update daily?" โ†’ Frequency-based filtering and categorization

For Data Scientists & Analysts

  • Discover datasets relevant to specific research questions
  • Assess data quality and reliability through update patterns
  • Understand data structure before detailed analysis
  • Find related datasets across different city departments
  • Evaluate data completeness and field availability

๐Ÿ—๏ธ Tech Stack

  • Cloudflare Workers: Serverless deployment platform
  • Model Context Protocol (MCP): Standard for AI tool integrations
  • TypeScript: Type safety and modern development
  • Zod: Runtime parameter validation
  • CKAN API: Direct integration with Toronto Open Data

๐Ÿ“ Project Structure

toronto-mcp/
โ”œโ”€โ”€ src/
โ”‚   โ”œโ”€โ”€ index.ts                 # MCP server setup and routing
โ”‚   โ””โ”€โ”€ ckanTools.ts            # Toronto Open Data tools implementation
โ”œโ”€โ”€ test-runner.ts              # Automated testing framework
โ”œโ”€โ”€ test-deployment.ts          # Deployment validation script
โ”œโ”€โ”€ claude-mcp-config.json      # Claude Desktop configuration
โ”œโ”€โ”€ evaluation-guide.md         # Comprehensive testing strategies
โ”œโ”€โ”€ example-usage.md            # Usage examples and patterns
โ”œโ”€โ”€ testing-guide.md            # Automated testing documentation
โ””โ”€โ”€ README.md                   # This file

๐Ÿš€ Quick Start

1. Deploy Your Own Instance

# Clone and deploy
git clone <your-repo>
cd toronto-mcp
npm install
wrangler deploy

2. Test Deployment

# Install testing dependencies
npm install tsx

# Test your deployment
npx tsx test-deployment.ts https://your-worker.workers.dev

3. Connect to Claude Desktop

Create or edit ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "toronto-mcp": {
      "command": "npx",
      "args": ["mcp-remote", "https://toronto-mcp.s-a62.workers.dev/sse"]
    }
  }
}

Restart Claude Desktop and start asking questions about Toronto's open data!

๐Ÿงช Testing & Validation

Quick Connectivity Test

npx tsx test-deployment.ts https://toronto-mcp.s-a62.workers.dev

Automated Testing Framework

# Run mock tests (validates framework)
npm test

# Test specific deployment
npm run test:deployment https://your-url.workers.dev

Manual Testing in Claude Desktop

Try these test queries to verify functionality:

  1. Basic Search: "Find datasets about parking in Toronto"
  2. Update Analysis: "How often does Toronto update traffic data?"
  3. Data Structure: "What fields are in Toronto's building permits data?"
  4. Comprehensive: "Give me insights about Toronto's environmental data"
  5. Categories: "What departments provide open data in Toronto?"

๐Ÿ“Š Success Metrics

Your MCP server is working correctly when:

  • โœ… Claude consistently selects appropriate tools for queries
  • โœ… Results include relevance scores and rankings
  • โœ… Update frequency information is categorized correctly
  • โœ… Data structure analysis shows complete field information
  • โœ… Response times are under 10 seconds for complex queries
  • โœ… Error handling provides helpful messages

๐Ÿ“š Documentation

๐Ÿ“˜ Example Usage Guide

Concrete examples of how to use each MCP tool, including JSON parameters and expected responses. Essential for understanding tool capabilities and integration patterns.

๐Ÿ“Š Evaluation & Testing Guide

Comprehensive testing strategies, quality metrics, and evaluation criteria. Includes manual test queries, performance benchmarks, and success metrics for validating MCP server functionality.

๐Ÿงช Automated Testing Framework

TypeScript test framework for programmatic validation, performance monitoring, and automated quality assurance. Includes executable test cases and CI/CD integration patterns.

โš™๏ธ Claude Desktop Configuration

Ready-to-use MCP server configuration for Claude Desktop integration.

๐ŸŽฏ Example Tool Usage

Natural Language Queries (via AI Assistant)

"What traffic data is available in Toronto and how current is it?"
"Find housing development datasets with field information"
"Which Toronto datasets update daily?"
"Give me insights about budget and financial data"

Direct Tool Calls (for developers)

// Intelligent dataset discovery
await find_relevant_datasets({
  query: "traffic accidents",
  maxResults: 5,
  includeRelevanceScore: true,
});

// Update frequency analysis
await analyze_dataset_updates({
  query: "transportation",
  groupByFrequency: true,
});

// Complete data structure analysis
await analyze_dataset_structure({
  packageId: "building-permits",
  includeDataPreview: true,
  previewLimit: 10,
});

// Comprehensive insights
await get_dataset_insights({
  query: "housing development",
  maxDatasets: 3,
  includeUpdateFrequency: true,
  includeDataStructure: true,
});

๐Ÿ”ง Available Scripts

npm run dev           # Start development server
npm run deploy        # Deploy to Cloudflare Workers
npm run test          # Run automated tests
npm run test:deployment  # Test specific deployment
npm run lint:fix      # Fix linting issues
npm run format        # Format code

๐ŸŒŸ Key Features

Intelligent Relevance Scoring

  • Weighted algorithm: Title (10pts) > Description (5pts) > Tags (3pts) > Organization (2pts)
  • Context-aware ranking: Matches user intent with appropriate datasets
  • Multi-keyword support: Handles complex queries effectively

Comprehensive Update Analysis

  • Frequency categorization: Daily, weekly, monthly, quarterly, annually, irregular
  • Metadata inference: Analyzes patterns when explicit schedules aren't available
  • Quality assessment: Identifies stale vs. actively maintained datasets

Deep Data Structure Insights

  • Complete schema analysis: Field names, types, constraints
  • Record statistics: Counts, completeness, data quality indicators
  • Sample data: Optional previews for quick assessment
  • Multi-resource support: Handles datasets with multiple files/formats

๐Ÿš€ Extending

To add more tools or data sources:

  1. Edit src/ckanTools.ts to add new tool functions
  2. Register new tools in src/index.ts
  3. Update type definitions and validation schemas
  4. Add corresponding tests in the testing framework

Example:

server.tool("new_analysis_tool", { param: z.string() }, async ({ param }) => {
  // Implementation
  return { content: [{ type: "text", text: result }] };
});

๐Ÿค Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Add tests for new functionality
  4. Ensure all tests pass: npm test
  5. Submit a pull request

๐Ÿ“„ License

MIT License - see LICENSE file for details.


Built for intelligent open data discovery โ€ข Powered by Toronto Open Data & CKAN API โ€ข Enhanced for AI assistant integration

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
E2B

E2B

Using MCP to run code via e2b.

Official
Featured