Tilastokeskus StatFin MCP Server

Tilastokeskus StatFin MCP Server

Production-grade MCP server for Statistics Finland's StatFin database. Enables AI assistants like Claude to browse, search, and query Finnish statistical data.

Category
Visit Server

README

Tilastokeskus StatFin MCP Server

Production-grade Model Context Protocol (MCP) server for Statistics Finland's StatFin database. Enables AI assistants like Claude to browse, search, and query Finnish statistical data.

Purpose

This project makes Finland's official statistics accessible to AI assistants through the Model Context Protocol (MCP). It bridges the gap between natural language queries and the structured PxWeb API.

Why This Exists

The Problem: Statistics Finland maintains one of the world's most comprehensive national statistics databases with 4,500+ tables covering population, employment, housing, economy, and more. However, the PxWeb API requires:

  • Knowledge of table IDs, variable codes, and value codes
  • Understanding of the hierarchical data structure
  • Correct query formatting with specific filter types
  • Awareness of API rate limits and response sizes

This makes it difficult for users to answer simple questions like "What is Helsinki's population?" without significant technical knowledge.

The Solution: This MCP server provides 7 tools that allow AI assistants to:

  1. Discover relevant tables through natural language search
  2. Explore the database structure and available variables
  3. Query specific data with proper filtering and pagination
  4. Handle rate limiting, caching, and error recovery automatically

Use Cases

  • Journalists asking "How has unemployment changed since COVID?"
  • Researchers comparing regional population trends
  • Analysts tracking housing price divergence between Helsinki and other cities
  • Citizens curious about birth rates, migration, or energy consumption
  • Developers building applications that need Finnish statistical data

Design Philosophy

  • LLM-first: Tool descriptions, parameter hints, and output schemas are optimized for AI consumption
  • Guided workflow: Each tool guides the LLM to the next logical step
  • Fail-safe: Query size estimation prevents expensive API calls; rate limiting protects against quota exhaustion
  • Cache-smart: Historical data is cached until StatFin updates it, minimizing redundant API calls

Features

  • 7 MCP Tools for comprehensive data access
  • 149 subject areas covering population, employment, education, economy, environment, and more
  • ~4,500+ statistical tables with decades of historical data
  • Smart caching with timestamp validation - historical data stays cached until updated
  • Rate limiting (8 req/min per instance) to respect API limits
  • Multi-language support (Finnish, English, Swedish)

Documentation

  • ARCHITECTURE.md - Technical architecture and design decisions
  • BLOGS.md - Blog post ideas and data stories (English)
  • BLOGIT.md - Blog post ideas and data stories (Finnish)

Data Source & License

Statistics Finland StatFin Database

  • Official statistics of Finland
  • Data updated regularly (varies by table)
  • Free to use, no API key required

Data License: Statistics Finland data is licensed under CC BY 4.0. When using the data, provide attribution: "Source: Statistics Finland"

Installation

Local Installation

Clone and build the server to run on your machine.

git clone https://github.com/your-org/statfin-mcp.git
cd statfin-mcp
npm install
npm run build

<details> <summary>Claude Desktop</summary>

Add to your config file:

  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json
{
  "mcpServers": {
    "statfin": {
      "command": "node",
      "args": ["/absolute/path/to/statfin-mcp/dist/server.js"]
    }
  }
}

</details>

<details> <summary>Claude Code</summary>

claude mcp add statfin node /absolute/path/to/statfin-mcp/dist/server.js

Verify with:

claude mcp list

</details>

<details> <summary>OpenAI Codex / ChatGPT</summary>

Add to ~/.codex/config.toml:

[mcp.statfin]
command = "node"
args = ["/absolute/path/to/statfin-mcp/dist/server.js"]
transport = "stdio"

</details>

<details> <summary>Cursor</summary>

In Cursor settings, add MCP server with command:

  • Name: statfin
  • Command: node
  • Args: /absolute/path/to/statfin-mcp/dist/server.js

</details>


Remote Server

Connect to a hosted instance without local installation.

<details> <summary>Claude Code</summary>

claude mcp add --transport http statfin https://your-server.example.com/mcp

Verify with:

claude mcp list

</details>

<details> <summary>Claude Desktop</summary>

Add to your config file:

  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json
{
  "mcpServers": {
    "statfin": {
      "type": "http",
      "url": "https://your-server.example.com/mcp"
    }
  }
}

</details>

<details> <summary>OpenAI Codex / ChatGPT</summary>

Add to ~/.codex/config.toml:

[mcp.statfin]
url = "https://your-server.example.com/mcp"
transport = "http"

</details>

<details> <summary>Cursor</summary>

In Cursor settings, add MCP server:

  • Name: statfin
  • URL: https://your-server.example.com/mcp
  • Transport: HTTP

</details>


Running the Server

# Development mode (stdio transport, watch mode)
npm run dev

# Production HTTP server
npm start

# With custom port
PORT=8080 npm start

# Docker
docker-compose up --build

MCP Tools

Tool Description
search_statistics Search for tables by keyword (primary discovery)
list_subject_areas Browse all 149 topic areas
list_tables List tables in a subject area
get_table_metadata Get table structure and variables
get_variable_values Get all codes for a variable (regions, years)
query_table Execute data queries
get_api_status Check server health and rate limits

Example Workflows

Basic: Helsinki Population Trend

query_table({
  tableId: "statfin_vaerak_pxt_11re.px",
  selections: [
    { variable: "Alue", filter: "item", values: ["KU091"] },  // Helsinki
    { variable: "Ikä", filter: "item", values: ["SSS"] },      // All ages
    { variable: "Sukupuoli", filter: "item", values: ["SSS"] }, // Total
    { variable: "Vuosi", filter: "top", top: 10 }              // Last 10 years
  ]
})
// Returns: Helsinki population 2015-2024 (628K → 684K)

Advanced Example Queries

Education: University Student Employment by Field

Which fields of study have the highest employment rates for students?

// Table: Student employment by education level and field
query_table({
  tableId: "statfin_tyokay_pxt_13g2.px",
  selections: [
    { variable: "Koulutusaste", filter: "item", values: ["7"] },  // University level
    { variable: "Sukupuoli", filter: "item", values: ["SSS"] },   // All genders
    { variable: "Maakunta", filter: "item", values: ["SSS"] },    // Whole country
    { variable: "Vuosi", filter: "top", top: 3 }                  // Last 3 years
  ]
})
// Analyze: Compare IT vs humanities employment rates

Housing: Rent Price Trends by Postal Code

Track rental market changes in Helsinki city center vs suburbs

// Table: Free-market rental prices by postal code, quarterly
query_table({
  tableId: "statfin_asvu_pxt_13eb.px",
  selections: [
    { variable: "Postinumero", filter: "item", values: [
      "00100",  // Helsinki center (Kruununhaka)
      "00500",  // Sörnäinen
      "02100",  // Espoo Tapiola
      "01300"   // Vantaa Tikkurila
    ]},
    { variable: "Huoneluku", filter: "item", values: ["02"] },  // 2-room apartments
    { variable: "Vuosineljännes", filter: "top", top: 20 }      // 5 years quarterly
  ]
})
// Analyze: Which areas are gentrifying fastest?

Migration: International Migration Flows

Analyze emigration vs immigration patterns over decades

// Table: Migration by month and type
query_table({
  tableId: "statfin_muutl_pxt_119z.px",
  selections: [
    { variable: "Sukupuoli", filter: "item", values: ["SSS"] },
    { variable: "Tapahtumakuukausi", filter: "item", values: ["SSS"] },  // Annual totals
    { variable: "Tiedot", filter: "item", values: [
      "vm41",  // Immigration
      "vm42",  // Emigration
      "vm43"   // Net migration
    ]},
    { variable: "Vuosi", filter: "top", top: 30 }  // 30-year trend
  ]
})
// Analyze: How has Finland's migration balance changed since 1990?

Crime: Monthly Crime Statistics Trends

Track reported crimes by category with seasonal patterns

// Table: Reported crimes by month (preliminary data)
query_table({
  tableId: "statfin_rpk_pxt_13jt.px",
  selections: [
    { variable: "Rikosryhmä ja teonkuvauksen tarkenne", filter: "item", values: [
      "101T603",     // All crimes total
      "101T504X406", // Violent crimes
      "101T161"      // Property crimes
    ]},
    { variable: "Tiedot", filter: "item", values: ["rikokset_lkm"] },
    { variable: "Kuukausi", filter: "top", top: 60 }  // 5 years monthly
  ]
})
// Analyze: Seasonal crime patterns, COVID impact on crime rates

Electricity: Power Generation Mix Evolution

How has Finland's electricity production evolved toward renewables?

// Table: Electricity supply and production by source
query_table({
  tableId: "statfin_salatuo_pxt_11sr.px",
  selections: [
    { variable: "Tiedot", filter: "item", values: [
      "sahkon_tuot",       // Total production
      "vesivoima",         // Hydropower
      "tuulivoima",        // Wind power
      "ydinvoima",         // Nuclear
      "fossiiliset"        // Fossil fuels
    ]},
    { variable: "Vuosi", filter: "top", top: 25 }  // 25-year transition
  ]
})
// Analyze: Nuclear vs wind growth, fossil phase-out trajectory

Traffic: Road Accident Hotspots by Municipality

Which municipalities have the highest traffic accident rates?

// Table: Traffic accidents with injuries by area and road type
query_table({
  tableId: "statfin_ton_pxt_12qh.px",
  selections: [
    { variable: "Alue", filter: "item", values: [
      "SSS",    // Whole country (for comparison)
      "KU091",  // Helsinki
      "KU092",  // Vantaa
      "KU049",  // Espoo
      "KU837",  // Tampere
      "KU853"   // Turku
    ]},
    { variable: "Tielaji", filter: "item", values: ["SSS"] },
    { variable: "Osallinen", filter: "item", values: ["SSS"] },
    { variable: "Tiedot", filter: "item", values: [
      "konn",     // Total accidents
      "kuolonn",  // Fatal accidents
      "loukonn"   // Injury accidents
    ]},
    { variable: "Vuosi", filter: "top", top: 5 }
  ]
})
// Analyze: Per-capita accident rates, pedestrian vs vehicle involvement

Housing: Household Size and Building Type Changes

How are Finnish living patterns changing over time?

// Table: Households by size and building type
query_table({
  tableId: "statfin_asas_pxt_116a.px",
  selections: [
    { variable: "Talotyyppi", filter: "item", values: [
      "1",  // Detached houses
      "2",  // Attached houses
      "3",  // Apartment buildings
    ]},
    { variable: "Asuntokunnan koko", filter: "item", values: [
      "1",  // 1-person households
      "2",  // 2-person
      "3",  // 3-person
      "4"   // 4+ person
    ]},
    { variable: "Vuosi", filter: "top", top: 40 }  // Since 1985
  ]
})
// Analyze: Rise of single-person households, apartment living trends

Electric Vehicles: Adoption Rate by Vehicle Type

Track the EV transition in Finland's vehicle fleet

// Table: New vehicle registrations by fuel type
query_table({
  tableId: "statfin_merek_pxt_11ck.px",
  selections: [
    { variable: "Ajoneuvoluokka", filter: "item", values: ["01"] },  // Passenger cars
    { variable: "Käyttövoima", filter: "item", values: [
      "00",  // Total
      "01",  // Petrol
      "02",  // Diesel
      "04",  // Electric
      "05",  // Plug-in hybrid
    ]},
    { variable: "Vuosi", filter: "top", top: 15 }
  ]
})
// Analyze: EV market share growth, diesel decline post-2015

Multi-Step Analysis Patterns

Cross-Domain Analysis: Education → Employment → Income

1. Find education completion rates by field
2. Get employment statistics for recent graduates
3. Query income data by education level
4. Compare: Which fields offer best ROI?

Time-Series with Regional Breakdown

1. Get national trend (Alue: "SSS")
2. Compare major cities (KU091, KU837, KU853)
3. Identify regional divergence patterns
4. Correlate with local economic indicators

Demographic Shift Analysis

1. Query population by age groups (1990-2024)
2. Get migration data for same period
3. Query birth/death rates
4. Model: Aging population impact on workforce

Common Region Codes

Code Description
SSS Whole country (KOKO MAA)
MK01-MK19 Regions (maakunta)
KU091 Helsinki
KU092 Vantaa
KU049 Espoo

Development

# Type checking
npm run typecheck

# Run unit tests
npm run test:run

# Run tests with coverage
npm run test:coverage

# Run integration tests (requires API access)
npm run test:integration

# Fetch fresh test fixtures
npm run test:fixtures:fetch

Environment Variables

Variable Default Description
PORT 8080 HTTP server port
MCP_TRANSPORT stdio Set to "http" for HTTP transport
API_TOKEN - Optional authentication token
LOG_LEVEL info Logging level (debug, info, warn, error)

API Rate Limits

  • StatFin API: 30 requests/minute total
  • This server: 8 requests/minute per instance
  • Designed for up to 3 concurrent Cloud Run instances

License

This project (code) is licensed under MIT. The statistical data accessed through this server is provided by Statistics Finland under CC BY 4.0.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured