DuckDB Eurostat MCP Server
Enables querying Eurostat data using natural language, powered by DuckDB for optimized performance.
README
DuckDB Eurostat MCP Server
A Model Context Protocol (MCP) server that enables querying Eurostat data using natural language. This server leverages the DuckDB Eurostat extension to translate human queries into SQL and execute them efficiently against the Eurostat database.
Features
- š£ļø Natural Language Queries - Ask questions in plain English, get SQL and data back
- š¤ Flexible LLM Support - Choose from Anthropic, OpenAI, Ollama (local), or Azure OpenAI
- š DuckDB Powered - Fast query execution with filter pushdown optimization
- š Rich Dataset Access - Access all Eurostat datasets (GDP, unemployment, population, etc.)
- š Dataset Discovery - Browse and search available datasets
- š Schema Inspection - Understand dataset structure before querying
- šÆ Direct SQL Support - Execute raw SQL for advanced use cases
- š Privacy Options - Use local LLMs with Ollama for complete data privacy
Key Differentiator
Unlike other Eurostat MCP implementations that use direct SDMX API calls, this server:
- Uses the DuckDB Eurostat extension for optimized data access
- Provides SQL-based querying with automatic filter pushdown
- Supports natural language to SQL translation via Claude API
- Offers better performance through DuckDB's query optimization
Installation
Prerequisites
- Python 3.10 or higher
- LLM Provider (choose one):
- Anthropic API key (default, recommended)
- OpenAI API key
- Ollama (local, free, no API key needed)
- Azure OpenAI (enterprise)
Install with pip
# Clone the repository
git clone https://github.com/dar4datascience/duckdb-eurostat-mcp.git
cd duckdb-eurostat-mcp
# Create virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# Install with your preferred LLM provider
pip install -e ".[anthropic]" # Anthropic Claude (default)
# OR
pip install -e ".[openai]" # OpenAI GPT
# OR
pip install -e ".[ollama]" # Ollama (local)
# OR
pip install -e ".[azure]" # Azure OpenAI
# OR
pip install -e ".[all-providers]" # All providers
# For development (includes all providers)
pip install -e ".[dev]"
Configuration
LLM Provider Setup
The server supports multiple LLM providers. Choose the one that fits your needs:
Option 1: Anthropic Claude (Default, Recommended)
export LLM_PROVIDER=anthropic
export ANTHROPIC_API_KEY=your-api-key-here
Get API Key: https://console.anthropic.com/
Option 2: OpenAI GPT
export LLM_PROVIDER=openai
export OPENAI_API_KEY=your-api-key-here
Get API Key: https://platform.openai.com/api-keys
Option 3: Ollama (Local, Free, Private)
export LLM_PROVIDER=ollama
# No API key needed!
Setup:
- Install Ollama: https://ollama.ai/
- Pull a model:
ollama pull llama3.1 - Start Ollama:
ollama serve
Benefits: No API costs, complete privacy, works offline
Option 4: Azure OpenAI
export LLM_PROVIDER=azure
export AZURE_OPENAI_API_KEY=your-api-key-here
export AZURE_OPENAI_ENDPOINT=https://your-resource.openai.azure.com/
export AZURE_OPENAI_DEPLOYMENT=your-deployment-name
Claude Desktop Configuration
Add to your Claude Desktop config file (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):
With Anthropic (default):
{
"mcpServers": {
"duckdb-eurostat": {
"command": "python",
"args": ["-m", "duckdb_eurostat_mcp.server"],
"env": {
"LLM_PROVIDER": "anthropic",
"ANTHROPIC_API_KEY": "your-api-key-here"
}
}
}
}
With OpenAI:
{
"mcpServers": {
"duckdb-eurostat": {
"command": "python",
"args": ["-m", "duckdb_eurostat_mcp.server"],
"env": {
"LLM_PROVIDER": "openai",
"OPENAI_API_KEY": "your-api-key-here"
}
}
}
}
With Ollama (local):
{
"mcpServers": {
"duckdb-eurostat": {
"command": "python",
"args": ["-m", "duckdb_eurostat_mcp.server"],
"env": {
"LLM_PROVIDER": "ollama"
}
}
}
}
See docs/LLM_PROVIDERS.md for detailed configuration guide.
Available Tools
1. query_eurostat
Query Eurostat data using natural language.
Example:
Query: "Get population data for Germany in 2020"
2. list_dataflows
List available Eurostat datasets with optional filtering.
Parameters:
provider(optional): Filter by provider (e.g., 'ESTAT')search(optional): Search term to filter by labellimit(optional): Maximum results (default: 50)
3. get_dataflow_structure
Get the structure (dimensions and concepts) of a specific dataset.
Parameters:
provider_id: Provider ID (e.g., 'ESTAT')dataflow_id: Dataset ID (e.g., 'DEMO_R_D2JAN')
4. execute_sql
Execute raw SQL queries against the DuckDB Eurostat database.
Parameters:
sql: SQL query to executelimit(optional): Maximum rows to return (default: 100)
5. list_providers
List all available Eurostat API endpoints/providers.
Usage Examples
Natural Language Queries
"Show unemployment rates for EU countries in 2023"
"What was the GDP of France from 2015 to 2020?"
"List population data for Germany by age group"
Direct SQL Queries
-- Get population data for Germany
SELECT * FROM EUROSTAT_Read('ESTAT', 'DEMO_R_D2JAN')
WHERE geo = 'DE' AND time_period = '2020'
LIMIT 10;
-- List available datasets about GDP
SELECT dataflow_id, label
FROM EUROSTAT_Dataflows(language := 'en')
WHERE label ILIKE '%GDP%'
LIMIT 20;
-- Get dataset structure
SELECT dimension, concept
FROM EUROSTAT_DataStructure('ESTAT', 'DEMO_R_D2JAN', language := 'en');
Popular Datasets
- DEMO_R_D2JAN - Population by age, sex, and NUTS-2 region
- UNE_RT_A - Unemployment rates
- NAMA_10_GDP - GDP and main components
- PRC_HICP_MIDX - HICP - Monthly index (inflation)
- COMEXT - International trade data
Development
Setup Development Environment
# Create virtual environment
python -m venv venv
source venv/bin/activate
# Install with dev dependencies
pip install -e ".[dev]"
Run Tests
# Run all tests
pytest
# Run with coverage
pytest --cov=duckdb_eurostat_mcp --cov-report=html
# Run specific test file
pytest tests/test_duckdb_manager.py -v
Code Quality
# Format code
black src/ tests/
# Lint code
ruff check src/ tests/
# Type check
mypy src/
Testing with MCP Inspector
# Install MCP Inspector
npx @modelcontextprotocol/inspector
# Run your server
python -m duckdb_eurostat_mcp.server
Project Structure
duckdb-eurostat-mcp/
āāā src/
ā āāā duckdb_eurostat_mcp/
ā āāā __init__.py
ā āāā server.py # Main MCP server
ā āāā duckdb_manager.py # DuckDB connection management
ā āāā query_translator.py # Natural language to SQL
āāā tests/
ā āāā test_server.py
ā āāā test_duckdb_manager.py
ā āāā test_query_translator.py
āāā .windsurf/
ā āāā workflows/ # Development workflows
ā āāā memories/ # Project documentation
āāā pyproject.toml
āāā README.md
Workflows
This project includes helpful workflows in .windsurf/workflows/:
- setup-and-test.md - Setup development environment and run tests
- deploy-to-github.md - Deploy to GitHub with CI/CD
- add-new-feature.md - Add new features to the server
Use them with: /setup-and-test, /deploy-to-github, /add-new-feature
Troubleshooting
DuckDB Extension Not Loading
Ensure you have internet connection on first run to download the extension.
API Key Errors
Verify ANTHROPIC_API_KEY is set correctly in your environment.
Query Performance
Use filters to reduce data volume:
WHERE geo = 'DE' AND time_period >= '2020'
See .windsurf/memories/common-issues.md for more solutions.
Contributing
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Make your changes
- Run tests (
pytest) - Format code (
black src/ tests/) - Commit changes (
git commit -m 'Add amazing feature') - Push to branch (
git push origin feature/amazing-feature) - Open a Pull Request
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
- DuckDB Eurostat Extension by ahuarte47
- Model Context Protocol by Anthropic
- Eurostat for providing the data
Related Projects
- eurostat-mcp - Alternative MCP implementation using direct SDMX API
- DuckDB - Fast in-process analytical database
Support
For issues and questions:
- Open an issue on GitHub
- Check
.windsurf/memories/common-issues.mdfor common problems - Review the workflows in
.windsurf/workflows/
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.