esg-mcp-servers
Open-source MCP servers for ESG data extraction, analysis, and regulation management, providing 31 tools across 6 servers for tasks like metrics extraction, PDF processing, vector storage, and web scraping.
README
ESG MCP Servers
<!-- mcp-name: io.github.freminder/esg-mcp-servers -->
Open-source Model Context Protocol servers for ESG (Environmental, Social, and Governance) data extraction, analysis, and regulation management.
31 tools across 6 servers — install once, run only what you need.
Author: Ioannis Michos (johnmichos.tf@gmail.com)
Quick Start
pip install esg-mcp-servers
Prerequisites
| Service | Required for |
|---|---|
| PostgreSQL 16 + pgvector | Vector storage, metrics, regulations |
| MongoDB 7 | PDF binary storage (GridFS) |
| Anthropic API key | RAG queries, metric extraction |
Spin up local databases with Docker:
docker compose up -d
Run the database migration:
esg-mcp-migrate
Environment Variables
Copy .env.example and fill in your values:
cp .env.example .env
Key variables:
| Variable | Default | Description |
|---|---|---|
POSTGRES_DSN |
postgresql://esg:esg@localhost/esg_platform |
PostgreSQL connection string |
MONGODB_URI |
mongodb://localhost:27017 |
MongoDB connection string |
ANTHROPIC_API_KEY |
— | Required for RAG and LLM extraction |
EMBEDDING_MODEL |
Snowflake/snowflake-arctic-embed-l-v2.0 |
Sentence-transformer model |
EMBEDDING_DIMENSIONS |
1024 |
Embedding vector size |
Servers & Tools
esg-metrics-extractor (11 tools)
ESRS-aligned KPI extraction for emissions, energy, water, waste, social, and governance domains.
| Tool | Description |
|---|---|
extract_emissions_data |
GHG Scope 1, 2, 3 extraction |
extract_energy_data |
Energy consumption & renewables (ESRS E2) |
extract_water_data |
Water withdrawal, discharge, consumption (ESRS E3) |
extract_waste_data |
Waste generation, recycling, landfill (ESRS E5) |
extract_social_data |
Workforce, diversity, H&S (ESRS S1) |
extract_governance_data |
Board composition & governance (ESRS G1) |
answer_esg_query |
Free-text RAG Q&A over documents |
keyword_similarity_search |
Semantic keyword search in documents |
detect_query_domain |
Classify query into ESG domain |
detect_emissions_query_type |
Detect emissions scope/year from query |
batch_extract_metrics |
Extract all KPIs and persist to DB |
esg-pdf-processor (5 tools)
PDF validation, text/table extraction, and embedding generation.
| Tool | Description |
|---|---|
verify_esg_report |
RandomForest ESG report classifier |
extract_text_chunks |
Extract and chunk PDF text |
extract_tables |
Table extraction with OCR fallback |
generate_embeddings |
Batch embedding generation |
process_pdf_full_pipeline |
End-to-end: extract, embed, store |
esg-vector-store (5 tools)
pgvector CRUD operations for document chunks and query cache.
| Tool | Description |
|---|---|
upsert_document_chunks |
Insert/update chunks with embeddings |
similarity_search |
Cosine similarity search |
get_cached_query_response |
Retrieve cached LLM responses |
cache_query_response |
Store LLM response in cache |
list_documents |
List indexed documents |
esg-regulations (5 tools)
EU ESG regulation download, ingestion, and semantic search.
| Tool | Description |
|---|---|
download_regulation |
Download regulation PDF from EUR-Lex |
download_all_regulations |
Batch download all configured regulations |
ingest_regulation |
Extract, parse articles, embed, store |
search_regulation_text |
Semantic search across regulation articles |
list_regulations |
List ingested regulations |
esg-scraper (5 tools)
ESG report discovery and download from the web.
| Tool | Description |
|---|---|
search_esg_reports |
Multi-engine search for ESG PDFs |
crawl_company_website |
Deep-crawl website for PDF links |
download_pdf |
Download PDF and store in GridFS |
get_scraping_status |
Check scrape job status |
cancel_scraping_job |
Cancel running scrape job |
esg-mcp-all (31 tools)
All tools from all servers in a single process.
Claude Desktop Configuration
Add to your claude_desktop_config.json:
{
"mcpServers": {
"esg-metrics": {
"command": "esg-metrics-extractor"
},
"esg-pdf": {
"command": "esg-pdf-processor"
},
"esg-vectors": {
"command": "esg-vector-store"
},
"esg-regulations": {
"command": "esg-regulations"
},
"esg-scraper": {
"command": "esg-scraper"
}
}
}
Or use the combined server for all tools:
{
"mcpServers": {
"esg": {
"command": "esg-mcp-all"
}
}
}
Optional: Scraper Dependencies
The scraper server requires Selenium for JavaScript-heavy sites:
pip install esg-mcp-servers[scraper]
Development
git clone https://github.com/freminder/esg-mcp-servers.git
cd esg-mcp-servers
pip install -e ".[scraper,dev]"
docker compose up -d
esg-mcp-migrate
License
MIT — see LICENSE.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.