geo-mcp
Enables searching and downloading Gene Expression Omnibus (GEO) data from NCBI, including datasets, series, samples, platforms, and profiles, through natural language queries.
README
Gene Expression Omnibus (GEO) MCP
<div align="center"> <img src="https://www.ncbi.nlm.nih.gov/geo/img/geo_main.gif" alt="GEO Logo" width="200"/> <br/> <em>Gene Expression Omnibus (GEO) - A public functional genomics data repository</em> </div>
Status: Beta. The MCP stdio and HTTP servers are both functional and in day-to-day use. The public tool surface is stable for the 0.1.x line; internal APIs and config defaults may still evolve. Please open an issue if you hit a regression.
A Model Context Protocol (MCP) server for accessing GEO (Gene Expression Omnibus) data through NCBI E-Utils API. The tool will enable you to search for GEO datasets, series, samples, platforms, and profiles for your LLM. Tested with Claude Desktop, chatGPT has no out of the box support for this tool yet. Claude will automatically use the tools if it fits the context.
Quick Install (pip)
install from pip
pip install geo-mcp
install from source
git clone https://github.com/MCPmed/GEOmcp
cd GEOmcp
pip install -e .
Configuration
Run init to create a config file
geo-mcp --init
By default this writes to $XDG_CONFIG_HOME/geomcp/config.json
(typically ~/.config/geomcp/config.json). If you already have a legacy
~/.geo-mcp/config.json from an older install, that location is still
honored and --init will update it in place so nothing gets orphaned.
The file contains:
{
"base_url": "https://eutils.ncbi.nlm.nih.gov/entrez/eutils",
"email": "your_email@example.com",
"api_key": "YOUR_API_KEY",
"download_dir": "~/.local/share/geomcp/downloads"
}
emailis required by NCBI.api_keyis optional but recommended for higher rate limits (get one here).
You can also skip the config file entirely and pass values via env vars
(GEOMCP_EMAIL, GEOMCP_API_KEY, GEOMCP_BASE_URL, GEOMCP_DOWNLOAD_DIR)
or CLI flags (--email, --api-key, --download-dir, ...). Precedence,
highest wins: CLI flags → env vars → config.json → built-in defaults.
Running the Server
- MCP stdio mode:
geo-mcp - HTTP mode:
geo-mcp --http --port 8001
Claude Desktop Integration
Common Issue: spawn geo-mcp ENOENT
This error means Claude Desktop cannot find the geo-mcp command. This is usually a PATH issue.
Solution
-
Find the full path to the executable:
which geo-mcpExample output:
/Users/youruser/miniforge3/bin/geo-mcp -
Update your Claude config: Instead of just
"geo-bio-mcp", use the full path:{ "mcpServers": { "geo-mcp": { "command": "/Users/youruser/miniforge3/bin/geo-mcp", "env": { "CONFIG_PATH": "/Users/youruser/.geo-mcp/config.json" } } } } -
(Optional) Use a Conda Environment:
- Activate your conda env and run
which geo-mcpto get the correct path. - Use that path in your Claude config as above.
- Activate your conda env and run
-
Restart Claude Desktop after updating the config.
Troubleshooting
- If you see
command not found: geo-mcp, make sure you installed into the Python environment whosebindirectory is on yourPATH, or invoke the server with its full path. - If the config file is missing,
geo-mcp --initwill create one. You can also run without a config by settingGEOMCP_EMAILin the environment.
HTTP API
With the server running via geo-mcp --http (default port 8001), the
following endpoints are exposed:
GET /— server statusGET /health— health checkGET /tools— list all registered MCP toolsPOST /tools/call— execute a tool with argumentsGET /events— server-sent events stream of tool callsGET /docs— interactive Swagger UI
Examples
# List available tools
curl http://localhost:8001/tools
# Search GEO Series
curl -X POST http://localhost:8001/tools/call \
-H "Content-Type: application/json" \
-d '{
"name": "search_geo_series",
"arguments": {"term": "breast cancer", "retmax": 5}
}'
Available Tools
This MCP server provides access to all major GEO databases through the following tools:
Search Tools
-
search_geo- Universal GEO search across GSE/GSM/GPL/GDS records, with results bucketed by accession type. Passrecord_typesto filter.- Example searches: "breast cancer", "GSE12345", "RNA-seq time course"
-
search_geo_profiles- Search gene expression profiles across different biological contexts- Example searches: "cancer", "breast cancer", "p53", "apoptosis"
-
search_geo_datasets- Search curated gene expression datasets- Example searches: "diabetes", "Alzheimer's disease", "drug response", "tissue specific"
-
search_geo_series- Search original submitter-supplied gene expression series- Example searches: "GSE12345", "microarray", "RNA-seq", "time course"
-
search_geo_samples- Search individual gene expression samples- Example searches: "GSM123456", "human", "mouse", "tumor", "normal"
-
search_geo_platforms- Search microarray platform definitions- Example searches: "Affymetrix", "Illumina", "Agilent", "GPL96"
Search Parameters
Each tool accepts:
term(required): Search term or query stringretmax(optional, default: 20): Maximum number of results to return
Example usage
For now just chat with claude desktop and it will use the tools automatically if it fits the context.
User: find a small brain dataset
Assistant: I'll help you find a small brain dataset from the GEO database. Let me search for brain-related datasets with a maximum of one result.
Assistant (system): search_geo_datasets
Assistant: I found a brain-related dataset for you:
- Dataset: GSE290848
- Title: Maternal Stress Triggers Early-Life Eczema via Fetal Mast Cell Reprogramming [10x Genomics Multiome ATAC + Gene Expression]
- Species: Mouse (Mus musculus)
- Samples: 8
- Data type: ATAC-seq & gene expression (10x Genomics Multiome)
- Publication date: 21 Jun 2025
This small dataset explores how prenatal stress affects neuro-immune circuits and mast-cell development.
Would you like me to download this dataset or search for others?
User: download GSE290848
Assistant: I'll download the GSE290848 dataset for you.
Assistant (system): download_geo_data
Assistant: Download complete.
Downloaded files
GSE290848_family.soft.gz— series, platform & sample infoGSE290848_metadata.xml— detailed study metadata
Location: ~/.local/share/geomcp/downloads/gse/GSE290848
Total size: 0.01 MB
The dataset is ready for analysis. Let me know if you’d like to explore it further.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.