Unofficial BioThings.io MCP Server

Unofficial BioThings.io MCP Server

Provides access to BioThings.io APIs for comprehensive gene and variant annotations, enabling seamless integration of biological data into workflows.

Category
Visit Server

README

Logo

Unofficial BioThings.io MCP Server

A Model Context Protocol (MCP) server that provides access to the BioThings.io ecosystem of APIs, including MyGene.info and MyVariant.info. This server enables seamless integration of comprehensive gene and variant annotation data into your workflows.

Developed by Augmented Nature

Overview

BioThings.io provides high-performance APIs for biological data:

  • MyGene.info: Gene annotation service with 11M+ requests/month, covering 22M+ genes across 22K+ species
  • MyVariant.info: Variant annotation service with 5M+ requests/month, covering 400M+ human variants

Features

Core Gene Tools

  • get_gene_annotation: Retrieve detailed gene annotation by Entrez or Ensembl ID
  • query_genes: Search genes using flexible syntax (symbols, names, genomic intervals, etc.)
  • batch_gene_query: Process up to 1000 genes efficiently in a single request

Core Variant Tools

  • get_variant_annotation: Retrieve variant annotation by HGVS ID
  • query_variants: Search variants using genomic ranges, rsIDs, gene names, and filters
  • batch_variant_query: Process up to 1000 variants efficiently in a single request

Advanced Gene Tools

  • search_genes_by_pathway: Search genes by pathway (KEGG, Reactome, BioCarta, etc.)
  • search_genes_by_go_term: Search genes by Gene Ontology terms (biological process, molecular function, cellular component)
  • get_gene_orthologs: Find orthologous genes across species using HomoloGene
  • search_drug_target_genes: Search for genes that are drug targets using PharmGKB annotations
  • get_genomic_interval_genes: Get all genes within a specific genomic interval

Advanced Variant Tools

  • search_variants_by_gene: Find all variants in or near a specific gene
  • search_pathogenic_variants: Search for pathogenic or likely pathogenic variants with clinical annotations
  • search_variants_by_population_frequency: Search variants by population frequency thresholds

Utility Tools

  • get_gene_metadata: Retrieve MyGene.info API metadata and available fields
  • get_variant_metadata: Retrieve MyVariant.info API metadata and available fields
  • get_gene_fields: Get all available fields for gene annotation with descriptions
  • get_variant_fields: Get all available fields for variant annotation with descriptions

Installation

  1. The server has been built and configured in your MCP settings
  2. No API keys required - BioThings APIs are freely accessible
  3. The server will automatically start when you use MCP tools

Usage

With Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "biothings": {
      "command": "node",
      "args": ["/path/to/biothings-server/build/index.js"]
    }
  }
}

Example Queries

  1. Get gene annotation for CDK2:

    Get detailed information about gene CDK2 using get_gene_annotation
    
  2. Search for insulin-related genes:

    Search for genes related to insulin using query_genes
    
  3. Find genes in cell cycle pathway:

    Find genes involved in cell cycle using search_genes_by_pathway
    
  4. Search genes by GO term:

    Find genes involved in apoptosis using search_genes_by_go_term
    
  5. Get variant annotation:

    Get annotation for variant chr7:g.55241707G>T using get_variant_annotation
    
  6. Search variants by rsID:

    Find variant information for rs58991260 using query_variants
    
  7. Batch process multiple genes:

    Get annotations for genes CDK2, TP53, and BRCA1 using batch_gene_query
    
  8. Find pathogenic variants:

    Search for pathogenic variants in BRCA1 and BRCA2 genes using search_pathogenic_variants
    
  9. Get gene orthologs:

    Find mouse and rat orthologs for human TP53 gene using get_gene_orthologs
    
  10. Search drug target genes:

    Find genes that are targets for aspirin using search_drug_target_genes
    

Query Syntax

Gene Queries

  • Simple: CDK2, insulin receptor
  • Fielded: symbol:TP53, entrezgene:1017, summary:diabetes
  • Genomic: chr1:1000000-2000000
  • Boolean: (CDK2 OR CDK4) AND human
  • Wildcards: CDK*, IL?R

Variant Queries

  • rsID: rs58991260
  • Genomic range: chr1:69000-70000
  • Gene-based: dbnsfp.genename:BRCA1
  • Functional: cadd.phred:>20
  • Clinical: clinvar.clinical_significance:pathogenic

Available Fields

Gene Fields (Examples)

  • symbol, name, summary - Basic gene information
  • go.BP, go.MF, go.CC - Gene Ontology terms
  • pathway.kegg, pathway.reactome - Pathway information
  • refseq.rna, refseq.protein - RefSeq identifiers
  • ensembl.gene, ensembl.transcript - Ensembl identifiers

Variant Fields (Examples)

  • cadd.phred, cadd.raw - CADD scores
  • dbnsfp.genename, dbnsfp.aa.alt - dbNSFP annotations
  • clinvar.clinical_significance - ClinVar classifications
  • dbsnp.rsid, dbsnp.vartype - dbSNP information
  • exac.af, gnomad.af - Population frequencies

Species Support

Common Species Names

  • human, mouse, rat, fruitfly, nematode, zebrafish, thale-cress, frog, pig

Taxonomy IDs

  • Human: 9606
  • Mouse: 10090
  • Rat: 10116

Error Handling

The server provides comprehensive error handling:

  • Invalid parameters: Clear validation messages
  • API errors: Detailed HTTP status and error information
  • Not found: Graceful handling of missing genes/variants
  • Rate limiting: Automatic retry logic for temporary failures

Data Sources

MyGene.info (32+ sources)

  • NCBI Gene, Ensembl, UniProt, GO, KEGG, Reactome, PharmGKB, and more

MyVariant.info (19+ sources)

  • dbSNP, ClinVar, CADD, dbNSFP, ExAC, gnomAD, COSMIC, and more

Performance

  • High throughput: APIs handle millions of requests monthly
  • Batch processing: Up to 1000 items per request
  • Caching: Responses are cached for optimal performance
  • Pagination: Support for large result sets

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured