bib-enrich-mcp
Automatically enriches BibTeX citations with missing metadata (DOI, venue, etc.) from arXiv, DBLP, and CrossRef via an MCP server for AI assistants.
README
bib-enrich-mcp
Writing a paper and your .bib file is a mess? This tool lets AI automatically complete your citations — fill in missing metadata, find publication venues, add DOIs, and even discover if a preprint has been formally published.
Features
- Automatic metadata scraping from multiple sources:
- arXiv API
- DBLP API
- CrossRef API
- BibTeX parsing and writing with full field support
- Batch processing of entire .bib files
- MCP integration for use with AI assistants
Installation
# Or install with uv (Recommended)
uv tool install bib-enrich-mcp
# Install from PyPI
pip install bib-enrich-mcp
# Or clone and install locally
git clone https://github.com/haoxiangsnr/bib-enrich-mcp.git
cd bib-enrich-mcp
uv sync
Quick Start
Step 1: Configure MCP Client
Add the server to your MCP client (e.g., Cherry Studio, Claude Desktop, Cursor):
{
"mcpServers": {
"bib-enrich": {
"command": "bib-enrich-mcp"
}
}
}
Step 2: Enable the MCP Server
In your MCP client, enable the bib-enrich server. Look for a tools icon (usually a wrench) in the chat interface.
Step 3: Start Using
Now you can ask the AI to help with your bibliography. Example prompts:
Help me find the complete citation for: Attention Is All You Need
Enrich this BibTeX entry with arXiv ID 2401.12345
Process my references.bib file and fill in missing metadata
The AI will automatically call the appropriate tools to fetch metadata from arXiv, DBLP, and CrossRef.
Usage
As an MCP Server
Add to your MCP client configuration:
{
"mcpServers": {
"bib-enrich": {
"command": "bib-enrich-mcp"
}
}
}
Running the Server
bib-enrich-mcp
API Documentation
MCP Tools
mcp_enrich_bib_entry
Enrich a single bibliography entry by scraping metadata from academic sources.
Parameters:
cite_key(required): The citation key for the entrytitle(optional): Paper title to search forarxiv_id(optional): arXiv ID (e.g., "2401.12345")doi(optional): DOI of the paper
Returns: BibTeX string with enriched metadata
Example:
result = await mcp_enrich_bib_entry(
cite_key="vaswani2017attention",
title="Attention Is All You Need"
)
mcp_enrich_bib_file
Enrich all entries in a BibTeX file.
Parameters:
file_path(required): Path to the .bib file
Returns: Summary of enriched entries
Example:
result = await mcp_enrich_bib_file("/path/to/references.bib")
# Returns: "Enriched 5/10 entries in /path/to/references.bib"
Python API
You can also use the library directly in Python:
from bib_enrich_mcp.bib_parser import parse_bib_file, write_bib_file
from bib_enrich_mcp.scrapers import scrape_metadata
# Parse a bib file
entries = parse_bib_file("references.bib")
# Scrape metadata for a paper
results = await scrape_metadata(
title="Attention Is All You Need",
arxiv_id="1706.03762"
)
Supported Metadata Sources
| Source | Search by Title | Search by ID | Notes |
|---|---|---|---|
| arXiv | ✅ | ✅ (arXiv ID) | Best for preprints |
| DBLP | ✅ | ❌ | Best for CS conferences |
| CrossRef | ✅ | ✅ (DOI) | Best for journals |
Development
Running Tests
uv run pytest tests/ -v
Project Structure
bib-enrich-mcp/
├── src/bib_enrich_mcp/
│ ├── __init__.py
│ ├── bib_parser.py # BibTeX parsing/writing
│ ├── scrapers.py # Metadata scrapers
│ └── server.py # MCP server
├── tests/
│ ├── test_bib_parser.py
│ ├── test_scrapers.py
│ └── test_server.py
├── pyproject.toml
└── README.md
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.