Research Citations MCP Server
An MCP server for searching and citing research papers using RAG, enabling semantic search, citation finding, and question answering over a collection of PDF papers.
README
Research Citations MCP Server
A Model Context Protocol (MCP) server for searching and citing research papers using Retrieval-Augmented Generation (RAG). This server helps you find relevant citations, search for specific passages, and answer research questions based on your collection of PDF research papers.
Features
- π Semantic Search: Find relevant passages across all your research papers
- π Citation Finder: Get properly formatted citations with source references
- π‘ Question Answering: Ask questions and get answers backed by your papers
- π Paper Summarization: Generate summaries with custom prompts or pre-defined focus areas
- π¬ Methodology Extraction: Extract structured methodology details from papers
- π Bibliography Extraction: Get APA-formatted citations from paper reference sections
- π·οΈ Automatic Metadata Extraction: During index rebuild, automatically extracts authors, year, title, and APA citation from first 1-2 pages using GPT-4o
- π PDF Processing: Automatic extraction and chunking of text from PDFs
- π SSE Transport: Remote access via Server-Sent Events
- π― Vector Search: ChromaDB-powered semantic similarity search
- π€ LangChain Integration: Built on battle-tested RAG frameworks
Architecture
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β MCP Client (Claude, etc.) β
βββββββββββββββββββββ¬ββββββββββββββββββββββββββββββ
β SSE Transport
βββββββββββββββββββββΌββββββββββββββββββββββββββββββ
β FastAPI + Starlette β
β MCP Server β
βββββββββββββββββββββββββββββββββββββββββββββββββββ€
β RAG Engine β
β βββββββββββββββ ββββββββββββββββ β
β β LangChain βββββββββΊβ ChromaDB β β
β β Retrieval β β Vector Store β β
β βββββββββββββββ ββββββββββββββββ β
β β β² β
β βΌ β β
β βββββββββββββββ ββββββββββββββββ β
β β OpenAI LLM β β PDF Processorβ β
β β & Embeddingsβ β (PyPDFLoader)β β
β βββββββββββββββ ββββββββ¬ββββββββ β
ββββββββββββββββββββββββββββββββββΌββββββββββββββββ
β
ββββββββββΌβββββββββ
β Research Papers β
β (PDF Files) β
ββββββββββββββββββββ
Installation
Prerequisites
- Python 3.10 or higher
- OpenAI API key
- UV package manager (recommended) or pip
Setup
-
Clone the repository
git clone <your-repo-url> cd citations-mcp -
Install dependencies
# Using uv (recommended) uv sync # Or using pip pip install -e . -
Configure environment
cp .env.example .envEdit
.envand set:OPENAI_API_KEY: Your OpenAI API keyPAPERS_DIRECTORY: Path to your folder containing PDF research papers- Other optional settings (see
.env.example)
Usage
Starting the Server
# Using uv
uv run uvicorn src.main:app --host 127.0.0.1 --port 8000 --reload
# Or using uvicorn directly
uvicorn src.main:app --host 127.0.0.1 --port 8000 --reload
The server will:
- Start on
http://127.0.0.1:8000 - Automatically process all PDFs in your papers directory
- Build a vector store index (cached for future runs)
- Expose MCP tools via SSE at
http://127.0.0.1:8000/mcp/sse
Available MCP Tools
1. search_papers
Search for relevant passages in your research papers.
{
"query": "machine learning for natural language processing",
"num_results": 5
}
2. search_in_paper
Search for relevant passages within a specific paper only.
{
"query": "transformer architecture",
"filename": "attention_is_all_you_need.pdf",
"num_results": 5
}
3. find_citation
Find relevant citations for a specific topic, grouped by source paper.
{
"topic": "transformer architecture",
"num_citations": 3
}
4. answer_question
Ask a research question and get an answer with sources.
{
"question": "What are the main challenges in few-shot learning?"
}
5. list_papers
List all indexed research papers.
{}
6. get_paper_info
Get information about a specific paper.
{
"filename": "attention_is_all_you_need.pdf"
}
7. rebuild_index
Rebuild the vector store index (use after adding new papers). During rebuild, the server automatically:
- Extracts metadata from the first 1-2 pages of each paper using GPT-4o
- Captures authors, publication year, title, journal/conference, DOI, and generates APA citation
- Attaches this metadata to all chunks for easy citation reference
{
"force": true
}
Note: Metadata extraction uses GPT-4o API calls (one per paper), so rebuilding with many papers may incur API costs.
8. extract_methodology
Extract structured methodology details from a research paper.
{
"filename": "attention_is_all_you_need.pdf"
}
Returns: Research approach, datasets, models, evaluation metrics, experimental setup, baselines, and implementation details.
9. summarize_paper
Generate a summary of a research paper with custom prompts or pre-defined focus.
{
"filename": "attention_is_all_you_need.pdf",
"focus": "key_findings" // Options: "general", "key_findings", "methodology", "limitations", "contributions"
}
Or with a custom prompt:
{
"filename": "attention_is_all_you_need.pdf",
"custom_prompt": "Summarize the experimental results and their statistical significance"
}
10. extract_bibliography
Extract the bibliography/references from a paper in APA format.
{
"filename": "attention_is_all_you_need.pdf"
}
Returns: APA-formatted list of all citations used in the paper.
Connecting from Claude Desktop
Add this to your Claude Desktop MCP configuration:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
{
"mcpServers": {
"research-citations": {
"url": "http://127.0.0.1:8000/mcp/sse"
}
}
}
Restart Claude Desktop and you'll see the research tools available.
Public Access via Ngrok (for ChatGPT and Remote Access)
To expose your server publicly via ngrok:
# Install ngrok (if not already installed)
brew install ngrok
# Configure your ngrok auth token
export NGROK_AUTHTOKEN="your_ngrok_token"
# Start the public server
./start_public.sh
This will give you a public URL like https://abc123.ngrok-free.app/mcp/sse that you can use with ChatGPT or other remote MCP clients.
See NGROK_SETUP.md for detailed instructions.
Configuration
Environment Variables
| Variable | Description | Default |
|---|---|---|
OPENAI_API_KEY |
OpenAI API key (required) | - |
PAPERS_DIRECTORY |
Path to research papers folder (required) | - |
VECTOR_DB_PATH |
Path to store vector database | ./vector_db |
COLLECTION_NAME |
ChromaDB collection name | research_papers |
CHUNK_SIZE |
Text chunk size for processing | 1000 |
CHUNK_OVERLAP |
Overlap between chunks | 200 |
EMBEDDING_MODEL |
OpenAI embedding model | text-embedding-3-small |
LLM_MODEL |
OpenAI LLM model | gpt-4o-mini |
HOST |
Server host | 127.0.0.1 |
PORT |
Server port | 8000 |
Adding Papers
- Place PDF files in your
PAPERS_DIRECTORY - Either:
- Restart the server (auto-detects new papers)
- Call the
rebuild_indextool withforce: true
Development
Project Structure
citations-mcp/
βββ src/
β βββ __init__.py
β βββ config.py # Configuration management
β βββ pdf_processor.py # PDF loading and chunking
β βββ rag_engine.py # RAG pipeline and search
β βββ mcp_server.py # MCP tool definitions
β βββ sse_transport.py # SSE transport layer
β βββ main.py # FastAPI application
βββ pyproject.toml # Dependencies
βββ .env.example # Environment template
βββ README.md # This file
Running Tests
# Using uv
uv run pytest
# Or using pytest directly
pytest
Adding New Tools
Edit src/mcp_server.py and add a new function decorated with @mcp.tool():
@mcp.tool()
async def my_new_tool(param: str) -> Dict[str, Any]:
"""Tool description."""
# Implementation
return {"result": "value"}
Troubleshooting
Vector store not initializing
- Check that
PAPERS_DIRECTORYexists and contains PDF files - Ensure
OPENAI_API_KEYis valid - Check logs for specific error messages
PDFs not being processed
- Verify PDFs are valid and readable
- Check file permissions
- Look for processing errors in server logs
Poor search results
- Adjust
CHUNK_SIZEandCHUNK_OVERLAPin.env - Try different
EMBEDDING_MODELoptions - Rebuild index with
force: true
Performance Tips
- First Run: Initial indexing takes time proportional to number of papers
- Caching: Vector store is persisted and reused on subsequent runs
- Embeddings:
text-embedding-3-smallis fast and cost-effective - Chunks: Smaller chunks (500-1000) work better for precise citations
License
MIT License - See LICENSE file for details
Contributing
Contributions welcome! Please open an issue or PR.
Acknowledgments
- Built with LangChain
- Uses Model Context Protocol
- Powered by OpenAI embeddings and LLMs
- PDF processing via pypdf
- Vector storage with ChromaDB
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.