mcp-arxiv
An MCP server that enables AI assistants to search arXiv papers, retrieve metadata, and access PDFs.
README
MCP ArXiv Server
An MCP Server for accessing the arXiv API. Enables AI assistants to search papers, retrieve metadata, and access PDFs from arXiv.
Features
- 📚 Search arXiv papers by query
- 📄 Retrieve full paper metadata by ID
- 🔗 Direct links to abstract and PDF URLs
- 🚀 Fast XML parsing for Atom feed responses
- ✅ TypeScript with strict typing
Installation
From npm (Recommended)
# Install via npm
npm install -g @alexgenovese/mcp-arxiv
# Or via pnpm
pnpm add -g @alexgenovese/mcp-arxiv
# Or via yarn
yarn add -g @alexgenovese/mcp-arxiv
Build from Source
git clone https://github.com/alexgenovese/mcp-arxiv.git
cd mcp-arxiv
npm install
npm run build
Installation via Smithery
# Install via smithery
npx smithery install @alexgenovese/mcp-arxiv
# Or stream via smithery
npx smithery stream @alexgenovese/mcp-arxiv
# Run locally with smithery
npx @smithery/cli r @alexgenovese/mcp-arxiv
Installation via Glama
# Glama provides MCP discovery and installation
npx glama install @alexgenovese/mcp-arxiv
Usage
Configure your MCP Client
Add the MCP server configuration to your MCP client (Cursor, VS Code, Claude Desktop, etc.):
Compiled version (recommended):
{
"mcpServers": {
"arxiv": {
"command": "node",
"args": ["./dist/index.js"]
}
}
}
Development version (with source rebuild):
{
"mcpServers": {
"arxiv": {
"command": "npx",
"args": ["tsx", "src/index.ts"],
"cwd": "/path/to/mcp-arxiv"
}
}
}
Available Tools
The MCP server provides the following tools:
🔍 arxiv_search
Search for papers on arXiv by query.
{
"name": "arxiv_search",
"arguments": {
"query": "machine learning neural networks",
"start": 0,
"max_results": 10,
"sort_by": "relevance",
"sort_order": "descending"
}
}
Parameters:
query: Search query string (required)start: Starting index for pagination (default: 1)max_results: Maximum number of results (default: 10, max: 20)sort_by:"submittedDate"(publication date),"lastUpdatedDate","relevance", or"timestamp"(default:"submittedDate")sort_order:"ascending"or"descending"(default:"descending")
Returns: Array of papers with id, title, summary, abs_url, pdf_url, html_url, published, authors, categories, comment, doi, and journal_ref.
📄 arxiv_get_paper
Retrieves full metadata for a specific paper by ID.
{
"name": "arxiv_get_paper",
"arguments": {
"paper_id": "2401.00000"
}
}
Parameters:
paper_id: arXiv paper ID (e.g., "2401.00000")
Returns: Full paper metadata including ID, title, authors, abstract, categories, URLs, and comments.
📥 arxiv_get_feed
Retrieves papers from an arXiv category feed (RSS/Atom).
{
"name": "arxiv_get_feed",
"arguments": {
"format": "atom",
"category": "cs.AI",
"limit": 20,
"publishedInMonths": 3
}
}
Parameters:
format: "atom" or "rss" (default: "atom")category: arXiv category (e.g., "cs.AI", "physics", "quant-ph")limit: Maximum number of papers (default: 20)publishedInMonths: Papers from last N months (default: 12, max: 24)
Returns: Array of papers from the specified category.
🔗 arxiv_get_pdf_url
Retrieves canonical absolute and PDF URLs for a paper ID.
{
"name": "arxiv_get_pdf_url",
"arguments": {
"paper_id": "2401.00000"
}
}
Parameters:
paper_id: arXiv paper ID
Returns: Absolute, PDF, and HTML URLs for the specified paper.
Development
# Install dependencies
npm install
# Run development server with type checking
npm run dev
# Build production version
npm run build
# Run tests
npm test
License
MIT License - see LICENSE file for details.
Contributing
Contributions are welcome! Please read our code of conduct before contributing.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
Acknowledgments
- Built with Model Context Protocol
- Data from arXiv API
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.