OpenAlex MCP Server
Provides access to OpenAlex's catalog of 240M+ scholarly works, enabling search and retrieval of research papers, authors, institutions, journals, concepts, and funders with advanced filtering and classification capabilities.
README
OpenAlex MCP Server
A Model Context Protocol (MCP) server that provides access to the OpenAlex API - a fully open catalog of the global research system covering over 240 million scholarly works.
Features
This MCP server provides tools to search and retrieve:
- Works - Scholarly articles, preprints, datasets, books (240M+ items)
- Authors - Researchers and creators with ORCID integration
- Sources - Journals, conferences, repositories (~250K venues)
- Institutions - Universities, hospitals, labs with ROR matching
- Concepts - Hierarchical research topics (levels 0-5)
- Publishers - Publishing organizations
- Funders - Grant-making bodies
- Autocomplete - Type-ahead search across all entity types
- Text Classification - Concept prediction for arbitrary text
Installation
From npm (Recommended)
npm install -g openalex-mcp
From Source
git clone https://github.com/reetp14/openalex-mcp.git
cd openalex-mcp
npm install
npm run build
Usage
As an MCP Server
Add to your MCP client configuration:
{
"mcpServers": {
"openalex": {
"command": "npx",
"args": ["openalex-mcp"]
}
}
}
json
Or if installed locally:
{
"mcpServers": {
"openalex": {
"command": "node",
"args": ["./node_modules/openalex-mcp/build/index.js"]
}
}
}
Available Tools
Entity Search Tools
All search tools support the full OpenAlex query grammar:
search_works- Search scholarly workssearch_authors- Search researchers and creatorssearch_sources- Search journals, conferences, repositoriessearch_institutions- Search universities, hospitals, labssearch_concepts- Search research topicssearch_publishers- Search publishing organizationssearch_funders- Search grant-making bodies
Common Parameters:
search- Full-text search queryfilter- Boolean filters (e.g.,concept.id:C12345,from_publication_date:2022-01-01)sort- Sort field with optional:desc(e.g.,cited_by_count:desc)page/per_page- Standard pagination (max 10,000 results total)cursor- Deep pagination (use*for first call)group_by- Faceting/aggregation by fieldselect- Comma-separated fields to returnsample- Random sample size with optionalseedmailto- Your email for higher rate limits
Single Entity Retrieval
get_entity- Get a single entity by OpenAlex IDentity_type- One of: works, authors, sources, institutions, concepts, publishers, fundersopenalex_id- OpenAlex ID (e.g., W2741809807, A1969205038)
Utility Tools
-
autocomplete- Type-ahead search across entity typessearch- Search query (required)type- Entity type to search within (optional)per_page- Number of suggestions (max 50)
-
classify_text- Predict research concepts from texttitle- Title text to classifyabstract- Abstract text to classify
Examples
Search for AI papers from 2023
{
"tool": "search_works",
"arguments": {
"search": "artificial intelligence",
"filter": "from_publication_date:2023-01-01,to_publication_date:2023-12-31",
"sort": "cited_by_count:desc",
"per_page": 10,
"mailto": "researcher@university.edu"
}
}
Find authors by institution
{
"tool": "search_authors",
"arguments": {
"filter": "last_known_institution.id:I27837315",
"sort": "works_count:desc",
"select": "id,display_name,works_count,cited_by_count"
}
}
Get publication trends by year
{
"tool": "search_works",
"arguments": {
"filter": "concepts.id:C154945302",
"group_by": "publication_year"
}
}
Autocomplete journal names
{
"tool": "autocomplete",
"arguments": {
"search": "nature",
"type": "sources",
"per_page": 5
}
}
Classify research text
{
"tool": "classify_text",
"arguments": {
"title": "Deep Learning for Medical Image Analysis",
"abstract": "We present a novel approach using convolutional neural networks..."
}
}
Query Grammar Quick Reference
Filters
- Chain with
,for AND:concept.id:C12345,publication_year:2023 - Chain with
|for OR:type:journal|type:repository - Negate with
!:authors.id!A12345(exclude author) - Date ranges:
from_publication_date:2020-01-01,to_publication_date:2023-12-31
Sorting
- Ascending:
sort=publication_year - Descending:
sort=cited_by_count:desc - Multiple:
sort=publication_year:desc,cited_by_count:desc
Pagination
- Standard:
page=2&per_page=100(max 10,000 results) - Deep:
cursor=*(first call), then use returnednext_cursor
Rate Limits
- Anonymous: 10 requests/second, 100,000/day
- With
mailto: 100 requests/second, 1,000,000/day
API Response Format
All tools return the standard OpenAlex JSON envelope:
{
"meta": {
"count": 249256387,
"db_response_time_ms": 12,
"page": 1,
"per_page": 25,
"next_cursor": "ZjEwMD..."
},
"results": [
{
/* entity object */
}
]
}
Development
# Watch mode during development
npm run watch
# Test with MCP inspector
npm run inspector
# Run basic functionality test
node test-simple.js
Environment Configuration
The server supports environment variables for configuration. Copy .env.example to .env and configure:
cp .env.example .env
# Edit .env with your settings
Environment Variables
OPENALEX_BEARER_TOKEN: Bearer token for authenticated API access (optional)OPENALEX_DEFAULT_EMAIL: Default email for rate limiting when nomailtoparameter provided
API Access Notes
- Free Access: OpenAlex API is free and open
- Rate Limits: 10 req/sec (anonymous) or 100 req/sec (with Bearer token or
mailto) - Authentication: Bearer token automatically loaded from environment
- Response Size: Use
selectparameter to limit response size for large datasets
Example with optimized response:
{
"tool": "search_works",
"arguments": {
"search": "machine learning",
"select": "id,display_name,publication_year,cited_by_count",
"per_page": 10
}
}
About OpenAlex
OpenAlex is a fully open catalog of the global research system, named after the ancient Library of Alexandria and created by the nonprofit OurResearch. It provides free, comprehensive metadata about scholarly works, authors, institutions, and more.
- Website: https://openalex.org/
- API Documentation: https://docs.openalex.org/
- Data sources: Crossref, ORCID, ROR, Microsoft Academic Graph, and more
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
E2B
Using MCP to run code via e2b.