OpenAlex MCP Server

OpenAlex MCP Server

Provides access to OpenAlex's catalog of 240M+ scholarly works, enabling search and retrieval of research papers, authors, institutions, journals, concepts, and funders with advanced filtering and classification capabilities.

Category
Visit Server

README

OpenAlex MCP Server

A Model Context Protocol (MCP) server that provides access to the OpenAlex API - a fully open catalog of the global research system covering over 240 million scholarly works.

Features

This MCP server provides tools to search and retrieve:

  • Works - Scholarly articles, preprints, datasets, books (240M+ items)
  • Authors - Researchers and creators with ORCID integration
  • Sources - Journals, conferences, repositories (~250K venues)
  • Institutions - Universities, hospitals, labs with ROR matching
  • Concepts - Hierarchical research topics (levels 0-5)
  • Publishers - Publishing organizations
  • Funders - Grant-making bodies
  • Autocomplete - Type-ahead search across all entity types
  • Text Classification - Concept prediction for arbitrary text

Installation

From npm (Recommended)

npm install -g openalex-mcp

From Source

git clone https://github.com/reetp14/openalex-mcp.git
cd openalex-mcp
npm install
npm run build

Usage

As an MCP Server

Add to your MCP client configuration:

{
  "mcpServers": {
    "openalex": {
      "command": "npx",
      "args": ["openalex-mcp"]
    }
  }
}

json



Or if installed locally:

{
  "mcpServers": {
    "openalex": {
      "command": "node",
      "args": ["./node_modules/openalex-mcp/build/index.js"]
    }
  }
}

Available Tools

Entity Search Tools

All search tools support the full OpenAlex query grammar:

  • search_works - Search scholarly works
  • search_authors - Search researchers and creators
  • search_sources - Search journals, conferences, repositories
  • search_institutions - Search universities, hospitals, labs
  • search_concepts - Search research topics
  • search_publishers - Search publishing organizations
  • search_funders - Search grant-making bodies

Common Parameters:

  • search - Full-text search query
  • filter - Boolean filters (e.g., concept.id:C12345,from_publication_date:2022-01-01)
  • sort - Sort field with optional :desc (e.g., cited_by_count:desc)
  • page/per_page - Standard pagination (max 10,000 results total)
  • cursor - Deep pagination (use * for first call)
  • group_by - Faceting/aggregation by field
  • select - Comma-separated fields to return
  • sample - Random sample size with optional seed
  • mailto - Your email for higher rate limits

Single Entity Retrieval

  • get_entity - Get a single entity by OpenAlex ID
    • entity_type - One of: works, authors, sources, institutions, concepts, publishers, funders
    • openalex_id - OpenAlex ID (e.g., W2741809807, A1969205038)

Utility Tools

  • autocomplete - Type-ahead search across entity types

    • search - Search query (required)
    • type - Entity type to search within (optional)
    • per_page - Number of suggestions (max 50)
  • classify_text - Predict research concepts from text

    • title - Title text to classify
    • abstract - Abstract text to classify

Examples

Search for AI papers from 2023

{
  "tool": "search_works",
  "arguments": {
    "search": "artificial intelligence",
    "filter": "from_publication_date:2023-01-01,to_publication_date:2023-12-31",
    "sort": "cited_by_count:desc",
    "per_page": 10,
    "mailto": "researcher@university.edu"
  }
}

Find authors by institution

{
  "tool": "search_authors",
  "arguments": {
    "filter": "last_known_institution.id:I27837315",
    "sort": "works_count:desc",
    "select": "id,display_name,works_count,cited_by_count"
  }
}

Get publication trends by year

{
  "tool": "search_works",
  "arguments": {
    "filter": "concepts.id:C154945302",
    "group_by": "publication_year"
  }
}

Autocomplete journal names

{
  "tool": "autocomplete",
  "arguments": {
    "search": "nature",
    "type": "sources",
    "per_page": 5
  }
}

Classify research text

{
  "tool": "classify_text",
  "arguments": {
    "title": "Deep Learning for Medical Image Analysis",
    "abstract": "We present a novel approach using convolutional neural networks..."
  }
}

Query Grammar Quick Reference

Filters

  • Chain with , for AND: concept.id:C12345,publication_year:2023
  • Chain with | for OR: type:journal|type:repository
  • Negate with !: authors.id!A12345 (exclude author)
  • Date ranges: from_publication_date:2020-01-01,to_publication_date:2023-12-31

Sorting

  • Ascending: sort=publication_year
  • Descending: sort=cited_by_count:desc
  • Multiple: sort=publication_year:desc,cited_by_count:desc

Pagination

  • Standard: page=2&per_page=100 (max 10,000 results)
  • Deep: cursor=* (first call), then use returned next_cursor

Rate Limits

  • Anonymous: 10 requests/second, 100,000/day
  • With mailto: 100 requests/second, 1,000,000/day

API Response Format

All tools return the standard OpenAlex JSON envelope:

{
  "meta": {
    "count": 249256387,
    "db_response_time_ms": 12,
    "page": 1,
    "per_page": 25,
    "next_cursor": "ZjEwMD..."
  },
  "results": [
    {
      /* entity object */
    }
  ]
}

Development

# Watch mode during development
npm run watch

# Test with MCP inspector
npm run inspector

# Run basic functionality test
node test-simple.js

Environment Configuration

The server supports environment variables for configuration. Copy .env.example to .env and configure:

cp .env.example .env
# Edit .env with your settings

Environment Variables

  • OPENALEX_BEARER_TOKEN: Bearer token for authenticated API access (optional)
  • OPENALEX_DEFAULT_EMAIL: Default email for rate limiting when no mailto parameter provided

API Access Notes

  • Free Access: OpenAlex API is free and open
  • Rate Limits: 10 req/sec (anonymous) or 100 req/sec (with Bearer token or mailto)
  • Authentication: Bearer token automatically loaded from environment
  • Response Size: Use select parameter to limit response size for large datasets

Example with optimized response:

{
  "tool": "search_works",
  "arguments": {
    "search": "machine learning",
    "select": "id,display_name,publication_year,cited_by_count",
    "per_page": 10
  }
}

About OpenAlex

OpenAlex is a fully open catalog of the global research system, named after the ancient Library of Alexandria and created by the nonprofit OurResearch. It provides free, comprehensive metadata about scholarly works, authors, institutions, and more.

  • Website: https://openalex.org/
  • API Documentation: https://docs.openalex.org/
  • Data sources: Crossref, ORCID, ROR, Microsoft Academic Graph, and more

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
E2B

E2B

Using MCP to run code via e2b.

Official
Featured