OpenAlex MCP Server

OpenAlex MCP Server

Provides academic research tools via the OpenAlex API, enabling searches for papers, authors, concepts, institutions, and citation analysis.

Category
Visit Server

README

OpenAlex MCP Server

A production-ready Model Context Protocol (MCP) server that provides academic research tools using the OpenAlex API. Built with FastAPI and designed for deployment on Google Cloud Run.

Features

šŸ”¬ Academic Research Tools

  • search_works - Search for papers, articles, and academic publications
  • search_authors - Find researchers and their profiles with h-index, citations
  • get_work_details - Get detailed metadata for specific papers
  • search_concepts - Explore research topics and their relationships
  • search_institutions - Find universities and research organizations
  • get_citations - Analyze citation networks (citing/cited works)
  • advanced_filter - Complex multi-criteria searches with OpenAlex filter syntax

šŸš€ MCP Protocol Features

  • Streamable-HTTP transport - Modern HTTP/SSE-based communication
  • JSON-RPC 2.0 - Standard message protocol
  • Server-Sent Events - Real-time streaming for large responses
  • Cloud-ready - Optimized for Google Cloud Run deployment

šŸ“Š OpenAlex Polite Access

  • Automatic rate limiting (10 requests/second)
  • Mailto parameter for polite pool access
  • Proper User-Agent headers
  • Exponential backoff for retries

Quick Start

Local Development

  1. Clone and setup:
cd openalex-mcp-server
cp .env.example .env
# Edit .env and set your MAILTO_EMAIL
  1. Install dependencies:
pip install -r requirements.txt
  1. Run the server:
python server.py
# Or with uvicorn:
uvicorn server:app --reload --port 8080
  1. Test the endpoint:
curl http://localhost:8080/health

Docker Local Testing

docker build -t openalex-mcp-server .
docker run -p 8080:8080 -e MAILTO_EMAIL=your-email@williamscollege.edu openalex-mcp-server

Google Cloud Run Deployment

Prerequisites

  • Google Cloud SDK installed
  • GCP project with Cloud Run API enabled
  • Docker installed locally

Deployment Steps

  1. Authenticate with Google Cloud:
gcloud auth login
gcloud config set project YOUR_PROJECT_ID
  1. Build and push container:
# Configure Docker for Google Container Registry
gcloud auth configure-docker

# Build the image
docker build -t gcr.io/YOUR_PROJECT_ID/openalex-mcp-server .

# Push to Container Registry
docker push gcr.io/YOUR_PROJECT_ID/openalex-mcp-server
  1. Deploy to Cloud Run:
gcloud run deploy openalex-mcp-server \
  --image gcr.io/YOUR_PROJECT_ID/openalex-mcp-server \
  --platform managed \
  --region us-central1 \
  --allow-unauthenticated \
  --set-env-vars MAILTO_EMAIL=your-email@williamscollege.edu \
  --memory 512Mi \
  --cpu 1 \
  --max-instances 10
  1. Get your service URL:
gcloud run services describe openalex-mcp-server --platform managed --region us-central1 --format 'value(status.url)'

Alternative: One-Command Deployment

gcloud run deploy openalex-mcp-server \
  --source . \
  --region us-central1 \
  --allow-unauthenticated \
  --set-env-vars MAILTO_EMAIL=your-email@williamscollege.edu

MCP Client Configuration

Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "openalex": {
      "transport": "streamable-http",
      "url": "https://YOUR-SERVICE-URL.run.app/mcp",
      "description": "OpenAlex academic research API"
    }
  }
}

Config file locations:

  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json
  • Linux: ~/.config/Claude/claude_desktop_config.json

Other MCP Clients

The server implements standard MCP over HTTP/SSE, so it works with any compatible client:

import httpx
import json

# Initialize request
init_message = {
    "jsonrpc": "2.0",
    "id": 1,
    "method": "initialize",
    "params": {
        "protocolVersion": "2025-03-26",
        "clientInfo": {
            "name": "my-client",
            "version": "1.0.0"
        }
    }
}

response = httpx.post(
    "https://YOUR-SERVICE-URL.run.app/mcp",
    json=init_message,
    headers={"Content-Type": "application/json"}
)

print(response.json())

Usage Examples

Search for Papers

{
  "jsonrpc": "2.0",
  "id": 2,
  "method": "tools/call",
  "params": {
    "name": "search_works",
    "arguments": {
      "query": "machine learning climate change",
      "publication_year": "2020-2024",
      "open_access": true,
      "limit": 10
    }
  }
}

Find Researchers

{
  "jsonrpc": "2.0",
  "id": 3,
  "method": "tools/call",
  "params": {
    "name": "search_authors",
    "arguments": {
      "name": "Andrew Ng",
      "institution": "Stanford",
      "limit": 5
    }
  }
}

Get Citation Network

{
  "jsonrpc": "2.0",
  "id": 4,
  "method": "tools/call",
  "params": {
    "name": "get_citations",
    "arguments": {
      "work_id": "W2741809807",
      "direction": "citing",
      "limit": 50
    }
  }
}

Advanced Filtering

{
  "jsonrpc": "2.0",
  "id": 5,
  "method": "tools/call",
  "params": {
    "name": "advanced_filter",
    "arguments": {
      "entity_type": "works",
      "filters": {
        "publication_year": ">2020",
        "cited_by_count": ">100",
        "is_oa": true
      },
      "search": "artificial intelligence",
      "sort": "cited_by_count:desc",
      "limit": 25
    }
  }
}

API Endpoints

POST /mcp

Main MCP endpoint for JSON-RPC messages. Supports both JSON and SSE responses.

Headers:

  • Content-Type: application/json (required)
  • Accept: application/json (JSON response) or Accept: text/event-stream (SSE)

GET /mcp

Optional SSE endpoint for server-initiated messages (keepalive, notifications).

Headers:

  • Accept: text/event-stream (required)

GET /health

Health check endpoint for monitoring.

Response: {"status": "healthy", "service": "openalex-mcp-server", "version": "1.0.0"}

GET /

Service information and endpoint documentation.

Environment Variables

Variable Default Description
PORT 8080 Server port (Cloud Run sets automatically)
MAILTO_EMAIL researcher@williamscollege.edu Required for polite pool access
OPENALEX_BASE_URL https://api.openalex.org OpenAlex API endpoint
MAX_REQUESTS_PER_SECOND 10 Rate limit for API calls
LOG_LEVEL INFO Logging verbosity
ALLOWED_ORIGINS None CORS allowed origins (comma-separated)

Architecture

openalex-mcp-server/
ā”œā”€ā”€ server.py              # FastAPI app with HTTP/SSE transport
ā”œā”€ā”€ mcp_handler.py         # MCP protocol & JSON-RPC handling
ā”œā”€ā”€ config.py              # Environment configuration
ā”œā”€ā”€ tools/
│   ā”œā”€ā”€ __init__.py
│   ā”œā”€ā”€ search.py          # OpenAlex API tool implementations
│   ā”œā”€ā”€ filters.py         # Filter utilities
│   └── utils.py           # Formatting helpers
ā”œā”€ā”€ Dockerfile             # Cloud Run deployment
ā”œā”€ā”€ requirements.txt       # Python dependencies
ā”œā”€ā”€ .env.example           # Environment template
ā”œā”€ā”€ claude_desktop_config.json  # Client config example
└── README.md

Development

Adding New Tools

  1. Implement the tool function in tools/search.py:
async def my_new_tool(param1: str, param2: int = 10) -> str:
    """Tool description."""
    # Implementation
    return json.dumps(result)
  1. Add to MCPHandler in mcp_handler.py:
self.tools = {
    # ... existing tools
    "my_new_tool": my_new_tool,
}
  1. Add schema in _get_tool_schema():
schemas = {
    # ... existing schemas
    "my_new_tool": {
        "description": "Tool description",
        "inputSchema": {
            "type": "object",
            "properties": {
                "param1": {"type": "string", "description": "..."},
                "param2": {"type": "integer", "default": 10}
            },
            "required": ["param1"]
        }
    }
}

Running Tests

# Install test dependencies
pip install pytest pytest-asyncio httpx

# Run tests
pytest

Monitoring Cloud Run

# View logs
gcloud run services logs read openalex-mcp-server --region us-central1

# Check service status
gcloud run services describe openalex-mcp-server --region us-central1

Security Considerations

Current Configuration (Development)

  • No authentication required
  • CORS allows all origins
  • Suitable for testing and internal use

Production Hardening

  1. Enable authentication:
gcloud run deploy openalex-mcp-server \
  --no-allow-unauthenticated
  1. Set allowed origins in .env:
ALLOWED_ORIGINS=https://yourdomain.com,https://anotherdomain.com
  1. Use Cloud Run IAM for access control
  2. Enable Cloud Armor for DDoS protection
  3. Set up VPC for private networking

Rate Limiting & Polite Access

The server implements OpenAlex polite pool best practices:

  • āœ… 10 requests/second rate limit (vs 6 req/s for non-polite)
  • āœ… Mailto parameter in all requests
  • āœ… User-Agent header with contact info
  • āœ… Exponential backoff for errors
  • āœ… Response caching (where appropriate)

Always set MAILTO_EMAIL to get better rate limits!

Troubleshooting

Issue: "Origin not allowed"

Solution: Set ALLOWED_ORIGINS environment variable or update CORS middleware in server.py

Issue: Rate limiting errors

Solution: Verify MAILTO_EMAIL is set correctly for polite pool access

Issue: Cloud Run timeout

Solution: Increase timeout in deployment:

gcloud run deploy openalex-mcp-server --timeout 300

Issue: Memory errors

Solution: Increase memory allocation:

gcloud run deploy openalex-mcp-server --memory 1Gi

Resources

Contributing

Contributions welcome! Please:

  1. Fork the repository
  2. Create a feature branch
  3. Add tests for new functionality
  4. Submit a pull request

License

MIT License - See LICENSE file for details

Support

For issues and questions:

  • OpenAlex API: support@openalex.org
  • Williams College: Contact your research computing support

Built for Williams College undergraduate researchers šŸŽ“

Happy researching! šŸ”¬

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured