google-scholar-mcp
Enables AI assistants to search and analyze Google Scholar publications, authors, citations, and download papers.
README
Google Scholar MCP Server
<div align="center">
The most comprehensive Google Scholar MCP server for AI assistants
Installation ⢠Quick Start ⢠Tools ⢠API Reference ⢠Contributing
</div>
A powerful Model Context Protocol (MCP) server that connects AI assistants like Claude to Google Scholar. Search academic publications, find researchers, analyze citations, download papers, and perform advanced research workflows.
š Features
Tools (11 Total)
| Tool | Description |
|---|---|
search_publications |
Search for academic papers by topic, author, or date range |
search_author |
Find researchers by name, field, or institution |
get_author_profile |
Get detailed author info including h-index, publications, and coauthors |
get_citations |
Find papers that cite a specific publication |
generate_bibtex |
Generate BibTeX entries for citations |
get_related_articles |
Find related articles for a publication |
get_all_versions |
Get all available versions of a paper |
download_paper |
Download and store papers locally for offline access |
list_papers |
List all locally stored papers |
read_paper |
Read content of a stored paper |
advanced_search |
Advanced search with language, patent, and date filters |
Prompts (6 Total)
| Prompt | Description |
|---|---|
literature_review |
Structured approach to conducting literature reviews |
find_expert |
Find leading researchers in a specific field |
citation_analysis |
Analyze the citation impact of a publication |
deep_paper_analysis |
Comprehensive multi-step paper analysis workflow |
research_synthesis |
Synthesize research across multiple papers |
methodology_comparison |
Compare methodologies across research papers |
Key Advantages
- 11 Powerful Tools: Search, citations, related articles, versions, downloads & more
- Advanced Search: Filter by language, patents, review articles, and more
- Local Paper Storage: Download and cache papers for offline access
- Intelligent Caching: Reduce redundant requests with TTL-based caching
- Rich Metadata: Abstracts, citation counts, h-index, coauthors, and more
- Rate Limiting: Built-in protection against Google Scholar blocking
- Error Handling: Graceful handling of rate limits and CAPTCHAs
- BibTeX Support: Generate proper academic citations
- Advanced Prompts: Pre-built workflows for deep paper analysis and research synthesis
š¦ Installation
Via Smithery (Recommended)
npx -y @smithery/cli install @dihannahdi/google-scholar-mcp --client claude
Via npm
npm install -g @dihannahdi/google-scholar-mcp
From Source
git clone https://github.com/dihannahdi/google-scholar-mcp.git
cd google-scholar-mcp
npm install
npm run build
š Quick Start
Running the Server
# If installed globally
google-scholar-mcp
# From source
npm start
# Development mode
npm run dev
Configuration for Claude Desktop
Add to your claude_desktop_config.json:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
{
"mcpServers": {
"google-scholar": {
"command": "node",
"args": ["/path/to/google-scholar-mcp/dist/index.js"],
"env": {
"SCHOLAR_STORAGE_PATH": "~/.google-scholar-mcp/papers",
"SCHOLAR_RATE_LIMIT_MS": "3000",
"SCHOLAR_CACHE_ENABLED": "true",
"SCHOLAR_CACHE_TTL_MS": "3600000",
"SERPAPI_KEY": "your_serpapi_key_here",
"SCHOLAR_USE_SERPAPI_FALLBACK": "true"
}
}
}
}
Or using npx:
{
"mcpServers": {
"google-scholar": {
"command": "npx",
"args": ["-y", "@dihannahdi/google-scholar-mcp"]
}
}
}
Environment Variables
| Variable | Description | Default |
|---|---|---|
SCHOLAR_STORAGE_PATH |
Path to store downloaded papers | ~/.google-scholar-mcp/papers |
SCHOLAR_RATE_LIMIT_MS |
Rate limit between requests (ms) | 3000 |
SCHOLAR_CACHE_ENABLED |
Enable response caching | true |
SCHOLAR_CACHE_TTL_MS |
Cache time-to-live (ms) | 3600000 (1 hour) |
SCHOLAR_PROXY_URL |
Optional proxy URL | - |
SERPAPI_KEY |
SerpAPI key for reliable fallback (get free key) | - |
SCHOLAR_USE_SERPAPI_FALLBACK |
Enable automatic SerpAPI fallback when blocked | false |
SCHOLAR_ROTATE_USER_AGENT |
Rotate user agents to avoid detection | true |
SCHOLAR_ENABLE_JITTER |
Add random delays between requests | true |
SCHOLAR_JITTER_MAX_MS |
Maximum jitter delay in milliseconds | 2000 |
š SerpAPI Fallback (Recommended)
Google Scholar aggressively blocks automated access. To ensure 100% uptime, we recommend using SerpAPI as a fallback:
- Get a free API key at serpapi.com (100 free searches/month)
- Set environment variables:
export SERPAPI_KEY="your_serpapi_key" export SCHOLAR_USE_SERPAPI_FALLBACK="true"
How it works:
- The server first attempts direct scraping (free, but may be blocked)
- If blocked, it automatically falls back to SerpAPI (reliable, paid)
- Results are normalized to the same format regardless of source
Benefits of SerpAPI:
- ā 100% uptime guarantee
- ā No CAPTCHA or blocking issues
- ā Handles all anti-bot measures
- ā Rich structured data
- ā Author profiles, citations, related articles
š Usage Examples
Search for Publications
Search for recent machine learning papers about transformers
Tool: search_publications
Arguments:
query: "transformer neural network"
yearStart: 2020
numResults: 10
sortBy: "relevance"
Find an Author
Find researchers working on quantum computing at MIT
Tool: search_author
Arguments:
query: "quantum computing"
organization: "MIT"
numResults: 5
Get Author Profile
Get detailed profile for Geoffrey Hinton
Tool: get_author_profile
Arguments:
scholarId: "JicYPdAAAAAJ"
Get Citations
Find papers citing "Attention Is All You Need"
Tool: get_citations
Arguments:
clusterId: "4054916225996727837"
numResults: 10
sortBy: "date"
Generate BibTeX
Create BibTeX for a paper
Tool: generate_bibtex
Arguments:
title: "Attention Is All You Need"
authors: ["Ashish Vaswani", "Noam Shazeer", "Niki Parmar"]
year: 2017
venue: "Advances in Neural Information Processing Systems"
š§ API Reference
search_publications
Search Google Scholar for academic publications.
Parameters:
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| query | string | Yes | - | Search query |
| author | string | No | - | Filter by author name |
| yearStart | number | No | - | Start year for filter |
| yearEnd | number | No | - | End year for filter |
| numResults | number | No | 10 | Number of results (1-20) |
| sortBy | string | No | "relevance" | "relevance" or "date" |
Returns:
- List of publications with title, authors, abstract, venue, year, citation count, URLs
search_author
Search for academic researchers.
Parameters:
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| query | string | Yes | - | Author name or keywords |
| organization | string | No | - | Filter by institution |
| numResults | number | No | 10 | Number of results (1-20) |
Returns:
- List of author profiles with name, affiliation, interests, citation count
get_author_profile
Get detailed author information.
Parameters:
| Parameter | Type | Required | Description |
|---|---|---|---|
| scholarId | string | Yes | Google Scholar author ID |
Returns:
- Complete profile with publications, h-index, i10-index, coauthors, citation history
get_citations
Get papers that cite a specific publication.
Parameters:
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| clusterId | string | Yes | - | Google Scholar cluster ID |
| numResults | number | No | 10 | Number of results (1-20) |
| sortBy | string | No | "relevance" | "relevance" or "date" |
Returns:
- List of citing publications with details
generate_bibtex
Generate a BibTeX citation entry.
Parameters:
| Parameter | Type | Required | Description |
|---|---|---|---|
| title | string | Yes | Publication title |
| authors | string[] | Yes | List of author names |
| year | number | No | Publication year |
| venue | string | No | Journal/conference name |
| url | string | No | Publication URL |
Returns:
- Formatted BibTeX entry
get_related_articles
Find related articles for a given publication.
Parameters:
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| clusterId | string | Yes | - | Google Scholar cluster ID |
| numResults | number | No | 10 | Number of results (1-20) |
Returns:
- List of related publications with metadata
get_all_versions
Get all available versions of a paper.
Parameters:
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| clusterId | string | Yes | - | Google Scholar cluster ID |
| numResults | number | No | 10 | Number of results (1-20) |
Returns:
- List of all versions (preprint, published, etc.)
download_paper
Download and store a paper locally.
Parameters:
| Parameter | Type | Required | Description |
|---|---|---|---|
| url | string | Yes | URL to the paper (PDF, arXiv, etc.) |
| filename | string | Yes | Name for the saved file |
| metadata | object | No | Optional metadata (title, authors, year) |
Returns:
- Path to stored file and confirmation
list_papers
List all locally stored papers.
Parameters:
| Parameter | Type | Required | Description |
|---|---|---|---|
| pattern | string | No | Optional filter pattern |
Returns:
- List of stored papers with metadata
read_paper
Read the content of a stored paper.
Parameters:
| Parameter | Type | Required | Description |
|---|---|---|---|
| filename | string | Yes | Name of the stored file |
Returns:
- Paper content (text format)
advanced_search
Advanced search with additional filters.
Parameters:
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| query | string | Yes | - | Search query |
| exactPhrase | string | No | - | Exact phrase to match |
| withoutWords | string | No | - | Words to exclude |
| author | string | No | - | Filter by author |
| source | string | No | - | Filter by source/journal |
| yearStart | number | No | - | Start year filter |
| yearEnd | number | No | - | End year filter |
| language | string | No | "en" | Language code |
| includePatents | boolean | No | true | Include patents in results |
| includeCitations | boolean | No | true | Include citations |
| numResults | number | No | 10 | Number of results |
Returns:
- List of publications matching criteria
š ļø Development
Project Structure
google-scholar-mcp/
āāā src/
ā āāā index.ts # MCP server entry point
ā āāā config/
ā ā āāā index.ts # Environment configuration
ā āāā types/
ā ā āāā index.ts # TypeScript type definitions
ā āāā tools/
ā ā āāā definitions.ts # Tool schemas (11 tools)
ā ā āāā handlers.ts # Tool implementation
ā ā āāā index.ts
ā āāā scraper/
ā ā āāā scholar.ts # Google Scholar scraper
ā ā āāā index.ts
ā āāā utils/
ā āāā helpers.ts # URL builders, rate limiting
ā āāā cache.ts # In-memory caching
ā āāā storage.ts # Local paper storage
ā āāā index.ts
āāā dist/ # Compiled JavaScript
āāā package.json
āāā tsconfig.json
āāā README.md
Building
npm run build
Testing
# Run MCP Inspector for debugging
npx @modelcontextprotocol/inspector node dist/index.js
ā ļø Limitations
-
Rate Limiting: Google Scholar may rate-limit or block requests. The server includes delays and retries, but heavy usage may trigger blocks.
-
CAPTCHA: Excessive requests may trigger CAPTCHA challenges. If this happens, wait a few minutes before retrying.
-
No Official API: This server scrapes Google Scholar's web interface, which may break if Google changes their HTML structure.
-
Results Limited: Maximum 20 results per request to stay within reasonable limits.
š¤ Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
š License
This project is licensed under the MIT License - see the LICENSE file for details.
š Acknowledgments
- Model Context Protocol for the MCP SDK
- Scholarly for inspiration
- The academic community for making research accessible
š¬ Support
For issues and questions:
- Open an issue on GitHub
- Check the MCP documentation
Made with ā¤ļø for the research community
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.