semantic-scholar-mcp
Enables searching academic papers on Semantic Scholar and downloading their PDFs, with support for multiple publishers.
README
Semantic Scholar MCP Server
一个基于FastMCP框架的Semantic Scholar学术搜索引擎工具,可以搜索学术论文并下载PDF文件。支持通过MCP协议与各种AI助手集成。
GitHub仓库: https://github.com/cktbarking/semanticMCP.git
功能特性
- 🔍 学术论文搜索 - 搜索Semantic Scholar数据库
- 📄 PDF下载 - 自动识别出版社并下载论文PDF
- 🔗 DOI解析 - 支持通过DOI获取论文信息
- 🏢 多出版社支持 - 支持IEEE、ACM、Elsevier、Springer、arXiv等
- 🛠️ MCP协议 - 兼容Model Context Protocol
快速开始
使用uvx运行(推荐)
# 从GitHub仓库安装并运行MCP服务器
uvx --from git+https://github.com/cktbarking/semanticMCP.git semantic-scholar-mcp --transport stdio
# 或者使用简写方式
uvx cktbarking/semanticMCP --transport stdio
# 或者指定版本
uvx semantic-scholar-mcp@latest --transport stdio
本地开发
# 克隆项目
git clone https://github.com/cktbarking/semanticMCP.git
cd semanticMCP
# 安装依赖
uv sync
# 运行服务器
uv run semantic-scholar-mcp --transport stdio
传统方式
# 安装依赖
pip install -r requirements.txt
# 运行服务
python main.py
MCP工具
1. search_papers - 搜索论文
搜索Semantic Scholar数据库中的学术论文。
参数:
query(str, 必需): 搜索查询字符串limit(int, 可选): 返回结果数量,默认5,最大100
返回: 包含论文标题、作者、摘要、年份、引用次数、DOI、PDF链接等信息的列表。
2. download_paper - 下载论文PDF
根据DOI下载论文PDF文件,自动识别出版社。
参数:
doi(str, 必需): 论文的DOI标识符output_dir(str, 可选): PDF输出目录,默认"./pdfs"
支持的出版社:
- IEEE (10.1109/*)
- ACM (10.1145/*)
- Elsevier (10.1016/*)
- Springer (10.1007/*)
- arXiv (10.48550/arXiv.*)
返回: 下载结果信息,包括成功状态、文件路径、出版社等。
响应格式
搜索响应示例
{
"success": true,
"query": "machine learning",
"count": 5,
"results": [
{
"title": "Deep Learning for Computer Vision",
"authors": "John Doe, Jane Smith",
"abstract": "This paper presents...",
"year": 2023,
"citation_count": 150,
"article_url": "https://...",
"pdf_url": "https://...",
"doi": "10.1109/...",
"doi_link": "https://doi.org/10.1109/..."
}
]
}
下载响应示例
{
"success": true,
"message": "PDF下载成功",
"file_path": "./pdfs/10.1109_example.pdf",
"doi": "10.1109/example",
"publisher": "ieee"
}
配置MCP客户端
Claude Desktop配置
在Claude Desktop的配置文件中添加:
{
"mcpServers": {
"semantic-scholar": {
"command": "uvx",
"args": ["--from", "git+https://github.com/cktbarking/semanticMCP.git", "semantic-scholar-mcp", "--transport", "stdio"]
}
}
}
其他MCP客户端
支持任何兼容MCP协议的客户端,只需配置命令为:
uvx semantic-scholar-mcp --transport stdio
开发说明
项目结构
semantic-scholar-mcp/
├── main.py # MCP服务器主文件
├── publishers/ # 出版社下载器
│ ├── ieee_downloader.py
│ ├── acm_downloader.py
│ ├── elsevier_downloader.py
│ ├── springer_downloader.py
│ ├── arxiv_downloader.py
│ └── generic_downloader.py
├── pyproject.toml # 项目配置
├── requirements.txt # 依赖列表
└── README.md # 说明文档
添加新的出版社支持
- 在
publishers/目录下创建新的下载器文件 - 实现下载函数,接受
doi,doi_link,output_dir参数 - 在
publishers_dict.py的PUBLISHER_DICT字典中添加DOI前缀到出版社的映射 - 在
download_paper_by_doi()函数中添加对应的处理分支
注意事项
- API限制 - Semantic Scholar API有调用频率限制,请合理使用
- 版权合规 - 下载的PDF文件仅供个人学术研究使用
- 网络要求 - 某些出版社可能需要机构访问权限,请合理使用。
- Python版本 - 需要Python 3.10或更高版本
许可证
GNU General Public License v3.0 - 详见LICENSE文件
贡献
欢迎提交Issue和Pull Request来改进这个项目!
致谢
- Semantic Scholar 提供优秀的学术搜索API
- FastMCP 简化MCP服务器开发
- 各大学术出版社提供的研究资源
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.