MinerU MCP Server

MinerU MCP Server

Exposes the MinerU document-to-markdown API as MCP tools for converting PDF, Word, PPT, and images into Markdown. It supports both local and remote file processing with integrated OCR capabilities for multiple languages.

Category
Visit Server

README

MinerU MCP Server

基于 FastMCP 3.x 的 MCP 服务器,将 MinerU 的文档转 Markdown API 暴露为 MCP 工具。

支持 PDF、Word、PPT 及图片格式(JPG、PNG)的解析。

工具

工具 说明
parse_documents 将文件转换为 Markdown(支持本地路径和 URL,可批量处理)
get_ocr_languages 获取 OCR 支持的语言列表

安装

pip install mineru-mcp-server

从源码安装:

git clone https://github.com/Tongzhao9417/mineru_mcp.git
cd mineru_mcp
pip install -e .

核心依赖仅 3 个:fastmcp>=3.0.0python-dotenv>=1.0.0httpx>=0.24.0

环境变量

在项目根目录创建 .env 文件(参考 .env.example),或直接设置环境变量:

变量 说明 默认值
MINERU_API_KEY MinerU API 密钥(官网申请 必填
MINERU_API_BASE 远程 API 基础 URL https://mineru.net
OUTPUT_DIR 转换结果保存路径 ./downloads
USE_LOCAL_API 是否使用本地 API false
LOCAL_MINERU_API_BASE 本地 API 地址(USE_LOCAL_API=true 时生效) http://localhost:8080

MCP 客户端配置

Claude Code / Claude Desktop

{
  "mcpServers": {
    "mineru-mcp": {
      "command": "mineru-mcp-server",
      "env": {
        "MINERU_API_KEY": "your-api-key"
      }
    }
  }
}

从源码运行

{
  "mcpServers": {
    "mineru-mcp": {
      "command": "uv",
      "args": ["--directory", "/path/to/mineru_mcp", "run", "mineru-mcp-server"],
      "env": {
        "MINERU_API_KEY": "your-api-key"
      }
    }
  }
}

直接运行

# stdio 模式(默认,MCP 客户端自动管理)
mineru-mcp-server

# HTTP 模式(独立服务,多客户端可连接)
mineru-mcp-server --transport streamable-http --port 8001

推荐使用 streamable-http 传输模式(MCP 规范推荐),SSE 已被标记为 deprecated。

parse_documents 参数

参数 类型 说明 默认值
file_sources string 文件路径或 URL,多个用逗号分隔 必填
enable_ocr bool 启用 OCR false
language string 文档语言(ch/en 等) ch
page_ranges string 页码范围,如 "2,4-6"(仅远程 API) None

常见问题

API 返回 401:检查 MINERU_API_KEY 是否正确设置。

找不到文件:请使用绝对路径。

调用超时:大文档处理耗时较长,建议分批处理或使用本地 API 模式。

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured