pdf-mcp
Enables PDF document processing including text, image, and table extraction, as well as intelligent classification and similarity analysis across multiple languages.
README
PDF-MCP 
📄 PDF-MCP 服务
高性能 PDF 文档处理服务,支持文本、图片、表格提取及高级分析。
✨ 主要特性
- 📜 文本提取:多语言支持,保留格式。
- 🖼️ 图片处理:提取与优化。
- 📊 表格识别:结构化数据输出。
- 🧠 智能分类:基于深度学习。
- 🔍 相似度分析:跨语言比较。
- 🌐 多语言支持:100+ 种语言。
💻 系统要求
- 🖥️ 硬件:2 核 CPU,4GB 内存。
- ⚙️ 软件:Python 3.10+,可选 CUDA 支持。
🚀 快速开始
- 🗂️ 克隆仓库并进入目录:
git clone https://github.com/saury1120/pdf-mcp.git cd pdf-mcp - 🛠️ 创建虚拟环境并安装依赖:
uv venv source .venv/bin/activate uv pip install -r requirements.txt - ▶️ 启动服务:
uv run pdf_reader
Claude Desktop 配置
- 找到配置文件:
- macOS:
~/Library/Application Support/Claude/claude_desktop_config.json - Windows:
%AppData%/Claude/claude_desktop_config.json
- macOS:
- 添加以下配置:
{
"mcpServers": {
"pdf_reader": {
"command": "uv",
"args": [
"--directory",
"/path/to/pdf-mcp", # 替换为实际路径
"run",
"pdf_reader"
]
}
}
}
PDF-MCP Service
A high-performance PDF document processing service supporting text, image, table extraction, and advanced analysis.
✨ Key Features
- 📜 Text Extraction: Multilingual support, retains formatting.
- 🖼️ Image Processing: Extraction and optimization.
- 📊 Table Recognition: Structured data output.
- 🧠 Intelligent Classification: Based on deep learning.
- 🔍 Similarity Analysis: Cross-language comparison.
- 🌐 Multilingual Support: 100+ languages.
💻 System Requirements
- 🖥️ Hardware: 2-core CPU, 4GB RAM.
- ⚙️ Software: Python 3.10+, optional CUDA support.
🚀 Quick Start
- 🗂️ Clone the repository and enter the directory:
git clone https://github.com/saury1120/pdf-mcp.git cd pdf-mcp - 🛠️ Create a virtual environment and install dependencies:
uv venv source .venv/bin/activate uv pip install -r requirements.txt - ▶️ Start the service:
uv run pdf_reader
Claude Desktop
{
"mcpServers": {
"pdf_reader": {
"command": "uv",
"args": [
"--directory",
"/path/to/pdf-mcp", # 替换为实际路径
"run",
"pdf_reader"
]
}
}
}
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.