pdf-agent-mcp

pdf-agent-mcp

A local MCP server that extracts text-layer content from PDF files, enabling AI agents to inspect, extract text, outlines, and page content.

Category
Visit Server

README

pdf-agent-mcp

<p align="right"> <a href="#readme-zh">中文</a> | <a href="#readme-en">English</a> </p>

<a id="readme-zh"></a>中文

pdf-agent-mcp 是一个本地 MCP 服务,给 AI agent 提供 PDF 文本层读取能力。

工具列表

  • inspect_pdf:检查 PDF 基本信息、页数、是否可能有文本层
  • extract_pdf_text:按 raw / lines / blocks 抽取文本
  • extract_pdf_outline:提取 PDF 目录(书签)
  • extract_pdf_page:提取单页文本项和坐标

使用说明

环境要求:Node.js 22+

npm install
npm run dev
npm run lint
npm test
npm run build

推荐直接用 npx 启动:

npx -y github:sanhua1/pdf-agent-mcp

Agent 自然语言交互示例

在 Claude/Codex 里可直接说:

  1. 先帮我 inspect 这个 PDF:/path/to/doc.pdf
  2. 把 1-5 页按 lines 模式提取出来
  3. 第 10 页排版乱,改用 blocks 模式再提取一次
  4. 先读取目录,再按章节整理成 Markdown 摘要

Claude Code 配置方法

{
  "mcpServers": {
    "pdf-agent-mcp": {
      "command": "npx",
      "args": ["-y", "github:sanhua1/pdf-agent-mcp"]
    }
  }
}

Codex 配置方法

[mcp_servers.pdf-agent-mcp]
command = "npx"
args = ["-y", "github:sanhua1/pdf-agent-mcp"]

<a id="readme-en"></a>English

pdf-agent-mcp is a local MCP server for extracting text-layer content from PDF files.

Tools

  • inspect_pdf: inspect metadata, page count, and text-layer hints
  • extract_pdf_text: extract text in raw / lines / blocks modes
  • extract_pdf_outline: read PDF bookmarks/outlines
  • extract_pdf_page: extract text items with coordinates from a single page

Quick Start

Requirement: Node.js 22+

npm install
npm run dev

Run with npx:

npx -y github:sanhua1/pdf-agent-mcp

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured