mcp-pdf-utils
An MCP server for local PDF manipulation including merging, splitting, rotating, watermarking, and text extraction. It works with various MCP-compatible clients and processes PDFs entirely on-device without cloud services.
README
<div align="center">
mcp-pdf-utils
MCP server for PDF manipulation. Merge, split, rotate, watermark, extract text, and more.
Works with Claude Desktop, Cursor, VS Code, ChatGPT, and any MCP-compatible client.
</div>
Why
PDF manipulation is one of the most common developer tasks, yet there's no good MCP server for it. The existing options have 1 star. This one actually works.
Everything runs locally. No API keys. No cloud services. Your PDFs never leave your machine.
Tools
| Tool | Description |
|---|---|
pdf_info |
Get metadata, page count, file size |
pdf_extract_text |
Extract text content from pages |
pdf_merge |
Combine multiple PDFs into one |
pdf_split |
Split PDF into individual page files |
pdf_extract_pages |
Extract a page range into a new PDF |
pdf_rotate |
Rotate pages by 90°, 180°, or 270° |
pdf_add_watermark |
Add text watermark (e.g. DRAFT, CONFIDENTIAL) |
pdf_delete_pages |
Remove specific pages |
pdf_reorder |
Rearrange page order |
pdf_set_metadata |
Set title, author, subject, keywords |
pdf_create |
Create a new PDF from plain text |
Quick Start
Claude Desktop
Add to your claude_desktop_config.json:
{
"mcpServers": {
"pdf-tools": {
"command": "npx",
"args": ["-y", "mcp-pdf-utils"]
}
}
}
That's it. Restart Claude Desktop and start asking it to work with PDFs.
Claude Code
claude mcp add pdf-tools -- npx -y mcp-pdf-utils
Cursor
Add to your Cursor MCP settings:
{
"mcpServers": {
"pdf-tools": {
"command": "npx",
"args": ["-y", "mcp-pdf-utils"]
}
}
}
VS Code
Add to your VS Code MCP config:
{
"mcp": {
"servers": {
"pdf-tools": {
"command": "npx",
"args": ["-y", "mcp-pdf-utils"]
}
}
}
}
Usage Examples
Once connected, just talk to your AI naturally:
- "Merge invoice_jan.pdf and invoice_feb.pdf into combined.pdf"
- "Extract pages 3-7 from report.pdf"
- "Add a CONFIDENTIAL watermark to contract.pdf"
- "How many pages does presentation.pdf have?"
- "Rotate all pages in scan.pdf by 90 degrees"
- "Remove page 5 from my document"
- "Extract all the text from this PDF"
- "Reorder the pages: put page 3 first, then 1, then 2"
Development
git clone https://github.com/zekovdev/mcp-pdf-utils.git
cd mcp-pdf-utils
npm install
npm run dev
Test with MCP Inspector
npm run inspect
This opens the official MCP Inspector where you can test every tool visually.
How It Works
The server uses pdf-lib for PDF manipulation and pdf-parse for text extraction. Both run locally with zero external dependencies — no API keys, no network calls, no cloud services.
Communication uses the MCP stdio transport, meaning the server runs as a local process that your AI client spawns and talks to via stdin/stdout.
Roadmap
- [ ] PDF compression (reduce file size)
- [ ] Image extraction from PDFs
- [ ] PDF to images (page thumbnails)
- [ ] Add page numbers
- [ ] PDF encryption / password protection
- [ ] Fill PDF form fields
- [ ] OCR for scanned PDFs (via Tesseract)
- [ ] Image to PDF conversion
Contributing
PRs welcome. Run npm run inspect to test your changes before submitting.
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.