pdf-navigator-mcp
Provides PDF reading, text search, navigation, and form filling capabilities through MCP, with cross-platform PDF viewer integration for seamless document interaction.
README
PDF Navigator MCP
A comprehensive Model Context Protocol (MCP) server for PDF reading, navigation, and text search with cross-platform PDF viewer integration. Eliminates PyMuPDF dependency issues by providing PDF functionality through MCP.
Features
- PDF text extraction - Read full PDFs or specific pages/ranges
- PDF structure analysis - Extract table of contents and page summaries
- Text search with location - Find text and jump to results
- Direct PDF navigation - Open PDFs to specific pages
- PDF form filling - Extract form fields to markdown, edit, and fill PDFs
- Cross-platform PDF viewers - Supports Skim, Zathura, Evince, and more
- MCP integration - Works with Claude Code and other MCP clients
- No dependency issues - PyMuPDF isolated in MCP server environment
Installation
# Install with pipx (recommended)
pipx install git+https://github.com/matsengrp/pdf-navigator-mcp.git
# Or install in current environment
pip install git+https://github.com/matsengrp/pdf-navigator-mcp.git
Claude Code Integration
Add to your ~/.claude.json:
{
"mcpServers": {
"pdf-navigator": {
"type": "stdio",
"command": "pdf-navigator-mcp"
}
}
}
Usage
In Claude Code, you can:
- "Read the abstract from paper.pdf" → Extracts and shows text content
- "What's the table of contents for paper.pdf?" → Shows PDF structure
- "Read pages 5-10 of paper.pdf" → Extracts specific page range
- "Search for 'parameter efficiency' in paper.pdf" → Finds text and locations
- "Open paper.pdf to page 5" → Opens PDF viewer to specific page
- "Extract form fields from application.pdf" → Creates markdown file with form fields
- "Fill the PDF form with my data" → Fills PDF using edited markdown data
MCP Tools
Reading Tools
read_pdf_text(file_path, start_page, end_page)- Extract text from page rangeread_pdf_page(file_path, page_number)- Extract text from single pageget_pdf_structure(file_path)- Get table of contents and page summariesget_pdf_info(file_path)- Get document metadata
Navigation Tools
search_pdf_text(file_path, query)- Search text and return locationsopen_pdf_page(file_path, page_number)- Open PDF viewer to specific pagesearch_and_open(file_path, query, result_index)- Search and open to result
Form Filling Tools
extract_form_to_markdown(file_path, output_md_path)- Extract form fields to markdown with multi-line detectionfill_form_from_markdown(pdf_path, markdown_path, output_pdf_path, distribute_text=True, max_chars_per_field=50, respect_line_breaks=True)- Fill PDF from markdown with intelligent text distribution
PDF Form Filling Workflow
The PDF form filling feature uses a markdown-based workflow:
- Extract form fields - Analyze the PDF and create a markdown file with all detected fields
- Edit the markdown - Fill in values using any text editor
- Fill the PDF - Apply the markdown data back to create a filled PDF
Example Workflow
# Step 1: Extract form fields to markdown
# Creates a markdown file with placeholders for each field
extract_form_to_markdown("application.pdf", "application_form.md")
# Step 2: Edit application_form.md in your editor
# Fill in values after each arrow (→)
# Step 3: Fill the PDF with your data
fill_form_from_markdown("application.pdf", "application_form.md", "application_filled.pdf")
Markdown Format
The extracted markdown looks like:
# PDF Form: application.pdf
Type: Interactive Form
Generated: 2025-08-03
## Form Fields
### Page 1
- Full Name → John Smith
- Email → john@example.com
- Phone → 555-0123
- [ ] Subscribe to newsletter → true
Form Types Supported
- Interactive Forms - PDFs with actual form fields (fillable PDFs)
- Static Forms - PDFs with underlines/boxes (creates moveable text annotations)
Enhanced Multi-line Form Detection
The PDF Navigator now includes advanced multi-line form detection and intelligent text distribution:
Features
- Multi-line Section Detection - Automatically detects when multiple consecutive blank lines follow a section header (e.g., "I love..." followed by several underscores)
- Smart Text Distribution - Distributes long text across multiple related fields using natural break points
- Natural Break Points - Respects sentences, commas, conjunctions, and explicit line breaks
- Configurable Parameters - Control text distribution behavior
Text Distribution Strategies
- Sentence splitting - "I love reading. Playing games is fun." → separate fields
- Comma/semicolon splitting - "Reading books, playing games, going to parks" → separate fields
- Conjunction splitting - "Reading and playing and going" → separate fields
- Word boundary splitting - Intelligent length-based splitting while preserving whole words
Configuration Options
distribute_text: bool- Enable/disable multi-line text distribution (default: True)max_chars_per_field: int- Target character limit per field (default: 50)respect_line_breaks: bool- Honor newlines in input text (default: True)
Example
Instead of cramming "Reading books with my parents, doing puzzles and addition, going on trips, anything with my big sister" into one tiny field, it automatically distributes as:
- Field 1: "Reading books with my parents"
- Field 2: "doing puzzles and addition"
- Field 3: "going on trips"
- Field 4: "anything with my big sister"
Form Filling Best Practices
For optimal text distribution in multi-line fields:
- personal_interests_love_1 (I love...) → Reading books with my parents
Doing puzzles and addition
Going on trips
Anything with my big sister
The newlines enable intelligent distribution across multiple PDF fields, preventing cramped text. Use the extract_and_fill_form and format_multiline_form_data MCP prompts for guided workflows.
Supported PDF Readers
- Skim (macOS) -
skim://URL scheme - Zathura (Linux) -
--pageargument - Evince (Linux) -
--page-indexargument - SumatraPDF (Windows) -
-pageargument - Adobe Acrobat (Cross-platform) -
/A page=Nargument
Configuration
Configure your PDF reader in ~/.pdf-navigator-config.json:
{
"pdf_reader": "skim",
"reader_path": "/Applications/Skim.app"
}
Development
git clone https://github.com/matsengrp/pdf-navigator-mcp.git
cd pdf-navigator-mcp
pip install -e ".[dev]"
License
MIT License
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.