MCP PDF Reader
An MCP server that enables AI assistants to read, search, and analyze PDF files from local paths or URLs. It provides tools for extracting specific page ranges, searching for terms, and retrieving document metadata.
README
MCP PDF Reader Server
A Model Context Protocol (MCP) server that enables AI assistants like Claude, Windsurf, and other MCP-compatible tools to read and analyze PDF files.
Features
- 📄 Read PDF files from local paths or URLs
- 🔍 Search for specific terms within PDFs
- 📊 Get PDF metadata (page count, author, title, etc.)
- 📖 Read specific pages or page ranges
- 🌐 Download and analyze PDFs from the web
- 🤖 Seamless integration with AI assistants via MCP
Installation
Using npm
npm install -g @dev.saqibaziz/mcp-pdf-reader
Using npx (no installation required)
npx @dev.saqibaziz/mcp-pdf-reader
Configuration
For Claude Desktop
Add this to your Claude Desktop configuration file:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
{
"mcpServers": {
"pdf-reader": {
"command": "npx",
"args": ["-y", "@dev.saqibaziz/mcp-pdf-reader"]
}
}
}
For Windsurf
Option 1: Via Plugin Store (Coming Soon)
- Open Windsurf
- Click the
Pluginsicon in Cascade panel - Search for "PDF Reader"
- Click
Install
Option 2: Manual Configuration
Edit your MCP configuration file:
macOS: ~/Library/Application Support/Windsurf/User/globalStorage/codeium.codeium/mcp_config.json
Windows: %APPDATA%\Windsurf\User\globalStorage\codeium.codeium\mcp_config.json
Linux: ~/.config/Windsurf/User/globalStorage/codeium.codeium/mcp_config.json
{
"mcpServers": {
"pdf-reader": {
"command": "npx",
"args": ["-y", "@dev.saqibaziz/mcp-pdf-reader"]
}
}
}
After adding, click the refresh button in the Plugins panel.
For Cline
Add this to your MCP settings file:
{
"mcpServers": {
"pdf-reader": {
"command": "npx",
"args": ["-y", "@dev.saqibaziz/mcp-pdf-reader"]
}
}
}
Available Tools
The server provides the following tools to AI assistants:
read_pdf_pages
Read specific pages from a PDF file.
Parameters:
pdf_path(string): Path to the PDF filepages(array): List of page numbers to read
read_pdf_page_range
Read a range of pages from a PDF file.
Parameters:
pdf_path(string): Path to the PDF filestart_page(number): First page to read (inclusive)end_page(number): Last page to read (inclusive)
search_pdf
Search for terms in a PDF file and return pages containing them.
Parameters:
pdf_path(string): Path to the PDF fileterms(string or array): Search term(s)
get_pdf_metadata
Get metadata from a PDF file (page count, author, title, etc.).
Parameters:
pdf_path(string): Path to the PDF file
Usage Examples
Once configured, you can ask your AI assistant:
Local Files:
- "Read pages 1-5 from document.pdf"
- "Search for 'machine learning' in research.pdf"
- "What's the metadata of report.pdf?"
- "Read page 10 from presentation.pdf"
URLs:
- "Read this PDF: https://example.com/research-paper.pdf"
- "Search for 'climate change' in https://example.com/report.pdf"
- "Get metadata from https://arxiv.org/pdf/2301.00001.pdf"
- "Read pages 1-3 from https://example.com/whitepaper.pdf"
Development
Prerequisites
- Node.js >= 18.0.0
- npm or yarn
Setup
# Clone the repository
git clone https://github.com/Saqib-Aziz007/mcp-pdf-reader.git
cd mcp-pdf-reader
# Install dependencies
npm install
# Build the project
npm run build
# Run in development mode
npm run dev
Scripts
npm run build- Compile TypeScript to JavaScriptnpm run dev- Watch mode for developmentnpm start- Run the compiled servernpm run prepare- Pre-install build hook
Technical Details
This MCP server is built with:
- @modelcontextprotocol/sdk - MCP SDK for TypeScript
- pdf-parse - PDF parsing library
- TypeScript for type safety
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
MIT License - see LICENSE file for details
Author
Muhammad Saqib Aziz
- Email: saqib.aziz1000@gmail.com
- GitHub: @Saqib-Aziz007
- npm: @saqibaziz
Support
If you encounter any issues or have questions:
- Open an issue on GitHub
- Check the MCP documentation
Acknowledgments
Built with the Model Context Protocol by Anthropic.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.