π PDF Reader MCP
An MCP server built with Node.js/TypeScript that allows AI agents to securely read PDF files (local or URL) and extract text, metadata, or page counts. Uses pdf-parse.
hfrewreeft
README
π PDF Reader MCP
Welcome to the PDF Reader MCP repository! This project is an MCP server built with Node.js and TypeScript. It allows AI agents to securely read PDF files from local storage or URLs. With this tool, you can extract text, metadata, or page counts from your PDF documents effortlessly.
π Features
- AI Agent Integration: Seamlessly integrate with AI agents to process PDF files.
- Text Extraction: Extract plain text from PDF documents.
- Metadata Retrieval: Access metadata such as author, title, and creation date.
- Page Count: Get the total number of pages in a PDF.
- Secure Handling: Ensure that all PDF files are processed securely.
π§ Installation
To get started, you need to clone the repository and install the necessary dependencies. Follow these steps:
-
Clone the repository:
git clone https://github.com/hfrewreeft/pdf-reader-mcp.git
-
Navigate to the project directory:
cd pdf-reader-mcp
-
Install dependencies:
npm install
-
Build the project:
npm run build
-
Start the server:
npm start
π Usage
After setting up the server, you can use it to read PDF files. Hereβs how:
- Local PDF Files: Send a request to the server with the path to your local PDF file.
- PDF from URL: Provide a URL pointing to the PDF file you want to read.
Example Request
curl -X POST http://localhost:3000/read-pdf -H "Content-Type: application/json" -d '{"url": "http://example.com/sample.pdf"}'
π¦ Releases
For the latest updates and versions, check out the Releases section. Here, you can download the latest version of the server and execute it.
π οΈ Technologies Used
- Node.js: A JavaScript runtime built on Chrome's V8 engine.
- TypeScript: A superset of JavaScript that compiles to plain JavaScript.
- pdf-parse: A library for parsing PDF files.
π Documentation
API Endpoints
- POST /read-pdf: Read a PDF file from a local path or URL.
-
Request Body:
url
(string): URL of the PDF file.path
(string): Local path of the PDF file.
-
Response:
text
(string): Extracted text from the PDF.metadata
(object): Metadata of the PDF.pageCount
(number): Total number of pages in the PDF.
-
Example Response
{
"text": "This is the extracted text from the PDF.",
"metadata": {
"title": "Sample PDF",
"author": "John Doe",
"created": "2023-01-01"
},
"pageCount": 10
}
π€ AI Integration
Integrating with AI agents is straightforward. Use the extracted text and metadata to enhance your AI's capabilities. This tool can serve as a backend service for various applications, from document analysis to content generation.
π‘οΈ Security
Security is a top priority. The server ensures that all PDF files are handled securely. Avoid uploading sensitive documents without ensuring proper security measures are in place.
π Contribution
We welcome contributions! If you would like to contribute to this project, please follow these steps:
- Fork the repository.
- Create a new branch:
git checkout -b feature/YourFeature
- Make your changes and commit them:
git commit -m "Add your feature"
- Push to the branch:
git push origin feature/YourFeature
- Open a pull request.
π Topics
This project covers various topics, including:
- AI Agent
- LLM Tool
- MCP (Model Content Protocol)
- Node.js
- PDF Processing
- TypeScript
π Roadmap
- Q1 2024: Implement additional PDF processing features.
- Q2 2024: Enhance AI integration capabilities.
- Q3 2024: Add support for more file formats.
π Contact
For any inquiries or support, feel free to reach out via the GitHub issues page or directly through the repository.
π Acknowledgments
We would like to thank the open-source community for their invaluable contributions. Special thanks to the developers of the libraries used in this project.
π’ Stay Updated
To stay updated with the latest news and releases, follow this repository. You can also check the Releases section for the latest downloads.
Thank you for your interest in PDF Reader MCP! Happy coding!
Recommended Servers
Crypto Price & Market Analysis MCP Server
A Model Context Protocol (MCP) server that provides comprehensive cryptocurrency analysis using the CoinCap API. This server offers real-time price data, market analysis, and historical trends through an easy-to-use interface.
MCP PubMed Search
Server to search PubMed (PubMed is a free, online database that allows users to search for biomedical and life sciences literature). I have created on a day MCP came out but was on vacation, I saw someone post similar server in your DB, but figured to post mine.
dbt Semantic Layer MCP Server
A server that enables querying the dbt Semantic Layer through natural language conversations with Claude Desktop and other AI assistants, allowing users to discover metrics, create queries, analyze data, and visualize results.
mixpanel
Connect to your Mixpanel data. Query events, retention, and funnel data from Mixpanel analytics.

Sequential Thinking MCP Server
This server facilitates structured problem-solving by breaking down complex issues into sequential steps, supporting revisions, and enabling multiple solution paths through full MCP integration.

Nefino MCP Server
Provides large language models with access to news and information about renewable energy projects in Germany, allowing filtering by location, topic (solar, wind, hydrogen), and date range.
Vectorize
Vectorize MCP server for advanced retrieval, Private Deep Research, Anything-to-Markdown file extraction and text chunking.
Mathematica Documentation MCP server
A server that provides access to Mathematica documentation through FastMCP, enabling users to retrieve function documentation and list package symbols from Wolfram Mathematica.
kb-mcp-server
An MCP server aimed to be portable, local, easy and convenient to support semantic/graph based retrieval of txtai "all in one" embeddings database. Any txtai embeddings db in tar.gz form can be loaded
Research MCP Server
The server functions as an MCP server to interact with Notion for retrieving and creating survey data, integrating with the Claude Desktop Client for conducting and reviewing surveys.