PageIndex MCP
Enables LLMs to chat with long PDFs using a reasoning-based, tree-structured document index that navigates content like a human would, without requiring vector databases or hitting context limits.
README
<div align="center"> <a href="https://pageindex.ai/mcp"> <img src="https://docs.pageindex.ai/images/general/mcp_banner.jpg"> </a> </div>
PageIndex MCP
If you find this repo useful, please also star our main PageIndex repo ⭐
📘 PageIndex is a vectorless, reasoning-based RAG system that represents documents as hierarchical tree structures. It enables LLMs to navigate and retrieve information through structure and reasoning, not vector similarity — much like a human would retrieve information using a book's index.
🔌 PageIndex MCP exposes this LLM-native, in-context tree index directly to LLMs via MCP, allowing platforms like Claude, Cursor, and other MCP-compatible agents or LLMs to reason over document structure and retrieve the right information — without vector databases.
Want to chat with long PDFs but hit context limit reached errors? Add your file to PageIndex to seamlessly chat with long PDFs on any agent/LLM platforms.
✨ Chat to long PDFs the human-like, reasoning-based way ✨
- Support local and online PDFs
- Free 1000 pages
- Unlimited conversations
For more information, visit the PageIndex MCP page.
💡 Looking for a fully hosted experience? Try PageIndex Chat 🤖: a human-like document analyst that lets you chat with long PDFs using the same agentic, reasoning-based workflow as PageIndex MCP.
<p align="center"> <a href="https://pageindex.ai/mcp"> <img src="https://github.com/user-attachments/assets/d807d506-131d-4c7b-837c-96ab1adb2271"> </a> </p>
What is PageIndex?
<div align="center"> <a href="https://pageindex.ai/mcp"> <img src="https://docs.pageindex.ai/images/cookbook/vectorless-rag.png" width="70%"> </a> </div>
PageIndex is a vectorless, reasoning-based RAG system that generates hierarchical tree structures of documents and uses multi-step reasoning and tree search to retrieve information like a human expert would. It has the following key properties:
- Higher Accuracy: Relevance beyond similarity
- Better Transparency: Clear reasoning trajectory with traceable search paths
- Like A Human: Retrieve information like a human expert navigates documents
- No Vector DB: No extra infrastructure overhead
- No Chunking: Preserve full document context and structure
- No Top-K: Retrieve all relevant passages automatically
PageIndex MCP Setup
See PageIndex MCP for full video guidances.
1. For Claude Desktop (Recommended)
One-Click Installation with Desktop Extension (MCPB):
- Download the latest
.mcpbfile from Releases - Double-click the
.mcpbfile to install automatically in Claude Desktop - The OAuth authentication will be handled automatically when you first use the extension
Note: Claude Desktop Extensions now use the
.mcpb(MCP Bundle) file extension. Existing.dxtextensions will continue to work, but we recommend using.mcpbfor new installations.
This is the easiest way to get started with PageIndex's reasoning-based RAG capabilities.
2. For Other MCP-Compatible Clients
Option 1: Local MCP Server (with local PDF upload)
Requirements: Node.js ≥18.0.0
Add to your MCP configuration:
{
"mcpServers": {
"pageindex": {
"command": "npx",
"args": ["-y", "pageindex-mcp"]
}
}
}
Note: This local server provides full PDF upload capabilities and handles all authentication automatically.
Option 2: Direct Connection to PageIndex
Connect directly to the PageIndex OAuth-enabled MCP server:
{
"mcpServers": {
"pageindex": {
"type": "http",
"url": "https://chat.pageindex.ai/mcp"
}
}
}
For clients that don't support HTTP MCP servers:
If your MCP client doesn't support HTTP servers directly, you can use mcp-remote as a bridge:
{
"mcpServers": {
"pageindex": {
"command": "npx",
"args": ["-y", "mcp-remote", "https://chat.pageindex.ai/mcp"]
}
}
}
Note: Option 1 provides local PDF upload capabilities, while Option 2 only supports PDF processing via URLs (no local file uploads).
Related Links
License
This project is licensed under the terms of the MIT open source license. Please refer to MIT for the full terms.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.