PDF Search for Zed
An MCP server extension for Zed that retrieves relevant pieces from a PDF file
freespirit
README
PDF Search for Zed
A document search extension for Zed that lets you semantically search through a PDF document and use the results in Zed's AI Assistant.
Prerequisites
This extension currently requires:
- An
OpenAIAPI key (to generate embeddings) uvinstalled on your system
Note: While the current setup requires an OpenAI API key for generating embeddings, we plan to implement a self-contained alternative in future versions. Community feedback will help prioritize these improvements.
Quick Start
- Clone the repository
git clone https://github.com/freespirit/pdfsearch-zed.git
- Set up the Python environment for the MCP server:
cd pdfsearch-zed/pdf_rag
uv venv
uv sync
-
Install Dev Extension in Zed
-
Build the search db
cd /path/to/pdfsearch-zed/pdf_rag
echo "OPENAI_API_KEY=sk-..." > src/pdf_rag/.env
# This may take a couple of minutes, depending on the documents' size
# You can provide multiple files and directories as arguments.
# - files would be chunked.
# - a directory would be considered as if its files contains chunks.
# E.g. they won't be further split.
uv run src/pdf_rag/rag.py build "file1.pdf" "dir1" "file2.md" ...
- Configure Zed
"context_servers": {
"pdfsearch-context-server": {
"settings": {
"extension_path": "/path/to/pdfsearch-zed"
}
}
}
Usage
- Open Zed's AI Assistant panel
- Type
/pdfsearchfollowed by your search query - The extension will search the PDF and add relevant sections to the AI Assistant's context
Future Improvements
- [x] Self-contained vector store
- [ ] Self-contained embeddings
- [ ] Automated index building on first run
- [ ] Configurable result size
- [x] Support for multiple PDFs
- [x] Optional: Additional file formats beyond PDF
Project Structure
pdf_rag/: Python-based MCP server implementationsrc/: Zed extension codeextension.tomlandCargo.toml: Zed extension configuration files
Known Limitations
- Manual index building is required before first use
- Requires external services (OpenAI)
Recommended Servers
mixpanel
Connect to your Mixpanel data. Query events, retention, and funnel data from Mixpanel analytics.
Sequential Thinking MCP Server
This server facilitates structured problem-solving by breaking down complex issues into sequential steps, supporting revisions, and enabling multiple solution paths through full MCP integration.
MCP PubMed Search
Server to search PubMed (PubMed is a free, online database that allows users to search for biomedical and life sciences literature). I have created on a day MCP came out but was on vacation, I saw someone post similar server in your DB, but figured to post mine.
dbt Semantic Layer MCP Server
A server that enables querying the dbt Semantic Layer through natural language conversations with Claude Desktop and other AI assistants, allowing users to discover metrics, create queries, analyze data, and visualize results.
Crypto Price & Market Analysis MCP Server
A Model Context Protocol (MCP) server that provides comprehensive cryptocurrency analysis using the CoinCap API. This server offers real-time price data, market analysis, and historical trends through an easy-to-use interface.
Nefino MCP Server
Provides large language models with access to news and information about renewable energy projects in Germany, allowing filtering by location, topic (solar, wind, hydrogen), and date range.
Vectorize
Vectorize MCP server for advanced retrieval, Private Deep Research, Anything-to-Markdown file extraction and text chunking.
MATLAB MCP Server
Integrates MATLAB with AI to execute code, generate scripts from natural language, and access MATLAB documentation seamlessly.
Macrostrat MCP Server
Enables Claude to query comprehensive geologic data from the Macrostrat API, including geologic units, columns, minerals, and timescales through natural language.
MCP Word Counter
A Model Context Protocol server that provides tools for analyzing text documents, including counting words and characters. This server helps LLMs perform text analysis tasks by exposing simple document statistics functionality.