mcp-local-rag | local RAG-like web search
"primitive" RAG-like web search model context protocol server that runs locally. ✨ no APIs ✨
nkapila6
README
<img src='images/rag.jpeg' width='200' height='200'>
mcp-local-rag
"primitive" RAG-like web search model context protocol (MCP) server that runs locally. ✨ no APIs ✨
<img src='images/flowchart.png' width='1000' height='500'>
Installation instructions
- You would need to install
uv
: https://docs.astral.sh/uv/
If you do not want to clone in Step 2.
Just paste this directly into Claude config. You can find the configuration paths here: https://modelcontextprotocol.io/quickstart/user
{
"mcpServers": {
"mcp-local-rag":{
"command": "uvx",
"args": [
"--python=3.10",
"--from",
"git+https://github.com/nkapila6/mcp-local-rag",
"mcp-local-rag"
]
}
}
}
Otherwise:
- Clone this GitHub repository (OPTIONAL, can be skipped with above config)
git clone https://github.com/nkapila6/mcp-local-rag
- Add the following to your Claude config. You can find the configuration paths here: https://modelcontextprotocol.io/quickstart/user
{
"mcpServers": {
"mcp-local-rag": {
"command": "uv",
"args": [
"--directory",
"<path where this folder is located>/mcp-local-rag/",
"run",
"src/mcp_local_rag/main.py"
]
}
}
}
Example use
On prompt
When asked to fetch/lookup/search the web, the model prompts you to use MCP server for the chat.
In the example, have asked it about Google's latest Gemma models released yesterday. This is new info that Claude is not aware about. <img src='images/mcp_prompted.png'>
Result
The result from the local rag_search
helps the model answer with new info.
<img src='images/mcp_result.png'>
Recommended Servers

E2B
Using MCP to run code via e2b.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Exa MCP
A Model Context Protocol server that enables AI assistants like Claude to perform real-time web searches using the Exa AI Search API in a safe and controlled manner.
Perplexity Chat MCP Server
MCP Server for the Perplexity API.
Web Research Server
A Model Context Protocol server that enables Claude to perform web research by integrating Google search, extracting webpage content, and capturing screenshots.
PubMedSearch
A Model Content Protocol server that provides tools to search and retrieve academic papers from PubMed database.
Aindreyway Codex Keeper
Serves as a guardian of development knowledge, providing AI assistants with curated access to latest documentation and best practices.
Perplexity Deep Research
A server that allows AI assistants to perform web searches using Perplexity's sonar-deep-research model with citation support.

Docx Document Processing Service
A powerful Word document processing service based on FastMCP, enabling AI assistants to create, edit, and manage docx files with full formatting support. Preserves original styles when editing content.
Web Research Server
MCP web research server (give Claude real-time info from the web) - oneshot-engineering/mcp-webresearch