ElasticMind-MCP
An MCP server that indexes PDF documentation and text into Elasticsearch for semantic search and retrieval. It enables users to query knowledge bases, ingest new files, and dynamically update content through MCP-compatible clients like Claude Desktop and Cursor.
README
SME Knowledge Base ā MCP Server
This repository provides an MCP (Model Context Protocol) server that indexes documentation into Elasticsearch and exposes tools to query it from MCP-compatible clients such as Claude Desktop, Cursor, and GitHub Copilot.
š Features
- Smart Indexing: Uses deterministic IDs to prevent duplicate entries in Elasticsearch.
- Semantic Search: Query the knowledge base using Elasticsearch's matching capabilities.
- Dynamic Updates: Add new text content directly via MCP tools.
- Robustness: Gracefully handles database connection failures.
Prerequisites
Before running the server, ensure the following are installed:
- Python 3.11+
- Docker (for running Elasticsearch)
- uv (Python package and project manager)
š¦ Setup
1. Start Elasticsearch
The server requires a running Elasticsearch instance. You can start one easily using Docker:
docker run -d --name elasticsearch \
-p 9200:9200 -p 9300:9300 \
-e "discovery.type=single-node" \
-e "xpack.security.enabled=false" \
-e ES_JAVA_OPTS="-Xms1g -Xmx1g" \
docker.elastic.co/elasticsearch/elasticsearch:9.1.5
(Note: Ensure the version tag matches your requirements. Version 8.11.0 is used here as a stable default.)
2. Install Dependencies
Navigate to the project directory and install the required Python packages:
uv sync
# OR
pip install -e .
3. Ingest Data
Place your PDF documents in the input/ folder and run the extraction script to generate the data/docs.json index file:
uv run extraction.py
š§© Configuration
To use this server with Claude Desktop, Cursor, or GitHub Copilot, you need to configure the MCP settings.
1. Locate Paths
You will need the absolute paths for both the uv executable and your cloned repository.
- Find
uvpath:which uv - Find Repository path:
pwd
2. Edit Configuration File
A template configuration file is provided in mcp.json. You can copy its content, but remember to update the paths to be absolute.
- Open Claude Desktop.
- Go to Settings > Developer > Edit Config.
- Add the following configuration to the
mcpServersobject in the JSON file:
{
"mcpServers": {
"sme-knowledge-base": {
"command": "/absolute/path/to/uv",
"args": [
"run",
"--directory",
"/absolute/path/to/my_server_sme",
"elastic_server.py"
],
"env": {
"ES_HOST": "http://localhost:9200"
}
}
}
}
Replace /absolute/path/to/uv and /absolute/path/to/my_server_sme with the actual paths identified in Step 1.
š§ Available Tools
The server exposes the following tools to the LLM:
| Tool Name | Description |
|---|---|
ingest_pdfs |
Scans the input/ directory for new PDFs, extracts text, updates docs.json, and indexes everything into Elasticsearch. Call this after adding new files. |
index_documents |
Manually triggers the indexing process from data/docs.json to Elasticsearch. Useful if you've modified the JSON file directly. |
add_text_to_index |
Adds a new text document to the knowledge base. Features: <br>⢠Updates both persistent storage (docs.json) and Elasticsearch.<br>⢠Automatically chunks content > 1000 words.<br>⢠Generates unique IDs. |
query_knowledge_base |
Accepts a search query string and returns the top 2 most relevant document sections (Heading + Content). |
š Example Workflow
- Start Elasticsearch: Ensure your Docker container is running.
docker start elasticsearch - Add Documents: Drop any PDF files you want to index into the
input/folder. - Start Server: When you open Claude Desktop or Cursor, the server starts automatically.
- It will scan
input/, extract text from new PDFs, and index them into Elasticsearch.
- It will scan
- Interact:
- "What does the document say about [topic]?" (Uses
query_knowledge_base) - "Add this meeting note to the knowledge base: [content]" (Uses
add_text_to_index)
- "What does the document say about [topic]?" (Uses
Troubleshooting
- Connection Refused: Ensure the Docker container is running (
docker ps) and port 9200 is accessible. - Path Errors: Double-check that the paths in your config JSON are absolute (start with
/) and point to the correct locations.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.