neo4j-prefiltering-mcp
Automatically discovers vector indexes in a Neo4j database and exposes each as a semantic search tool with metadata pre-filtering, enabling natural language queries with dynamic filter support.
README
Neo4j Prefiltering Vector Search MCP Server
An MCP (Model Context Protocol) server that automatically discovers vector indexes in a Neo4j database and exposes each one as a semantic search tool. Built with FastMCP and LangChain embeddings, so it works with any embedding provider out of the box.
How It Works
On startup the server:
- Connects to Neo4j and runs
SHOW INDEXESto find everyVECTORindex. - Samples one node per indexed property to detect its type (string, numeric, date, bool, or vector).
- Identifies the embedding property and the remaining filterable metadata properties.
- Registers an MCP tool
search_<index_name>for each discovered index, complete with a dynamically generated description listing the available filters.
If no vector indexes are found, the server exits with an error.
Prerequisites
- Python 3.10+
- A running Neo4j instance (5.x+ with vector index support)
- At least one vector index already created in the database
- An API key or credentials for your chosen embedding provider
Installation
First, clone the repository:
git clone https://github.com/tomasonjo/neo4j-prefiltering-mcp.git
cd neo4j-prefiltering-mcp
Using uvx (recommended)
No installation needed — just run it directly from the local folder:
uvx --from /path/to/neo4j-prefiltering-mcp neo4j-prefiltering-mcp
Using pip
pip install /path/to/neo4j-prefiltering-mcp
Then run:
neo4j-prefiltering-mcp
Embedding providers
The base package does not include an embedding provider. Install the one you need as an extra:
# OpenAI
pip install "/path/to/neo4j-prefiltering-mcp[openai]"
# Cohere
pip install "/path/to/neo4j-prefiltering-mcp[cohere]"
# HuggingFace
pip install "/path/to/neo4j-prefiltering-mcp[huggingface]"
Or with uvx:
uvx --from /path/to/neo4j-prefiltering-mcp --with langchain-openai neo4j-prefiltering-mcp
Configuration
All configuration is done through environment variables.
| Variable | Default | Description |
|---|---|---|
NEO4J_URI |
bolt://localhost:7687 |
Neo4j connection URI |
NEO4J_USER |
neo4j |
Neo4j username |
NEO4J_PASSWORD |
password |
Neo4j password |
NEO4J_DATABASE |
neo4j |
Neo4j database name |
EMBEDDING_MODEL |
openai:text-embedding-3-small |
LangChain embedding model spec |
The EMBEDDING_MODEL value is passed directly to langchain.embeddings.init_embeddings(). Any provider string it supports will work:
# OpenAI
export EMBEDDING_MODEL="openai:text-embedding-3-small"
# Cohere
export EMBEDDING_MODEL="cohere:embed-english-v3.0"
# HuggingFace
export EMBEDDING_MODEL="huggingface:BAAI/bge-small-en-v1.5"
Make sure the corresponding provider SDK and API key env var are set (e.g. OPENAI_API_KEY, COHERE_API_KEY).
Usage
Claude Desktop
Add the server to your claude_desktop_config.json:
{
"mcpServers": {
"neo4j-vector": {
"command": "uvx",
"args": ["--from", "/path/to/neo4j-prefiltering-mcp", "--with", "langchain-openai", "neo4j-prefiltering-mcp"],
"env": {
"NEO4J_URI": "bolt://localhost:7687",
"NEO4J_USER": "neo4j",
"NEO4J_PASSWORD": "your-password",
"NEO4J_DATABASE": "neo4j",
"EMBEDDING_MODEL": "openai:text-embedding-3-small",
"OPENAI_API_KEY": "sk-..."
}
}
}
}
Claude Code
claude mcp add neo4j-vector -- uvx --from /path/to/neo4j-prefiltering-mcp --with langchain-openai neo4j-prefiltering-mcp
Standalone
neo4j-prefiltering-mcp
The server communicates over stdio by default, which is the standard transport for local MCP tool servers.
Cursor / Continue / Other MCP Clients
Point the client at the server as a stdio server. The exact config format varies by client — consult its docs and use the command + args pattern shown above.
Tool Interface
Each discovered index is exposed as a tool with the following parameters:
| Parameter | Type | Required | Description |
|---|---|---|---|
query |
str |
Yes | Natural-language search text (embedded at call time) |
top_k |
int |
No | Number of results to return (default 10) |
filters |
dict |
No | Metadata filters (keys and accepted types are index-specific) |
Filter Types
The server infers filter types by sampling a node from each index:
| Detected Type | Filter Format | Example |
|---|---|---|
float / int |
{"min": ..., "max": ...} |
{"min": 0.5, "max": 1.0} |
date |
{"min": "...", "max": "..."} |
{"min": "2024-01-01", "max": "2024-12-31"} |
bool |
true / false |
true |
string |
"exact value" |
"en" |
Both min and max are optional within a range filter — you can supply either or both.
Example Tool Call
Given an index called news_articles on :Article nodes with metadata properties language (string) and sentiment (float):
{
"name": "search_news_articles",
"arguments": {
"query": "recent breakthroughs in fusion energy",
"top_k": 5,
"filters": {
"language": "en",
"sentiment": { "min": 0.6 }
}
}
}
Response Format
The tool returns a JSON array of results, each containing the matched node's properties (minus the raw embedding vector) and a similarity score:
[
{
"doc": {
"title": "Fusion Milestone Reached at NIF",
"language": "en",
"sentiment": 0.92,
"published": "2025-01-15"
},
"score": 0.941
}
]
Project Structure
.
├── pyproject.toml
├── src/
│ └── neo4j_prefiltering_mcp/
│ ├── __init__.py
│ └── server.py
└── README.md
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases