MCP Servers

ProjectMind MCP

Gives AI coding assistants persistent project memory and semantic code search, running fully locally with no API keys required.

README

ProjectMind MCP

Give your AI coding assistant a brain. Persistent memory, semantic code search, and project intelligence — all running locally with no API keys required.

ProjectMind is an open-source MCP (Model Context Protocol) server that supercharges AI assistants like Claude, Zencoder, and Cursor with long-term project memory and intelligent codebase search.

🤖 This project was built with AI — designed, coded, debugged, and documented using AI-assisted development from day one.

Why ProjectMind?

Every time you start a new AI session, your assistant forgets everything about your project. ProjectMind solves this:

No more re-explaining your architecture every session
Semantic code search that understands what code does, not just what it's named
Dependency graph analysis to understand how modules connect
Works 100% locally — your code never leaves your machine

Features

🧠 Persistent Project Memory

Save architectural decisions, tech stack notes, and context that survives across sessions. The AI reads this at the start of every conversation.

Hierarchical memory access avoids dumping everything at once:

read_memory_index() — section headings only (cheap)
read_memory_section(name) — expand only the section you need
search_memory(query) — relevance-ranked retrieval of the memory blocks most relevant to a task (keyword-scored), not just the head of the file

🪜 3-Tier Hierarchical Search

Queries escalate through tiers only when the previous one is insufficient — large repositories never trigger a cold load just to answer a path lookup.

Tier	Engine	When loaded	Typical latency
L0 Manifest	`.ai/manifest.json` (paths + whole-file symbols)	always	< 50 ms
L1 BM25	`rank-bm25` lexical index	only when L0 weak	~ 100 ms
L2 Vector	ChromaDB + sentence-transformers	only on `intent='semantic'/'deep'`	first call ~ 30 s, then cached

Use the unified query(text, intent, n_results) tool with one of:

overview — L0 only (paths + top-level symbols)
lookup — L0 + L1 (keyword/lexical)
semantic — L0 + L1 + L2 (escalates to embeddings if signal is weak)
deep — L0 + L1 + L2 with relaxed thresholds

Hits from every tier are fused with scale-invariant Reciprocal Rank Fusion (normalized to 0..1), so no single tier can dominate the ranking by raw score magnitude — results corroborated across tiers float to the top.

🔍 Semantic Code Search

Search your codebase by meaning, not just text. Powered by a local sentence-transformers model — no OpenAI key needed.

"find authentication middleware"  →  finds auth code even if it's named differently

🌳 AST-Aware Chunking

Unlike naive text splitters that cut code in the middle of a function, ProjectMind uses tree-sitter to parse source files into exact syntax units:

Functions and methods are indexed as individual, self-contained chunks
Class methods get a # Class: ClassName context prefix for better search relevance
Rich metadata per chunk: symbol_type, symbol_name, class_name, line_start, line_end
Supports: Python, JavaScript, TypeScript, TSX, Java, Go, Rust, Ruby
Graceful fallback to text splitting for unsupported file types

🕸 Dependency Graph Intelligence

Traverse import relationships up to 5 levels deep
Find related files via shared dependency clustering
Discover the shortest path between any two modules
Identify entry points and orphaned modules
Monorepo-aware JS/TS resolution — follows tsconfig/jsconfig path aliases (@/...) and workspace/package imports, not just relative paths
Python src/-layout & relative-import resolution — absolute imports resolve through src//lib/ roots and relative imports (./..) walk the correct package levels
Cached import graph (120s TTL) — repeated calls return instantly instead of re-scanning the filesystem

⚡ Instant Project Exploration (no indexing needed)

get_project_overview() — manifest-first; tech stack, git info, file stats in < 1 second
explore_directory(path) — browse project tree level by level
get_file_summary(path) — imports, classes, functions, git history

⚡ Hybrid Search (BM25 + Vector)

Two search engines combined via Reciprocal Rank Fusion (RRF):

BM25 catches exact keyword matches — finds getUserById when you type exactly that
Vector search catches semantic matches — finds auth code even if named differently
RRF merges both ranked lists for best-of-both-worlds results
Automatic fallback to pure vector search when BM25 index is not ready

🔄 Incremental Indexing

Only re-indexes changed files — 10-100x faster than full re-indexing.

🩺 Self-Healing Maintenance Daemon

A background thread keeps the index lean without user intervention. State persists in .ai/maintenance_state.json.

Task	Trigger	Action
`manifest_refresh`	every 5 min	rebuild L0 manifest if files changed
`stale_gc`	hourly	delete embeddings for files removed from disk
`db_compaction`	daily	`VACUUM` ChromaDB SQLite when > 200 MB
`log_truncate`	every 6 h	truncate `projectmind.log` when > 8 MB
`model_unload`	every 5 min	release `sentence-transformers` after 15 min idle
`cache_pressure`	every minute	drop file/query caches when RSS > 500 MB

Inspect with maintenance_status(); force a sync run with maintenance_run(); aggressively clean the index with prune_index(force=True).

📊 Code Quality Metrics

Cyclomatic complexity, pylint scores, test coverage tracking — all queryable via MCP tools. Both analyze_code_complexity and analyze_code_quality accept mode='quick' (default, fast) or mode='deep' (wider scan).

⚡ Lazy `session_init` (no more 30s timeouts)

session_init no longer loads the embedding model or runs an incremental reindex; it returns the project root + manifest + memory index in well under a second even on multi-GB repositories. The vector store is loaded only when an intent='semantic' or 'deep' query actually needs it.

Quick Start

1. Clone and install

git clone https://github.com/Nik0lay1/project-mind-mcp.git
cd project-mind-mcp
python -m venv .venv

# Windows
.venv\Scripts\pip install -e .

# macOS/Linux
.venv/bin/pip install -e .

2. Add to your MCP client

Zencoder / Claude Desktop — add to mcp.json:

{
  "mcpServers": {
    "Memory": {
      "command": "/path/to/ProjectMindMCP/.venv/bin/python",
      "args": ["/path/to/ProjectMindMCP/mcp_server.py"]
    }
  }
}

Windows example:

{
  "mcpServers": {
    "Memory": {
      "command": "F:\\Projects\\ProjectMindMCP\\.venv\\Scripts\\python.exe",
      "args": ["F:\\Projects\\ProjectMindMCP\\mcp_server.py"]
    }
  }
}

3. Index your project

In your target project, ask the AI:

Memory__index_codebase

Or run directly for large projects:

# Windows
.venv\Scripts\python.exe run_index.py

# macOS/Linux
.venv/bin/python run_index.py

Available Tools (45+)

Category	Tools
Session	`session_init`, `health`, `set_project_root`
Memory	`read_memory`, `read_memory_index`, `read_memory_section`, `search_memory`, `update_memory`, `clear_memory`, `save_memory_version`
Search	`query` (tier-aware), `search_codebase`, `search_for_feature`, `search_architecture`, `search_for_errors`
Exploration	`get_project_overview`, `explore_directory`, `get_file_summary`
Dependencies	`get_file_relations`, `get_dependencies_with_depth`, `get_module_cluster`, `find_dependency_path`
Indexing	`index_codebase`, `index_changed_files`, `get_index_stats`, `prune_index`
Git	`ingest_git_history`, `get_recent_changes_summary`, `auto_update_memory_from_commits`
Quality	`analyze_code_complexity`, `analyze_code_quality`, `get_test_coverage_info`
Maintenance	`maintenance_status`, `maintenance_run`
Project	`detect_project_conventions`, `generate_project_summary`

Full reference: docs/api/tools-reference.md

How It Works

Your Project
     │
     ▼
ProjectMind MCP Server
     │
     ├── .ai/memory.md                ← persistent notes & decisions
     ├── .ai/manifest.json            ← L0: paths, symbols, modules (≤200 KB)
     ├── .ai/bm25_index/              ← L1: lexical index
     ├── .ai/vector_store/            ← L2: ChromaDB embeddings (local)
     ├── .ai/index_metadata.json      ← tracks changed files
     ├── .ai/maintenance_state.json   ← self-healing daemon schedule
     └── .ai/.indexignore             ← per-project ignore patterns
     │
     ▼
AI Assistant (Claude / Zencoder / Cursor)

Embedding model: flax-sentence-embeddings/st-codesearch-distilroberta-base

Trained specifically on code (CodeSearchNet dataset)
~130MB, runs fully locally on CPU
No API keys, no data sent anywhere

Search pipeline: BM25 (keyword) + ChromaDB (semantic) → Reciprocal Rank Fusion → top-N results

Requirements

Python 3.10 – 3.12
~500MB disk (model + dependencies)
Works on Windows, macOS, Linux

Configuration

All settings in config.py:

Setting	Default	Description
`MODEL_NAME`	`flax-sentence-embeddings/st-codesearch-distilroberta-base`	Embedding model
`CHUNK_SIZE`	`1500`	Characters per chunk
`MAX_FILE_SIZE_MB`	`10`	Skip files larger than this
`MAX_MEMORY_MB`	`100`	Memory limit for indexing batch
`IMPORT_GRAPH_MAX_FILES`	`8000`	Max files scanned when building the import graph
`TOOL_SOFT_BUDGET_SECONDS`	`20`	Wall-clock budget for analysis tools (`analyze_code_quality`, `analyze_code_complexity`, import-graph build) — they return partial results instead of timing out

Override via environment variables:

PROJECTMIND_MAX_FILE_SIZE_MB=5
PROJECTMIND_MAX_MEMORY_MB=200
PROJECTMIND_IMPORT_GRAPH_MAX_FILES=20000
PROJECTMIND_TOOL_BUDGET_SECONDS=45

Custom ignore patterns: create .ai/.indexignore (same syntax as .gitignore).

Project Structure

mcp_server.py           ← all MCP tool definitions
config.py               ← configuration
manifest.py             ← L0 lightweight project manifest
query_router.py         ← tier-aware query() router (L0 → L1 → L2)
maintenance.py          ← self-healing background daemon
vector_store_manager.py ← ChromaDB wrapper + hybrid search (L2)
bm25_index.py           ← BM25 keyword index + RRF fusion (L1)
codebase_indexer.py     ← file scanning & AST-aware chunking
ast_splitter.py         ← tree-sitter parser (9 languages)
code_intelligence.py    ← import graph, complexity analysis, cached graph
memory_manager.py       ← persistent memory read/write
incremental_indexing.py ← change tracking
context.py              ← dependency injection
run_index.py            ← helper script for manual re-indexing

Contributing

Issues and PRs are welcome. This is an open project — built in the open, improved in the open.

pip install -e ".[dev]"
pytest tests/
ruff check .

License

MIT

Built with AI assistance — was used throughout development for coding, debugging, refactoring, and documentation.

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured

ProjectMind MCP

README

ProjectMind MCP

Why ProjectMind?

Features

🧠 Persistent Project Memory

🪜 3-Tier Hierarchical Search

🔍 Semantic Code Search

🌳 AST-Aware Chunking

🕸 Dependency Graph Intelligence

⚡ Instant Project Exploration (no indexing needed)

⚡ Hybrid Search (BM25 + Vector)

🔄 Incremental Indexing

🩺 Self-Healing Maintenance Daemon

📊 Code Quality Metrics

⚡ Lazy session_init (no more 30s timeouts)

Quick Start

1. Clone and install

2. Add to your MCP client

3. Index your project

Available Tools (45+)

How It Works

Requirements

Configuration

Project Structure

Contributing

License

Recommended Servers

⚡ Lazy `session_init` (no more 30s timeouts)