Claude Memory MCP
Persistent, searchable per-project memory for Claude Code using DuckDB and vector search, with rule enforcement and a management UI.
README
Claude Memory MCP
Persistent, searchable, per-project memory for Claude Code.
Claude forgets everything between sessions. You re-explain the same decisions, rules get missed, and context is lost when the window fills up. Claude Memory MCP gives each of your projects its own brain — decisions, rules, architecture notes, and sprint goals stored locally in a vector database, retrieved by meaning, and automatically loaded every time you start a session.

What you get
- Per-project memory — each project has an isolated DuckDB database; memory never leaks between projects.
- Semantic search — ask "what database did we pick?" and it finds the Postgres decision even if you never typed "Postgres".
- Rule enforcement — mandatory/forbidden rules are re-injected into Claude's context every turn (via hooks) so they survive context compaction and stop being forgotten.
- A management UI — a React app to browse, search, and edit every project's
memories, rules, sessions, and history, with a
Cmd+Kcommand palette. - One shared daemon — a single background process serves the MCP endpoint and the UI; the embedding model loads once, and there are no database lock conflicts between clients.
- Templates — define a set of default rules once, then seed every new project from it (pick exactly which rules with checkboxes) instead of re-typing them. New projects can also import selected rules from any existing project.
- CLAUDE.md import — convert an existing
CLAUDE.mdinto structured memory. - Portable & team-shareable — move a project's database into the repo,
commit it, and teammates get the same memory after
git pull.
How it works
flowchart LR
subgraph clients[Claude Code]
CLI[Terminal CLI]
APP[Desktop app]
end
UI[Management UI<br/>React + command palette]
subgraph daemon[memory-mcp daemon · port 8765]
MCP[/MCP endpoint /mcp/]
API[/JSON API /api/]
EMB[Embedding model<br/>loaded once]
end
DB[(Per-project<br/>DuckDB + vector index)]
CLI -->|HTTP| MCP
APP -->|HTTP| MCP
UI -->|HTTP| API
MCP --> DB
API --> DB
MCP --- EMB
Both Claude Code clients and the UI connect to the same daemon, which is the
sole owner of the DuckDB files. A UserPromptSubmit hook asks the daemon for
the current project's rules and injects them into context on every turn.
Quick start — Docker (recommended)
docker run -d --name memory-mcp \
-p 8765:8765 \
-v memory-mcp-data:/data \
kianfar/claude-memory-mcp:latest
Or with Compose:
docker compose up -d
Then:
-
Management UI — open http://localhost:8765/
-
Connect Claude Code — register the MCP server:
claude mcp add --transport http memory http://localhost:8765/mcp
Quick start — Homebrew
brew tap navid-kianfar/tap
brew install claude-memory-mcp
brew services start claude-memory-mcp # runs the daemon in the background
Then claude mcp add --transport http memory http://localhost:8765/mcp. See
packaging/homebrew/ for tap setup details.
Quick start — from source
Requires uv and (for the UI) Node 20+.
git clone https://github.com/navid-kianfar/claude-memory-mcp.git
cd claude-memory-mcp
./install.sh
install.sh installs dependencies, builds the UI, downloads the embedding
model, installs a launchd agent so the daemon auto-starts, points Claude Code at
the daemon, and installs the rule-enforcement hooks. It prints a one-time
sudo command to add a claude-memory-mcp entry to /etc/hosts so the UI URL
resolves — after that the UI is at http://claude-memory-mcp:8765/.
Screenshots
Once the daemon is running, the management UI is at http://localhost:8765/ — browse, search, and edit every project's memories, rules, and sessions.
Templates — define a baseline rule set once, then reuse it for every new project:

Seed a new project — on creation, import exactly the rules you want (with checkboxes) from a template or from another existing project:

Using it
Inside Claude Code:
memory_init_project("my-app", "My App") # create a project
memory_session_start("my-app") # loads rules + context
From then on Claude stores decisions, rules, and sprint notes automatically and recalls them with semantic search. At the start of each session it loads the project's rules, last summary, and recent decisions.
Rule enforcement
Rules you set (mandatory_rules / forbidden_rules) are enforced three ways:
- Hook injection — a
UserPromptSubmithook injects the actual rule text into context every turn, so rules survive context compaction. - Server instructions — the MCP server tells Claude to load and honor rules.
- Tool responses — search/store responses carry a compact rules reminder.
Hooks are silent in directories that are not registered memory projects, so they can be installed globally without noise.
Importing an existing CLAUDE.md
memory_import_claude_md("/path/to/project") # import into memory
memory_import_claude_md("/path/to/project", stub_rewrite=True) # + slim the file
Headings are mapped to categories (rules, architecture, decisions, devops,
docs); rule sections are split per bullet. With stub_rewrite, CLAUDE.md is
replaced by a short pointer at memory MCP (the original is backed up).
The management UI
A React single-page app served by the daemon at /:
- Browse, search, create, edit, and archive memories in every category
- Manage mandatory/forbidden rules
- Inspect sessions and per-memory provenance/history
- Switch and set the active project
Cmd+Kcommand palette for fast navigation and actions
MCP tools
35 tools, including:
| Area | Tools |
|---|---|
| Projects | memory_init_project, memory_load_from_folder, memory_link_folder, memory_list_projects, memory_project_info, memory_use |
| Memories | memory_store, memory_search, memory_recall, memory_update, memory_delete, memory_list |
| Rules | memory_get_rules, memory_add_rule, memory_update_rule, memory_delete_rule |
| Templates | memory_list_templates, memory_create_template, memory_add_template_rule, memory_apply_template, memory_import_rules |
| Sessions | memory_session_start, memory_session_end |
| Portability | memory_attach_project, memory_make_portable, memory_sync |
| Import/Export | memory_export, memory_import, memory_import_claude_md |
| Model | memory_model_info, memory_set_model, memory_reembed |
| Misc | memory_provenance, memory_version, memory_check_update |
Configuration
Environment variables (prefix MEMORY_MCP_):
| Variable | Default | Purpose |
|---|---|---|
MEMORY_MCP_DATA_DIR |
~/.claude-memory-mcp |
Where databases are stored |
MEMORY_MCP_DAEMON_HOST |
127.0.0.1 |
Daemon bind address (0.0.0.0 in Docker) |
MEMORY_MCP_DAEMON_PORT |
8765 |
Daemon port |
MEMORY_MCP_DAEMON_HOSTNAME |
claude-memory-mcp |
Hostname used in the UI URL |
Team / multi-device memory (git sync)
Bind a project to its source folder and its memory travels with the code through git — across your devices and teammates:
memory_link_folder("/path/to/project") # bind an existing project to its folder
You can also set the folder when creating a project — the New Project dialog
has a Project folder field, and memory_load_from_folder binds it
automatically.
Once bound, the project's rules and decisions are mirrored to a committable
.claude-memory/ snapshot in the project folder — one JSON file per
category, diff- and merge-friendly (no binary database, no embeddings). A
git push carries the latest memory; a teammate's git pull plus their next
session imports it back. The export runs at the end of each turn and the
import at session start (both via hooks), and the central database stays the
daemon's fast working copy.
Import is safe by design: it only adds new entries and applies edits that are strictly newer — it never deletes, and never reverts a more recent local change. Removing a rule is always explicit. Each project's memory is separate — sharing one never exposes the others.
Architecture
- Python + FastMCP — the MCP server and HTTP daemon (Starlette + uvicorn)
- DuckDB + VSS — per-project memory storage with an HNSW cosine vector index
- SQLite — the local registry (project list + app settings); stdlib, no extra dependency
- sentence-transformers — local embeddings (
all-MiniLM-L6-v2, 384-dim; a 50+ language multilingual preset is also available) - Layered design — repositories → services → container → tool/HTTP layer
- React + Vite + Tailwind — the management UI, with hand-built shadcn-style components
Existing databases are migrated automatically on open, so older project databases keep working after upgrades.
Development
uv sync --all-extras
uv run pytest -v # backend tests
cd frontend
npm install
npm run dev # UI dev server (proxies the API to the daemon)
npm run build # production build into frontend/dist
Run the daemon directly:
uv run memory-mcp serve
Releasing
The Docker image is published only for tagged releases — never on ordinary commits. Cut a release with the helper script:
./scripts/release.sh # patch bump (0.6.0 -> 0.6.1)
./scripts/release.sh minor # 0.6.0 -> 0.7.0
./scripts/release.sh 1.2.3 # explicit version
It runs the tests, bumps the version in pyproject.toml and the package,
commits, creates a vX.Y.Z tag, and pushes. The tag push triggers the workflow
that builds and publishes the multi-arch image to Docker Hub.
License
MIT — see LICENSE.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.