membank
LLM memory management system that stores corrections, preferences, decisions, and learnings in a local SQLite database with semantic search, exposed as an MCP server for automatic memory read/write by any MCP-compatible LLM harness.
README
Membank
LLM memory management system. Stores your corrections, preferences, decisions, and learnings in a local SQLite database, queryable via semantic search. Exposed as an MCP server so any MCP-compatible LLM harness (Claude Code, Cursor, etc.) can read and write memories automatically.
How it works
- Memories are typed (
correction>preference>decision>learning>fact) and scoped (global or per-project) - Embeddings run locally via
bge-small-en-v1.5— no data leaves your machine - Dedup via cosine similarity: >0.92 = auto-overwrite, 0.75–0.92 = flagged for review
- Project scope derived from
git remote get-url originhash, fallback to cwd hash - Session injection: stats + all pinned global memories + all pinned project memories prepended to every context window
- Optional background synthesis engine compresses memories into a rolling summary, replacing pinned memory injection when enabled
Requirements
- Node.js ≥ 24
- pnpm ≥ 10
Quick start
npx @membank/cli setup
setup auto-detects your installed LLM harness and writes the MCP server config and injection hooks. Use --harness <name> to target a specific harness, --yes to skip prompts, --json for machine-readable output.
Supported harnesses: claude-code, codex, opencode.
After setup, restart your harness. Membank will start injecting memories into every session.
CLI
membank query "preferred test framework" # semantic search
membank add --type <type> <content> # save a new memory
membank list # list all memories
membank pin <id> # pin (always injected)
membank unpin <id>
membank delete <id>
membank review # list memories flagged for dedup review
membank review --resolve <id> # dismiss review events for a memory
membank stats # storage and usage stats
membank export # export to JSON file (--output <path> to specify)
membank import <file> # import from JSON export file
membank migrate list # list available data migrations
membank migrate run <name> # run a named migration
membank activity # list activity events for the current project
membank dashboard # (deprecated) use: npx @membank/dashboard
membank config get <key> # print a config value
membank config set <key> <value> # set a config value
membank config show # print the full config
membank synthesize run # trigger a synthesis run for a scope
membank synthesize show # display current synthesis for a scope
membank synthesize status # show all scopes with synthesis status
membank setup # configure MCP server + injection hooks
membank setup upgrade # migrate harness configs to standalone membank-mcp
membank inject # output session context (called by hooks)
membank inject --harness claude-code # format output for a specific harness
membank inject --event user-prompt-submit # inject on prompt submit event
membank inject --event session-stop # inject on session stop event
Injection hooks
setup configures MCP servers and injection hooks for all supported harnesses. Hooks fire at session start, on each user prompt, and at session end to keep pinned memories (or synthesis) in context throughout the session.
Supported harnesses and their hook mechanisms:
| Harness | MCP config location | Session hooks |
|---|---|---|
claude-code |
Managed by claude mcp |
SessionStart, SessionEnd |
copilot |
~/.copilot/mcp-config.json |
MCP server only (hooks not supported) |
codex |
Managed by codex mcp |
SessionStart, UserPromptSubmit |
opencode |
~/.config/opencode/opencode.json |
experimental.chat.system.transform plugin |
Web dashboard
Access memories via a local web UI with browsing, searching, filtering, and editing:
npx @membank/dashboard
# Opens http://localhost:3847
Features:
- Browse all memories with filtering by type, scope, and pin status
- Full-text search across memory content
- Edit memory type, tags, and metadata
- Review memories flagged for deduplication
- View memory statistics and storage overview
- Dark mode support
See packages/dashboard/README.md for API docs and architecture details.
MCP tools
When running as an MCP server, the following tools are exposed to the LLM:
| Tool | Description |
|---|---|
query_memory |
Semantic search across stored memories. Supports global flag to query across all projects. |
save_memory |
Store a new memory with type, tags, and scope. Supports global flag to save outside project scope. |
update_memory |
Update content, type (reclassification), or tags on an existing memory |
delete_memory |
Remove a memory by ID |
list_memory_types |
List available memory types and their priority order |
pin_memory |
Pin a memory so it is always injected into session context |
unpin_memory |
Unpin a memory to remove it from guaranteed session injection |
get_memory_summary |
Aggregate stats: total memories, counts by type, pinned count, review queue size |
list_flagged_memories |
List memories with unresolved dedup review events (similarity 0.75–0.92) |
resolve_review |
Dismiss all open review events for a memory after reviewing it |
migrate |
List or run named data migrations (mode: "list" or mode: "run") |
Memory synthesis (optional)
The synthesis engine runs in the background and compresses memories into a rolling summary per scope. When a synthesis is available, it replaces pinned memory injection in the session context.
Synthesis calls Claude Haiku via your locally installed claude CLI. Any of the following auth methods work:
claude auth login— uses your existing Claude Code session (Pro/Max subscribers)CLAUDE_CODE_OAUTH_TOKEN— runclaude setup-tokento generate oneANTHROPIC_API_KEY— direct API billing via Anthropic Console
Enable by creating ~/.membank/config.json:
{
"synthesis": {
"enabled": true
}
}
View synthesis state via:
membank synthesize run # trigger synthesis for a scope
membank synthesize show # current synthesis for global scope
membank synthesize show --scope <s> # synthesis for a specific project scope
membank synthesize status # all scopes and their synthesis state
Storage
All data lives at ~/.membank/:
~/.membank/
memory.db # SQLite database (memories + vectors + synthesis)
models/ # cached embedding model (~30 MB, downloaded on first run)
config.json # optional config (synthesis settings, etc.)
Packages
| Package | Description |
|---|---|
@membank/core |
DB, embeddings, query engine, dedup logic |
@membank/mcp |
stdio MCP server |
@membank/cli |
CLI and npx entrypoint |
@membank/dashboard |
Web UI — browse, search, and manage memories |
Development
pnpm install # install workspace deps
pnpm build # build all packages in dependency order
pnpm dev # watch mode across all packages (including dashboard dev server)
pnpm typecheck # tsc --noEmit across all packages
pnpm lint # biome check
pnpm lint:fix # biome check --write
pnpm clean # remove all dist/ and *.tsbuildinfo
Scoped to one package:
pnpm --filter @membank/core build
pnpm --filter @membank/cli dev
pnpm --filter @membank/dashboard dev # dashboard only on http://localhost:3847
License
MIT — see LICENSE.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.