mcp-memory-server

mcp-memory-server

A knowledge-graph-based persistent memory server for the Model Context Protocol, storing entities, observations, and relations in SQLite with semantic vector search and temporal versioning.

Category
Visit Server

README

MCP Memory Server

A knowledge-graph-based persistent memory server for the Model Context Protocol (MCP). Stores entities, observations, and relations in SQLite with semantic vector search, temporal versioning, and cursor-based pagination.

Attribution

This project is derived from @modelcontextprotocol/server-memory by Anthropic, PBC, originally published under the MIT License as part of the MCP Servers monorepo. The original LICENSE file is preserved in this repository.

Features

  • SQLite storage with WAL mode, foreign key constraints, and database-level deduplication
  • Semantic vector search via sqlite-vec + all-MiniLM-L6-v2 ONNX embeddings (automatic, no config needed)
  • Temporal versioning — observations and relations track superseded_at timestamps; old data is preserved for history but hidden from active queries
  • Point-in-time queriesas_of parameter on read/search operations recovers the graph state at any past timestamp
  • Entity soft-deletedelete_entities marks entities as superseded (preserving history) rather than hard-deleting
  • Entity name normalization — surface variants like Dustin-Space, dustin/space, and DUSTIN SPACE all resolve to the same identity key
  • Observation metadataimportance (1-5 scale), context_layer (L0/L1/null), and memory_type classification on observations
  • Context layersget_context_layers returns L0 (always-loaded rules) and L1 (session-start context) observations with token budgets
  • Session-start briefingget_summary returns top observations by importance, recently updated entities, and aggregate stats
  • Entity timeline — view full change history (active + superseded + tombstoned observations and relations)
  • Similarity detectionadd_observations warns when new observations are semantically similar to existing ones
  • Four-tier eviction — Active → Superseded → Tombstoned → Hard-deleted degradation chain with 6-month LRU shield and configurable size cap (default 1 GB)
  • Cursor-based pagination — sorted by most-recently-updated, stable under concurrent use
  • Project scoping — optional projectId parameter isolates entities by project
  • Auto-migration — upgrades from JSONL to SQLite automatically on first run

Installation

npm install
npm run build

Note: better-sqlite3 is a native C addon. You'll need C build tools (gcc, make) installed. On Fedora: sudo dnf install gcc make. On Ubuntu/Debian: sudo apt install build-essential.

Full harness install (recommended for Claude Code)

The server alone provides the storage layer. To replicate the full session-lifecycle integration — auto-loaded L0 context, freshness scanning, pre/post-compact agents, noise gating, /audit-memory and /checkpoint skills, custom QA agents — see harness/README.md:

cd harness
./install.sh

The installer is idempotent, backs up existing files, and prints the manual integration steps (settings.json merge, MCP registration, CLAUDE.md fragments) at the end.

Usage

With Claude Code

Add to your MCP server configuration:

{
  "mcpServers": {
    "memory": {
      "command": "node",
      "args": ["/path/to/mcp-memory-server/dist/index.js"],
      "env": {
        "MEMORY_FILE_PATH": "/path/to/your/memory.db"
      }
    }
  }
}

Environment Variables

Variable Default Description
MEMORY_FILE_PATH memory.db (alongside script) Path to database file. Extension determines backend: .db/.sqlite → SQLite, .jsonl → JSONL
MEMORY_VECTOR_SEARCH on Set to off to disable vector search entirely
MEMORY_SIZE_CAP_BYTES 1000000000 (1 GB) Database size cap for the eviction system. Eviction triggers at 90% of cap

Tools

Tool Description Key Parameters
create_entities Create new entities in the knowledge graph entities[], projectId?
create_relations Create directed relations between entities relations[] (from, to, relationType)
add_observations Add observations to existing entities. Returns similarExisting when new observations are semantically close to existing ones observations[] (entityName, contents[], importances?[], contextLayers?[], memoryTypes?[])
delete_entities Soft-delete entities (preserves history for as_of and entity_timeline) entityNames[]
delete_observations Delete specific observations from entities deletions[] (entityName, contents[])
delete_relations Delete relations between entities relations[] (from, to, relationType)
supersede_observations Atomically retire old observations and insert replacements. Preserves history supersessions[] (entityName, oldContent, newContent)
invalidate_relations Mark relations as no longer active. Preserves history. Idempotent relations[] (from, to, relationType)
set_observation_metadata Update importance, context layer, and/or memory type on existing observations in-place updates[] (entityName, content, importance?, contextLayer?, memoryType?)
read_graph Read the knowledge graph (paginated, sorted by most recently updated) projectId?, asOf?, cursor?, limit?
search_nodes Search entities by name, type, or observation content. LIKE + semantic vector search query, projectId?, asOf?, cursor?, limit?
open_nodes Retrieve specific entities by name with full relations names[], projectId?, asOf?
list_projects List all project names in the knowledge graph
entity_timeline Full change history for an entity (active + superseded + tombstoned items) entityName, projectId?
get_context_layers Returns L0/L1 observations sorted by importance with token budgets projectId?, layers?
get_summary Session-start briefing: top observations, recent entities, aggregate stats projectId?, excludeContextLayers?, limit?

Data Model

  • Entities — named nodes with a type, a project scope, and timestamped observations. Entity names are normalized (case-insensitive, separator-insensitive) for identity matching while preserving the original display form.
  • Relations — directed edges between entities with temporal tracking (createdAt, supersededAt)
  • Observations{ content, createdAt, importance, contextLayer, memoryType } objects attached to an entity (the atomic unit of knowledge)

Temporal Versioning

Observations, relations, and entities support a four-tier lifecycle:

  • Active items appear in read_graph, search_nodes, and open_nodes
  • Superseded items are hidden from active queries but preserved for history and as_of recovery
  • Tombstoned items have their content stripped by the eviction system but preserve the existence skeleton
  • Hard-deleted items are permanently removed when under extreme size pressure
  • Use entity_timeline to see the full history of any entity (all tiers)
  • Use as_of parameter on read_graph/search_nodes/open_nodes to recover the graph at any past timestamp

Vector Search

Vector search is automatic when using the SQLite backend:

  • Uses sqlite-vec (brute-force KNN) with all-MiniLM-L6-v2 ONNX embeddings (384 dimensions)
  • Model downloads ~23MB on first startup (cached thereafter)
  • LIKE substring search always runs; vector search adds supplementary semantic matches
  • Set MEMORY_VECTOR_SEARCH=off to disable
  • If the model fails to load, the system degrades gracefully to LIKE-only search

Storage Backends

SQLite (default, recommended)

SQLite is the default and recommended backend. Features: WAL mode, FK constraints, database-level deduplication, vector search, temporal versioning, schema migrations.

JSONL (deprecated)

The JSONL backend is maintained for environments without C build tools but does not support vector search, as_of queries, entity soft-delete, entity_timeline, set_observation_metadata, invalidate_relations, or eviction. Set MEMORY_FILE_PATH to a .jsonl path to use it.

Migration from JSONL

  1. Change MEMORY_FILE_PATH to a .db path (or remove it to use the default)
  2. On first run, the server auto-migrates data from the .jsonl file
  3. The original JSONL file is renamed to .jsonl.bak

Known Limitations

  • Entity names are globally unique across all projects (relations use names as FK)
  • Project filtering is advisory, not a security boundary
  • Paginated relation coverage is incomplete — use open_nodes for full relations on specific entities
  • Vector search is best-effort — degrades to LIKE-only if model/extension unavailable
  • vec0 is brute-force KNN — fine at current scale, consider ANN at ~50,000+ observations
  • Context layers and memory types are caller-managed — the server stores them but does not auto-classify

See CLAUDE.md for detailed architecture documentation and the full limitations list.

Development

npm test           # run tests (520 tests across 9 test files)
npm run test:watch # run tests in watch mode
npm run build      # compile TypeScript
npm run watch      # compile in watch mode

Set SKIP_VECTOR_INTEGRATION=1 to skip the vector integration tests (which download the embedding model) for faster CI runs.

License

See LICENSE for the full text and original copyright notices. The upstream project is transitioning from MIT to Apache-2.0; documentation is licensed under CC-BY-4.0.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured