mnemos
MCP server providing persistent memory, semantic search, versioned storage, webhook fanout, and unified LLM routing for AI agents via FastAPI runtime with multiple backend options.
README
<p align="center"> <img src="docs/images/logo.png" alt="MNEMOS" width="220" /> </p>
MNEMOS + GRAEAE
MNEMOS v6.0.0rc1 is the — release/v6.0-rc branch, v6.0-rc tag) is the memory operating system for
serious agentic work: a packaged FastAPI runtime, four-backend persistence
layer (PostgreSQL + pgvector, Oracle Database 26ai HNSW INMEMORY NEIGHBOR GRAPH,
IBM Db2 12.1.5 (EAP) DiskANN vector, SQLite + sqlite-vec), GRAEAE reasoning bus,
operator-audited compression stack, divergent dream-state pipeline (REPLAY ->
CLUSTER -> CONSOLIDATE -> SYNTHESISE -> EXTRACT), GDPR right-to-be-forgotten
worker, PERSEPHONE archival subsystem, PANTHEON unified LLM facade, KRONOS
recall observability, and CLI-first deployment surface.
MNEMOS is not just a place to put bytes. It is a runtime of named subsystems that manage the full lifecycle of agent memory across providers, agents, and time horizons: write, embed, search, compress, version, reason-over, audit, federate, export, import, and operate.
Release candidate. This README documents the
v6.0-rctag — the canonical release candidate. It adds Oracle Database 26ai and IBM Db2 12.1.5 (Early Access Program) as first-class persistence backends alongside PostgreSQL and SQLite. The most recent published latest published PyPI release is5.0.1(legacy);6.0.0rc1is the source-install candidate from thev6.0-rctag (Postgres + SQLite only). Enterprise backends ship from source against the tag until pip extras are published:git clone -b v6.0-rc https://github.com/ncz-os/mnemos. See docs/INSTALL.md for driver, DSN, and migration steps. Development history continues on thefeat/oracle-portbranch.
Quick Start
Memory and reasoning runtime for AI agents: persistent search, versioned storage, webhook fanout, and a unified LLM routing bus - all behind a single MCP interface.
1. Agent-driven install
Paste into Claude Code, Cursor, or Codex. The agent runs the install; you confirm.
Install MNEMOS on this machine.
Steps:
1. pip install 'mnemos-os[server]==6.0.0rc1' # source-install from v6.0-rc tag until PyPI publish
2. mnemos init # scaffold config + token
3. mnemos serve # start API on :5002
4. mnemos doctor # verify subsystems
5. Set MNEMOS_BASE=http://localhost:5002 and MNEMOS_API_KEY=<token from step 2>
in shell env and any agent config that needs to reach it.
Edge device (SQLite, no Postgres): pip install 'mnemos-os[edge]==6.0.0rc1' instead.
Full install with all subsystems: pip install 'mnemos-os[full]==6.0.0rc1'
Enterprise backends (Oracle Database 26ai, IBM Db2 12.1.5 EAP).
Until the next PyPI release is cut, install from source against the
v6.0-rc tag (or the feat/oracle-port development branch) and add
the matching extras. See
docs/INSTALL.md
for full driver, DSN, and migration steps.
git clone -b feat/oracle-port https://github.com/ncz-os/mnemos
cd mnemos
python -m pip install -e '.[server,oracle]' # or '.[server,db2]' or '.[server,enterprise]'
export MNEMOS_DATABASE_DSN='oracle://user:pass@host:1521/service_name'
# or: MNEMOS_DATABASE_DSN='db2://user:pass@host:50000/dbname'
mnemos install --profile server
mnemos serve --profile server
2. Connect an agent via MCP
Add to ~/.claude/mcp_servers.json (Claude Code) or equivalent:
{
"mcpServers": {
"mnemos": {
"command": "mnemos",
"args": ["serve", "mcp-stdio"],
"env": {
"MNEMOS_BASE": "http://<host>:5002",
"MNEMOS_API_KEY": "<token>"
}
}
}
}
For HTTP/SSE transport (ChatGPT, remote agents): mnemos serve mcp-http on :5004.
Key MCP tools the agent gets:
| Tool | What it does |
|---|---|
search_memories |
Semantic + filtered search across the memory store |
create_memory |
Write a new memory with category, tags, and content |
get_memory |
Fetch a memory by ID |
kg_search |
Query the knowledge-graph triple store |
kronos_anomalies |
Surface recall anomalies and memory health signals |
list_deletions |
List soft-deleted memories pending hard deletion |
3. Webhooks + integrations
| Integration | What connects | How |
|---|---|---|
| Claude Code | Hooks fire on session-start, prompt-submit, stop - auto-log to MNEMOS | integrations/claude-code/ - copy hooks + set MNEMOS_BASE |
| ZeroClaw | Zeroclaw agent reads/writes memories via MCP | integrations/zeroclaw/ + mnemos serve mcp-stdio in zeroclaw config |
| OpenClaw | OpenClaw gateway routes memory ops through MCP | integrations/openclaw/ + MCP server entry in openclaw.json |
| Hermes | Optional memory skill mounts MNEMOS as a tool provider | integrations/hermes/optional-skills/memory/mnemos/ |
| Webhooks (any) | Push memory.created, memory.updated, memory.deleted, consultation.completed events to any HTTPS endpoint |
POST /api/webhooks/register with {"url": "...", "events": [...]} |
| Cursor / Cline / Continue.dev / Zed / Aider | Any MCP-capable IDE connects via stdio or HTTP transport | See docs/connectors/ |
Full documentation: docs/
Architecture
MNEMOS is a packaged FastAPI service with a single mnemos CLI for installation, serving, MCP transport, and operational checks. Agents connect through MCP stdio, MCP HTTP/SSE, REST, or OpenAI-compatible SDKs, while the runtime routes memory, reasoning, session, webhook, federation, portability, and observability work through the mnemos/ package. Persistence is selected by profile and DSN: SQLite + sqlite-vec for edge and development installs, PostgreSQL + pgvector for server deployments, Oracle Database 26ai (23.26.1-ee, HNSW INMEMORY NEIGHBOR GRAPH, JSON Duality, TDE) for enterprise installs, and IBM Db2 12.1.5 (native VECTOR(768, FLOAT32) + DiskANN vector index; runs through Db2 Oracle Compatibility Mode with cursor-level Oracle→Db2 token translation — a native Db2 dialect port is on the v6.x roadmap, see docs/v6.1-roadmap.md. Db2MemoryRepository.semantic_search emits native Db2 SQL — VECTOR_DISTANCE(..., EUCLIDEAN) + FETCH APPROX FIRST — engaging the DiskANN index on the user-facing query path) for enterprise installs. All four backends implement the same PersistenceBackend ABC (mnemos/persistence/base.py) and share tests/test_persistence_parity.py. GRAEAE handles multi-provider reasoning and model routing; MOIRAI handles operator-audited compression through APOLLO and ARTEMIS.
Documentation
| Topic | File |
|---|---|
| Installation | docs/INSTALL.md |
| Specification | docs/SPECIFICATION.md |
| System requirements | docs/SYSTEM_REQUIREMENTS.md |
| Memory architecture | docs/MEMORY_ARCHITECTURE.md |
| Compression | docs/COMPRESSION.md |
| GRAEAE reasoning | docs/GRAEAE_FEATURES.md |
| PANTHEON provider facade | docs/PANTHEON.md |
| KRONOS observability | docs/KRONOS.md |
| Portability format | docs/MEMORY_EXPORT_FORMAT.md |
| Scaling | docs/SCALING.md |
| Single-binary builds | docs/SINGLE_BINARY.md |
| Operations | docs/OPERATIONS.md |
| Benchmark harness | scripts/bench_v4.py — cross-backend vector-search harness (PG / Oracle / Db2 / SQLite). Results published post-GA. |
License
MNEMOS is licensed under the Apache License, Version 2.0. See LICENSE for the full text.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.