mnemos

mnemos

MCP server providing persistent memory, semantic search, versioned storage, webhook fanout, and unified LLM routing for AI agents via FastAPI runtime with multiple backend options.

Category
Visit Server

README

<p align="center"> <img src="docs/images/logo.png" alt="MNEMOS" width="220" /> </p>

MNEMOS + GRAEAE

MNEMOS v6.0.0rc1 is the — release/v6.0-rc branch, v6.0-rc tag) is the memory operating system for serious agentic work: a packaged FastAPI runtime, four-backend persistence layer (PostgreSQL + pgvector, Oracle Database 26ai HNSW INMEMORY NEIGHBOR GRAPH, IBM Db2 12.1.5 (EAP) DiskANN vector, SQLite + sqlite-vec), GRAEAE reasoning bus, operator-audited compression stack, divergent dream-state pipeline (REPLAY -> CLUSTER -> CONSOLIDATE -> SYNTHESISE -> EXTRACT), GDPR right-to-be-forgotten worker, PERSEPHONE archival subsystem, PANTHEON unified LLM facade, KRONOS recall observability, and CLI-first deployment surface.

MNEMOS is not just a place to put bytes. It is a runtime of named subsystems that manage the full lifecycle of agent memory across providers, agents, and time horizons: write, embed, search, compress, version, reason-over, audit, federate, export, import, and operate.

Release candidate. This README documents the v6.0-rc tag — the canonical release candidate. It adds Oracle Database 26ai and IBM Db2 12.1.5 (Early Access Program) as first-class persistence backends alongside PostgreSQL and SQLite. The most recent published latest published PyPI release is 5.0.1 (legacy); 6.0.0rc1 is the source-install candidate from the v6.0-rc tag (Postgres + SQLite only). Enterprise backends ship from source against the tag until pip extras are published: git clone -b v6.0-rc https://github.com/ncz-os/mnemos. See docs/INSTALL.md for driver, DSN, and migration steps. Development history continues on the feat/oracle-port branch.

Quick Start

Memory and reasoning runtime for AI agents: persistent search, versioned storage, webhook fanout, and a unified LLM routing bus - all behind a single MCP interface.


1. Agent-driven install

Paste into Claude Code, Cursor, or Codex. The agent runs the install; you confirm.

Install MNEMOS on this machine.

Steps:
1. pip install 'mnemos-os[server]==6.0.0rc1'  # source-install from v6.0-rc tag until PyPI publish
2. mnemos init                         # scaffold config + token
3. mnemos serve                        # start API on :5002
4. mnemos doctor                       # verify subsystems
5. Set MNEMOS_BASE=http://localhost:5002 and MNEMOS_API_KEY=<token from step 2>
   in shell env and any agent config that needs to reach it.

Edge device (SQLite, no Postgres): pip install 'mnemos-os[edge]==6.0.0rc1' instead.
Full install with all subsystems: pip install 'mnemos-os[full]==6.0.0rc1'

Enterprise backends (Oracle Database 26ai, IBM Db2 12.1.5 EAP). Until the next PyPI release is cut, install from source against the v6.0-rc tag (or the feat/oracle-port development branch) and add the matching extras. See docs/INSTALL.md for full driver, DSN, and migration steps.

git clone -b feat/oracle-port https://github.com/ncz-os/mnemos
cd mnemos
python -m pip install -e '.[server,oracle]'   # or '.[server,db2]' or '.[server,enterprise]'
export MNEMOS_DATABASE_DSN='oracle://user:pass@host:1521/service_name'
# or:  MNEMOS_DATABASE_DSN='db2://user:pass@host:50000/dbname'
mnemos install --profile server
mnemos serve --profile server

2. Connect an agent via MCP

Add to ~/.claude/mcp_servers.json (Claude Code) or equivalent:

{
  "mcpServers": {
    "mnemos": {
      "command": "mnemos",
      "args": ["serve", "mcp-stdio"],
      "env": {
        "MNEMOS_BASE": "http://<host>:5002",
        "MNEMOS_API_KEY": "<token>"
      }
    }
  }
}

For HTTP/SSE transport (ChatGPT, remote agents): mnemos serve mcp-http on :5004.

Key MCP tools the agent gets:

Tool What it does
search_memories Semantic + filtered search across the memory store
create_memory Write a new memory with category, tags, and content
get_memory Fetch a memory by ID
kg_search Query the knowledge-graph triple store
kronos_anomalies Surface recall anomalies and memory health signals
list_deletions List soft-deleted memories pending hard deletion

3. Webhooks + integrations

Integration What connects How
Claude Code Hooks fire on session-start, prompt-submit, stop - auto-log to MNEMOS integrations/claude-code/ - copy hooks + set MNEMOS_BASE
ZeroClaw Zeroclaw agent reads/writes memories via MCP integrations/zeroclaw/ + mnemos serve mcp-stdio in zeroclaw config
OpenClaw OpenClaw gateway routes memory ops through MCP integrations/openclaw/ + MCP server entry in openclaw.json
Hermes Optional memory skill mounts MNEMOS as a tool provider integrations/hermes/optional-skills/memory/mnemos/
Webhooks (any) Push memory.created, memory.updated, memory.deleted, consultation.completed events to any HTTPS endpoint POST /api/webhooks/register with {"url": "...", "events": [...]}
Cursor / Cline / Continue.dev / Zed / Aider Any MCP-capable IDE connects via stdio or HTTP transport See docs/connectors/

Full documentation: docs/

Architecture

MNEMOS is a packaged FastAPI service with a single mnemos CLI for installation, serving, MCP transport, and operational checks. Agents connect through MCP stdio, MCP HTTP/SSE, REST, or OpenAI-compatible SDKs, while the runtime routes memory, reasoning, session, webhook, federation, portability, and observability work through the mnemos/ package. Persistence is selected by profile and DSN: SQLite + sqlite-vec for edge and development installs, PostgreSQL + pgvector for server deployments, Oracle Database 26ai (23.26.1-ee, HNSW INMEMORY NEIGHBOR GRAPH, JSON Duality, TDE) for enterprise installs, and IBM Db2 12.1.5 (native VECTOR(768, FLOAT32) + DiskANN vector index; runs through Db2 Oracle Compatibility Mode with cursor-level Oracle→Db2 token translation — a native Db2 dialect port is on the v6.x roadmap, see docs/v6.1-roadmap.md. Db2MemoryRepository.semantic_search emits native Db2 SQL — VECTOR_DISTANCE(..., EUCLIDEAN) + FETCH APPROX FIRST — engaging the DiskANN index on the user-facing query path) for enterprise installs. All four backends implement the same PersistenceBackend ABC (mnemos/persistence/base.py) and share tests/test_persistence_parity.py. GRAEAE handles multi-provider reasoning and model routing; MOIRAI handles operator-audited compression through APOLLO and ARTEMIS.

Documentation

Topic File
Installation docs/INSTALL.md
Specification docs/SPECIFICATION.md
System requirements docs/SYSTEM_REQUIREMENTS.md
Memory architecture docs/MEMORY_ARCHITECTURE.md
Compression docs/COMPRESSION.md
GRAEAE reasoning docs/GRAEAE_FEATURES.md
PANTHEON provider facade docs/PANTHEON.md
KRONOS observability docs/KRONOS.md
Portability format docs/MEMORY_EXPORT_FORMAT.md
Scaling docs/SCALING.md
Single-binary builds docs/SINGLE_BINARY.md
Operations docs/OPERATIONS.md
Benchmark harness scripts/bench_v4.py — cross-backend vector-search harness (PG / Oracle / Db2 / SQLite). Results published post-GA.

License

MNEMOS is licensed under the Apache License, Version 2.0. See LICENSE for the full text.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured