MemoryThreads
Persistent, searchable conversation memory shared across Claude Code and Codex, enabling cross-session recall and thread continuity via hybrid BM25 and vector search.
README
MemoryThreads
Persistent, searchable conversation memory shared across Claude Code and Codex.
MemoryThreads is a local MCP server that auto-captures every conversation turn from both Claude Code and OpenAI Codex into one SQLite database, then makes that history instantly searchable from any future session - hybrid BM25 (FTS5) + vector (sqlite-vec) recall, with cross-platform thread continuity. Start work in Claude Code, continue it in Codex, and either side can recall the full shared history.
No cloud, no account - everything lives in ~/.claude/memory-server/ on your machine.
Why
LLM coding sessions are stateless - close the terminal and the context is gone. MemoryThreads fixes that:
- Never re-explain.
recall_context("that auth bug from last week")pulls the actual prior turns. - Cross-tool. Claude Code and Codex write into and read from the same memory, so switching tools never loses the thread.
- Automatic. Hooks capture turns on every session; you don't manage it.
- Fast + private. Local SQLite, FTS5 + sqlite-vec. Your conversations never leave your machine.
How it works
Claude Code ─┐ ┌─ recall_context / search_docs
├─ hooks ─► jobs queue ─► worker.js ─► SQLite ◄─┤ (MCP tools)
Codex ─┘ (capture) (parse + embed) └─ mt launch (resume)
- Capture. Session hooks (Stop, PreCompact, UserPromptSubmit) queue each session's transcript for ingestion. A launchd file-watcher (
incremental-sync.js) also ingests Claude Code turns continuously. - Parse + embed.
worker.jsparses transcripts (dual-format: Claude Code JSONL and Codex rollout JSONL viatranscript-parser.js), stores turns + threads, and embeds each turn (OpenAItext-embedding-3-small, 1536-dim) into a sqlite-vec table. - Recall. The MCP server (
server.js) exposesrecall_context, which runs hybrid BM25 + cosine search over turns/threads and returns the matches to the model. - Continuity. A
canonical_thread_idlinks a Claude Code stream and a Codex stream into one logical MemoryThread, so continuation works across both tools without sharing native session files.
MCP tools
| Tool | Purpose |
|---|---|
recall_context(query, resolution=0, include_threads) |
Hybrid BM25 + vector search. resolution 0 = raw turns (default), 1 = full threads, 2 = thread key-exchanges. |
search_docs(query) |
FTS5 search over ingested reference docs. |
ingest_doc(source, tags?, title?) |
Add a reference doc (URL, llms.txt, or local file). |
list_docs / delete_doc |
Manage ingested docs. |
save_thread(name, ...) |
Bookmark the current session as a named MemoryThread. |
list_threads / activate_thread / delete_thread |
Manage and resume bookmarks. |
Slash commands & CLI
/mt-save <name>,/mt-list,/mt-delete <name>,/mt-doc-ingest <source>mt launch(terminal) - interactive picker to resume a saved thread.
Data model
One SQLite DB (data/memory.db). Core tables: threads, turns, turns_fts (FTS5), turn_embeddings (sqlite-vec), plus saved_threads, active_memory_threads, docs, tool_uses, summaries, recovery_buffer, and the worker jobs queue. Full DDL in SCHEMA.md.
Recall operates directly over conversation turns and threads - there is no extracted-knowledge layer.
Setup
See SETUP.md for the full guide. In short:
npm install- Put
OPENAI_API_KEY=...in.env(gitignored). - Register the MCP server:
claude mcp add --scope user memory node ~/.claude/memory-server/server.js(and the[mcp_servers.memory]block in~/.codex/config.tomlfor Codex). - Add the hooks block to
~/.claude/settings.jsonand~/.codex/hooks.json. - Start the worker via the launchd watchdog.
Hooks
| Event | Script | Role |
|---|---|---|
SessionStart |
session-start-cold.sh / session-start-compact.sh |
Status line; compaction recovery |
UserPromptSubmit |
user-prompt-submit.cjs |
Inject relevant prior turns + active-thread context |
PreCompact |
pre-compact.sh |
Snapshot recent turns to recovery_buffer before compaction |
Stop |
stop.cjs |
Queue the session transcript for ingestion |
The same hook scripts serve both Claude Code (settings.json) and Codex (hooks.json).
Tech stack
Node.js (ES modules) · better-sqlite3 · sqlite-vec · OpenAI embeddings · @modelcontextprotocol/sdk · SQLite FTS5.
Privacy
All data is local. .env (your API key) and data/ (the DB) are gitignored and never committed.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.