MCP Servers

MemoryThreads

Persistent, searchable conversation memory shared across Claude Code and Codex, enabling cross-session recall and thread continuity via hybrid BM25 and vector search.

README

MemoryThreads

Persistent, searchable conversation memory shared across Claude Code and Codex.

MemoryThreads is a local MCP server that auto-captures every conversation turn from both Claude Code and OpenAI Codex into one SQLite database, then makes that history instantly searchable from any future session - hybrid BM25 (FTS5) + vector (sqlite-vec) recall, with cross-platform thread continuity. Start work in Claude Code, continue it in Codex, and either side can recall the full shared history.

No cloud, no account - everything lives in ~/.claude/memory-server/ on your machine.

Why

LLM coding sessions are stateless - close the terminal and the context is gone. MemoryThreads fixes that:

Never re-explain. recall_context("that auth bug from last week") pulls the actual prior turns.
Cross-tool. Claude Code and Codex write into and read from the same memory, so switching tools never loses the thread.
Automatic. Hooks capture turns on every session; you don't manage it.
Fast + private. Local SQLite, FTS5 + sqlite-vec. Your conversations never leave your machine.

How it works

  Claude Code  ─┐                            ┌─ recall_context / search_docs
                ├─ hooks ─► jobs queue ─► worker.js ─► SQLite ◄─┤  (MCP tools)
  Codex        ─┘   (capture)             (parse + embed)        └─ mt launch (resume)

Capture. Session hooks (Stop, PreCompact, UserPromptSubmit) queue each session's transcript for ingestion. A launchd file-watcher (incremental-sync.js) also ingests Claude Code turns continuously.
Parse + embed. worker.js parses transcripts (dual-format: Claude Code JSONL and Codex rollout JSONL via transcript-parser.js), stores turns + threads, and embeds each turn (OpenAI text-embedding-3-small, 1536-dim) into a sqlite-vec table.
Recall. The MCP server (server.js) exposes recall_context, which runs hybrid BM25 + cosine search over turns/threads and returns the matches to the model.
Continuity. A canonical_thread_id links a Claude Code stream and a Codex stream into one logical MemoryThread, so continuation works across both tools without sharing native session files.

MCP tools

Tool	Purpose
`recall_context(query, resolution=0, include_threads)`	Hybrid BM25 + vector search. `resolution` 0 = raw turns (default), 1 = full threads, 2 = thread key-exchanges.
`search_docs(query)`	FTS5 search over ingested reference docs.
`ingest_doc(source, tags?, title?)`	Add a reference doc (URL, llms.txt, or local file).
`list_docs` / `delete_doc`	Manage ingested docs.
`save_thread(name, ...)`	Bookmark the current session as a named MemoryThread.
`list_threads` / `activate_thread` / `delete_thread`	Manage and resume bookmarks.

Slash commands & CLI

/mt-save <name>, /mt-list, /mt-delete <name>, /mt-doc-ingest <source>
mt launch (terminal) - interactive picker to resume a saved thread.

Data model

One SQLite DB (data/memory.db). Core tables: threads, turns, turns_fts (FTS5), turn_embeddings (sqlite-vec), plus saved_threads, active_memory_threads, docs, tool_uses, summaries, recovery_buffer, and the worker jobs queue. Full DDL in SCHEMA.md.

Recall operates directly over conversation turns and threads - there is no extracted-knowledge layer.

Setup

See SETUP.md for the full guide. In short:

npm install
Put OPENAI_API_KEY=... in .env (gitignored).
Register the MCP server: claude mcp add --scope user memory node ~/.claude/memory-server/server.js (and the [mcp_servers.memory] block in ~/.codex/config.toml for Codex).
Add the hooks block to ~/.claude/settings.json and ~/.codex/hooks.json.
Start the worker via the launchd watchdog.

Hooks

Event	Script	Role
`SessionStart`	`session-start-cold.sh` / `session-start-compact.sh`	Status line; compaction recovery
`UserPromptSubmit`	`user-prompt-submit.cjs`	Inject relevant prior turns + active-thread context
`PreCompact`	`pre-compact.sh`	Snapshot recent turns to `recovery_buffer` before compaction
`Stop`	`stop.cjs`	Queue the session transcript for ingestion

The same hook scripts serve both Claude Code (settings.json) and Codex (hooks.json).

Tech stack

Node.js (ES modules) · better-sqlite3 · sqlite-vec · OpenAI embeddings · @modelcontextprotocol/sdk · SQLite FTS5.

Privacy

All data is local. .env (your API key) and data/ (the DB) are gitignored and never committed.

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured