Cortex

Cortex

Private local memory for AI tools that provides persistent, private memory via HTTP and MCP protocols, enabling tools to store and recall context without external services.

Category
Visit Server

README

<p align="center"> <img src="assets/cortex-header.gif" alt="Cortex" width="100%"> </p>

<h1 align="center">Cortex</h1> <p align="center"><b>Private local memory for your AI tools.</b><br> Install once. Your tools stop starting from scratch.</p>

<p align="center"> <a href="https://ko-fi.com/adityavg13"> <img src="https://img.shields.io/badge/☕_Ko--fi-Donations_help_support_Cortex_development-FF5E5B?style=for-the-badge&logo=ko-fi&logoColor=white" alt="Support Cortex on Ko-fi"> </a> </p>

<p align="center"> <a href="CHANGELOG.md#060---2026-06-06"><img src="https://img.shields.io/badge/release-0.6.0-blue?style=flat-square" alt="release 0.6.0"></a>  <a href="LICENSE"><img src="https://img.shields.io/github/license/AdityaVG13/cortex?style=flat-square" alt="MIT License"></a>  <img src="https://img.shields.io/badge/platforms-Windows_|macOS|Linux-333?style=flat-square" alt="Windows | macOS | Linux">  <img src="https://img.shields.io/badge/Rust+React-daemon+_desktop-orange?style=flat-square" alt="Rust + React"> </p>

<p align="center"> <a href="https://github.com/AdityaVG13/cortex/releases/latest">Download</a>  ·   <a href="Info/connecting.md">Connect your tools</a>  ·   <a href="CHANGELOG.md">What's new</a>  ·   <a href="Info/roadmap.md">Roadmap</a> </p>

<p align="center"> <img src="https://img.shields.io/badge/works_with-Claude_Code-6B4FBB?style=flat-square&logo=anthropic&logoColor=white">   <img src="https://img.shields.io/badge/works_with-Codex-412991?style=flat-square&logo=openai&logoColor=white">   <img src="https://img.shields.io/badge/works_with-Cursor-000?style=flat-square&logo=cursor&logoColor=white">   <img src="https://img.shields.io/badge/works_with-Factory_Droid-F97316?style=flat-square">   <img src="https://img.shields.io/badge/works_with-MCP_·_HTTP-58a6ff?style=flat-square"> </p>


<p align="center"> 🔒 <b>Private by default</b>: localhost only, data never leaves your machine<br> 🔗 <b>One memory, every tool</b>: HTTP and MCP, same brain, no per-tool silos<br> 📊 <b>Prove it works</b>: token savings, recall quality, and Monte Carlo projections </p>


Quick Start

Get to the first memory moment before learning daemon internals.

1. Install or build Cortex

Use the latest desktop installer, or build the 0.6.0 source CLI:

git clone https://github.com/AdityaVG13/cortex.git
cd cortex/daemon-rs
cargo build --release

2. Start local memory

Open Cortex Control Center and start Cortex from the app. CLI-only users can run:

cortex serve

3. Check readiness

cortex status --json

Success is "status": "ready". If status is needs_action or error, follow the returned nextAction / repair before continuing.

4. Connect one AI tool

Claude Code:

claude plugin marketplace add AdityaVG13/cortex
claude plugin install cortex@cortex-marketplace

Codex:

codex mcp add cortex -- cortex.exe mcp --agent codex

Restart the AI tool after changing MCP config.

5. Store and recall one memory

From a connected MCP client, call cortex_store, then cortex_recall. From the repo, run the matching smoke script:

Windows:

powershell -ExecutionPolicy Bypass -File scripts\first-run-smoke.ps1

macOS / Linux:

bash scripts/first-run-smoke.sh

That smoke checks status, stores one disposable local memory, and recalls it. Normal use does not require benchmark adapters, provider keys, or LongMemEval.

More tool-specific setup: Info/connecting.md.


<table> <tr> <td align="center" valign="top"> <img width="400" height="1" src="https://raw.githubusercontent.com/AdityaVG13/cortex/master/assets/spacer.png"><br> <h3>❌ Without Cortex</h3> <p> Session 1  →  explain preferences<br> Session 2  →  explain them again<br> Session 3  →  and again, new tool<br> Session 14  →  still explaining<br><br> <b>~15,000 tokens wasted</b> </p> <br> </td> <td align="center" valign="top"> <img width="400" height="1" src="https://raw.githubusercontent.com/AdityaVG13/cortex/master/assets/spacer.png"><br> <h3>✅ With Cortex</h3> <p> Session 1  →  store once<br> Session 2  →  boot, already knows<br> Session 3  →  boot, already knows<br> Session 14  →  boot, still knows<br><br> <b>~300 tokens per boot (97% less)</b> </p> <br> </td> </tr> </table>


<p align="center"> <img src="https://img.shields.io/badge/1-STORE-6B4FBB?style=for-the-badge" alt="Store">   <img src="https://img.shields.io/badge/→-grey?style=for-the-badge" alt="→">   <img src="https://img.shields.io/badge/2-RECALL-4a2d8a?style=for-the-badge" alt="Recall">   <img src="https://img.shields.io/badge/→-grey?style=for-the-badge" alt="→">   <img src="https://img.shields.io/badge/3-BOOT-8B5CF6?style=for-the-badge" alt="Boot"> </p>

<table align="center"> <tr> <td align="center" width="33%">

POST /store

Save decisions, lessons, preferences. Conflict detection is automatic.

</td> <td align="center" width="33%">

GET /recall

Hybrid keyword + semantic search. In-process ONNX embeddings, no external service.

</td> <td align="center" width="33%">

GET /boot

Compiled identity + delta capsule. ~300 tokens served instead of ~15,000 raw.

</td> </tr> </table>


<p align="center">Memory tools are easy to pitch and hard to trust. Cortex starts to matter when the savings stop looking theoretical.</p>

<table> <tr> <td width="50%">

<p align="center"><b>📊 Analytics</b></p> <img src="assets/grid-control-center-analytics.png" width="100%"> <p align="center"><sub>Savings, compression, and activity heatmaps</sub></p>

</td> <td width="50%">

<p align="center"><b>📈 Monte Carlo</b></p> <img src="assets/grid-cc-monte-carlo.png" width="100%"> <p align="center"><sub>30-day projection with confidence bands</sub></p>

</td> </tr> <tr> <td width="50%">

<p align="center"><b>🤖 Agents</b></p> <img src="assets/grid-cc-agents.png" width="100%"> <p align="center"><sub>Live sessions, inbox, deduped by identity</sub></p>

</td> <td width="50%">

<p align="center"><b>🎛️ Overview</b></p> <img src="assets/grid-cc-overview.png" width="100%"> <p align="center"><sub>Memory counts, health, and navigation</sub></p>

</td> </tr> </table>


<p align="center">Benchmark note: <code>cortex-http-pure</code> is a benchmark adapter only; it is not required for normal Cortex operation. Scored LongMemEval-S validation is deferred until project budget allows, so v0.6.x does not claim a LongMemEval quality lift.</p>

<p align="center">Historical v0.5.0 numbers below were measured against a 20-query ground-truth dataset via the <code>cortex-http-base</code> adapter. New recall-quality claims use the helper-free <code>cortex-http-pure</code> adapter as the canonical core baseline after funded validation.</p>

<table align="center"> <tr> <th></th> <th align="center">v0.4.1</th> <th align="center">v0.5.0</th> <th align="center">Δ</th> </tr> <tr> <td align="center"><b>Precision</b></td> <td align="center">55.2%</td> <td align="center"><b>87.5%</b></td> <td align="center">📈 +32.3%</td> </tr> <tr> <td align="center"><b>MRR</b></td> <td align="center">69.2%</td> <td align="center"><b>95.0%</b></td> <td align="center">📈 +25.8%</td> </tr> <tr> <td align="center"><b>Top-1 hit</b></td> <td align="center">90.0%</td> <td align="center"><b>90.0%</b></td> <td align="center">—</td> </tr> <tr> <td align="center"><b>Avg query tokens</b></td> <td align="center">n/a</td> <td align="center"><b>48.4</b></td> <td align="center">—</td> </tr> </table>

<p align="center"> <sub><a href="benchmarking/results/raw-recall-no-helper-dev-20260421-224217.json">Raw v0.5.0 JSON</a></sub><br> <sub>Note: <code>cortex-http-base</code> ("raw") adapter retains partial adapter-layer helpers and is deprecated for new quality claims. The helper-free <code>cortex-http-pure</code> adapter is the v0.6.0+ canonical measurement floor, enforced by 5 CI purity gates. See <a href="benchmarking/README.md">benchmarking/README.md</a>. Reranking ships default-off behind off/shadow/primary modes; public promotion remains gated on LongMemEval/API-backed validation. Query expansion (HyDE) is targeted for v0.7.0.</sub> </p>


<p align="center">v0.6.0 makes settings, governance, boot audits, and recall-quality measurement first-class. Full details in <a href="CHANGELOG.md">CHANGELOG.md</a>.</p>

Accessibility and settings

  • Settings panel: Accessibility, Appearance & Motion, Connection, Budgets, and Keyboard & Navigation
  • Runtime preferences: high contrast, reduced motion, keyboard hints, and compact navigation
  • Accessibility gates: stronger focus states, ARIA/live regions, contrast checks, and 375px reflow checks

Governance

  • Retention classes across store, MCP, OpenAPI, export, and import
  • Local endpoint budgets with stable HTTP 429 / JSON-RPC denial metadata
  • Budget UI in Control Center, backed by the local budgets.toml
  • Boot audits plus GET /boot/audit and the read-only cortex_boot_audit MCP tool
  • Admin rollback with dry-run/apply workflow and audit events

Recall quality

  • cortex-http-pure adapter as the canonical helper-free measurement floor
  • Purity gates, CAS-100, and triangle judge tooling for safer quality claims
  • bge-base-en-v1.5 default embeddings with MiniLM profiles and qwen3-embedding-0.6b opt-in
  • Cross-encoder reranking behind off/shadow/primary modes; default remains off

Reliability

  • Claude plugin MCP is attach-only and no longer starts a second daemon from plugin MCP paths
  • Control Center supervises the app-managed daemon and honors intentional stops
  • Handler panics return JSON 500 responses, with local panic breadcrumbs
  • Storage hygiene compacts FTS, prunes stale embeddings, and migrates canonical vectors to PQ8 int8 blobs

<p align="center">Cortex tracks active agent sessions when clients identify themselves through <code>cortex_boot</code> or <code>GET /boot?agent=NAME</code>.</p>

<table> <tr> <td width="55%">

Connected agents in Control Center

</td> <td width="45%" valign="top">

Multi-agent, one brain

  • Each boot call registers a session. Control Center shows active sessions, deduplicated by agent identity.
  • Read-path tools (recall, peek, unfold) reattach to existing sessions. No duplicates.
  • Session descriptions preserved across reconnects and daemon restarts.
  • What one agent stores, every other agent can recall.

Claude Code, Codex, Cursor, and custom scripts can all be connected simultaneously. Each tracks its own session while sharing the same memory.

</td> </tr> </table>


<div align="center">

Tool Connection Setup
Claude Code MCP (plugin) or desktop app Plugin: claude plugin install cortex@cortex-marketplace
Codex MCP codex mcp add cortex -- cortex.exe mcp --agent codex
Cursor MCP Point MCP server at cortex mcp --agent cursor
Factory Droid MCP cortex mcp --agent droid
Aider CLI / HTTP cortex boot --agent aider
Custom tools HTTP Three endpoints: /boot, /recall, /store
Local LLMs HTTP / MCP Same protocol, any runtime

</div>

<p align="center">Full setup guide: <a href="Info/connecting.md"><b>Info/connecting.md</b></a></p>


<p align="center"><b>Desktop app (Control Center)</b><br> Download from the <a href="https://github.com/AdityaVG13/cortex/releases/latest">latest tagged release page</a>. The Control Center manages daemon lifecycle for you.</p>

<div align="center">

Platform Desktop installer Daemon archive
Windows .exe (NSIS installer) .zip
macOS .dmg .tar.gz
Linux .AppImage / .deb .tar.gz

</div>

<p align="center"><sub>Current release: <code>v0.6.0</code>.</sub></p>

<p align="center"><b>From source</b></p>

git clone https://github.com/AdityaVG13/cortex.git
cd cortex/daemon-rs
cargo build --release

<p align="center"><b>Claude Code plugin</b></p>

claude plugin marketplace add AdityaVG13/cortex
claude plugin install cortex@cortex-marketplace

<p align="center">The plugin attaches to a running Cortex runtime. If Cortex is not ready, it reports <code>APP_INIT_REQUIRED</code>; open Control Center or start the local runtime, then retry.</p>


<p align="center">Cortex enforces a <b>single-daemon invariant</b>: only one daemon process runs at a time.</p>

<div align="center">

Mode How it works
Desktop app Control Center owns the daemon. Restart and monitor from the app.
CLI cortex serve starts the daemon. Exits cleanly if one is already running.
Plugin Attach-only MCP bridge. It connects to the running app/service daemon and does not silently spawn a second daemon.

</div>

<p align="center">Default bind: <code>127.0.0.1:7437</code>. Non-loopback binds require TLS. Auth token at <code>~/.cortex/cortex.token</code>.<br> If using the Control Center, manage the daemon from there. Do not run a second <code>cortex serve</code> alongside it.</p>


<p align="center">After installing, verify the product path:</p>

cortex status --json

Windows:

powershell -ExecutionPolicy Bypass -File scripts\first-run-smoke.ps1

macOS / Linux:

bash scripts/first-run-smoke.sh

<details> <summary>Development build verification</summary>

# Daemon unit tests
cargo test --manifest-path daemon-rs/Cargo.toml

# Desktop test suite
npm --prefix desktop/cortex-control-center test

# Lifecycle smoke test
npm --prefix desktop/cortex-control-center run verify:lifecycle:dev

# Security audit
npm audit --omit=dev --audit-level=high
cargo audit

</details>


<div align="center">

Document Covers
Connecting Setup, MCP, HTTP, auth, troubleshooting
Architecture Codebase map, entry points, data flow, config, tests
MCP Tools All 29 MCP tool definitions and parameters
Research Papers, inspirations, adaptation notes
Roadmap What shipped, what's planned, and why
Security Threat model, auth rules, vulnerability reporting
Team mode Shared-server setup for engineering teams
Contributing Development setup and PR guidelines

</div>

<details> <summary>CLI reference</summary>

Command Description
cortex serve Start the daemon
cortex --help Full command reference
cortex doctor Run diagnostics
cortex paths --json Show file and port paths
cortex plugin ensure-daemon Ensure daemon health (plugin mode)
cortex plugin mcp MCP stdio bridge to HTTP API
cortex setup --team Initialize team mode and generate API keys
cortex export Export data (json or sql)
cortex import Import from a previous export
cortex admin rollback --session-id <id> Soft-delete a session's memory writes (dry-run default; --apply to persist)

</details>


<p align="center">Cortex defaults to localhost-only access with bearer-token auth.<br> Full threat model, auth rules, and vulnerability reporting: <a href="Info/security-rules.md"><b>Info/security-rules.md</b></a></p>


<details> <summary>💾 <b>How much disk space does Cortex use?</b></summary> <br> The daemon binary is ~30 MB. The SQLite database grows with usage. A real install with 286 memories and 493 decisions uses ~386 MB after compaction. The ONNX embedding model (~50 MB) downloads on first run. </details>

<details> <summary>🤖 <b>Can multiple agents write to Cortex at the same time?</b></summary> <br> Yes. SQLite WAL mode handles concurrent reads and serialized writes. Each agent maintains its own session while sharing the same memory. Conflict detection handles contradictions automatically. </details>

<details> <summary>🔒 <b>Does Cortex send any data externally?</b></summary> <br> No. In solo mode, Cortex runs entirely on localhost. No telemetry, no phone-home, no cloud sync. Team mode sends data only to the configured team server over your network. </details>

<details> <summary>🔄 <b>What happens if the daemon crashes mid-session?</b></summary> <br> The MCP proxy detects daemon death and restarts automatically (bounded to 3 attempts with backoff). SQLite WAL mode ensures no data corruption. Sessions survive transient crashes. </details>

<details> <summary>🧹 <b>How do I reset Cortex to a clean state?</b></summary> <br> Delete <code>~/.cortex/cortex.db</code> and restart the daemon. A new empty database and auth token are generated. Settings and model files are preserved. </details>


<p align="center"><b>Built by</b></p>

<p align="center"> <a href="https://github.com/AdityaVG13/cortex/graphs/contributors"> <img src="https://contrib.rocks/image?repo=AdityaVG13/cortex&max=20&columns=10" /> </a> </p>


<p align="center"> <a href="https://ko-fi.com/adityavg13"><b>☕ Support Cortex</b></a>  ·   <a href="Info/research.md">Research</a>  ·   <a href="Info/connecting.md">Connecting</a>  ·   <a href="Info/security-rules.md">Security</a>  ·   <a href="CONTRIBUTING.md">Contributing</a>  ·   <a href="CODE_OF_CONDUCT.md">Code of Conduct</a>  ·   <a href="CHANGELOG.md">Changelog</a>  ·   <a href="LICENSE">MIT License</a> </p>

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured