Cortex

Cortex

Local-first AI memory layer with hybrid retrieval and brain-inspired namespaces. Enables agents to save, search, and manage memories directly via MCP tools.

Category
Visit Server

README

Cortex 🧠

Local-first AI memory layer with hybrid retrieval and brain-inspired namespaces. No API keys needed.

npm version License: MIT

Give your AI agent persistent memory that runs entirely on your machine. Cortex combines vector similarity, full-text search, recency, and importance scoring into a single embedded database β€” no cloud services, no API keys, no monthly bills.

v0.4.0: L0/L1/L2 tiered context loading, hierarchical namespace subtrees, auto-session extraction (cortex extract). v0.3.0: Namespaced collections (brain regions), memory decay, consolidation, audit & health commands.

Quick Start (2 minutes)

# Install globally
npm install -g cortex-memory

# Save memories to brain regions
cortex save "I prefer TypeScript over JavaScript" --namespace learnings
cortex save "Ran 5k in 24:30" --namespace health
cortex save "Sarah prefers async communication" --namespace people

# Search within a namespace
cortex search "programming languages" --namespace learnings

# Search everything
cortex search "programming languages"

# Ingest a folder of notes
cortex ingest ~/notes --recursive --namespace daily

# Check brain health
cortex health
cortex status

That's it. Memories are stored locally at ~/.cortex/lance_db/.

Why Cortex?

Feature Cortex Plain vector DB Cloud memory APIs
100% local βœ… βœ… ❌
Hybrid retrieval (vector + BM25 + recency + importance) βœ… ❌ Sometimes
Zero config βœ… ❌ ❌
MCP server built-in βœ… ❌ Sometimes
No API keys βœ… βœ… ❌
Free forever βœ… βœ… ❌

Install

npm install -g cortex-memory

Requires Node.js 18+. First run downloads the embedding model (~30MB) automatically.

Tiered Context Loading (L0/L1/L2)

Every memory is automatically indexed at three depth levels β€” no LLM needed:

Level Size Content Flag
L0 ~100 tokens First sentence + key terms (abstract) --depth 0 (default)
L1 ~500 tokens First paragraph + structure --depth 1
L2 Full Original content --depth 2
# Default: L0 abstracts β€” fastest, saves tokens
cortex search "vector search"

# L1 overviews β€” good balance
cortex search "vector search" --depth 1

# L2 full β€” when you need every detail
cortex search "vector search" --depth 2

This is huge for agents: retrieve 10 L0 abstracts to scan what's relevant, then fetch full content only for the 1-2 memories you actually need.

Namespaces (Brain Regions)

Organize memories into hierarchical namespaces β€” like a filesystem for knowledge:

Namespace Purpose
health Food, workouts, sleep, body metrics
projects/myapp App-specific memories
projects/trading Trading-specific memories
user/preferences Personal settings and preferences
user/people Info about specific people
personal Relationships, reflections, plans
daily Raw daily logs
learnings Mistakes, corrections, patterns
general Default / uncategorized (backward compatible)
cortex save "Ran 5k in 24:30" --namespace health
cortex save "MyApp uses React" --namespace projects/myapp

# Exact namespace match
cortex search "running times" --namespace health

# Subtree search β€” matches all projects/* namespaces
cortex search "project decisions" --namespace projects/ --namespace-prefix

Memory Maintenance

Decay

Memories that haven't been accessed lose importance over time (configurable half-life):

cortex decay --dry-run              # Preview what would decay
cortex decay --apply --half-life 30 # Apply with 30-day half-life

Consolidation

Merge similar memories into summaries (like sleep consolidating episodic β†’ semantic memory):

cortex consolidate --dry-run        # Find clusters
cortex consolidate --apply          # Merge and summarize

Audit & Health

cortex audit    # Find duplicates (cosine sim > 0.95), stale memories, namespace distribution
cortex health   # Overall brain health: namespace balance, avg importance, staleness

CLI Reference

Save Memories

cortex save "I prefer TypeScript over JavaScript" --namespace learnings
cortex save "Deploy with Vercel" --type procedural --tags "deploy,vercel"
cortex save "Had a great meeting with the team" --type episodic -i 0.8
cortex save "Use pnpm for monorepos" --project myapp --namespace projects/myapp

Search (Hybrid Retrieval)

cortex search "what programming languages"
cortex search "deployment" --type procedural --limit 3
cortex search "meeting notes" --min-importance 0.7
cortex search "database setup" --project myapp --namespace projects/myapp

# Tiered depth (token-efficient retrieval)
cortex search "decisions" --depth 0          # L0 abstracts (default)
cortex search "decisions" --depth 1          # L1 overviews
cortex search "decisions" --depth 2          # L2 full content

# Hierarchical namespace prefix (subtree search)
cortex search "deployment" --namespace projects/ --namespace-prefix
cortex search "who is" --namespace user/ --namespace-prefix

Extract Memories from a Transcript

# Dry run β€” preview what would be extracted
cortex extract transcript.md --dry-run

# Save to a specific namespace
cortex extract meeting-notes.txt --namespace projects/myapp

# Save to general (default)
cortex extract conversation.md

cortex extract reads a conversation transcript (plain text or markdown) and automatically saves key facts, decisions, lessons, and person mentions as separate typed memories. Zero LLM calls β€” pure regex/heuristic extraction.

Ingest Files & Folders

# Ingest a single markdown file
cortex import MEMORY.md

# Ingest an entire folder of notes
cortex ingest ~/notes
cortex ingest ~/notes --recursive              # include subdirectories
cortex ingest ~/notes --smart                   # extract high-signal lines (Decision:, Lesson:, etc.)
cortex ingest ~/docs --ext .md,.txt,.org        # custom file extensions

# Smart mode recognizes prefixed lines:
#   Decision: Use PostgreSQL for the main database  β†’ type: decision, importance: 0.9
#   Lesson: Always test with real data              β†’ type: lesson, importance: 0.85
#   Bug: Race condition in the queue worker         β†’ type: lesson, importance: 0.75

Manage & Curate

cortex status                    # Database overview
cortex curate                    # Find low-value memories to clean up
cortex curate --auto             # Auto-delete low-value entries
cortex export --type semantic    # Export as markdown
cortex delete <memory-id>        # Delete by ID

MCP Server (Claude Desktop / Cursor / OpenClaw)

Cortex ships with a built-in Model Context Protocol server, so AI agents can save and search memories directly.

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "cortex": {
      "command": "npx",
      "args": ["cortex-memory", "mcp"]
    }
  }
}

Cursor

Add to .cursor/mcp.json:

{
  "mcpServers": {
    "cortex": {
      "command": "npx",
      "args": ["cortex-memory", "mcp"]
    }
  }
}

OpenClaw

Add to your OpenClaw MCP config:

{
  "cortex": {
    "command": "npx",
    "args": ["cortex-memory", "mcp"]
  }
}

MCP Tools Available

Tool Description
memory_save Save a new memory with type, importance, and tags
memory_search Hybrid search across all memories
memory_context Get memory stats and overview
memory_forget Delete a memory by ID
memory_reflect Get AI-powered reflection on stored memories

Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚   CLI/MCP   │────▢│ Memory Engine │────▢│   LanceDB   β”‚
β”‚   Client    │◀────│  (scoring)   │◀────│  (embedded) β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                           β”‚
                    β”Œβ”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”
                    β”‚  Embeddings β”‚
                    β”‚ (MiniLM-L6) β”‚
                    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Storage: LanceDB embedded vector database at ~/.cortex/lance_db/

Embeddings: Xenova/all-MiniLM-L6-v2 β€” 384-dimensional vectors, runs locally via ONNX Runtime. No API keys.

Full-text search: Tantivy-powered BM25 scoring, automatically indexed.

Hybrid scoring formula:

score = 0.35Γ—vector + 0.30Γ—bm25 + 0.15Γ—recency + 0.10Γ—importance + 0.05Γ—access_frequency + 0.05Γ—type_boost

This means Cortex doesn't just find semantically similar results β€” it factors in how recently something was accessed, how important it is, and whether it matches exact keywords.

Memory Types

Type Use Case Example
semantic Facts, preferences, knowledge "I prefer dark mode"
episodic Events, experiences "Met with client on Tuesday"
procedural How-to, processes "To deploy: run npm build then vercel"
decision Choices made and why "Chose PostgreSQL over MySQL for JSONB support"
lesson Things learned the hard way "Always test with real data, not mocks"
fact Verified information "The API rate limit is 100 req/min"
preference Personal preferences "Prefers tabs over spaces"
project-state Current state of a project "v2 API is in beta, launching next week"
person Info about people "Sarah is the frontend lead, prefers React"

Programmatic Usage

import { MemoryEngine } from 'cortex-memory';

const engine = new MemoryEngine();

// Save
await engine.save({
  content: "Always validate user input server-side",
  type: "lesson",
  importance: 0.9,
  tags: ["security", "validation"],
});

// Search
const results = await engine.search({
  query: "input validation",
  limit: 5,
});

for (const r of results) {
  console.log(`[${r.score.toFixed(3)}] ${r.memory.content}`);
}

// Stats
const stats = await engine.stats();
console.log(`Total memories: ${stats.totalMemories}`);

Agent Integration

Cortex is designed to be the memory backbone for AI agents. See the docs for full integration guides:

  • Agent Integration Guide β€” How any AI agent should use Cortex: search, save, ingest, curate, and best practices
  • OpenClaw Integration Guide β€” Specific setup for OpenClaw agents: AGENTS.md config, session ingestion, heartbeat maintenance

Quick Start for Agents

# Search before answering (mandatory dual-search with memory_search)
npx cortex search "deployment steps" --limit 5

# Save lessons immediately when recognized
npx cortex save --type belief --tags "calibration" "Always verify sub-agent output independently"

# Ingest sessions after significant work days
npx cortex ingest-sessions

# Weekly maintenance
npx cortex curate
npx cortex health

Development

git clone https://github.com/ryanfrigo/cortex.git
cd cortex
npm install
npm run build
node bin/cortex.js status

Testing

npm test              # Run unit tests (vitest)
npm run bench         # Run recall benchmarks against real DB
npm run bench:regression  # Compare with previous benchmark run
  • Unit tests (tests/): engine, scoring, decay, consolidation β€” uses temp DB
  • Benchmarks (bench/): 22 ground-truth queries across health, projects, personal, learnings, factual, and semantic categories
  • Regression guard: flags if recall drops >5% between runs

Contributing

See CONTRIBUTING.md for guidelines.

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured