MCP Servers

Cortex

Local-first AI memory layer with hybrid retrieval and brain-inspired namespaces. Enables agents to save, search, and manage memories directly via MCP tools.

README

Cortex 🧠

Local-first AI memory layer with hybrid retrieval and brain-inspired namespaces. No API keys needed.

Give your AI agent persistent memory that runs entirely on your machine. Cortex combines vector similarity, full-text search, recency, and importance scoring into a single embedded database — no cloud services, no API keys, no monthly bills.

v0.4.0: L0/L1/L2 tiered context loading, hierarchical namespace subtrees, auto-session extraction (cortex extract). v0.3.0: Namespaced collections (brain regions), memory decay, consolidation, audit & health commands.

Quick Start (2 minutes)

# Install globally
npm install -g cortex-memory

# Save memories to brain regions
cortex save "I prefer TypeScript over JavaScript" --namespace learnings
cortex save "Ran 5k in 24:30" --namespace health
cortex save "Sarah prefers async communication" --namespace people

# Search within a namespace
cortex search "programming languages" --namespace learnings

# Search everything
cortex search "programming languages"

# Ingest a folder of notes
cortex ingest ~/notes --recursive --namespace daily

# Check brain health
cortex health
cortex status

That's it. Memories are stored locally at ~/.cortex/lance_db/.

Why Cortex?

Feature	Cortex	Plain vector DB	Cloud memory APIs
100% local	✅	✅	❌
Hybrid retrieval (vector + BM25 + recency + importance)	✅	❌	Sometimes
Zero config	✅	❌	❌
MCP server built-in	✅	❌	Sometimes
No API keys	✅	✅	❌
Free forever	✅	✅	❌

Install

npm install -g cortex-memory

Requires Node.js 18+. First run downloads the embedding model (~30MB) automatically.

Tiered Context Loading (L0/L1/L2)

Every memory is automatically indexed at three depth levels — no LLM needed:

Level	Size	Content	Flag
L0	~100 tokens	First sentence + key terms (abstract)	`--depth 0` (default)
L1	~500 tokens	First paragraph + structure	`--depth 1`
L2	Full	Original content	`--depth 2`

# Default: L0 abstracts — fastest, saves tokens
cortex search "vector search"

# L1 overviews — good balance
cortex search "vector search" --depth 1

# L2 full — when you need every detail
cortex search "vector search" --depth 2

This is huge for agents: retrieve 10 L0 abstracts to scan what's relevant, then fetch full content only for the 1-2 memories you actually need.

Namespaces (Brain Regions)

Organize memories into hierarchical namespaces — like a filesystem for knowledge:

Namespace	Purpose
`health`	Food, workouts, sleep, body metrics
`projects/myapp`	App-specific memories
`projects/trading`	Trading-specific memories
`user/preferences`	Personal settings and preferences
`user/people`	Info about specific people
`personal`	Relationships, reflections, plans
`daily`	Raw daily logs
`learnings`	Mistakes, corrections, patterns
`general`	Default / uncategorized (backward compatible)

cortex save "Ran 5k in 24:30" --namespace health
cortex save "MyApp uses React" --namespace projects/myapp

# Exact namespace match
cortex search "running times" --namespace health

# Subtree search — matches all projects/* namespaces
cortex search "project decisions" --namespace projects/ --namespace-prefix

Memory Maintenance

Decay

Memories that haven't been accessed lose importance over time (configurable half-life):

cortex decay --dry-run              # Preview what would decay
cortex decay --apply --half-life 30 # Apply with 30-day half-life

Consolidation

Merge similar memories into summaries (like sleep consolidating episodic → semantic memory):

cortex consolidate --dry-run        # Find clusters
cortex consolidate --apply          # Merge and summarize

Audit & Health

cortex audit    # Find duplicates (cosine sim > 0.95), stale memories, namespace distribution
cortex health   # Overall brain health: namespace balance, avg importance, staleness

CLI Reference

Save Memories

cortex save "I prefer TypeScript over JavaScript" --namespace learnings
cortex save "Deploy with Vercel" --type procedural --tags "deploy,vercel"
cortex save "Had a great meeting with the team" --type episodic -i 0.8
cortex save "Use pnpm for monorepos" --project myapp --namespace projects/myapp

Search (Hybrid Retrieval)

cortex search "what programming languages"
cortex search "deployment" --type procedural --limit 3
cortex search "meeting notes" --min-importance 0.7
cortex search "database setup" --project myapp --namespace projects/myapp

# Tiered depth (token-efficient retrieval)
cortex search "decisions" --depth 0          # L0 abstracts (default)
cortex search "decisions" --depth 1          # L1 overviews
cortex search "decisions" --depth 2          # L2 full content

# Hierarchical namespace prefix (subtree search)
cortex search "deployment" --namespace projects/ --namespace-prefix
cortex search "who is" --namespace user/ --namespace-prefix

Extract Memories from a Transcript

# Dry run — preview what would be extracted
cortex extract transcript.md --dry-run

# Save to a specific namespace
cortex extract meeting-notes.txt --namespace projects/myapp

# Save to general (default)
cortex extract conversation.md

cortex extract reads a conversation transcript (plain text or markdown) and automatically saves key facts, decisions, lessons, and person mentions as separate typed memories. Zero LLM calls — pure regex/heuristic extraction.

Ingest Files & Folders

# Ingest a single markdown file
cortex import MEMORY.md

# Ingest an entire folder of notes
cortex ingest ~/notes
cortex ingest ~/notes --recursive              # include subdirectories
cortex ingest ~/notes --smart                   # extract high-signal lines (Decision:, Lesson:, etc.)
cortex ingest ~/docs --ext .md,.txt,.org        # custom file extensions

# Smart mode recognizes prefixed lines:
#   Decision: Use PostgreSQL for the main database  → type: decision, importance: 0.9
#   Lesson: Always test with real data              → type: lesson, importance: 0.85
#   Bug: Race condition in the queue worker         → type: lesson, importance: 0.75

Manage & Curate

cortex status                    # Database overview
cortex curate                    # Find low-value memories to clean up
cortex curate --auto             # Auto-delete low-value entries
cortex export --type semantic    # Export as markdown
cortex delete <memory-id>        # Delete by ID

MCP Server (Claude Desktop / Cursor / OpenClaw)

Cortex ships with a built-in Model Context Protocol server, so AI agents can save and search memories directly.

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "cortex": {
      "command": "npx",
      "args": ["cortex-memory", "mcp"]
    }
  }
}

Cursor

Add to .cursor/mcp.json:

{
  "mcpServers": {
    "cortex": {
      "command": "npx",
      "args": ["cortex-memory", "mcp"]
    }
  }
}

OpenClaw

Add to your OpenClaw MCP config:

{
  "cortex": {
    "command": "npx",
    "args": ["cortex-memory", "mcp"]
  }
}

MCP Tools Available

Tool	Description
`memory_save`	Save a new memory with type, importance, and tags
`memory_search`	Hybrid search across all memories
`memory_context`	Get memory stats and overview
`memory_forget`	Delete a memory by ID
`memory_reflect`	Get AI-powered reflection on stored memories

Architecture

┌─────────────┐     ┌──────────────┐     ┌─────────────┐
│   CLI/MCP   │────▶│ Memory Engine │────▶│   LanceDB   │
│   Client    │◀────│  (scoring)   │◀────│  (embedded) │
└─────────────┘     └──────────────┘     └─────────────┘
                           │
                    ┌──────┴──────┐
                    │  Embeddings │
                    │ (MiniLM-L6) │
                    └─────────────┘

Storage: LanceDB embedded vector database at ~/.cortex/lance_db/

Embeddings: Xenova/all-MiniLM-L6-v2 — 384-dimensional vectors, runs locally via ONNX Runtime. No API keys.

Full-text search: Tantivy-powered BM25 scoring, automatically indexed.

Hybrid scoring formula:

score = 0.35×vector + 0.30×bm25 + 0.15×recency + 0.10×importance + 0.05×access_frequency + 0.05×type_boost

This means Cortex doesn't just find semantically similar results — it factors in how recently something was accessed, how important it is, and whether it matches exact keywords.

Memory Types

Type	Use Case	Example
`semantic`	Facts, preferences, knowledge	"I prefer dark mode"
`episodic`	Events, experiences	"Met with client on Tuesday"
`procedural`	How-to, processes	"To deploy: run npm build then vercel"
`decision`	Choices made and why	"Chose PostgreSQL over MySQL for JSONB support"
`lesson`	Things learned the hard way	"Always test with real data, not mocks"
`fact`	Verified information	"The API rate limit is 100 req/min"
`preference`	Personal preferences	"Prefers tabs over spaces"
`project-state`	Current state of a project	"v2 API is in beta, launching next week"
`person`	Info about people	"Sarah is the frontend lead, prefers React"

Programmatic Usage

import { MemoryEngine } from 'cortex-memory';

const engine = new MemoryEngine();

// Save
await engine.save({
  content: "Always validate user input server-side",
  type: "lesson",
  importance: 0.9,
  tags: ["security", "validation"],
});

// Search
const results = await engine.search({
  query: "input validation",
  limit: 5,
});

for (const r of results) {
  console.log(`[${r.score.toFixed(3)}] ${r.memory.content}`);
}

// Stats
const stats = await engine.stats();
console.log(`Total memories: ${stats.totalMemories}`);

Agent Integration

Cortex is designed to be the memory backbone for AI agents. See the docs for full integration guides:

Agent Integration Guide — How any AI agent should use Cortex: search, save, ingest, curate, and best practices
OpenClaw Integration Guide — Specific setup for OpenClaw agents: AGENTS.md config, session ingestion, heartbeat maintenance

Quick Start for Agents

# Search before answering (mandatory dual-search with memory_search)
npx cortex search "deployment steps" --limit 5

# Save lessons immediately when recognized
npx cortex save --type belief --tags "calibration" "Always verify sub-agent output independently"

# Ingest sessions after significant work days
npx cortex ingest-sessions

# Weekly maintenance
npx cortex curate
npx cortex health

Development

git clone https://github.com/ryanfrigo/cortex.git
cd cortex
npm install
npm run build
node bin/cortex.js status

Testing

npm test              # Run unit tests (vitest)
npm run bench         # Run recall benchmarks against real DB
npm run bench:regression  # Compare with previous benchmark run

Unit tests (tests/): engine, scoring, decay, consolidation — uses temp DB
Benchmarks (bench/): 22 ground-truth queries across health, projects, personal, learnings, factual, and semantic categories
Regression guard: flags if recall drops >5% between runs

Contributing

See CONTRIBUTING.md for guidelines.

License

MIT

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured