Marrow

Marrow

Marrow is a persistent multi-project memory backend that provides AI agents with MCP tools for semantic code navigation, task backlog management, and versioned artifact storage. It enables seamless, token-optimized context sharing and asynchronous handoffs between different AI models working on the same codebase.

Category
Visit Server

README

Marrow MCP server

Marrow

A persistent, multi-project intelligence backend for AI coding agents.

Marrow gives AI agents structured, long-lived memory over your codebase and projects — served over the Model Context Protocol (MCP). It exposes a unified API surface covering task management, versioned document storage, semantic code navigation, session state, and a build pipeline.

At its core, a background daemon watches your source files in real time, extracts structural skeletons using language-aware grammars, generates vector embeddings, and keeps a semantic index always in sync. The result: agents can navigate your code by meaning, not just by filename.


Why Marrow?

AI coding agents are stateless by nature. Every new session starts cold — no memory of what was decided, what was built, or where things stand. Marrow solves this by acting as a persistent, structured workspace that any agent can plug into via MCP and immediately orient itself.

Without Marrow With Marrow
Agent forgets context between sessions Full session state persisted and recoverable
Agent searches code by filename Agent searches code by semantic meaning
Notes and plans live in chat history Versioned artifact storage with history and rollback
Tasks tracked in external tools Native task backlog with semantic search
Build context assembled manually Declarative build manifests assemble context automatically

Use Case: Multi-Agent Handoff

Marrow acts as the single source of truth for heterogeneous agent workflows. You can use Claude for heavy architectural lifting, let it save state into Marrow, and then spin up a cheaper local model to write unit tests. The second agent immediately aligns itself using get_session_context and semantic task backlogs.


Architecture

Marrow is composed of three packages:

marrow_server/     — MCP + REST API server (the main service)
marrow_worker/     — Background file watcher and skeleton indexer
marrow_common/     — Shared schema (skeleton_schema.py)

marrow_server

A FastAPI + FastMCP application that exposes 22 structured MCP tools and a REST API for the worker. Storage uses LanceDB for vector embeddings and metadata, and Markdown blobs for task and artifact content. Transport is Streamable HTTP MCP (protocol version 2025-03-26).

marrow_worker

A standalone background daemon that:

  1. Watches source files using filesystem events
  2. Debounces rapid changes
  3. Parses modified files with tree-sitter grammars (multi-language)
  4. Extracts structural skeletons: classes, methods, namespaces, properties
  5. Generates vector embeddings via a lazy-loaded encoder
  6. Delivers skeleton chunks to marrow_server via a resilient batched outbox with retry logic

marrow_common

Shared Pydantic schema (SkeletonChunk, SCHEMA_VERSION) used as the data contract between worker and server.


MCP Tool Reference

All tools are available to any MCP-compatible client (Claude, Cursor, custom agents, etc.).

🗒️ Task Tools

Tool Description
add_tasks Adds a list of tasks to the project backlog
search_tasks Semantic search over tasks
get_task_details Returns full task details by ID
update_task Updates task fields (status, priority, etc.)
complete_tasks Atomically closes tasks and auto-unblocks dependents

📄 Artifact Tools

Tool Description
read_project_artifacts Reads one or more markdown artifacts
save_project_artifacts Creates or updates artifacts (patch, replace, append)
list_project_artifacts Lists files in artifact storage
move_project_artifact Moves or renames an artifact
delete_project_artifact Safely deletes an artifact
search_project_artifacts Global semantic search across all artifacts
get_project_artifact_outline Extracts table of contents from a markdown file
list_artifact_history Lists version history for an artifact
restore_project_artifact Restores a previous artifact version

🧠 Code Intelligence Tools

Tool Description
search_code_skeletons Semantic search over indexed source code skeletons
get_file_skeleton Retrieves a token-optimized structural outline of a file
view_file_source Reads a precise line range from the live source repository
get_project_map Returns a live directory tree of all indexed files

📁 Session & Project Tools

Tool Description
list_projects Returns a list of all available projects
get_session_context Reads session state and returns phase-appropriate guidelines for the active agent role
get_guideline Assembles and returns the full context bundle (guidelines + ADRs) for any named agent role — use for mid-session role switches without disturbing pipeline state

🛠️ Build Tools

Tool Description
run_project_build Executes a YAML build manifest to assemble context payloads

Requirements

  • Python 3.12+
  • LanceDB (installed via pip)
  • tree-sitter with language wheels (see ADR-0022)
  • A sentence-transformer compatible embedding model

Quickstart

Option A — Docker (recommended)

Prerequisites: Docker and Docker Compose

1. Clone and configure

git clone https://github.com/desikai-lab/Marrow.git
cd Marrow
cp marrow_server/.env.example marrow_server/.env

Open marrow_server/.env and set the two required values:

SECRET_TOKEN=your-strong-random-secret
TASKS_DIR=/data                          # leave as-is for Docker; data persists in a named volume

2. Start both services

docker compose up

Marrow server starts on port 8000. The worker starts automatically once the server is healthy (allow ~20 seconds on first run for the embedding model to download).

MCP endpoint: http://localhost:8000/mcp

3. Initialize your first project

docker exec marrow-server python src/cli/admin_cli.py project-init --project MyProject

This creates a structured workspace at /data/MyProject/. Open MyProject/spec.md and fill in your tech stack before your first agent session.

4. Connect your agent

Add Marrow to your MCP client configuration:

{
  "mcpServers": {
    "marrow": {
      "url": "http://localhost:8000/mcp",
      "headers": {
        "Authorization": "Bearer your-strong-random-secret"
      }
    }
  }
}

For Cursor: add the same block under mcp in your ~/.cursor/mcp.json.


Option B — Manual / Development Setup

<details> <summary>Expand for manual setup instructions</summary>

1. Clone the repository

git clone https://github.com/desikai-lab/Marrow.git
cd Marrow

2. Set up marrow_server

cd marrow_server
python -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
pip install -e .

Copy and configure the environment file:

cp .env.example .env
# Edit .env — set SECRET_TOKEN and TASKS_DIR (required)

Start the server:

python src/marrow_server.py

The MCP server will be available at http://localhost:8000/mcp by default.

2b. Initialize your first project

python src/cli/admin_cli.py project-init --project MyProject

This copies the built-in project template into your TASKS_DIR/MyProject/ workspace. Open MyProject/spec.md and fill in your tech stack before your first agent session.

3. Set up marrow_worker

In a separate terminal:

cd marrow_worker
pip install -e .
cp .env.example .env
# Edit .env — set SECRET_TOKEN (must match server) and WATCH_ROOT
python main.py

4. Connect your agent

Add Marrow to your MCP client configuration:

{
  "mcpServers": {
    "marrow": {
      "url": "http://localhost:8000/mcp",
      "headers": {
        "Authorization": "Bearer your-strong-random-secret"
      }
    }
  }
}

For Cursor: add the same block under mcp in your ~/.cursor/mcp.json.

</details>


Configuration

Both services are configured via environment variables (.env files).

marrow_server

Variable Description Default
SECRET_TOKEN Bearer token for MCP and REST API authentication Required
TASKS_DIR Absolute path where project workspaces are stored Required
EMBEDDING_MODEL_CODE Sentence-transformer model for code skeleton embeddings BAAI/bge-small-en-v1.5
EMBEDDING_MODEL_TEXT Sentence-transformer model for text/artifact embeddings sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
EMBEDDING_DIMENSIONS Embedding vector dimensions — must match the chosen model 384
MAX_EMBED_CHARS Maximum characters to embed per chunk 2000
VECT_DEBOUNCE_SECONDS Debounce delay before vectorizing a changed file 0.5
PORT HTTP server port 8000

marrow_worker

Variable Description Default
WATCH_ROOT Root path to watch (all subdirectories are scanned) Required
SECRET_TOKEN Must match the token set in marrow_server Required
SERVER_URL URL of the running marrow_server http://localhost:8000
DEBOUNCE_SECONDS File change debounce interval in seconds 1.0
BATCH_SIZE Max skeleton chunks per delivery batch 50
EMBEDDING_MODEL_CODE Must match marrow_server embedding model BAAI/bge-small-en-v1.5
EMBEDDING_DIMENSIONS Must match marrow_server embedding dimensions 384

Project Structure (Agent Workspace)

Each project managed by Marrow has a structured workspace in TASKS_DIR:

{project_name}/
├── session.md              # Session state — current focus, pipeline phase
├── spec.md                 # Project specification and architectural constants
├── builds/                 # YAML build manifests
└── docs/
    ├── decisions/adr/        # Architectural Decision Records
    ├── features/
    │   ├── active/           # Features currently in development
    │   └── archive/          # Completed work history
    ├── manuals/              # Operational guidelines and docs
    └── templates/            # Standardization blueprints

Build Engine

Marrow includes a declarative build system for assembling complex context payloads from multiple artifact sources. Define a YAML manifest and run it via MCP:

# builds/my_context.yaml
name: feature_context
version: "1.0.0"
output:
  format: single_file
  filename: "context_{{DATE}}.md"
steps:
  - action: include_artifact
    path: session.md
    mode: full
  - action: include_artifact
    path: docs/decisions/adr/0034-product-name-marrow.md
    mode: section
    section_name: "Decision"

Run via MCP tool run_project_build, or locally:

python run_build.py --project MyProject --build my_context

Roadmap

✅ v1.0.0 — The Foundation

Core MCP server, LanceDB storage, artifact management, task backlog, code skeleton indexing, build engine, session continuity.

🟡 v1.1.0 — The Sandbox & Sync (Active)

Workflow hardening, dynamic reindexing, phase-aware agent guidelines, handoff optimization.

🔴 v2.0.0 — The AI Orchestrator (Planned)

Declarative handoff, context sanitization, branch-aware indexing, diff intelligence, multi-agent orchestration.


Contributing

See CONTRIBUTING.md for development setup, coding standards, and the pull request process.


License

MIT — see LICENSE.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured