agent-engrams-mcp
An MCP server providing durable, shared memory for coding agents, storing engineering knowledge as structured markdown engrams with semantic search via embeddings.
README
Agent Engrams MCP Server
Table of contents
- Introduction
- Architecture
- On-disk layout
- Quick Start
- Tools
- Configuration Reference
- Engram Format
- Transport Modes
- Development
- License
Introduction
What this is. Agent Engrams MCP is a Model Context Protocol server that gives coding agents a durable, shared memory for transferable engineering knowledge. Agents store lessons as structured markdown documents called engrams on your machine; the server indexes them with embeddings so agents can search by meaning, not just filenames. Your IDE (Cursor, VS Code, or any MCP-aware client) talks to the server over stdio; the server reads and writes files locally and calls an OpenAI-compatible embedding API when indexing and searching.
Why it matters. Agent sessions are short and context windows are finite. Without a memory that survives across tasks and projects, every session starts from zero: the same debugging tricks, API quirks, and architectural lessons get rediscovered-or missed-again and again. A shared engram store turns one-off insights into reusable knowledge: your agent (and others using the same store) can recall what already worked before inventing a new dead end.
The learning flywheel. The engram system is designed around a loop that gets stronger the more it is used honestly:
- Recall - Before diving into a non-trivial task, search the store. Prior work may already answer the question.
- Learn - During the task, notice transferable patterns (would this help on a different project?). That is engram-worthy.
- Write - Capture those insights in structured engrams so the next recall is richer.
Each high-quality write makes the next search more useful; that encourages more search, which surfaces more opportunities to learn and write. That self-reinforcing loop is the flywheel effect. It stalls if the store fills with noise (low-quality writes), if agents skip search (duplicate effort), or if everything is written indiscriminately (diluted results). Quality beats quantity.
flowchart LR
Recall["Recall<br/>Search engrams<br/>before you act"]
Learn["Learn<br/>Spot transferable<br/>knowledge"]
Write["Write<br/>Capture durable<br/>engrams"]
Recall --> Learn
Learn --> Write
Write -->|"Richer store →<br/>better recall"| Recall
The flywheel is a habit, not a one-time setup: the MCP server is the machinery that stores, embeds, and retrieves engrams so that loop can run every day.
Architecture
graph LR
IDE["IDE<br/>(Cursor · VS Code)"]
MCP["agent-engrams-mcp<br/>MCP Server"]
FS["Local store<br/>ENGRAMS_DIR"]
EMB["Embedding Model<br/>(OpenAI-compatible API)"]
IDE -- "stdio / JSON-RPC" --> MCP
MCP -- "read / write under<br/>.../docs/*.md" --> FS
MCP -- "POST /v1/embeddings" --> EMB
The IDE spawns the MCP server as a child process. When an agent writes or searches engrams, the server reads and writes markdown under the store's docs/ folder and calls an OpenAI-compatible embedding endpoint to build query vectors and score matches.
On-disk layout
ENGRAMS_DIR (and the dir field in mcp.json) is the store root, not the folder that holds markdown directly. The server creates this layout on startup if it is missing:
| Path | Purpose |
|---|---|
$ENGRAMS_DIR/docs/ |
Engram markdown files (.md) - this is what gets indexed and searched. |
$ENGRAMS_DIR/index.json |
Persisted vector index (embeddings + excerpts + metadata). Same general shape as pi-agent-engrams (dimensions, embeddingModelId, providerFingerprint, entries keyed by absolute .md paths; we also write version: 3 for their tooling). Load succeeds only when the file's top-level fields match the current config: dimensions, embeddingModelId, and providerFingerprint must all match; otherwise the file is skipped and embeddings are rebuilt from docs/. The version field is not used when loading. |
$ENGRAMS_DIR/index/ |
Reserved directory (optional); not where index.json lives. |
If you previously set ENGRAMS_DIR to a path ending in /docs, that still works: the server treats it as a legacy docs-only path (root = parent directory, docs = that path).
Quick Start
Prerequisites
- Node.js 20+
- An OpenAI-compatible embedding API (local or remote). Examples: Ollama, LM Studio, vLLM, OpenAI.
1. Install
npm install -g agent-engrams-mcp
2. Configure your IDE
Pick your editor and paste the JSON block into the indicated file. Adjust the env values to match your embedding provider.
Cursor
Add to ~/.cursor/mcp.json (global) or <project>/.cursor/mcp.json (per-project):
{
"mcpServers": {
"agent-engrams": {
"command": "npx",
"args": ["agent-engrams-mcp", "--stdio"],
"env": {
"ENGRAMS_DIR": "~/.config/agent-engrams-mcp",
"EMBEDDER_DIMENSIONS": "512",
"EMBEDDER_TYPE": "openai",
"EMBEDDER_BASE_URL": "http://localhost:8000/v1",
"EMBEDDER_API_KEY": "your-api-key",
"EMBEDDER_MODEL": "Qwen3-Embedding-0.6B-4bit-DWQ"
}
}
}
}
Visual Studio Code
Add to your VS Code settings (JSON):
{
"mcp.servers": {
"agent-engrams": {
"command": "npx",
"args": ["agent-engrams-mcp", "--stdio"],
"env": {
"ENGRAMS_DIR": "~/.config/agent-engrams-mcp",
"EMBEDDER_DIMENSIONS": "512",
"EMBEDDER_TYPE": "openai",
"EMBEDDER_BASE_URL": "http://localhost:8000/v1",
"EMBEDDER_API_KEY": "your-api-key",
"EMBEDDER_MODEL": "Qwen3-Embedding-0.6B-4bit-DWQ"
}
}
}
}
3. Restart your IDE
After saving the config, restart (or reload MCP servers) so the IDE picks up the new server. You should see agent-engrams listed among your MCP servers.
That's it - your agent now has persistent memory.
Tools
The server exposes three tools to the agent:
| Tool | Description |
|---|---|
| write-engram | Capture a piece of transferable knowledge as a structured markdown file. |
| search-engrams | Semantic search across all engrams by natural-language query, with optional metadata filters (category, tags, scope, durability). |
| reindex | Force a full re-index of every engram on disk. |
Three seed engram resources ship with the server to guide agents on writing and searching effectively.
Configuration Reference
All settings can be provided via environment variables (shown above in the IDE snippets), a JSON config file, or CLI arguments. Environment variables take precedence over the config file.
Environment Variables
| Variable | Description | Default |
|---|---|---|
ENGRAMS_DIR |
Store root directory (docs/ holds markdown; index/ reserved) |
~/.config/agent-engrams-mcp |
EMBEDDER_TYPE |
Provider type: openai, bedrock, ollama |
openai |
EMBEDDER_BASE_URL |
Base URL for the embedding API | - |
EMBEDDER_MODEL |
Embedding model name | Qwen3-Embedding-0.6B-4bit-DWQ |
EMBEDDER_API_KEY |
API key for the embedding provider | - |
EMBEDDER_DIMENSIONS |
Embedding vector dimensions | 512 |
MCP_CONFIG |
Path to a JSON config file (overrides XDG default) | ~/.config/agent-engrams-mcp/mcp.json |
XDG_CONFIG_HOME |
Base config directory | ~/.config |
USE_STDIO |
Force stdio transport | - |
PORT |
HTTP server port (HTTP mode only) | 3000 |
Config File
If you prefer a config file over env vars, create ~/.config/agent-engrams-mcp/mcp.json:
{
"dir": "~/.config/agent-engrams-mcp",
"dimensions": 512,
"provider": {
"type": "openai",
"model": "Qwen3-Embedding-0.6B-4bit-DWQ",
"baseUrl": "http://localhost:11434/v1",
"apiKey": "your-api-key"
},
"minSearchScore": 0.40
}
A starter file is included in the repo:
mkdir -p ~/.config/agent-engrams-mcp
cp mcp.json.example ~/.config/agent-engrams-mcp/mcp.json
Provider Examples
<details> <summary>OpenAI-compatible (default)</summary>
{
"type": "openai",
"model": "text-embedding-3-small",
"baseUrl": "https://api.openai.com/v1",
"apiKey": "sk-..."
}
</details>
<details> <summary>Bedrock</summary>
{
"type": "bedrock",
"profile": "default",
"region": "us-east-1",
"model": "amazon.titan-embed-text-v2:0"
}
</details>
<details> <summary>Ollama</summary>
{
"type": "ollama",
"url": "http://localhost:11434",
"model": "nomic-embed-text"
}
</details>
CLI Arguments
Override any setting when starting the server directly:
npm start -- --dir=/path/to/store-root --dimensions=768
npm start -- --provider='{"type":"openai","model":"text-embedding-3-small","baseUrl":"https://api.openai.com/v1","apiKey":"sk-..."}'
Engram Format
Engrams are markdown files with YAML frontmatter:
---
Category: debugging
Tags: async, testing, jest
Durability: permanent
Scope: universal
Agent: system
Date: 2024-01-01
Source: Task #123
---
# Title of the Engram
## Context
What situation triggered this learning? Include specific details.
## Insight
What was learned? What is the non-obvious part? Be specific.
## Application
**Trigger:** When to apply this knowledge
**Anti-trigger:** When NOT to apply this knowledge
## Supersedes
None
Transport Modes
| Mode | How to run | Use case |
|---|---|---|
| stdio (default for IDEs) | npx agent-engrams-mcp --stdio |
Cursor, VS Code, Claude Desktop |
| HTTP | npm start (port 3000) |
Remote or multi-client setups |
Architecture
The application follows a clean architecture pattern with clear separation of concerns:
Core Components
src/index.ts- Entry point with CLI argument parsing, config loading, and service instantiationsrc/mcp-server-service.ts- MCP server service that encapsulates server initialization and tool registrationsrc/engram-service.ts- Application service layer for engram operationssrc/abstractions.ts- Interface definitions for external dependencies (file system, HTTP fetch)src/embedder.ts- Embedding provider implementations (OpenAI, Bedrock, Ollama)src/index-store.ts- Index management and search logicsrc/config.ts- Configuration loading and managementsrc/frontmatter.ts- Markdown parsing and engram rendering
Dependency Injection
External dependencies are injected via interfaces:
IFileSystem- File system operations (defaults toNodeFileSystem)IHttpFetch- HTTP fetch operations (defaults tocreateHttpFetch())
This enables easy mocking for unit tests.
Testability
The refactored code is designed for easy testing:
- No global state - all state is encapsulated within class instances
- All external dependencies are injectable via interfaces
- Test helpers provide easy setup for test instances
- Tests can run without network access or file system
Development
npm install
npm run build # compile TypeScript
npm run typecheck # type check only
npm run lint # lint
npm run format # check formatting
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.