engrams
Gives AI assistants persistent, queryable project memory for decisions, patterns, and rules, reducing the need to re-explain context in every prompt.
README
<div align="center">
<img src="https://raw.githubusercontent.com/stevebrownlee/engrams/refs/heads/main/static/engram.sh.png" style="height:150px;" />
Engrams
Persistent Project Memory for AI Assistants
</div>
Engrams is an MCP server that gives AI assistants structured, queryable memory for your projects — decisions, patterns, progress, and team rules — so you stop re-explaining your stack in every prompt.
Installation
Prerequisites
- Python 3.8+
- uv — Install
Add to Your MCP Client
Add the following to your MCP settings file (mcp.json or your IDE's MCP configuration):
{
"mcpServers": {
"engrams": {
"command": "uvx",
"args": [
"--reinstall",
"--from",
"engrams-mcp",
"engrams-mcp",
"--mode",
"stdio",
"--log-level",
"INFO"
]
}
}
}
Works with Roo Code, Cline, Cursor, Windsurf, Claude Code, and any MCP-compatible client.
Workspace detection is automatic — no
--workspace_idflag required. Engrams locates your project root per-call using.git,package.json, and similar indicators.
Add Custom Instructions to Your Project
After installing, run engrams init from your project root to scaffold the strategy file for your AI tool:
engrams init --tool roo # → .roo/rules/engrams_strategy + .roomodes + system prompts
engrams init --tool cline # → .clinerules
engrams init --tool cursor # → .cursorrules
engrams init --tool windsurf # → .windsurfrules
engrams init --tool claude-code # → CLAUDE.md
engrams init --tool generic # → engrams_strategy.md
engrams init --list # See all supported tools
Reducing Token Costs
Every time you start a new AI session, you're re-paying to re-establish project context: your stack, your conventions, your current task. With a large project and frequent sessions, this adds up fast.
Engrams stores that context once in a local SQLite database. Your AI assistant retrieves only what's relevant to the current task — not everything, not nothing.
Before Engrams — you prompt with background context every session:
You: We use FastAPI with SQLAlchemy and PostgreSQL. JWT auth is required on all
routes. Follow the existing error handling pattern. Now add a /medications
endpoint that...
After Engrams — the context is already there:
You: Add a /medications endpoint that...
AI: Retrieving relevant context...
Decision #23: PostgreSQL primary database
Decision #24: SQLAlchemy for all DB operations
Decision #7: JWT auth required on all routes
Pattern #5: Structured error handling
Implementing with SQLAlchemy, JWT middleware, and structured error responses...
The more you log, the less you repeat. Features like codebase bindings and context budgeting ensure that only the relevant context for each task is loaded — keeping prompt sizes (and costs) low even as the knowledge base grows.
Establishing Team Rules
Engrams has a two-layer governance system: team-level rules that apply to everyone, and individual-level rules for personal workflow preferences.
Log a team decision
You: "The team has decided to use PostgreSQL for the primary database because
we need ACID guarantees and complex query support. This is a team-level
architectural decision."
AI: Decision #23 logged:
Use PostgreSQL for primary database
Rationale: ACID guarantees, complex query support, mature ecosystem
Scope: Team Architecture
Tags: database, architecture
Block conflicting proposals automatically
Once a team rule is in place, Engrams enforces it:
You: "I want to use MongoDB for the analytics module"
AI: ⚠ Governance Conflict Detected
Your proposal conflicts with Team Decision #23:
"Standardize on PostgreSQL for all data storage"
Scope: Core Architecture (Team-level, hard block)
Would you like to propose an amendment instead?
Allow individual overrides
Individual developers can still log personal workflow strategies without affecting team rules:
You: "When I'm ready to create a PR, compare my branch with main, generate
a summary of my changes, run autoflake on the API, and fix any critical
lint errors."
AI: Individual strategy #4 created and saved.
See the Governance documentation for scopes, amendment workflows, and compliance checks.
Learn More
The Engrams documentation site covers all features in detail:
- How It Works
- Semantic Search
- Codebase Bindings
- Context Budgeting
- Knowledge Graph
- Project Onboarding
- Knowledge Dashboard
- MCP Tool Reference
- Contributing
License
Apache 2.0 · Forked from GreatScottyMac/context-portal v0.3.13
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.