cogmemai-mcp
Persistent cloud memory for AI coding assistants. 28 MCP tools for semantic search, auto-learning, task tracking, correction patterns, knowledge graphs, and session replay across Claude Code, Cursor, Windsurf, Cline, and any MCP client. Encrypted at rest. Team shared memory with author attribution.
README
<p align="center"> <img src="assets/hero.png" alt="CogmemAi — Why Claude Code Forgets Everything (And How to Fix It)" width="800"> </p>
CogmemAi — Cognitive Memory for Ai Coding Assistants
<p align="center"> <img src="assets/demo.svg" alt="CogmemAi demo — your Ai assistant remembers your project across sessions" width="800"> </p>
Your Ai coding assistant forgets everything between sessions. CogmemAi fixes that.
One command. Your assistant remembers your architecture, patterns, decisions, bugs, and preferences — permanently. Works with Claude Code, Cursor, Windsurf, Cline, Continue, and any MCP-compatible tool. Switch editors, switch models, switch machines — your knowledge stays.
What's New — Intelligence Engine (v3.4)
CogmemAi now gets smarter every time you use it. The Intelligence Engine is a self-improving memory system that learns what matters, connects related knowledge automatically, and synthesizes answers from your entire memory — no manual curation needed.
- Self-improving recall — memories that consistently help you rank higher over time; memories you never use fade naturally. Your recall quality improves automatically with every session
- Auto-linking knowledge graph — related memories are automatically connected when you save them. Your knowledge builds into a web of relationships, not a flat list
- Contradiction detection — when recalled memories conflict with each other, CogmemAi flags the contradiction so you catch stale or outdated information before it causes problems
- Context-aware ranking — tell CogmemAi what you're doing (debugging, planning, reviewing) and it boosts the right types of memories. Debugging? Bug reports and patterns surface first. Planning? Architecture decisions lead
- Query synthesis — ask a question and get one coherent answer synthesized from all your relevant memories, not just a list of matches. Like asking a teammate who's read everything
- Cross-project intelligence — patterns that appear across 3+ projects are automatically promoted to global scope. Your best practices follow you everywhere without manual effort
- Proactive insights — at session start, CogmemAi tells you what you should know before you ask. Stale critical memories, duplicate subjects that need merging, patterns ready for promotion
Also in v3
- Memory health score — 0-100 score with actionable factors
- Session replay — pick up exactly where you left off with automatic session summaries
- Self-tuning memory — importance adjusts based on real usage; stale memories auto-archive
- Auto-ingest README — learn from your README on new projects instantly
- Smart recall — relevant memories surface automatically as you switch topics
- Auto-learning — CogmemAi learns from your sessions automatically
- Task tracking — persistent tasks with status and priority
- Correction learning — teach your assistant to avoid repeated mistakes
- Session reminders — nudges that surface at the start of your next session
- 29 tools — the most complete memory toolkit for Ai coding assistants
Quick Start
npx cogmemai-mcp setup
That's it. The setup wizard verifies your API key, configures Claude Code, installs automatic context recovery, and you're ready. Start Claude Code by typing claude and your memories are ready.
Don't have an API key yet? Get one free at hifriendbot.com/developer.
The Problem
Every time you start a new session, you lose context. You re-explain your tech stack, your architecture decisions, your coding preferences. Built-in memory in tools like Claude Code is a flat file with no search, no structure, and no intelligence.
CogmemAi gives your Ai assistant a real memory system:
- Semantic search — finds relevant memories by meaning, not keywords
- Ai-powered extraction — automatically identifies facts worth remembering from your conversations
- Smart deduplication — detects duplicate and conflicting memories automatically
- Privacy controls — auto-detects API keys, tokens, and secrets before storing
- Document ingestion — feed in READMEs and docs to instantly build project context
- Project scoping — memories tied to specific repos, plus global preferences that follow you everywhere
- Smart context — intelligently ranked for maximum relevance to your current work
- Compaction recovery — survives Claude Code context compaction automatically
- Token-efficient — compact context loading that won't bloat your conversation
- Zero setup — no databases, no Docker, no Python, no vector stores
Why Cloud Memory?
Local memory solutions come with maintenance overhead: database management, version conflicts, storage growth, and setup complexity. CogmemAi runs extraction and search server-side. Your MCP server is a thin HTTP client — zero local databases, zero RAM issues, zero maintenance. All memories are encrypted at rest, so your data is just as secure as a local database — with cloud portability and team features on top.
Your memory follows you everywhere. Memories created in Claude Code are instantly available in Cursor, Windsurf, Cline, and any MCP-compatible tool. Switch between Opus, Sonnet, Haiku, or any model your editor supports — your memories persist regardless. New laptop? New OS? Log in and your full project knowledge is waiting. A local SQLite file dies with your machine. Cloud memory is permanent.
Teams and collaboration. Cloud memory is the only way to share project knowledge across teammates. When one developer saves an architecture decision or documents a bug fix, every team member's Ai assistant knows about it instantly. No syncing, no merge conflicts, no stale local databases. Whether it's two developers or twenty, everyone's assistant has the same up-to-date context. This is impossible with local-only memory solutions.
Compaction Recovery
When your Ai assistant compacts your context, conversation history gets compressed and context is lost. CogmemAi handles this automatically — your context is preserved before compaction and seamlessly restored afterward. No re-explaining, no manual prompting.
The npx cogmemai-mcp setup command configures everything automatically.
Skill
CogmemAi includes a Claude Skill that teaches Claude best practices for memory management — when to save, importance scoring, memory types, and session workflows.
Claude Code:
/skill install https://github.com/hifriendbot/cogmemai-mcp/tree/main/skill/cogmemai-memory
Claude.ai: Upload the skill/cogmemai-memory folder in Settings > Skills.
Claude API: Use the Skills API to attach the skill to your requests.
CLI Commands
npx cogmemai-mcp setup # Interactive setup wizard
npx cogmemai-mcp setup <key> # Setup with API key
npx cogmemai-mcp verify # Test connection and show usage
npx cogmemai-mcp --version # Show installed version
npx cogmemai-mcp help # Show all commands
Manual Setup
If you prefer to configure manually instead of using npx cogmemai-mcp setup:
Option A — Per project (add .mcp.json to your project root):
{
"mcpServers": {
"cogmemai": {
"command": "cogmemai-mcp",
"env": {
"COGMEMAI_API_KEY": "cm_your_api_key_here"
}
}
}
}
Option B — Global (available in every project):
claude mcp add cogmemai cogmemai-mcp -e COGMEMAI_API_KEY=cm_your_api_key_here --scope user
Works With
Claude Code (Recommended)
Automatic setup:
npx cogmemai-mcp setup
Cursor
Add to ~/.cursor/mcp.json:
{
"mcpServers": {
"cogmemai": {
"command": "npx",
"args": ["-y", "cogmemai-mcp"],
"env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
}
}
}
Windsurf
Add to ~/.codeium/windsurf/mcp_config.json:
{
"mcpServers": {
"cogmemai": {
"command": "npx",
"args": ["-y", "cogmemai-mcp"],
"env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
}
}
}
Cline (VS Code)
Open VS Code Settings > Cline > MCP Servers, add:
{
"cogmemai": {
"command": "npx",
"args": ["-y", "cogmemai-mcp"],
"env": { "COGMEMAI_API_KEY": "cm_your_api_key_here" }
}
}
Continue
Add to ~/.continue/config.yaml:
mcpServers:
- name: cogmemai
command: npx
args: ["-y", "cogmemai-mcp"]
env:
COGMEMAI_API_KEY: cm_your_api_key_here
CogmemUI Cockpit
CogmemUI Cockpit is a free multi-model Ai workspace with built-in CogmemAi memory. Add your CogmemAi API key in Settings > API Keys and your memory is instantly available. CogmemUI also supports connecting any MCP-compatible tool server via Settings > MCP Servers — add endpoints, auto-discover tools, and use them in chat.
Get your free API key at hifriendbot.com/developer.
Tools
CogmemAi provides 29 tools that your Ai assistant uses automatically:
| Tool | Description |
|---|---|
save_memory |
Store a fact explicitly (architecture decision, preference, etc.) |
recall_memories |
Search memories using natural language (semantic search) |
extract_memories |
Ai extracts facts from a conversation exchange automatically |
get_project_context |
Load top memories at session start (with smart ranking, health score, and session replay) |
list_memories |
Browse memories with filters (paginated, with untyped filter) |
update_memory |
Update content, importance, scope, type, category, subject, and tags |
delete_memory |
Permanently delete a memory |
bulk_delete |
Delete up to 100 memories at once |
bulk_update |
Update up to 50 memories at once (content, type, category, tags, etc.) |
get_usage |
Check your usage stats and tier info |
export_memories |
Export all memories as JSON for backup or transfer |
import_memories |
Bulk import memories from a JSON array |
ingest_document |
Feed in a document (README, API docs) to auto-extract memories |
save_session_summary |
Save a summary of what was accomplished in this session |
list_tags |
View all tags in use across your memories |
link_memories |
Connect related memories with named relationships |
get_memory_links |
Explore the knowledge graph around a memory |
get_memory_versions |
View edit history of a memory |
get_analytics |
Memory health dashboard with self-tuning insights (filterable by project) |
promote_memory |
Promote a project memory to global scope |
consolidate_memories |
Merge related memories into comprehensive summaries using Ai |
save_task |
Create a persistent task with status and priority tracking |
get_tasks |
Retrieve tasks for the current project — pick up where you left off |
update_task |
Change task status, priority, or description as you work |
save_correction |
Store a "wrong approach → right approach" pattern to avoid repeated mistakes |
set_reminder |
Set a reminder that surfaces at the start of your next session |
get_stale_memories |
Find memories that may be outdated for review or cleanup |
get_file_changes |
See what files changed since your last session |
feedback_memory |
Signal whether a recalled memory was useful or irrelevant to improve future recall |
SDKs
Build your own integrations with the CogmemAi API:
- JavaScript/TypeScript:
npm install cogmemai-sdk— npm · GitHub - Python:
pip install cogmemai— PyPI · GitHub
Memory Types
Memories are categorized for better organization and retrieval:
- identity — Who you are, your role, team
- preference — Coding style, tool choices, conventions
- architecture — System design, tech stack, file structure
- decision — Why you chose X over Y
- bug — Known issues, fixes, workarounds
- dependency — Version constraints, package notes
- pattern — Reusable patterns, conventions
- context — General project context
- task — Persistent tasks with status and priority tracking
- correction — Wrong approach → right approach patterns
- reminder — Next-session nudges that auto-expire
Scoping
- Project memories — Architecture, decisions, bugs specific to one repo. Auto-detected from your repository.
- Global memories — Your coding preferences, identity, tool choices. Available in every project.
Pricing
| Free | Pro | Team | Enterprise | |
|---|---|---|---|---|
| Price | $0 | $14.99/mo | $39.99/mo | $99.99/mo |
| Memories | 500 | 2,000 | 10,000 | 50,000 |
| Extractions/mo | 500 | 2,000 | 5,000 | 20,000 |
| Projects | 5 | 20 | 50 | 200 |
Start free. Upgrade when you need more. Or pay per operation with USDC on-chain — no credit card required.
Privacy & Security
- Encryption at rest. All memories are encrypted before they touch the database. Even in a data breach, your data is unreadable.
- No source code leaves your machine. We store extracted facts (short sentences), never raw code.
- API keys cryptographically hashed (irreversible) server-side.
- All traffic over HTTPS.
- No model training on your data. Ever.
- Delete everything instantly via dashboard or MCP tool.
- No cross-user data sharing.
Read our full privacy policy.
Environment Variables
| Variable | Required | Description |
|---|---|---|
COGMEMAI_API_KEY |
Yes | Your API key (starts with cm_) |
COGMEMAI_API_URL |
No | Custom API URL (default: hifriendbot.com) |
Support
- Issues: GitHub Issues
- Docs: hifriendbot.com/developer
License
MIT — see LICENSE
Built by HiFriendbot — Better Friends, Better Memories, Better Ai.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.