subconscious-mcp
Local-first semantic memory layer for MCP agents. Recall, remember, forget, stats over stdio. ChromaDB plus sentence-transformers, all on-machine.
README
subconscious-mcp
A local-first, learning, semantic memory layer for any MCP-compatible LLM agent.
The server runs as an MCP stdio process on your machine and exposes four tools ā recall, remember, forget, stats ā that let an agent ask "have I seen this task before?" and, if so, get back the previous answer in milliseconds without re-running the work.
Embeddings come from sentence-transformers/all-MiniLM-L6-v2 (384-dim, runs on CPU). Storage is a persistent local ChromaDB collection. No data leaves your machine.
<!-- mcp-name: io.github.vishaltorc/subconscious-mcp -->
Install
# Once published:
pip install subconscious-mcp
# Local development:
git clone https://github.com/vishaltorc/subconscious-mcp
cd subconscious-mcp
pip install -e ".[dev]"
After install you can run the server from anywhere:
subconscious-mcp --help
The first time a tool is called, the embedding model (~80MB) is downloaded into the local Hugging Face cache. Subsequent starts are fast.
Configure your MCP client
Claude Desktop
Edit your config file:
- macOS:
~/Library/Application Support/Claude/claude_desktop_config.json - Windows:
%APPDATA%\Claude\claude_desktop_config.json
Add an mcpServers entry:
{
"mcpServers": {
"subconscious": {
"command": "subconscious-mcp",
"args": []
}
}
}
If subconscious-mcp isn't on Claude Desktop's PATH, use the absolute path printed by which subconscious-mcp, e.g. "command": "/Users/you/.local/bin/subconscious-mcp".
Then quit and restart Claude Desktop. The new tools appear under the š indicator.
Claude Code
Option A ā register from the CLI (recommended):
claude mcp add subconscious-mcp -- subconscious-mcp
Option B ā edit ~/.claude.json (or your project's .mcp.json) and add:
{
"mcpServers": {
"subconscious-mcp": {
"command": "subconscious-mcp",
"args": [],
"type": "stdio"
}
}
}
Reload the Claude Code session and the four tools become available.
A copy-pasteable file is in examples/claude_desktop_config.json and examples/claude_code_config.json.
Tools
recall(task, threshold=0.85, top_k=1)
Semantic search for a previously remembered task.
| arg | type | default | meaning |
|---|---|---|---|
task |
str | ā | the task description to look up |
threshold |
float | 0.85 | minimum cosine similarity for a hit |
top_k |
int | 1 | how many candidates to consider |
Returns:
{
"hit": true,
"similarity": 0.91,
"answer": "...",
"task_text": "...",
"entry_id": "uuid",
"stored_at": 1731000000.0,
"tags": ["..."]
}
On a miss, hit is false, answer is null, and similarity is the best similarity observed in top_k ā so callers can see how close they came.
remember(task, answer, tags=[], ttl_seconds=null)
Persist a (task, answer) pair. Returns {stored, entry_id, embedding_dim}.
ttl_seconds=null means never expire. Pass an integer to have the entry filtered out of future recalls after that many seconds.
forget(entry_id)
Delete the entry with this id. Returns {"forgotten": true} if it existed, else false.
stats()
Returns {"total_entries", "last_hit_at", "hit_rate_last_100"}. hit_rate_last_100 is a sliding window over the most recent 100 recall calls ā useful to see whether memory is actually paying off.
Configuration
Configuration is resolved in priority order:
- Environment variables (highest)
~/.subconscious-mcp/config.json- Built-in defaults
| key | default | env var |
|---|---|---|
storage_dir |
~/.subconscious-mcp/data |
SUBCONSCIOUS_STORAGE_DIR |
embedding_model |
all-MiniLM-L6-v2 |
SUBCONSCIOUS_EMBEDDING_MODEL |
default_threshold |
0.85 |
SUBCONSCIOUS_DEFAULT_THRESHOLD |
default_ttl_seconds |
null |
ā |
log_level |
INFO |
SUBCONSCIOUS_LOG_LEVEL |
Inspect the resolved config without starting the server:
subconscious-mcp --print-config
Files written on disk
~/.subconscious-mcp/
āāā config.json (optional, user-edited)
āāā data/ ChromaDB collection (sqlite + parquet)
āāā logs/server.log rotating, 2MB x 3 backups
To wipe your memory: rm -rf ~/.subconscious-mcp/data.
Demo session
See examples/demo_session.md for a worked example of an agent calling recall (miss ā remember), then on a later turn calling recall again with a paraphrase and getting a hit.
Architecture
See docs/architecture.md for the layered design (server / tools / memory / config), the rationale behind ChromaDB + cosine similarity, and the TTL strategy.
Troubleshooting
subconscious-mcp: command not found after install
Your shell's PATH doesn't include the install location. Try python -m subconscious_mcp.server --help to confirm the package works, then use the absolute path in your MCP client config.
Claude Desktop says "Server disconnected"
Check ~/.subconscious-mcp/logs/server.log for the traceback. Most common causes:
- The model download failed (offline at first launch) ā re-run with network connectivity.
- The
storage_diris on a read-only volume.
First recall is slow The first invocation lazily loads the sentence-transformer model (~5s on a modest CPU). Subsequent calls reuse the loaded model and respond in milliseconds.
Recall keeps missing on obvious paraphrases
Lower the threshold (recall(task=..., threshold=0.7)) or raise top_k to see candidates. all-MiniLM-L6-v2 is small and fast ā for higher-quality matching set SUBCONSCIOUS_EMBEDDING_MODEL=all-mpnet-base-v2.
Tests fail with a sentence-transformers download error
You're offline or behind a proxy. Set HF_HUB_OFFLINE=1 once you've pre-downloaded the model, or run python -c "from sentence_transformers import SentenceTransformer; SentenceTransformer('all-MiniLM-L6-v2')" once with connectivity.
License
MIT Ā© 2026 Vishal Jayaprakash
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.