semantic-memory
Persistent memory with semantic search for Claude and MCP-compatible clients, storing context that survives conversations and can be retrieved intelligently.
README
๐ง MCP Semantic Memory Server
Persistent memory with semantic search for Claude and MCP-compatible clients
Give your AI assistant persistent memory that survives between conversations. Save context once, retrieve it intelligently forever.
โจ Features
- ๐ Semantic Search - Find memories by meaning, not just keywords
- ๐พ SQLite Storage - Fast, reliable, and scalable
- ๐ค User Biography - Structured profile (name, occupation, tech stack, etc.)
- ๐ 100% Local - No external APIs, all processing on your machine
- โก Fast - Powered by all-MiniLM-L6-v2 embeddings (~50ms searches)
- ๐ Private - Your data never leaves your computer
๐ฏ Problem & Solution
Problem: Claude forgets everything between conversations. You constantly re-explain your context, projects, preferences, and tech stack.
Solution: This MCP server gives Claude persistent memory with intelligent semantic search. Save information once, and Claude retrieves it automatically when relevant.
Example
// Save once
save_memory("project-info", "Working on an e-commerce site with Next.js and Stripe")
// Days later, in a new conversation
User: "How do I add payments to my project?"
Claude: *searches memory* "Since you're using Stripe in your e-commerce project..."
๐ Quick Start
Installation
# Clone the repository
git clone https://github.com/GFYURI/mcp-semantic-memory.git
cd mcp-semantic-memory
# Install dependencies (pnpm recommended)
pnpm install
# or: npm install
Configuration
Add to your MCP client config (e.g., Claude Desktop):
Windows: %APPDATA%\Claude\claude_desktop_config.json
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
{
"mcpServers": {
"semantic-memory": {
"command": "node",
"args": ["/absolute/path/to/mcp-semantic-memory/index.js"]
}
}
}
Example (Windows):
{
"mcpServers": {
"semantic-memory": {
"command": "node",
"args": ["C:\\Users\\YourName\\mcp-semantic-memory\\index.js"]
}
}
}
First Run
- Restart your MCP client (e.g., Claude Desktop)
- The server will download the embedding model (~25MB) on first use
- Start saving memories!
๐ Available Tools
Memory Management
save_memory(id, text, metadata?)
Save a memory with semantic embedding.
save_memory({
id: "my-cat",
text: "My cat's name is Mia, she's orange and very playful",
metadata: { category: "personal", type: "pet" }
})
search_memory(query, n_results?, threshold?)
Search memories by semantic similarity.
search_memory({
query: "what's my pet's name?",
n_results: 5, // optional, default: 5
threshold: 0.3 // optional, default: 0.3 (0-1 scale)
})
get_memory(id)
Retrieve a specific memory by ID.
delete_memory(id)
Delete a memory permanently.
list_all_memories()
List all stored memories (ordered by last update).
User Biography
get_user_bio()
Get the user's complete biographical profile.
set_user_bio(data)
Create or update user biography. All fields are optional.
set_user_bio({
nombre: "Angel",
ocupacion: "Student",
ubicacion: "Santiago, Chile",
tecnologias: ["Python", "JavaScript", "Node.js"],
herramientas: ["VS Code", "Docker", "pnpm"],
idiomas: ["Spanish", "English"],
timezone: "America/Santiago",
mascotas: ["Mia (cat)"]
})
update_user_bio(field, value)
Update a single field in the biography.
update_user_bio({
field: "tecnologias",
value: ["Python", "JavaScript", "TypeScript"]
})
๐จ Use Cases
For Developers
- Remember your tech stack and project context
- Store solutions to common problems
- Keep track of configurations and preferences
For Students
- Save study notes and learning progress
- Remember assignment deadlines and requirements
- Track research topics and sources
For Everyone
- Personal preferences and interests
- Important dates and events
- Conversation context across sessions
๐ง How It Works
Semantic Search
Traditional keyword search:
Query: "what's my pet's name?"
Memory: "My cat Mia is orange"
Result: โ No matches (different words)
Semantic search:
Query: "what's my pet's name?"
Memory: "My cat Mia is orange"
Result: โ
78% similarity (understands meaning)
Technical Details
- Embeddings: all-MiniLM-L6-v2 (384 dimensions)
- Storage: SQLite with optimized indexes
- Search: Cosine similarity between vectors
- Performance: ~50-100ms per save, ~200ms search in 100 memories
๐ Database Schema
-- Memories table
CREATE TABLE memories (
id TEXT PRIMARY KEY,
text TEXT NOT NULL,
embedding TEXT NOT NULL, -- JSON array of 384 floats
metadata TEXT, -- JSON object
created_at TEXT NOT NULL,
updated_at TEXT NOT NULL
);
-- User biography table
CREATE TABLE user_bio (
id INTEGER PRIMARY KEY CHECK (id = 1),
nombre TEXT,
ocupacion TEXT,
ubicacion TEXT,
tecnologias TEXT, -- JSON array
herramientas TEXT, -- JSON array
idiomas TEXT, -- JSON array
timezone TEXT,
mascotas TEXT, -- JSON array
created_at TEXT NOT NULL,
updated_at TEXT NOT NULL
);
๐ Comparison with Alternatives
| Feature | This MCP | @modelcontextprotocol/server-memory |
|---|---|---|
| Semantic Search | โ | โ |
| User Biography | โ | โ |
| Storage | SQLite | In-memory |
| Persistence | โ Disk | โ RAM only |
| Scalability | 1000s of memories | Limited |
| Search Speed | Fast (indexed) | N/A |
๐ ๏ธ Development
# Install dependencies
pnpm install
# Run locally
node index.js
# Test with MCP inspector
npx @modelcontextprotocol/inspector node index.js
๐ Requirements
- Node.js >= 18.0.0
- ~100MB disk space (model + dependencies)
- MCP-compatible client (Claude Desktop, LM Studio, etc.)
๐ Troubleshooting
First run takes 30-60 seconds
The embedding model is being downloaded (~25MB). Subsequent runs are instant.
sharp installation fails on Windows
pnpm rebuild sharp
# or
pnpm install --force
Database is locked
Close other connections to memory.db or restart your MCP client.
Memories not loading
Check that the absolute path in your MCP config is correct.
๐ค Contributing
Contributions are welcome! Feel free to:
- Report bugs
- Suggest features
- Submit pull requests
- Improve documentation
๐ License
MIT License - feel free to use this in your own projects!
๐ Acknowledgments
- Built with @modelcontextprotocol/sdk
- Embeddings by @xenova/transformers
- Powered by better-sqlite3
โญ Star History
If you find this useful, consider giving it a star! It helps others discover the project.
Made with โค๏ธ for the MCP community
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.