MCP Memory Server

MCP Memory Server

An MCP server that gives Claude persistent memory by storing conversation context, entities, and enabling semantic search across sessions.

Category
Visit Server

README

๐Ÿง  MCP Memory Server

TypeScript Node.js Azure Cosmos DB OpenAI

Advanced Memory System for Claude Desktop - Transform Claude into an AI assistant with photographic memory using MCP (Model Context Protocol).

โœจ What it does

Imagine Claude with persistent memory that:

  • ๐Ÿง  Remembers everything from your conversations
  • ๐Ÿ” Automatically retrieves context when you reference past topics
  • ๐Ÿค– Understands references like "that project", "this company", "he/she"
  • ๐Ÿ“ˆ Builds knowledge over time across all your sessions
  • ๐Ÿ› ๏ธ Auto-captures results from web searches and other tools

๐Ÿš€ Quick Start

Prerequisites

  • Node.js 18+
  • Azure Cosmos DB account
  • OpenAI API key
  • Claude Desktop

Installation

git clone https://github.com/PlumyCat/mcp-memory-server.git
cd mcp-memory-server
npm install
npm run build

Configuration

  1. Environment setup:
cp .env.example .env
# Edit .env with your API keys
  1. Claude Desktop configuration:
{
  "mcpServers": {
    "memory": {
      "command": "node",
      "args": ["/path/to/mcp-memory-server/dist/index.js"],
      "cwd": "/path/to/mcp-memory-server"
    }
  }
}
  1. Test the magic:
You: "I'm working on a TypeScript project using CosmosDB"
Claude: [Responds normally + automatic background storage]

# Later...
You: "What was that project we discussed?"
Claude: "You mentioned working on a TypeScript project using CosmosDB..."

๐ŸŽฏ Key Features

๐Ÿง  Intelligent Memory Storage

  • Automatic entity extraction (people, companies, projects, tools)
  • Semantic storage with OpenAI embeddings
  • Conversation context preservation
  • Smart deduplication

๐Ÿ” Advanced Search & Retrieval

  • Semantic similarity search
  • Entity relationship mapping
  • Timeline-based retrieval
  • Context-aware responses

๐Ÿค– Entity Resolution

  • Automatic pronoun resolution ("he" โ†’ "John Smith")
  • Reference understanding ("that company" โ†’ "Microsoft")
  • Cross-conversation entity linking
  • Confidence scoring

๐Ÿ“Š Analytics & Insights

  • Conversation pattern analysis
  • Entity interaction timelines
  • Knowledge growth tracking
  • Usage statistics

๐Ÿ› ๏ธ Available Tools

The server provides 6 MCP tools for Claude:

Tool Description Example Usage
memory_store Store information with auto entity extraction Automatically triggered during conversations
memory_search Semantic search through stored memories "Find all discussions about React"
context_inject Get relevant context for current query "What did we discuss about this project?"
entity_resolve Resolve references to actual entities "Who is 'he' referring to?"
conversation_analyze Analyze conversation patterns "Show my discussion statistics"
memory_timeline Get timeline of entity interactions "Timeline of Microsoft mentions"

๐Ÿ“ Project Structure

mcp-memory-server/
โ”œโ”€โ”€ src/
โ”‚   โ”œโ”€โ”€ config/          # Azure Cosmos DB configuration
โ”‚   โ”œโ”€โ”€ memory/          # Core memory system (RAG, storage, graph)
โ”‚   โ”œโ”€โ”€ types/           # TypeScript type definitions
โ”‚   โ”œโ”€โ”€ utils/           # Entity extraction, context injection
โ”‚   โ””โ”€โ”€ server.ts        # Main MCP server implementation
โ”œโ”€โ”€ scripts/             # Maintenance and health check scripts
โ”œโ”€โ”€ tests/               # Unit and integration tests
โ”œโ”€โ”€ docs/                # Technical documentation
โ””โ”€โ”€ dist/                # Compiled JavaScript (generated)

๐Ÿ—๏ธ Architecture

Core Components

  • RAG System: Vector similarity search with OpenAI embeddings
  • Entity Extractor: NLP-based entity recognition with custom patterns
  • Memory Storage: Optimized CosmosDB integration with smart indexing
  • Context Injector: Intelligent context retrieval for conversations
  • Graph Engine: Entity relationship mapping and traversal

Data Flow

graph TD
    A[User Message] --> B[Entity Extraction]
    B --> C[Embedding Generation]
    C --> D[CosmosDB Storage]
    D --> E[Semantic Search]
    E --> F[Context Injection]
    F --> G[Enhanced Claude Response]

๐Ÿ”ง Configuration

Environment Variables

# Azure Cosmos DB
COSMOS_ENDPOINT=https://your-account.documents.azure.com:443/
COSMOS_KEY=your-primary-key
COSMOS_DATABASE_NAME=memory-db
COSMOS_CONTAINER_CONVERSATIONS=conversations
COSMOS_CONTAINER_ENTITIES=entities

# OpenAI
OPENAI_API_KEY=your-openai-api-key

# Optional
NODE_ENV=production
LOG_LEVEL=info
MEMORY_RETENTION_DAYS=30

Advanced Configuration

See Configuration Guide for detailed setup options.

๐Ÿงช Testing

# Run all tests
npm test

# Health check
npm run health-check

# Test memory functionality
npm run test-memory

๐Ÿ“Š Performance

  • Storage: Optimized CosmosDB indexing for sub-100ms queries
  • Search: Vector similarity with 95%+ accuracy
  • Memory: Efficient entity deduplication and compression
  • Scalability: Handles 1000+ entities with consistent performance

๐Ÿ›ฃ๏ธ Roadmap

โœ… Completed

  • Core memory storage and retrieval
  • Entity extraction and resolution
  • Semantic search with embeddings
  • CosmosDB integration
  • MCP server implementation

๐Ÿ”„ In Progress

  • [ ] Intelligent entity deduplication
  • [ ] Auto-capture of all MCP tool results
  • [ ] Enhanced entity classification patterns
  • [ ] Contradiction detection system

๐Ÿ”ฎ Planned

  • [ ] Multi-user memory isolation
  • [ ] Graph traversal with Gremlin queries
  • [ ] Advanced analytics dashboard
  • [ ] Memory compression and archiving

See Roadmap for detailed feature planning.

๐Ÿค Contributing

We welcome contributions! Please see our Contributing Guidelines for details.

Development Setup

git clone https://github.com/PlumyCat/mcp-memory-server.git
cd mcp-memory-server
npm install
npm run dev

๐Ÿ“š Documentation

๐Ÿ†˜ Support

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

๐Ÿ™ Acknowledgments

โญ Star History

Star History Chart


Made with โค๏ธ for the Claude Desktop community

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured