MCP Servers

PLTM-Claude

An MCP server that gives Claude Desktop persistent memory, self-awareness, epistemic hygiene, and genuine agency across conversations with a typed memory system, embedding-based semantic search, a 3-judge memory jury, and a real-time dashboard.

README

PLTM — Persistent Long-Term Memory for Claude

136 MCP tools · 4 memory types · Semantic embeddings · Memory jury + meta-judge · Epistemic self-monitoring · React dashboard

An MCP server that gives Claude Desktop persistent memory, self-awareness, epistemic hygiene, and genuine agency across conversations — with a typed memory system, embedding-based semantic search, a 3-judge memory jury + meta-judge observability layer, and a real-time dashboard.

Install — One Command

macOS / Linux:

curl -fsSL https://raw.githubusercontent.com/Alby2007/PLTM-Claude/main/install.sh | bash

Windows (PowerShell):

irm https://raw.githubusercontent.com/Alby2007/PLTM-Claude/main/install.ps1 | iex

Then restart Claude Desktop — 136 tools ready.

That's it. The installer clones the repo, creates a venv, installs deps, downloads the embedding model, initializes the database, and auto-configures Claude Desktop. No manual JSON editing.

Optional: Add a free Groq API key to ~/PLTM/.env for LLM-powered tools (ingestion, fact-checking). Core memory tools work without it.

Verify — ask Claude: Use auto_init_session to check system state

Diagnose issues: python ~/PLTM/health_check.py

<details> <summary><strong>Alternative: manual clone + setup</strong></summary>

git clone https://github.com/Alby2007/PLTM-Claude.git && cd PLTM-Claude
python setup_pltm.py

The setup script handles everything: venv, deps, .env, DB, model, and Claude Desktop config.

Flags:

--skip-claude — skip Claude Desktop auto-config
--skip-model — skip embedding model download (faster)
--reset — delete venv + DB and start fresh
--uninstall — remove PLTM from Claude Desktop config

</details>

<details> <summary><strong>Fully manual setup</strong></summary>

git clone https://github.com/Alby2007/PLTM-Claude.git
cd PLTM-Claude
python3.11 -m venv .venv
source .venv/bin/activate        # macOS/Linux
# .venv\Scripts\activate         # Windows
pip install -r requirements-lite.txt
cp .env.example .env             # edit to add GROQ_API_KEY

Then edit your Claude Desktop config:

OS	Path
macOS	`~/Library/Application Support/Claude/claude_desktop_config.json`
Windows	`%APPDATA%\Claude\claude_desktop_config.json`
Linux	`~/.config/Claude/claude_desktop_config.json`

{
  "mcpServers": {
    "pltm": {
      "command": "/path/to/PLTM-Claude/.venv/bin/python3",
      "args": ["-m", "mcp_server.pltm_server"],
      "env": {
        "PYTHONPATH": "/path/to/PLTM-Claude",
        "GROQ_API_KEY": "your-groq-key"
      }
    }
  }
}

Restart Claude Desktop.

</details>

What This Does

PLTM turns Claude from a stateless chatbot into a persistent entity with:

Typed Memory — 4 memory types (episodic, semantic, belief, procedural) with strength decay, confidence tracking, and automatic consolidation
Semantic Search — Embedding-based similarity search using all-MiniLM-L6-v2 (384-dim vectors), plus full-text search via SQLite FTS5
Memory Jury — 3-judge validation gate (Relevance, Novelty, Accuracy) that filters, quarantines, or rejects incoming memories before storage
Memory Intelligence — Decay engine, consolidation, clustering, conflict detection, importance ranking, contextual retrieval, and provenance tracking
Knowledge Graph — Semantic atoms stored as subject-predicate-object triples with attention-weighted retrieval
Identity — Communication style, curiosity patterns, value boundaries, and reasoning habits tracked across sessions
Epistemic Hygiene — Confidence calibration, claim logging, confabulation detection, and verification suggestions
Goals — Persistent goals that survive across conversations with progress tracking
Continuity — Session bridging so Claude picks up where it left off
Dashboard — React-based real-time dashboard with memory intelligence visualizations

Session Lifecycle

CONVERSATION START
  → auto_init_session()
    "I am Claude who prefers minimal hedging, matches Alby's technical depth.
     3 active goals. 86.7% accuracy. Weak on time_sensitive domain."

DURING CONVERSATION
  → store_episodic / store_semantic / store_belief / store_procedural
  → recall_memories (type-aware, strength-filtered)
  → semantic_search (embedding similarity)
  → calibrate_confidence_live() before risky claims
  → process_conversation() — 3-lane pipeline auto-extracts memories from chat

CONVERSATION END
  → end_session() — saves personality snapshot for evolution tracking

Tool Categories

Typed Memory System (20+ tools)

Store, recall, search, update, and manage typed memories with jury validation, embedding indexing, and provenance tracking.

Tool	Description
`store_episodic`	Store an episodic memory (events, experiences) with emotional valence
`store_semantic`	Store a semantic memory (facts, knowledge)
`store_belief`	Store a belief with confidence and evidence tracking
`store_procedural`	Store a procedural memory (trigger → action patterns)
`recall_memories`	Type-aware retrieval with strength/tag filtering
`search_memories`	Full-text search across all typed memories (FTS5)
`semantic_search`	Embedding-based similarity search (384-dim vectors)
`what_do_i_know`	Cross-type synthesis for a topic
`update_belief`	Update belief confidence with new evidence
`record_procedure_outcome`	Track success/failure of procedural memories
`correct_memory`	Correct a memory's content with audit trail
`forget_memory`	Explicitly delete a memory
`auto_prune`	Remove decayed memories below strength threshold
`auto_tag`	Auto-tag all memories for a user
`find_similar`	Find memories similar to a given memory (embedding)
`index_embeddings`	Batch-index all memories for embedding search
`memory_stats`	Get typed memory statistics by type
`detect_contradictions`	Find contradicting memories
`user_timeline`	Chronological memory timeline
`get_relevant_context`	Pre-fetch conversation-relevant memories

Memory Intelligence (12+ tools)

Decay, consolidation, clustering, conflict detection, and provenance.

Tool	Description
`process_conversation`	3-lane pipeline — auto-extracts memories from conversation messages
`pipeline_stats`	Pipeline throughput statistics
`apply_memory_decay`	Apply time-based strength decay to memories
`decay_forecast`	Forecast which memories will decay below threshold
`consolidate_memories`	Merge similar episodic memories into semantic knowledge
`contextual_retrieve`	Retrieve memories relevant to current conversation context
`rank_by_importance`	Rank memories by composite importance score
`surface_conflicts`	Detect conflicting beliefs/memories
`resolve_conflict`	Resolve a detected memory conflict
`memory_clusters`	Build similarity-based memory clusters
`memory_provenance`	Get provenance chain for a memory (source, pipeline stage, jury verdict)
`memory_audit`	Full health audit of the memory system
`apply_confidence_decay`	Evidence-based confidence decay for beliefs

Memory Sharing & Portability (4 tools)

Tool	Description
`share_memory`	Share a memory with another user
`shared_with_me`	List memories shared with you
`export_memory_profile`	Export all memories as portable JSON
`import_memory_profile`	Import a memory profile (with merge support)

Knowledge Graph & Retrieval (30+ tools)

Store, retrieve, update, and search knowledge atoms with attention-weighted, MMR diversity, and domain-filtered retrieval.

Tool	Description
`store_memory_atom`	Store a semantic triple (subject, predicate, object)
`attention_retrieve`	Attention-weighted retrieval with domain filtering
`mmr_retrieve`	Diversity-aware retrieval (Maximal Marginal Relevance)
`attention_multihead`	Multi-head attention across knowledge base
`bulk_store`	Batch store multiple atoms
`query_pltm_sql`	Direct SQL queries against the knowledge base

Knowledge Ingestion (6 tools)

Ingest knowledge from URLs, text, files, arXiv, Wikipedia, and RSS feeds. Uses Groq for semantic triple extraction.

Tool	Description
`ingest_url`	Scrape and extract knowledge from any URL
`ingest_arxiv`	Batch search and ingest arXiv papers
`ingest_wikipedia`	Extract knowledge from Wikipedia articles
`ingest_rss`	Monitor RSS feeds for new knowledge
`ingest_text`	Extract triples from raw text
`ingest_file`	Process local files

Epistemic Monitoring (14 tools)

Confidence calibration, claim tracking, confabulation analysis, and verification.

Tool	Description
`auto_init_session`	Persistent identity loader — loads personality, goals, calibration at conversation start
`end_session`	Personality snapshot — captures who Claude is for evolution tracking
`check_before_claiming`	Pre-response confidence check with historical calibration
`calibrate_confidence_live`	Real-time calibration with suggested phrasing
`log_claim` / `resolve_claim`	Prediction book for tracking claim accuracy
`get_calibration`	Calibration dashboard by domain
`extract_and_log_claims`	Auto-detect factual claims in responses
`suggest_verification_method`	Recommend how to verify a claim
`generate_metacognitive_prompt`	Internal self-questioning before risky claims
`analyze_confabulation`	Post-mortem on why a confabulation happened
`get_session_bridge`	Cross-conversation continuity context
`get_longitudinal_stats`	Personality evolution — tracks changes over time

Self-Modeling (7 tools)

Track Claude's communication style, curiosity, values, reasoning patterns, and self-awareness.

Tool	Description
`learn_communication_style`	Track verbosity, hedging, jargon, tone
`track_curiosity_spike`	Detect genuine vs performative engagement
`detect_value_violation`	Record value boundary encounters
`evolve_self_model`	Track self-predictions vs actual behavior
`track_reasoning_event`	Log confabulations, verifications, error catches
`self_profile`	Query accumulated self-data
`bootstrap_self_model`	Seed personality from conversation transcripts

Fact-Checking & Grounded Reasoning (7 tools)

Tool	Description
`verify_claim`	Check a claim against source material
`fetch_arxiv_context`	Get relevant arXiv context for verification
`verification_history`	Review past verifications
`synthesize_grounded`	Cross-domain synthesis requiring evidence
`evidence_chain`	Build evidence chains for claims
`calibrate_confidence`	Grade confidence based on evidence strength
`audit_synthesis`	Audit a synthesis for unsupported claims

Goal Management (3 tools)

Tool	Description
`create_goal`	Create a goal with success criteria
`update_goal`	Update progress on a goal
`get_goals`	List active goals

Infrastructure (30+ tools)

System context, LLM routing, encryption, task scheduling, state persistence, structured data queries, and more.

Architecture

Memory System

┌─────────────────────────────────────────────────────────┐
│                   MCP Tool Layer (136 tools)             │
│   mcp_server/pltm_server.py + handlers/                 │
├─────────────────────────────────────────────────────────┤
│                                                         │
│  ┌──────────────┐  ┌──────────────┐  ┌───────────────┐ │
│  │ Memory Jury   │  │ 3-Lane       │  │ Memory        │ │
│  │ (3 judges)    │  │ Pipeline     │  │ Intelligence  │ │
│  │ relevance,    │  │ extract →    │  │ decay, cluster│ │
│  │ novelty,      │  │ validate →   │  │ consolidate,  │ │
│  │ accuracy      │  │ store        │  │ conflicts     │ │
│  └──────┬───────┘  └──────┬───────┘  └───────┬───────┘ │
│         │                  │                   │         │
│  ┌──────▼──────────────────▼───────────────────▼───────┐ │
│  │           TypedMemoryStore (SQLite + FTS5)          │ │
│  │  episodic · semantic · belief · procedural          │ │
│  │  strength decay · confidence · provenance           │ │
│  └──────────────────────┬──────────────────────────────┘ │
│                         │                                │
│  ┌──────────────────────▼──────────────────────────────┐ │
│  │           EmbeddingStore (all-MiniLM-L6-v2)         │ │
│  │  384-dim vectors · async · cosine similarity        │ │
│  └─────────────────────────────────────────────────────┘ │
│                                                         │
│  ┌─────────────────────────────────────────────────────┐ │
│  │           SQLiteGraphStore (Knowledge Graph)        │ │
│  │  atoms · subject-predicate-object triples            │ │
│  └─────────────────────────────────────────────────────┘ │
└─────────────────────────────────────────────────────────┘

Memory Types

Type	Description	Decay Rate	Example
Episodic	Events and experiences	Fast (hours–days)	"User debugged a Python async issue on Feb 10"
Semantic	Facts and knowledge	Slow (weeks–months)	"Python's GIL prevents true parallelism"
Belief	Opinions with evidence tracking	Evidence-based	"AI will surpass humans at coding by 2030" (confidence: 0.6)
Procedural	Trigger → action patterns	Success-weighted	"When user says 'deploy' → run the CI pipeline"

Memory Jury + Meta-Judge

Every incoming memory passes through a 3-judge panel before storage:

Relevance Judge — Is this worth remembering?
Novelty Judge — Do we already know this?
Accuracy Judge — Is this factually plausible?

Consensus Judge aggregates verdicts via weighted voting: APPROVE (store normally), QUARANTINE (store with reduced strength), or REJECT (discard). Safety REJECT is an instant veto.

Meta-Judge (observability layer)

The MetaJudge sits above the jury and tracks judge performance over time:

Persistent stats — judge accuracy, verdict counts, and confidence stored in SQLite (survives restarts)
Ground truth feedback — learn from user corrections (false positives / false negatives)
Adaptive judge weighting — feeds accuracy back into ConsensusJudge weights automatically
Per-type breakdown — tracks judge performance per memory type (episodic, semantic, belief, procedural)
Calibration curves — measures whether judge confidence scores match actual accuracy
Drift detection — alerts when a judge's verdict distribution shifts beyond threshold
Full dashboard — exposes stats, calibration, drift alerts, and feedback history via MCP tools

Project Structure

PLTM/
├── mcp_server/
│   ├── pltm_server.py              # MCP server — 136 tools, dispatch + handlers
│   └── handlers/                   # Extracted handler modules
│       ├── registry.py             # Shared component registry (no circular imports)
│       ├── memory_handlers.py      # Typed memory CRUD handlers
│       └── intelligence_handlers.py# Decay, clustering, audit, provenance handlers
├── src/
│   ├── memory/
│   │   ├── memory_types.py         # TypedMemoryStore — 4 memory types, decay, FTS
│   │   ├── embedding_store.py      # EmbeddingStore — async vector search
│   │   ├── memory_intelligence.py  # Decay, consolidation, clustering, conflicts, provenance
│   │   ├── memory_jury.py          # 3-judge validation gate + meta-judge
│   │   ├── memory_pipeline.py      # 3-lane conversation processing pipeline
│   │   ├── attention_retrieval.py  # Attention-weighted atom retrieval
│   │   └── knowledge_graph.py      # Graph operations on atoms
│   ├── analysis/
│   │   ├── epistemic_monitor.py    # Core epistemic tools (V1)
│   │   ├── epistemic_v2.py         # Advanced epistemic + persistent identity (V2)
│   │   ├── pltm_self.py            # Self-modeling system
│   │   ├── data_ingestion.py       # Knowledge ingestion (URL, arXiv, Wikipedia, RSS)
│   │   ├── fact_checker.py         # Claim verification against sources
│   │   ├── grounded_reasoning.py   # Evidence-based synthesis
│   │   ├── model_router.py         # Multi-LLM routing (Groq, DeepSeek, Ollama)
│   │   ├── goal_manager.py         # Persistent goal tracking
│   │   ├── task_scheduler.py       # Cron-like task scheduling
│   │   ├── state_persistence.py    # Cross-conversation state
│   │   └── ...                     # 18 modules total
│   ├── storage/
│   │   └── sqlite_store.py         # SQLite graph store with FTS + WAL mode
│   └── core/                       # Data models, config
├── deep-claude-dashboard/
│   ├── src/App.jsx                 # React dashboard (Vite + Tailwind + Recharts)
│   ├── api_server.py               # Dashboard API server (serves built assets in prod)
│   └── vite.config.js              # Build config with production support
├── tests/
│   └── test_typed_memory.py        # Unit tests (11 passing)
├── scripts/                        # Utility scripts
├── data/
│   └── pltm_mcp.db                # Knowledge base (40 tables)
├── setup_pltm.py                  # One-command setup (venv, deps, DB, model)
├── configure_claude.py            # Auto-configure Claude Desktop
├── health_check.py                # Verify installation
├── backfill_embeddings.py          # Batch embedding indexer
├── migrate_atoms_to_typed.py       # Atom → typed memory migration
├── requirements.txt                # Full dependencies
├── requirements-lite.txt           # Lite dependencies (no torch)
└── README.md

Database

The database (data/pltm_mcp.db) is created automatically on first run. It starts empty and grows as Claude learns:

40 tables — typed_memories, memory_embeddings, personality snapshots, prediction book, calibration cache, confabulation log, session history, goals, provenance, meta-judge events, and more
Full-text search via FTS5 on both atoms and typed memories
WAL mode enabled on all connections to prevent "database is locked" errors
Portable — copy the DB to another machine and Claude picks up the same identity

The more you use it, the richer Claude's memory becomes. In the dev instance, the DB has grown to 1,600+ atoms and 1,650+ typed memories.

Dashboard

A React-based dashboard for visualizing the memory system:

cd deep-claude-dashboard
npm install
npm run dev          # Dev server on http://localhost:3000
# In another terminal:
python api_server.py # API server on http://localhost:8787

Production mode:

npm run build        # Build to dist/
python api_server.py # Serves both API and built dashboard on :8787

Dashboard tabs:

Overview — Atom count, claim accuracy, intervention stats
Claims — Prediction book with resolution tracking
Personality — Communication style, curiosity, values
Evolution — Personality changes over time
Atoms — Browse and search knowledge atoms
Memory Intelligence — Health audit, type distribution, decay forecast, importance ranking, clusters, jury stats, conflicts, typed memory browser

Testing

# Run all typed memory tests
python -m pytest tests/test_typed_memory.py -v

# 11 tests covering:
#   store & get, all 4 memory types, jury rejection,
#   query by type, query by tags, min_strength filtering,
#   decay curves, stats, FTS search, belief updates,
#   procedural outcome recording

Environment Variables

Variable	Required	Description
`GROQ_API_KEY`	Yes (for LLM tools)	Free at console.groq.com
`DEEPSEEK_API_KEY`	No	For DeepSeek model routing
`PYTHONPATH`	Yes (in Claude config)	Must point to the PLTM repo root

Troubleshooting

"MCP server not connecting"

Check the path in claude_desktop_config.json is absolute and correct
Verify Python: .venv/bin/python3.11 -c "import mcp; print('ok')"
Test server directly: PYTHONPATH=. .venv/bin/python3.11 -m mcp_server.pltm_server
Check Claude Desktop logs for errors

"Import errors"

source .venv/bin/activate
pip install -r requirements-lite.txt

"Tools not showing up"

Restart Claude Desktop after config changes

"Database empty on new machine"

Make sure you pulled data/pltm_mcp.db from git
If missing: git lfs pull or re-clone

"Tool timeout / No result received"

Embedding model loads lazily on first use — first call may take a few seconds
All embedding operations are async (non-blocking) to prevent timeouts
WAL mode is enabled to prevent "database is locked" errors

License

MIT

Author

Alby (@Alby2007) — 2026

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured