MCP Servers

lore

Semantic search across Claude Code conversations. Hybrid vector + keyword search, fully local, background indexing.

README

lore

Semantic search across your Claude Code conversations. Find anything you've ever discussed -- across all projects, all sessions, any branch.

Features

Hybrid search (vector + keyword) Combines multilingual-e5-small embeddings with FTS5/BM25 via Reciprocal Rank Fusion. Finds results by meaning and exact terms.
Fully local, zero API keys Everything runs on your machine. ONNX Runtime for embedding, sqlite-vec for storage. No data leaves your device.
Background indexing Index triggers return instantly. Monitor progress while you keep working. Search what's already indexed while the rest catches up.
Project-selective Register only the projects you care about. Add or remove anytime. Unregistering deletes indexed data to keep things clean. Browsing your session inventory also makes it easy to spot stale or unnecessary sessions you may want to clean up.
Conversation-aware chunking Splits by logical turns (user question + full assistant response chain), not arbitrary token windows. Handles tool-use chains, thinking blocks, and multi-step interactions correctly.
100+ languages Korean, Japanese, Chinese, English, and 90+ more. CJK-aware token estimation for accurate chunking.

Quick Start

Add to Claude Code

# No install needed — always runs latest version
claude mcp add -s user lore -- npx getlore

# Or for a single project only
claude mcp add -s project lore -- npx getlore

Add to OpenAI Codex CLI

# No install needed
codex mcp add lore -- npx getlore

<details> <summary>Alternative: global install (faster startup, works offline)</summary>

npm install -g getlore

# Then register with your tool:
claude mcp add -s user lore -- getlore   # Claude Code
codex mcp add lore -- getlore            # Codex CLI

# Manage your install:
getlore --version   # Check installed version
getlore update      # Update to latest

</details>

Usage

Once connected, the AI can use lore's tools directly:

You: "What did we discuss about auth refactoring last week?"

Claude: [calls lore search] Found 3 relevant conversations...
        In your "my-webapp" project on March 15, you decided to...

First time setup:

Browse projects -- lore shows all your Claude Code projects
Register -- pick which ones to index
Index -- runs in background, takes ~15 seconds per project
Search -- ask anything about past conversations

Tools

Tool	Purpose
`manage_projects`	Register/unregister projects for indexing
`index`	Start background indexing. Modes: `incremental` (default), `full` (requires `confirm: true`), `cancel`
`status`	Check indexing progress, ETA, skip reasons, DB health
`search`	Semantic + keyword search across conversations
`get_context`	Expand search results with surrounding conversation
`list_sessions`	Browse indexed sessions by project

full mode requires confirm: true as a safety gate — the AI doesn't know about this parameter, so it has to ask you before triggering a destructive reindex.

Why This Exists

Claude Code stores every conversation as a JSONL transcript in ~/.claude/projects/. After a few weeks, you have hundreds of sessions across dozens of projects -- discussions about architecture decisions, debugging sessions, code reviews, and design explorations.

But there's no way to search through them. You can't ask "what approach did we take for the auth middleware?" or "which project had that database migration discussion?"

Existing tools either require cloud APIs, spawn zombie processes, or treat conversations as generic documents. lore is purpose-built for Claude Code sessions: it understands turn boundaries, tool-use chains, and thinking blocks. It runs entirely locally with zero dependencies beyond Node.js.

How It Works

~/.claude/projects/*/*.jsonl
        |
   JSONL Parser (extracts user/assistant messages, skips noise)
        |
   Turn-pair Chunker (groups by logical conversation turns)
        |
   Transformers.js (multilingual-e5-small, INT8 quantized, 384d)
        |
   sqlite-vec + FTS5 (hybrid vector + keyword storage)
        |
   Reciprocal Rank Fusion (combines both signals for ranking)

Storage: Single SQLite file at ~/.lore/lore.db with WAL mode for concurrent reads.

Config: Project registration stored in ~/.lore/config.json.

<details> <summary>Configuration</summary>

Environment Variables

Variable	Default	Description
`LORE_DIR`	`~/.lore`	Data directory
`LORE_DB`	`~/.lore/lore.db`	Database path
`CLAUDE_PROJECTS_DIR`	`~/.claude/projects`	Claude Code transcripts location

</details>

<details> <summary>Performance</summary>

Measured on Apple Silicon (M-series):

Metric	Value
Search latency	7-15ms
Index speed	~10 sessions/sec
First search (cold model load)	~5s
DB size	~0.1MB per 10 sessions
Model size (downloaded once)	~112MB

</details>

<details> <summary>Troubleshooting</summary>

"No projects registered"

Run manage_projects with action list to see available projects, then add the ones you want.

Stale lock file

If indexing was interrupted, the lock file auto-cleans on next run (PID-based detection).

DB corruption

Delete ~/.lore/lore.db and re-index. Your source data (~/.claude/projects/) is never modified.

</details>

Development

git clone https://github.com/hyunjae-labs/lore.git
cd lore
npm install
npm run build
npm test          # 114 tests

Tech Stack

Model Context Protocol SDK -- stdio transport
@huggingface/transformers -- multilingual-e5-small (INT8)
better-sqlite3 + sqlite-vec -- embedded vector DB
Reciprocal Rank Fusion -- hybrid search ranking

License

MIT

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured