second-brain-mcp
Turns an Obsidian vault into semantic memory for coding agents, providing read-only semantic search and a human-approved write workflow via MCP.
README
second-brain-mcp
Turn your Obsidian vault into semantic memory for any MCP-capable coding agent.
v1.0.0 — early release. Works end-to-end on macOS and Linux; published on PyPI on 2026-04-15. Windows is untested. Feedback, bug reports, and testing notes are very welcome — please open an issue if anything misbehaves or if the docs are unclear.
What it does. Ships a stdio MCP server with four read-only tools
(obsidian_overview, obsidian_search, obsidian_read, obsidian_backlinks)
plus one prompt (to_obsidian) that drives a human-in-the-loop write
workflow. Works with Claude Code, Cursor, Zed, or any client that speaks MCP.
What it is not. No auto-writes — nothing is written to the vault
without an explicit /to_obsidian invocation. No background extraction,
no reflection or compaction loops running on your behalf. The LLM does
the extraction work when you call /to_obsidian, but every candidate
note requires your explicit per-note approval before it is written.
You control what enters the vault; the server only reads from it.
How this is different from other Obsidian MCP servers. Existing
Obsidian MCP servers (e.g. variants of mcp-obsidian) typically talk
to the running Obsidian app through its Local-REST plugin and expose
file-level tools — list_files, get_file, append_to_note, etc.
second-brain-mcp is offline and plugin-free: it reads the vault as
plain files from disk, builds a local semantic index with bge-m3, and
exposes semantic search rather than path-based CRUD. Obsidian does
not need to be running. The write path is a curated workflow with
per-note human approval, not a raw write_file tool.
30-second quick start
# 1. Install (uvx — no venv pollution)
uvx second-brain-mcp serve --help
# 2. Point at your vault
export OBSIDIAN_VAULT=$HOME/obsidian/vault
# 3. Register with Claude Code
claude mcp add -s user second-brain \
-e OBSIDIAN_VAULT="$OBSIDIAN_VAULT" \
-- uvx second-brain-mcp serve
# 4. Restart Claude Code, then ask: "call obsidian_overview"
First run downloads the bge-m3 embedder (~2.3 GB) on the first tool call — roughly 5 seconds after that. See docs/CUSTOMIZE.md for lighter models, or to point at an OpenAI-compatible embeddings API (Cloud.ru FM API, OpenAI, self-hosted Infinity) instead of the local model.
Vault requirements
Hard minimum: a directory with .md files somewhere (any nesting).
Everything else is optional:
- YAML frontmatter (
type,verified,confidence) enables filtering [[wikilinks]]enable backlinks navigation_index.mdat the vault root is returned whole inobsidian_overview
Start with your existing vault — unused features simply stay inactive until you add the relevant structure.
| Feature | No frontmatter | No wikilinks | No _index.md |
|---|---|---|---|
| semantic search | ✅ | ✅ | ✅ |
type_filter in search |
❌ | ✅ | ✅ |
obsidian_read body |
✅ | ✅ | ✅ |
obsidian_read frontmatter |
empty dict | ✅ | ✅ |
obsidian_read outlinks/backlinks |
✅ | empty | ✅ |
obsidian_backlinks tool |
✅ | empty | ✅ |
obsidian_overview index_md |
✅ | ✅ | placeholder |
Why editorial, not archival
Most agent-memory systems default to the archival model: capture everything — raw conversation turns, every tool call, every message — then rely on semantic search to pull the right thing back later. Comparative retrieval tests against one such system (MemPalace, with its exchange-pair chunking and multi-layer palace) surfaced a clear trade-off: on realistic queries a small set of human-approved notes outperformed a much larger raw conversation archive sitting in the same index.
The reason is simple. In any given session roughly 95% of what's said is working noise — code, syntactic back-and-forth, tactical detail that expires with the task. The 5% that survives — atomic facts, decisions, insights — is what you actually want to find six months later. Archival memory keeps both and leans on the embedder to separate them, and that separation is hard to get right in practice.
second-brain-mcp takes the editorial position: memory is what
you chose to remember. Nothing reaches the vault by accident. The
loop:
- You work a session normally. The agent reads from the vault through the four read-only tools but writes nothing on its own.
- When you're done, you invoke
to_obsidian. The agent walks the session, pulls out candidate facts, frames each as an atomic statement, and checks for duplicates against the existing vault. - It shows you the list. You approve, reject, merge, or rewrite each candidate individually.
- Only the approved notes land in the vault.
The LLM does the extraction work — it is good at abstracting and generalising. You are the editor — you know which of the candidates actually matter. The vault ends up small, dense, and almost entirely signal. That curated remnant is what makes retrieval surface the right thing instead of the loudest thing.
This is why there is no reflection loop, no background extraction, no auto-writes. The approval gate is the whole point.
Documentation
- docs/INSTALL.md — per-client setup (Claude Code, Cursor, Zed, generic)
- docs/ARCHITECTURE.md — components, data flow, principles
- docs/WRITE-WORKFLOW.md —
to_obsidianexplained - docs/SECURITY.md — read-only guarantees, path-traversal
- docs/CUSTOMIZE.md — alternative embedders, env vars
- docs/TROUBLESHOOT.md — common errors and fixes
- ROADMAP.md — v1.1+ planned features
License
MIT. See LICENSE.
Credits
The pattern of embedding a PROTOCOL string in the first read-tool's response
is borrowed from MemPalace's
tool_status. The design was shaped by empirical comparisons with raw
conversation archives — curation beats volume.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.