recall-mcp

recall-mcp

Local semantic search MCP server for markdown docs and Granola meeting transcripts that runs entirely on your machine with no API keys, no cloud, and no cost.

Category
Visit Server

README

recall-mcp

TL;DR: Local semantic search MCP server for markdown docs and Granola meeting transcripts. Ask Claude "what did we decide about X?" and it finds relevant content by meaning. Runs entirely on your machine - no API keys, no cloud, no cost.

Give Claude Code long-term memory over your notes and meetings. It searches by meaning, runs locally, and costs nothing.


What this is

A Model Context Protocol server that indexes your markdown files and Granola meeting transcripts into a local vector database. Claude Code can then search across everything by meaning, not just keywords.

You ask something like "what did we decide about the onboarding flow?" and it surfaces relevant chunks from your docs and meeting recordings, even if those exact words never appeared.

How it works:

Your markdown files + Granola meetings
           |
           v
  Ollama (nomic-embed-text)    <- local embedding model, no API key
           |
           v
       ChromaDB                <- local vector database on localhost:8000
           |
           v
    5 MCP tools                <- Claude Code calls these during conversations

Auto-indexing runs every 15 minutes in the background via a macOS LaunchAgent.

Example

You ask Claude Code: "what did we decide about the onboarding flow?"

recall-mcp searches your docs and meeting transcripts by meaning and returns the relevant chunks:

Source: meetings/product-sync-2026-03-14.md (similarity: 0.82)
"We agreed to cut the walkthrough video and ship a checklist instead.
Jamie owns the copy, targeting next sprint."

Source: docs/onboarding/decisions.md (similarity: 0.79)
"Checklist approach approved — three steps max, no modal,
inline in the dashboard."

Claude uses these results to answer your question with the actual context from your notes and meetings.

Works with

recall-mcp uses the standard MCP stdio transport, so it works with any MCP-compatible client:

Client Config file
Claude Code (CLI) ~/.claude.json
Claude Desktop ~/Library/Application Support/Claude/claude_desktop_config.json
Cursor Cursor settings > MCP
Windsurf ~/.codeium/windsurf/mcp_config.json

The MCP config block is the same across all of them. This README uses Claude Code as the example, but the server doesn't care which client connects to it.

Note: Setup requires comfort with the terminal, or the ability to point your AI tool at this repo and ask it to set things up for you. For example, you can paste this README into Claude Code or Claude Desktop and ask it to clone the repo and run the setup script on your behalf.


Prerequisites

  • macOS (LaunchAgent setup is macOS-specific; the core server works anywhere)
  • Node.js 18+ - brew install node
  • Ollama - ollama.com or brew install ollama
  • uv (for running ChromaDB) - brew install uv
  • Granola desktop app - granola.ai - only needed if you want meeting transcript search

Setup

1. Clone and build

git clone https://github.com/thetenzinwoser/recall-mcp.git
cd recall-mcp
npm install
npm run build

2. Install the embedding model

brew services start ollama
ollama pull nomic-embed-text

First pull is ~274MB. After that it's cached.

3. Run setup

./scripts/setup.sh

This will:

  • Ask where your markdown docs live (DOCS_PATH)
  • Write a .env file
  • Generate LaunchAgent plists from templates and load them
  • Start ChromaDB and schedule auto-indexing
  • Run an initial index of your docs and Granola transcripts
  • Print the MCP config block to paste into Claude Code

4. Add to your MCP client

Add this to your client's MCP config (see the "Works with" table above for file locations):

"recall": {
  "type": "stdio",
  "command": "node",
  "args": ["/absolute/path/to/recall-mcp/dist/server.js"]
}

Use the exact path printed at the end of setup.sh. Then restart your client. In Claude Code, run /mcp to confirm it connected.


Tools

Tool What it does
semantic_search Search docs and transcripts by meaning. Accepts query, optional limit (1-20), optional sourceTypes filter.
get_transcript Fetch a full meeting transcript. Accepts meetingId, searchTitle (with optional date), or listRecent.
reindex_docs Re-scan your docs folder. Incremental - only processes changed files.
index_granola_transcripts Pull and index new Granola meetings. Accepts limit and clearExisting.
index_status Show chunk counts by source type.

get_transcript details

  • Title search with date: { searchTitle: "team sync March 14" } - extracts the date and fuzzy-matches the title
  • Explicit date: { searchTitle: "team sync", date: "2026-03-14" } - when multiple meetings share a title
  • Partial ID: First 8 characters of a meeting UUID is enough
  • List recent: { listRecent: 20 } to browse recent meetings with IDs

Configuration

All settings have sensible defaults. The only one most people need is DOCS_PATH.

Copy .env.example to .env and edit:

cp .env.example .env
Variable Default Purpose
DOCS_PATH ~/docs Folder to scan for markdown files (recursive)
OLLAMA_URL http://localhost:11434 Ollama endpoint
EMBEDDING_MODEL nomic-embed-text Model to use for embeddings

ChromaDB runs on localhost:8000 by default. This is not configurable without editing the LaunchAgent plist directly.


Granola integration

recall-mcp reads your Granola auth token from the Granola desktop app's local data. No separate API key or login needed - just have Granola installed and signed in.

Auth token location: ~/Library/Application Support/Granola/supabase.json

If you don't use Granola, the doc indexing still works fine. Transcript-related tools will return errors, which Claude handles gracefully.


Manual commands

Reindex now (instead of waiting for the 15-min interval):

cd /path/to/recall-mcp && npx tsx src/scripts/auto-index.ts

Check ChromaDB is running:

curl -s http://localhost:8000/api/v2/heartbeat

Check what's indexed:

# Use the index_status MCP tool, or check logs:
tail -f /path/to/recall-mcp/logs/auto-index.log

Rebuild after pulling updates:

npm run build

Restart ChromaDB:

launchctl unload ~/Library/LaunchAgents/com.recall.chromadb.plist
launchctl load ~/Library/LaunchAgents/com.recall.chromadb.plist

Troubleshooting

semantic_search returns nothing / MCP not connecting

  • Run /mcp in Claude Code to check server status
  • Make sure dist/server.js exists (npm run build)
  • Verify the path in ~/.claude.json is absolute and correct

ChromaDB not running

curl -s http://localhost:8000/api/v2/heartbeat
# Should return: {"nanosecond heartbeat": ...}

If it fails, check logs/chromadb-stderr.log or reload the LaunchAgent.

Ollama errors

ollama list
# nomic-embed-text should appear

If not: ollama pull nomic-embed-text

Granola auth fails Make sure the Granola desktop app is installed and you're logged in. The token file should exist at ~/Library/Application Support/Granola/supabase.json.

Results feel stale Force a reindex: npx tsx src/scripts/auto-index.ts. The auto-indexer only picks up file changes and new meetings - if you renamed files or restructured folders, a manual run helps.


Stack

Component Version Purpose
@modelcontextprotocol/sdk ^1.27.0 MCP server framework
chromadb ^1.10.0 Vector database client
ollama (nomic-embed-text) - Local embeddings, 768 dimensions
glob ^10.0.0 File scanning
zod ^3.23.8 Config validation

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured