io.github.onetrueclaude-creator/mcp-knowledge-gaps
Scans markdown vaults to identify concepts mentioned but not defined, ranks gaps by priority, and generates research questions or random long-tail topics to fill knowledge gaps.
README
<!-- mcp-name: io.github.onetrueclaude-creator/mcp-knowledge-gaps -->
mcp-knowledge-gaps
Find what your knowledge base mentions but doesn't actually explain.
Find concepts mentioned but never defined in your markdown knowledge base (Obsidian vault, Logseq graph, any folder of .md files). Uses fuzzy canonicalization to avoid false positives, ranks gaps by frequency × region-diversity × novelty, generates prioritized research questions, and samples from the long tail via sortition to break confirmation bias in your research queue.
Install
pip install mcp-knowledge-gaps
# or
uvx mcp-knowledge-gaps
Usage
Claude Code
claude mcp add mcp-knowledge-gaps -- mcp-knowledge-gaps
Claude Desktop
Add to claude_desktop_config.json:
{
"mcpServers": {
"knowledge_gaps": {
"command": "uvx",
"args": ["mcp-knowledge-gaps"]
}
}
}
MCP Tools
| Tool | Tier | Description |
|---|---|---|
find_gaps |
Free | Scan a markdown vault and return concepts mentioned in multiple notes but without their own dedicated note. Applies fuzzy canonicalization and noise filtering. |
list_gaps_by_priority |
Free | Return gaps ranked by priority: frequency × diversity × novelty (higher = fill this gap first). |
generate_research_questions |
Pro | Generate prioritized research questions for the top N gaps. Each question comes with a priority score and factor breakdown. |
surprise_research_topic |
Pro | Sortition sampling — pick a random gap from the LOW-priority long tail. Breaks confirmation bias by surfacing topics you'd never pick yourself. |
export_review_queue |
Pro | Export a CSV of top-priority gap concepts, suitable for Anki or other spaced-repetition tools. Writes to output_csv and returns the row count. |
Pro tier
Unlocks research question generation with RL-weighted ranking, sortition sampling of long-tail gaps, and CSV review queue export.
License activation — any one of these works:
# 1. Environment variable
export KNOWLEDGE_GAPS_LICENSE="eyJhbGc..."
# 2. CLI flag
mcp-knowledge-gaps --license-key "eyJhbGc..."
# 3. Config file
echo "eyJhbGc..." > ~/.mcp-knowledge-gaps/license.jwt
Licenses are verified fully offline — no phone-home, no activation server. Get a license at https://github.com/onetrueclaude-creator/mcp-knowledge-gaps#pro-tier.
Requirements
- Python 3.10+
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.