docpack MCP Server
Enables querying and exploring a bundled knowledge base through tools for manifest, table of contents, node retrieval, and full-text search.
README
docpack
Bundle a directory of Markdown files into a portable, queryable knowledge base.
docpack bundle --input ./docs --output ./mykb --home ./docs/toc.md
docpack toc ./mykb "getting-started" --depth 2
docpack search ./mykb "authentication AND OAuth" --limit 5
Single binary. CLI, TypeScript library, and MCP server.
Quick start
# Bundle a directory of Markdown files
docpack bundle --input ./docs --output ./mykb --home ./docs/toc.md
# Explore the knowledge base
docpack manifest ./mykb
docpack toc ./mykb "toc" --depth 2
docpack search ./mykb "keyword" --limit 10
# Package as an agent skill
docpack skill ./mykb --use-when "Use for project docs" --output ./my-skill
# Start an MCP server
docpack serve ./mykb --mcp
Requirements
- Node.js >= 20
- better-sqlite3 -- native module. Prebuilt binaries are downloaded automatically for common platforms.
Usage
From CLI
npx @rlemaigre/docpack manifest ./mykb
Or install
npm install @rlemaigre/docpack
docpack manifest ./mykb
From TypeScript
import { bundle, query, summarize, generateSkill } from "@rlemaigre/docpack";
As an AI Skill
AI agents can install the query skill directly using:
npx skills add rlemaigre/docpack
Output
Bundle command produces two files:
mykb/
docpack.db # SQLite knowledge base
docpack.yaml # human-readable manifest and entry points
Input
The bundler reads files as Markdown text (UTF-8). It recursively walks the input directory, parses ATX headings (# through ######) to build a document hierarchy, and stores everything in SQLite with an FTS5 full-text index.
Conversion from other formats (PDF, DOCX, etc.) is the caller's responsibility — preprocess your files into Markdown before bundling.
Cheat sheet
| Command | Output | Use |
|---|---|---|
bundle --input <dir> --output <dir> --home <file> |
files | Create KB from Markdown files |
manifest <kb> |
YAML | KB metadata (version, home, stats) |
toc <kb> <slug> --depth N |
YAML | Hierarchy with clipped subtree summaries |
get <kb> <slug> |
XML | Document content + full subtree |
search <kb> "query" --limit N --offset O |
YAML | FTS5 search with snippet excerpts |
summarize <kb> --summaries <file> |
n/a | Import summaries from JSONL |
summarize <kb> --mode llm --model <name> --endpoint <url> |
n/a | Generate summaries via LLM fold |
skill <kb> --use-when "<text>" --output <dir> |
files | Package KB as a self-contained agent skill |
serve <kb> --mcp |
stdio | Long-lived MCP server for AI agents |
Architecture
flowchart LR
A[Markdown files/] --> B[bundle]
B --> C[docpack.db]
B --> D[docpack.yaml]
C --> E[query / search]
D --> E
E --> F[CLI / MCP / TS library]
The bundler walks the filesystem, reads each file as Markdown, parses headings into a document tree, and stores everything in SQLite with an FTS5 index. The query side reads from the same database.
Document hierarchy
graph TD
F1[file]
F2[file]
F1 --> S1[section]
F1 --> S2[section]
S1 --> L1[leaf]
S1 --> L2[leaf]
S2 --> L3[leaf]
All ingested files are root documents — directory structure is discarded. Two document types:
- file -- ingested Markdown document, root document, may contain sections
- section -- Markdown heading, child of a file, may contain subsections
Every Document has a slug (globally unique), title, chunk (self content), and children. The same Document shape applies at every level — files, sections, and leaves are all documents. Cross-file navigation uses docpack://slug links rewritten by the bundler.
CLI reference
bundle
docpack bundle --input <path> --output <path> --home <path>
| Option | Required | Description |
|---|---|---|
--input |
yes | Directory of Markdown files to bundle |
--output |
yes | Output directory (creates docpack.db + docpack.yaml) |
--home |
yes | Path to the primary entry file (Markdown TOC) |
--description |
no | Human-readable description of the KB |
--url |
no | Source URL (wiki, website, etc.) |
--exported-at |
no | Date of source data export (ISO 8601) |
Progress to stderr. Stats as JSON to stdout.
manifest
docpack manifest <kb>
Returns YAML with version, aggregate statistics, and metadata (home, description, url, exportedAt). No file enumeration.
toc
docpack toc <kb> <slug> [--depth <mode>]
| Depth mode | Behavior |
|---|---|
N (number) |
Unfold N levels, clip with Summary |
full |
Complete tree, no clipping |
For clipped subtrees, children Documents are replaced with a Summary object: chunkCount, totalBytes, depth, and optional summary text.
get
docpack get <kb> <slug>
Returns XML with the document's chunk and its full subtree. Attributes include slug, title, level, depth, parent, prev, next.
<document slug="api-auth" title="Authentication" level="2" depth="0" parent="api" prev="api-overview" next="api-billing">
<chunk>...</chunk>
<children>
<document slug="api-auth-oauth" title="OAuth" level="3" depth="0" parent="api-auth" prev="" next="api-auth-apikey">
<chunk>...</chunk>
</document>
</children>
</document>
search
docpack search <kb> "query" [--limit N] [--offset O]
FTS5 full-text search over titles and chunk content. Query language supports:
- Plain words:
authentication - Phrases:
"DataWindow painter" - Boolean:
DataWindow AND painter,error OR warning - Negation:
DataWindow NOT painter - Prefix:
GetSeries* - Column-specific:
title:DataWindow
Results ordered by BM25 score. Each hit carries a snippet excerpt (~30 tokens around matched terms with <b>/</b> markers). total gives full result set size.
Embeddings and reranking : TBD (requires AI).
summarize
docpack summarize <kb> --summaries <path>
docpack summarize <kb> --mode llm --model <name> --endpoint <url> --prompt <path>
Post-processing pass. Two modes:
JSONL file mode — import summaries from a JSONL file (one {"slug":"...","summary":"..."} per line):
docpack summarize ./mykb --summaries ./summaries.jsonl
LLM fold mode — built-in bottom-up tree fold with an OpenAI-compatible endpoint:
docpack summarize ./mykb \
--mode llm \
--model qwen3-8b \
--endpoint http://localhost:8000/v1 \
--prompt ./prompt.txt \
--concurrency 32 \
--min-content-length 200
Docpack traverses the document tree bottom-up, level by level. At each document it fills the prompt template with the document's content and its children's summaries, then sends a POST /chat/completions request. Parents always wait for all children to finish — siblings at the same depth are processed in parallel (bounded by --concurrency).
Tree folding algorithm:
- Find all leaf documents (no children). Process them in parallel.
- Move up one level. For each parent, fill the prompt template with its chunk + children summaries. Process in parallel.
- Repeat until the root is reached.
Prompt template variables:
| Variable | Description |
|---|---|
{title} |
Document's own title |
{slug} |
Document's own slug |
{chunk} |
Document's own content (Markdown). |
{children_titles} |
Ordered list of children titles, one per line |
{children_summaries} |
Ordered list of title: summary pairs, one per line |
{children_count} |
Number of children |
Pass-through optimization (--min-content-length):
If a leaf document has no chunk, or its chunk is shorter than --min-content-length, the LLM call is skipped. The chunk is used as-is if present, or the document is skipped. This avoids wasting LLM calls on trivial leaves and reduces hallucination risk on tiny inputs.
Options:
| Option | Required | Description |
|---|---|---|
--mode llm |
yes | Select LLM fold mode |
--model <name> |
yes | Model name sent to the endpoint |
--endpoint <url> |
yes | Base URL of an OpenAI-compatible server (e.g. http://localhost:8000/v1) |
--prompt <path> |
yes | Path to a prompt template file |
--concurrency <n> |
no | Max parallel LLM requests per level (default: 8) |
--min-content-length <n> |
no | Skip LLM call for leaf documents shorter than this (default: 0 = disabled) |
--api-key <key> |
no | API key for cloud endpoints |
Works with any OpenAI-compatible endpoint: vLLM, Ollama, LM Studio, cloud OpenAI.
Both modes use upsert semantics — existing summaries for untouched slugs are preserved.
skill
docpack skill <kb> --use-when "<description>" --output <dir>
Package an existing KB as a self-contained agent skill directory:
<output>/
SKILL.md # auto-generated skill instructions
references/
docpack.db
docpack.yaml
scripts/
docpack.mjs # wrapper script (pins docpack version)
| Option | Required | Description |
|---|---|---|
<kb> |
yes | Path to existing KB directory |
--use-when |
yes | When to use the skill (becomes SKILL.md description) |
--output |
yes | Output skill directory |
serve
docpack serve <kb> --mcp
Starts an MCP server over stdio, exposing a knowledge base with four tools: manifest, toc, get, search.
TypeScript API
Bundle
import { bundle } from "@rlemaigre/docpack";
const stats = bundle({
input: "./docs",
output: "./mykb",
home: "./docs/toc.md",
description: "My project documentation",
onProgress: (path, done, total) => console.log(`${done}/${total}`),
onError: (path, err) => console.error(err),
});
console.log(stats);
// { filesProcessed: 10, totalChunks: 85, totalBytes: 133714 }
Query
import { query } from "@rlemaigre/docpack";
const kb = query("./mykb");
// Discover entry point
const manifest = kb.manifest();
console.log(manifest.home); // "toc"
// Navigate with clipped summaries
const toc = kb.toc(manifest.home!, 2);
// Get full subtree
const doc = kb.get("api-auth");
// Search
const results = kb.search({
query: "authentication AND OAuth",
limit: 10,
offset: 0,
});
kb.close();
Summarize
JSONL file mode — import summaries from a JSONL file:
import { summarize } from "@rlemaigre/docpack";
await summarize({
input: "./mykb",
summaries: "./summaries.jsonl", // one {"slug":"...","summary":"..."} per line
});
LLM fold mode — built-in bottom-up tree fold with an LLM endpoint:
await summarize({
input: "./mykb",
mode: "llm",
model: "qwen3-8b",
endpoint: "http://localhost:8000/v1",
prompt: fs.readFileSync("./prompt.txt", "utf8"),
concurrency: 32,
minContentLength: 200,
});
Both modes use upsert semantics — existing summaries for untouched slugs are preserved.
Generate skill
import { generateSkill } from "@rlemaigre/docpack";
generateSkill({
kb: "./mykb",
useWhen: "Use when building PowerBuilder applications",
output: "./my-skill",
});
Reads the KB manifest and home TOC, renders a SKILL.md template, copies the KB to references/, and generates a scripts/docpack.mjs wrapper script that pins the docpack version.
Data model
Document
Document = {
type: "file" | "section",
title: string,
slug: string,
index: number,
chunk: string?, // self content (Markdown)
summary: string?, // subtree overview
children: Document[] | Summary
}
The Document shape is uniform across all levels — a file, a section, and a leaf section all share the same structure.
Summary
Summary = {
chunkCount: number, // descendants with content
totalBytes: number, // total chunk bytes in subtree
depth: number, // max depth below this document
text?: string // AI-generated overview
}
Storage
SQLite with FTS5. Schema is an internal detail and may change.
nodes-- document tree with slug, type, title, parent, chunk, summarynodes_fts-- FTS5 index on title and chunkclosure-- materialized transitive closure for subtree queries
Notes
- The bundler runs entirely synchronous -- no async, no streaming. Single SQLite transaction.
- Input files are read as Markdown (UTF-8). Conversion from other formats is the caller's responsibility.
toc()is the primary discovery tool. Clipped subtrees carrySummaryobjects that let you aggregate overviews across branches without loading full content.get()returns the full subtree. Usetoc()to find the slug you want, thenget()to read it.search()bypasses the slug gate -- use it for keyword discovery when you don't know the structure.- Summaries are optional post-processing. The bundler produces data; the summarizer produces overviews.
- The MCP server keeps the DB connection open across tool calls. Use it for multi-turn agent sessions.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.