WikiStrata
Enables querying and interacting with a Markdown wiki generated from Confluence pages, with search and MCP server for LLM access.
README
WikiStrata
Build a Markdown wiki and MCP layer from a Confluence tree.
WikiStrata is a public-facing, Confluence-first toolkit for turning internal documentation into structured Markdown, searchable indexes, and agent-friendly MCP context.
Language versions:
This is a prototype and a concept. Take it, adapt it, make it yours.
WikiStrata is currently Confluence-first.
It is built to help a team connect a Confluence tree, ingest it into immutable source files, rebuild it into Markdown, and expose that wiki through MCP.
Goals
- ingest Confluence page trees as immutable Layer 1 source data
- rebuild them into a structured Layer 2 wiki using schema-driven rules
- expose the wiki through stdio and HTTP MCP transports
- provide reusable onboarding via
init-style schema suggestion workflows
Current public-safe scope
- generic Confluence ingest and attachment extraction
- schema-driven sample build pipeline
- Whoosh-backed search indexing
- public CLI for init, ingest, build, index, stats, and MCP entrypoints
- stdio MCP server
- HTTP MCP server with CIDR and token gates
- synthetic sample schema and test coverage
- a tiny committed sample vault for GitHub browsing
Screenshots
CLI

Generated Summary

Obsidian Graph View

Quickstart
cd <project-root>
python -m venv .venv
.venv\Scripts\activate
python -m pip install -e .
Then:
wikistrata init --write-starter-schema
wikistrata sample-project .\wikistrata-sample
cd .\wikistrata-sample
wikistrata build-wiki
wikistrata build-index
wikistrata search "refund policy"
wikistrata rebuild
wikistrata-mcp
Documentation
docs/CONFLUENCE_FIRST.mddocs/QUICKSTART.mddocs/SCHEMA_GUIDE.mddocs/SAMPLE_DATASET.mddocs/SCREENSHOT_GUIDE.mddocs/INIT_ONBOARDING_DESIGN.mddocs/REPO_BOUNDARY.mddocs/RELEASE_CHECKLIST.mddocs/PUBLIC_CONVERSION_AUDIT.md
Included samples
examples/sample-vault/- a very small committed markdown sample for GitHub browsing
wikistrata sample-project ...- the fuller synthetic demo flow for screenshots, Obsidian, and CLI demos
What is happening in this workspace
This workspace is being built with a whitelist strategy. Only generic engine code, generic schema examples, synthetic sample data, and public documentation should land here.
See:
docs/PUBLIC_CONVERSION_AUDIT.mddocs/WHITELIST.mddocs/MIGRATION_PLAN.md
Release gates
The project should not be published until all of the following are true.
- no company names, brand names, internal URLs, page IDs, or internal examples remain
- schema files are synthetic and domain-neutral
- tests use synthetic fixtures only
- documentation is rewritten for public users
- the package name, CLI help text, and MCP server metadata are public-ready
Immediate priorities
- Move only generic core modules into
src/wikistrata. - Replace internal schema and fixtures with synthetic examples.
- Rewrite public docs around setup, schema customization, and MCP usage.
- Rename the package and CLI surface from internal names to public names.
Deferred work
These items do not block the split itself.
build_wiki.pyrefactor- production LLM synthesis automation
- PPT image extraction
- auth modernization beyond Basic Auth
Project Status
WikiStrata is an early public release.
The public surface is intentionally narrow and strong:
- Confluence-first ingest
- Markdown wiki generation
- Whoosh-based search
- stdio and HTTP MCP access
Broader multi-source onboarding and richer automation can come later.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.