WikiStrata

WikiStrata

Enables querying and interacting with a Markdown wiki generated from Confluence pages, with search and MCP server for LLM access.

Category
Visit Server

README

WikiStrata

Build a Markdown wiki and MCP layer from a Confluence tree.

WikiStrata is a public-facing, Confluence-first toolkit for turning internal documentation into structured Markdown, searchable indexes, and agent-friendly MCP context.

Language versions:

This is a prototype and a concept. Take it, adapt it, make it yours.

WikiStrata is currently Confluence-first. It is built to help a team connect a Confluence tree, ingest it into immutable source files, rebuild it into Markdown, and expose that wiki through MCP.

Goals

  • ingest Confluence page trees as immutable Layer 1 source data
  • rebuild them into a structured Layer 2 wiki using schema-driven rules
  • expose the wiki through stdio and HTTP MCP transports
  • provide reusable onboarding via init-style schema suggestion workflows

Current public-safe scope

  • generic Confluence ingest and attachment extraction
  • schema-driven sample build pipeline
  • Whoosh-backed search indexing
  • public CLI for init, ingest, build, index, stats, and MCP entrypoints
  • stdio MCP server
  • HTTP MCP server with CIDR and token gates
  • synthetic sample schema and test coverage
  • a tiny committed sample vault for GitHub browsing

Screenshots

CLI

WikiStrata CLI

Generated Summary

WikiStrata Summary

Obsidian Graph View

WikiStrata Graph View

Quickstart

cd <project-root>
python -m venv .venv
.venv\Scripts\activate
python -m pip install -e .

Then:

wikistrata init --write-starter-schema
wikistrata sample-project .\wikistrata-sample
cd .\wikistrata-sample
wikistrata build-wiki
wikistrata build-index
wikistrata search "refund policy"
wikistrata rebuild
wikistrata-mcp

Documentation

  • docs/CONFLUENCE_FIRST.md
  • docs/QUICKSTART.md
  • docs/SCHEMA_GUIDE.md
  • docs/SAMPLE_DATASET.md
  • docs/SCREENSHOT_GUIDE.md
  • docs/INIT_ONBOARDING_DESIGN.md
  • docs/REPO_BOUNDARY.md
  • docs/RELEASE_CHECKLIST.md
  • docs/PUBLIC_CONVERSION_AUDIT.md

Included samples

  • examples/sample-vault/
    • a very small committed markdown sample for GitHub browsing
  • wikistrata sample-project ...
    • the fuller synthetic demo flow for screenshots, Obsidian, and CLI demos

What is happening in this workspace

This workspace is being built with a whitelist strategy. Only generic engine code, generic schema examples, synthetic sample data, and public documentation should land here.

See:

  • docs/PUBLIC_CONVERSION_AUDIT.md
  • docs/WHITELIST.md
  • docs/MIGRATION_PLAN.md

Release gates

The project should not be published until all of the following are true.

  • no company names, brand names, internal URLs, page IDs, or internal examples remain
  • schema files are synthetic and domain-neutral
  • tests use synthetic fixtures only
  • documentation is rewritten for public users
  • the package name, CLI help text, and MCP server metadata are public-ready

Immediate priorities

  1. Move only generic core modules into src/wikistrata.
  2. Replace internal schema and fixtures with synthetic examples.
  3. Rewrite public docs around setup, schema customization, and MCP usage.
  4. Rename the package and CLI surface from internal names to public names.

Deferred work

These items do not block the split itself.

  • build_wiki.py refactor
  • production LLM synthesis automation
  • PPT image extraction
  • auth modernization beyond Basic Auth

Project Status

WikiStrata is an early public release.

The public surface is intentionally narrow and strong:

  • Confluence-first ingest
  • Markdown wiki generation
  • Whoosh-based search
  • stdio and HTTP MCP access

Broader multi-source onboarding and richer automation can come later.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured