Echoes MCP Server
Enables AI integration with the Echoes storytelling platform, providing narrative knowledge graph extraction, semantic search, and arc isolation.
README
Echoes MCP Server
Model Context Protocol server for AI integration with Echoes storytelling platform.
Features
- Narrative Knowledge Graph: Automatically extracts characters, locations, events, and their relationships using Gemini AI
- Semantic Search: Find relevant chapters using natural language queries
- Entity Search: Search for characters, locations, and events
- Relation Search: Explore relationships between entities
- Arc Isolation: Each arc is a separate narrative universe - no cross-arc contamination
- Statistics: Aggregate word counts, POV distribution, and more
- Dynamic Prompts: Reusable prompt templates with placeholder substitution
Installation
npm install -g @echoes-io/mcp-server
Or run directly with npx:
npx @echoes-io/mcp-server --help
Requirements
- Node.js 20+
- Gemini API key (for entity extraction)
Usage
CLI
# Count words in a markdown file
echoes words-count ./content/arc1/ep01/ch001.md
# Index timeline content
echoes index ./content
# Index only a specific arc
echoes index ./content --arc bloom
# Get statistics
echoes stats
echoes stats --arc arc1 --pov Alice
# Search (filters by arc to avoid cross-arc contamination)
echoes search "primo incontro" --arc bloom
echoes search "Alice" --type entities --arc bloom
# Check narrative consistency
echoes check-consistency bloom
echoes check-consistency bloom --rules kink-firsts,outfit-claims
MCP Server
Configure in your MCP client (e.g., Claude Desktop, Kiro):
{
"mcpServers": {
"echoes": {
"command": "npx",
"args": ["@echoes-io/mcp-server"],
"cwd": "/path/to/timeline",
"env": {
"GEMINI_API_KEY": "your_api_key"
}
}
}
}
Environment Variables
| Variable | Required | Default | Description |
|---|---|---|---|
GEMINI_API_KEY |
Yes | - | API key for Gemini entity extraction |
ECHOES_GEMINI_MODEL |
No | gemini-2.5-flash |
Gemini model for extraction |
ECHOES_EMBEDDING_MODEL |
No | Xenova/e5-small-v2 |
HuggingFace embedding model |
ECHOES_EMBEDDING_DTYPE |
No | fp32 |
Quantization level: fp32, q8, q4 (see Performance Notes) |
HF_TOKEN |
No | - | HuggingFace token for gated models |
Available Tools
| Tool | Description |
|---|---|
words-count |
Count words and statistics in a markdown file |
index |
Index timeline content into LanceDB |
search |
Search chapters, entities, or relations |
stats |
Get aggregate statistics |
check-consistency |
Analyze arc for narrative inconsistencies |
timeline-overview |
Quick overview of all arcs: status, chapters, words, POVs |
graph-export |
Export knowledge graph in various formats |
history |
Query character/arc history (kinks, outfits, locations, relations) |
review-generate |
Generate review file for pending entity/relation extractions |
review-status |
Show review statistics for an arc |
review-apply |
Apply corrections from review file to database |
Available Prompts
| Prompt | Arguments | Description |
|---|---|---|
arc-resume |
arc, episode?, lastChapters? | Load complete context for resuming work on an arc |
new-chapter |
arc, chapter | Create a new chapter |
revise-chapter |
arc, chapter | Revise an existing chapter |
expand-chapter |
arc, chapter, target | Expand chapter to target word count |
new-character |
name | Create a new character sheet |
new-episode |
arc, episode | Create a new episode outline |
new-arc |
name | Create a new story arc |
revise-arc |
arc | Review and fix an entire arc |
Architecture
Content Hierarchy
Timeline (content directory)
└── Arc (story universe)
└── Episode (story event)
└── Chapter (individual .md file)
Arc Isolation
Each arc is treated as a separate narrative universe:
- Entities are scoped to arcs:
bloom:CHARACTER:Alice≠work:CHARACTER:Alice - Relations are internal to arcs
- Searches can be filtered by arc to avoid cross-arc contamination
Data Flow
┌─────────────────────────────────────────────────────────────┐
│ INDEXING PHASE │
├─────────────────────────────────────────────────────────────┤
│ 1. Scan content/*.md (filesystem scanner) │
│ 2. Parse frontmatter + content (gray-matter) │
│ 3. For each chapter: │
│ a. Extract entities/relations with Gemini API │
│ b. Generate embeddings (Transformers.js ONNX) │
│ c. Calculate word count and statistics │
│ 4. Save everything to LanceDB │
└─────────────────────────────────────────────────────────────┘
Development
# Install dependencies
npm install
# Run tests
npm test
# Run tests with coverage
npm run test:coverage
# Lint
npm run lint
# Type check
npm run typecheck
# Build
npm run build
Tech Stack
| Purpose | Tool |
|---|---|
| Runtime | Node.js 20+ |
| Language | TypeScript |
| Vector DB | LanceDB |
| Embeddings | @huggingface/transformers (ONNX) |
| Entity Extraction | Gemini AI |
| MCP SDK | @modelcontextprotocol/sdk |
| Testing | Vitest |
| Linting | Biome |
Performance Notes
Embedding Quantization
The default embedding model (Xenova/e5-small-v2) supports different quantization levels via ECHOES_EMBEDDING_DTYPE:
| Level | Speed | Quality | Memory | Recommendation |
|---|---|---|---|---|
fp32 |
Baseline | Best (100%) | High | Production with ample resources |
q8 |
2-3x faster | Excellent (99.6%) | 50% less | Recommended - optimal balance |
q4 |
3-4x faster | Good (99.1%) | 75% less | Resource-constrained environments |
Note: Some models like onnx-community/embeddinggemma-300m-ONNX don't support fp16. Always check model documentation.
Recommended setting:
export ECHOES_EMBEDDING_DTYPE=q8
License
MIT
Part of the Echoes project - a multi-POV digital storytelling platform.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.