Cheap Research
A bounded evidence review engine that ingests documents, extracts evidence for a given claim, detects contradictions, and produces auditable evidence packets without hallucinations or open-web research.
README
Cheap Research v1.0.0
A bounded evidence review engine. Give it a claim and a document corpus - it extracts relevant evidence, detects contradictions, and produces auditable evidence packets. No hallucinations, no global truth claims, no open-web research.
What it does
- Ingest PDF, TXT, MD, DOCX, DOC, HTML, RTF, EPUB documents
- Extract evidence spans relevant to your claim
- Detect contradictions between claim and corpus
- Render markdown evidence packets with caveats
- Audit immutable hash-chained audit trail
Trust posture
- Fails closed - enforcement engine blocks invalid authority transitions
- Bounded corpus - cannot make claims outside provided documents
- Explicit caveats - every packet states limitations
- No global truth - "not established in corpus" not "false"
- Human review required - claims need explicit reviewer sign-off
MCP Tools
| Tool | Description |
|---|---|
research_start |
Ingest corpus, assess claim, produce evidence packet |
research_status |
Get run state and evidence summary |
research_getEvidence |
Get top relevant spans + contradictions |
research_getPacket |
Get full markdown assessment report |
research_audit |
Run compliance integrity audit |
corpus_topics |
List available paper topics for download |
corpus_list |
List all papers in curated library (filterable) |
corpus_load |
Download seminal ML papers to corpus directory |
Quick Start: Download Papers
Use corpus_load to download a curated set of foundational ML papers:
// Download ALL papers to your corpus
corpus_load({ corpus_path: "/path/to/cheap-research/corpus" })
// Or download specific topics
corpus_load({
corpus_path: "/path/to/corpus",
topics: ["transformers", "optimization"]
})
// Or specific papers by ID
corpus_load({
corpus_path: "/path/to/corpus",
paper_ids: ["attention", "bert", "resnet"]
})
Available topics: transformers, foundations, optimization, reinforcement, generative, representations, efficiency
Papers included: Attention Is All You Need, BERT, GPT, GPT-3, ResNet, AlexNet, LSTM, Adam, Batch Norm, Dropout, GANs, VAE, Diffusion Models, Vision Transformer, Word2Vec, DQN, Knowledge Distillation, and more (22 papers total).
Install
git clone https://github.com/your-org/cheap-research.git
cd cheap-research
npm install
npm run build
Adding to Your AI Assistant
This server uses the Model Context Protocol (MCP). Here's how to connect it to your AI:
Claude (Claude Code CLI / Claude Desktop)
-
Open your MCP config file:
- Claude Code:
~/.claude.json - Claude Desktop:
~/Library/Application Support/Claude/claude_desktop_config.json(macOS) or%APPDATA%\Claude\claude_desktop_config.json(Windows)
- Claude Code:
-
Add the server to
mcpServers:
{
"mcpServers": {
"cheap-research": {
"command": "node",
"args": [
"/ABSOLUTE/PATH/TO/cheap-research/node_modules/.bin/tsx",
"--tsconfig",
"/ABSOLUTE/PATH/TO/cheap-research/tsconfig.base.json",
"/ABSOLUTE/PATH/TO/cheap-research/apps/mcp-server/src/index.ts"
]
}
}
}
-
Replace
/ABSOLUTE/PATH/TO/with the actual path where you cloned the repo. -
Restart Claude Code or Claude Desktop.
-
Type
/mcpin Claude Code to verify the server is connected.
Other MCP-compatible AI Tools
Any AI tool that supports MCP can use this server. Point it to run the npm run mcp command or use the config format above. Check your tool's documentation for MCP server configuration.
Run as MCP Server (manual)
npm run mcp
This starts the server in stdio mode, ready for MCP clients.
Example
// Start research on a claim
research_start({
task_type: "bounded_doc_claim_assessment",
target_claim: "Attention mechanisms enable Transformers to model long-range dependencies",
entity: "Transformer architecture",
feature: "attention mechanism",
scope: "Assess based on provided neural network papers",
corpus_path: "/path/to/corpus"
})
// Returns: { run_id, status: "completed", assessment: "qualified", ... }
Supported Formats
| Format | Extension | Parser |
|---|---|---|
.pdf |
pdf-parse | |
| Text | .txt |
native |
| Markdown | .md |
native |
| Word | .docx |
mammoth |
| Word (legacy) | .doc |
mammoth + fallback |
| HTML | .html, .htm |
cheerio |
| RTF | .rtf |
rtf-parser |
| EPUB | .epub |
epub2 |
What's NOT included (intentional)
- Open-web research
- Autonomous recommendations
- Confidence scores
- Free-form LLM synthesis
- Automatic operational acceptance
- Vector embeddings / semantic search
Architecture
corpus/ → ingestCorpus() → MemoryStore
↓
claim + evidence → findRelevantSpans() → scored spans
↓
→ detectContradictions() → flagged spans
↓
→ createClaimCandidate() → enforcement check
↓
→ renderPacketMarkdown() → evidence packet
↓
→ publishPacket() → audit log
State Machines
Claims, findings, packets all have strict state transitions enforced by the enforcement engine. Illegal transitions are blocked with explicit blockers.
Audit Trail
Hash-chained immutable audit log. Every action recorded with:
- Actor context (who)
- Object reference (what)
- Event type (action)
- Payload (details)
- Chain hash (integrity)
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.