MCP-Deep-Researcher
A multi-agent research system that decomposes complex queries into targeted sub-questions, searches the web in parallel, scores source credibility, and synthesizes findings into structured markdown reports.
README
π MCP Deep Researcher
A multi-agent research system that decomposes complex queries into targeted sub-questions, searches the web in parallel, scores source credibility, and synthesizes findings into structured markdown reports β accessible via Streamlit UI, MCP tool (Claude Code / Cursor), or Python API.
What It Does
Give it a broad research question. It returns a structured report with executive summary, key findings, knowledge gaps, and cited sources β in about 10 seconds.
"What are the latest developments in multi-agent AI systems?"
β
# Research Report: Multi-Agent AI Systems β Latest Developments
## Executive Summary
...
## Key Findings
### How are multi-agent frameworks evolving in 2026?
... [source](https://...) [credibility: high]
## Knowledge Gaps
- No peer-reviewed benchmarks comparing LangGraph vs CrewAI at scale
- ...
## Sources
1. https://arxiv.org/... [high]
2. https://techcrunch.com/... [medium]
Architecture
Three-node LangGraph pipeline with typed state, parallel search, and 24-hour result caching:
ββββββββββββ ββββββββββββββββββ ββββββββββββββββ
β Planner ββββββΆβ Searcher ββββββΆβ Synthesizer β
β (GPT-4o) β β (Tavily Γ5) β β (GPT-4o) β
ββββββββββββ ββββββββββββββββββ ββββββββββββββββ
β β β
βΌ βΌ βΌ
3-5 targeted Parallel searches Markdown report
sub-questions with credibility with citations
scoring and knowledge gaps
Planner β Decomposes the query into 3β5 non-overlapping sub-questions using GPT-4o structured output (Pydantic). Context-aware: follow-up queries build on prior research instead of repeating it.
Searcher β Fires all searches concurrently via ThreadPoolExecutor. Each result is tagged with a credibility score (high / medium / unverified) based on domain authority. Graceful per-question error handling.
Synthesizer β Analyzes all evidence, weights high-credibility sources when findings conflict, and produces a structured report at temperature=0.2 for consistency.
Cache β SHA-256 hash of (query + context + search depth). 24-hour TTL. Repeat queries return in <100ms.
Quick Start
Prerequisites
Install
git clone https://github.com/sid12super/MCP-Deep-Researcher.git
cd MCP-Deep-Researcher
uv sync
cp .env.example .env
# Add your API keys to .env
Usage
Streamlit UI
streamlit run app.py
Opens at http://localhost:8501 with:
- Real-time progress tracking (planning β searching β synthesizing)
- Search depth toggle (basic / advanced)
- Multi-turn conversation with context carry-over
- Export full research conversation as HTML, PDF, or JSON
- "New Research Topic" button to reset context
MCP Server (Claude Code / Cursor)
The MCP server exposes the full research pipeline as a tool with Pydantic-validated input, search depth control, and multi-turn conversation support.
Start the server:
uv run server.py
Configure your MCP client β create .mcp.json in your project root:
{
"mcpServers": {
"deep_researcher_mcp": {
"command": "uv",
"args": [
"--directory",
"/absolute/path/to/MCP-Deep-Researcher",
"run",
"server.py"
],
"env": {
"OPENAI_API_KEY": "sk-...",
"TAVILY_API_KEY": "tvly-..."
}
}
}
}
Use it in Claude Code:
Use deep_researcher_research to find the latest developments in multi-agent AI systems
Use deep_researcher_research with search_depth "basic" for a quick comparison of LangGraph vs CrewAI
Use deep_researcher_research to follow up on that β pass the previous report as conversation_context
The tool accepts three parameters:
| Parameter | Type | Default | Description |
|---|---|---|---|
query |
string | required | Research question (3β2000 chars) |
search_depth |
"basic" | "advanced" |
"advanced" |
Speed vs. thoroughness tradeoff |
conversation_context |
string | "" |
Prior research for multi-turn follow-ups |
Python API
from agents import run_research
# Simple query
report = run_research("What are the latest AI trends in 2026?")
# With options
report = run_research(
"How does this compare to 2025?",
conversation_context="Previous findings: ...",
search_depth="basic",
)
# Full state (for programmatic access)
result = run_research("Your query", return_full_state=True)
# result["report"], result["query"], result["research_questions"]
Features
Search Depth Control
Toggle between basic (faster, ~8s) and advanced (comprehensive, ~14s) from the Streamlit sidebar or as an MCP parameter. Each depth caches separately.
Multi-turn Conversation
Follow-up queries automatically receive prior research context. The planner generates deeper, non-redundant questions instead of repeating covered ground. Reset anytime with "New Research Topic."
Export Formats
Download the full research conversation (all queries and reports) as:
- HTML β styled, web-ready
- PDF β professional, print-ready (via ReportLab)
- JSON β structured, machine-readable
Source Credibility Scoring
Every source is automatically classified:
- High β peer-reviewed, government, major outlets (arxiv.org, reuters.com, nih.gov, etc.)
- Medium β established tech/business (techcrunch.com, wikipedia.org, bloomberg.com, etc.)
- Unverified β everything else
The synthesizer weights high-credibility sources more heavily when findings conflict.
Real-time Progress
Streamlit UI shows live status updates as each pipeline stage completes β questions generated, results retrieved, report synthesized. Cache hits display instantly.
Caching
Results are cached by SHA-256 hash of (query + conversation context + search depth) with a 24-hour TTL. Identical requests return in <100ms at zero cost.
Project Structure
βββ agents.py # LangGraph pipeline, nodes, caching, credibility scoring
βββ server.py # MCP server (FastMCP, Pydantic input, async)
βββ app.py # Streamlit UI (chat, exports, progress, sidebar)
βββ pyproject.toml # Dependencies (uv)
βββ .env.example # API key template
βββ .mcp.json # MCP client config (gitignored β contains keys)
βββ CLAUDE.md # Claude Code development context
βββ test/ # Import, integration, and pipeline tests
Performance
| Stage | Time | Notes |
|---|---|---|
| Planner | ~2-3s | GPT-4o structured output |
| Searcher | ~2-3s | Parallel via ThreadPoolExecutor |
| Synthesizer | ~5-8s | GPT-4o at temperature=0.2 |
| Total | ~9-14s | First run |
| Cached | <100ms | Repeat queries within 24h |
Cost per unique query: ~$0.06-0.10 (GPT-4o + Tavily) Cost per cached query: $0.00
Troubleshooting
| Issue | Fix |
|---|---|
ModuleNotFoundError |
Run uv sync |
OpenAI API key not found |
Check .env exists with OPENAI_API_KEY |
Tavily API error |
Verify key at app.tavily.com |
| Port 8501 in use | streamlit run app.py --server.port 8502 |
| MCP server not found | Ensure .mcp.json is at project root (not inside .claude/) |
| MCP server failed | Test with uv run server.py directly to see errors |
Tech Stack
| Component | Technology |
|---|---|
| Agent orchestration | LangGraph |
| LLM | OpenAI GPT-4o |
| Web search | Tavily |
| MCP server | FastMCP (Python SDK) |
| Web UI | Streamlit |
| PDF export | ReportLab |
| Dependency management | uv |
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.