Chain of Draft (CoD) MCP Server
Implements the Chain of Draft reasoning approach to generate minimalistic intermediate reasoning outputs while solving tasks, significantly reducing token usage while maintaining accuracy.
stat-guy
Tools
chain_of_draft_solve
Solve a reasoning problem using Chain of Draft approach
math_solve
Solve a math problem using Chain of Draft reasoning
code_solve
Solve a coding problem using Chain of Draft reasoning
logic_solve
Solve a logic problem using Chain of Draft reasoning
get_performance_stats
Get performance statistics for CoD vs CoT approaches
get_token_reduction
Get token reduction statistics for CoD vs CoT
analyze_problem_complexity
Analyze the complexity of a problem
README
Chain of Draft (CoD) MCP Server
Overview
This MCP server implements the Chain of Draft (CoD) reasoning approach as described in the research paper "Chain of Draft: Thinking Faster by Writing Less". CoD is a novel paradigm that allows LLMs to generate minimalistic yet informative intermediate reasoning outputs while solving tasks, significantly reducing token usage while maintaining accuracy.
Key Benefits
- Efficiency: Significantly reduced token usage (as little as 7.6% of standard CoT)
- Speed: Faster responses due to shorter generation time
- Cost Savings: Lower API costs for LLM calls
- Maintained Accuracy: Similar or even improved accuracy compared to CoT
- Flexibility: Applicable across various reasoning tasks and domains
Features
-
Core Chain of Draft Implementation
- Concise reasoning steps (typically 5 words or less)
- Format enforcement
- Answer extraction
-
Performance Analytics
- Token usage tracking
- Solution accuracy monitoring
- Execution time measurement
- Domain-specific performance metrics
-
Adaptive Word Limits
- Automatic complexity estimation
- Dynamic adjustment of word limits
- Domain-specific calibration
-
Comprehensive Example Database
- CoT to CoD transformation
- Domain-specific examples (math, code, biology, physics, chemistry, puzzle)
- Example retrieval based on problem similarity
-
Format Enforcement
- Post-processing to ensure adherence to word limits
- Step structure preservation
- Adherence analytics
-
Hybrid Reasoning Approaches
- Automatic selection between CoD and CoT
- Domain-specific optimization
- Historical performance-based selection
-
OpenAI API Compatibility
- Drop-in replacement for standard OpenAI clients
- Support for both completions and chat interfaces
- Easy integration into existing workflows
Setup and Installation
Prerequisites
- Python 3.10+ (for Python implementation)
- Node.js 18+ (for JavaScript implementation)
- Anthropic API key
Python Installation
- Clone the repository
- Install dependencies:
pip install -r requirements.txt
- Configure API keys in
.env
file:ANTHROPIC_API_KEY=your_api_key_here
- Run the server:
python server.py
JavaScript Installation
- Clone the repository
- Install dependencies:
npm install
- Configure API keys in
.env
file:ANTHROPIC_API_KEY=your_api_key_here
- Run the server:
node index.js
Claude Desktop Integration
To integrate with Claude Desktop:
-
Install Claude Desktop from claude.ai/download
-
Create or edit the Claude Desktop config file:
~/Library/Application Support/Claude/claude_desktop_config.json
-
Add the server configuration (Python version):
{ "mcpServers": { "chain-of-draft": { "command": "python3", "args": ["/absolute/path/to/cod/server.py"], "env": { "ANTHROPIC_API_KEY": "your_api_key_here" } } } }
Or for the JavaScript version:
{ "mcpServers": { "chain-of-draft": { "command": "node", "args": ["/absolute/path/to/cod/index.js"], "env": { "ANTHROPIC_API_KEY": "your_api_key_here" } } } }
-
Restart Claude Desktop
You can also use the Claude CLI to add the server:
# For Python implementation
claude mcp add chain-of-draft -e ANTHROPIC_API_KEY="your_api_key_here" "python3 /absolute/path/to/cod/server.py"
# For JavaScript implementation
claude mcp add chain-of-draft -e ANTHROPIC_API_KEY="your_api_key_here" "node /absolute/path/to/cod/index.js"
Available Tools
The Chain of Draft server provides the following tools:
Tool | Description |
---|---|
chain_of_draft_solve |
Solve a problem using Chain of Draft reasoning |
math_solve |
Solve a math problem with CoD |
code_solve |
Solve a coding problem with CoD |
logic_solve |
Solve a logic problem with CoD |
get_performance_stats |
Get performance stats for CoD vs CoT |
get_token_reduction |
Get token reduction statistics |
analyze_problem_complexity |
Analyze problem complexity |
Developer Usage
Python Client
If you want to use the Chain of Draft client directly in your Python code:
from client import ChainOfDraftClient
# Create client
cod_client = ChainOfDraftClient()
# Use directly
result = await cod_client.solve_with_reasoning(
problem="Solve: 247 + 394 = ?",
domain="math"
)
print(f"Answer: {result['final_answer']}")
print(f"Reasoning: {result['reasoning_steps']}")
print(f"Tokens used: {result['token_count']}")
JavaScript Client
For JavaScript/Node.js applications:
import { Anthropic } from "@anthropic-ai/sdk";
import dotenv from "dotenv";
// Load environment variables
dotenv.config();
// Create the Anthropic client
const anthropic = new Anthropic({
apiKey: process.env.ANTHROPIC_API_KEY,
});
// Import the Chain of Draft client
import chainOfDraftClient from './lib/chain-of-draft-client.js';
// Use the client
async function solveMathProblem() {
const result = await chainOfDraftClient.solveWithReasoning({
problem: "Solve: 247 + 394 = ?",
domain: "math",
max_words_per_step: 5
});
console.log(`Answer: ${result.final_answer}`);
console.log(`Reasoning: ${result.reasoning_steps}`);
console.log(`Tokens used: ${result.token_count}`);
}
solveMathProblem();
Implementation Details
The server is available in both Python and JavaScript implementations, both consisting of several integrated components:
Python Implementation
- AnalyticsService: Tracks performance metrics across different problem domains and reasoning approaches
- ComplexityEstimator: Analyzes problems to determine appropriate word limits
- ExampleDatabase: Manages and retrieves examples, transforming CoT examples to CoD format
- FormatEnforcer: Ensures reasoning steps adhere to word limits
- ReasoningSelector: Intelligently chooses between CoD and CoT based on problem characteristics
JavaScript Implementation
- analyticsDb: In-memory database for tracking performance metrics
- complexityEstimator: Analyzes problems to determine complexity and appropriate word limits
- formatEnforcer: Ensures reasoning steps adhere to word limits
- reasoningSelector: Automatically chooses between CoD and CoT based on problem characteristics and historical performance
Both implementations follow the same core principles and provide identical MCP tools, making them interchangeable for most use cases.
License
This project is open-source and available under the MIT license.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
MCP Package Docs Server
Facilitates LLMs to efficiently access and fetch structured documentation for packages in Go, Python, and NPM, enhancing software development with multi-language support and performance optimization.
Claude Code MCP
An implementation of Claude Code as a Model Context Protocol server that enables using Claude's software engineering capabilities (code generation, editing, reviewing, and file operations) through the standardized MCP interface.
@kazuph/mcp-taskmanager
Model Context Protocol server for Task Management. This allows Claude Desktop (or any MCP client) to manage and execute tasks in a queue-based system.
Linear MCP Server
Enables interaction with Linear's API for managing issues, teams, and projects programmatically through the Model Context Protocol.
mermaid-mcp-server
A Model Context Protocol (MCP) server that converts Mermaid diagrams to PNG images.
Jira-Context-MCP
MCP server to provide Jira Tickets information to AI coding agents like Cursor
Crypto Price & Market Analysis MCP Server
A Model Context Protocol (MCP) server that provides comprehensive cryptocurrency analysis using the CoinCap API. This server offers real-time price data, market analysis, and historical trends through an easy-to-use interface.
MCP PubMed Search
Server to search PubMed (PubMed is a free, online database that allows users to search for biomedical and life sciences literature). I have created on a day MCP came out but was on vacation, I saw someone post similar server in your DB, but figured to post mine.