nikhilnt
Track LLM token costs across Claude, GPT and Gemini. MCP server + CLI with optimization hints and $ savings estimates.
README
TokenBurnRate
See where your AI tokens go — and how to spend less of them.
TokenBurnRate is an MCP server + CLI that logs every Claude / GPT / Gemini API call locally, shows you a cost dashboard in your terminal, and tells you exactly how to reduce that cost.
⬡ token-tracker LAST 7 DAYS
────────────────────────────────────────────────────────────────────────
Overview
Total cost $18.7421 (~$80.59/month est.)
API calls 347
Input tokens 4.82M
Cache reads 620K (11.4% hit rate)
Daily Cost
Mon Jun 02 ████████████░░░░░░░░░░░░ $2.14
Tue Jun 03 ████████████████████░░░░ $3.82
Thu Jun 05 ████████████████████████ $4.51
💡 Optimization Hints saves $31.20/mo
● CRIT Prompt cache barely used
Cache hit rate: 11.4% — target is 30–60%
Action: Move static content to top of messages
Est. saving: $12.40/month
● HIGH Expensive model doing test generation
104 test-gen calls on Sonnet costing $2.25/week
Action: Route to claude-haiku-4-5, saves 80%
Est. saving: $7.80/month
Install
npm install -g token-tracker-mcp
Or run without installing:
npx token-tracker-mcp report
Add to Claude Desktop
node scripts/setup.js
Or add manually to ~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"token-tracker": {
"command": "token-tracker",
"args": ["serve"]
}
}
}
Restart Claude Desktop. Done.
CLI Commands
| Command | Description |
|---|---|
token-tracker report |
Full 7-day dashboard |
token-tracker report --period month |
Monthly report |
token-tracker today |
Today only |
token-tracker hints |
Optimization hints ranked by $ saving |
token-tracker hint <id> |
Deep-dive on one hint |
token-tracker status |
One-line: cost · cache % · top hint |
token-tracker budget |
Budget gauges |
token-tracker models |
Pricing table for all models |
token-tracker export > out.csv |
Raw CSV export |
MCP Tools (use inside Claude)
| Tool | Description |
|---|---|
log_usage |
Log an API call — auto-calculates cost |
get_summary |
Summary for today / week / month / all |
get_hints |
Ranked optimization hints with $ savings |
get_hint_detail |
Deep-dive on a specific hint |
set_budget |
Set a daily / weekly / monthly spend limit |
list_sessions |
Sessions ranked by cost |
list_models |
Pricing table |
export_csv |
CSV dump |
Optimization Hints Engine
8 deterministic rules — no LLM calls, runs instantly on your local data:
| Hint | Triggers when |
|---|---|
cache-utilization |
Cache hit rate < 30% |
model-swap-testing |
Test gen running on Sonnet / Opus |
model-swap-debug |
Debugging on Opus |
verbose-outputs |
Output / input ratio > 0.35 |
session-spike |
Any session costs 3× your average |
context-bloat |
Avg tokens / call > 8K |
retry-loops |
Sessions with 30+ high-token calls |
single-model-dependency |
100% traffic on one expensive model |
Each hint includes severity · evidence · recommended action · estimated monthly saving.
Privacy
All data stored at ~/.token-tracker/usage.db (SQLite).
Nothing leaves your machine. No telemetry, no account required.
Development
git clone https://github.com/nikhilnt1234/TokenBurnRate.git
cd TokenBurnRate
npm install --ignore-scripts
npx tsup
npm test
Roadmap
- [ ] Team / Supabase backend (multi-user shared dashboard)
- [ ] Weekly email digest
- [ ] Slack / webhook alerts
- [ ] macOS menubar app
License
MIT © 2026 Nikhil T
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.