agent-cost-mcp
Tracks AI agent token usage and spending in real time, with budget alerts, per-task cost breakdown, and a visual dashboard.
README
agent-cost-mcp
MCP server that tracks AI agent token usage and spending in real time. Budget alerts, per-task cost breakdown, visual dashboard, daily/weekly/monthly reports.
Works with any MCP client: Claude Code, Cursor, Windsurf, Codex, Gemini CLI, and more.


Why?
Every developer using AI agents worries about spending. Most tools don't tell you what each message costs until the bill arrives.
This MCP server tracks it in real time — per message, per model, per day. Set a budget, get alerts, see exactly where your money goes.
Features
- Visual dashboard — dark-themed web UI with spending charts, budget gauges, and activity log
- Per-message cost logging — see what each AI interaction costs instantly
- Budget alerts — warns when approaching daily/monthly limits
- Cost reports — today, this week, this month, all time
- Model breakdown — donut chart showing which model eats your budget
- Spending trends — 14-day bar chart with color-coded spending
- 15+ models supported — Claude, GPT, DeepSeek, Gemini, Llama
- Estimate before running — check cost before expensive tasks
- Local storage — all data stays on your machine (
~/.agent-cost-mcp/) - Auto-refresh — dashboard updates every 30 seconds
Quick Start
1. Install
pip install agent-cost-mcp
Or with uv:
uv pip install agent-cost-mcp
2. Add to your AI tool
Claude Code — add to ~/.claude/settings.json:
{
"mcpServers": {
"agent-cost": {
"command": "agent-cost-mcp"
}
}
}
Cursor — add to .cursor/mcp.json:
{
"mcpServers": {
"agent-cost": {
"command": "agent-cost-mcp"
}
}
}
Windsurf — add to MCP config:
{
"mcpServers": {
"agent-cost": {
"command": "agent-cost-mcp"
}
}
}
3. Open the dashboard
open dashboard.html
Or serve it locally:
cd ~/.agent-cost-mcp && python3 -m http.server 3456
# Open http://localhost:3456/dashboard.html
The dashboard reads from ~/.agent-cost-mcp/cost-log.json and auto-refreshes every 30 seconds. Leave it open in a browser tab while you work.
MCP Tools
These tools are available to any connected MCP client:
| Tool | What it does | Example |
|---|---|---|
log_cost |
Log token usage and cost for a task | log_cost(model="claude-sonnet-4-6", tokens_in=1500, tokens_out=800, task="code review") |
cost_report |
Get spending report | cost_report(period="today") — also: week, month, all |
set_budget |
Set daily/monthly budget limits | set_budget(daily_limit=5.00, monthly_limit=50.00) |
cost_trend |
Show daily spending chart | cost_trend(days=7) |
estimate_cost |
Estimate cost without logging | estimate_cost(model="claude-opus-4-6", tokens_in=5000, tokens_out=3000) |
supported_models |
List all models + pricing | supported_models() |
How It Works
You use Claude Code / Cursor / Windsurf normally
↓
MCP server logs each interaction (model, tokens, cost)
↓
Data saved to ~/.agent-cost-mcp/cost-log.json
↓
Dashboard reads the JSON and shows charts
↓
Budget alerts warn you before you overspend
The MCP server runs as a background process alongside your AI tool. You don't need to do anything extra — it tracks automatically when tools call log_cost.
Example Session
> How much did that last message cost?
Logged: $0.0165 (1,500 in / 800 out, claude-sonnet-4-6)
> Show my spending for today
# Cost Report — Today (2026-03-27)
- Messages: 26
- Tokens: 187,000 (118,000 in / 69,000 out)
- Total cost: $2.14
- Avg cost/message: $0.082
## By Model
claude-opus-4-6: $0.99 (46%)
claude-sonnet-4-6: $0.93 (43%)
gpt-5.4: $0.19 (9%)
deepseek-v3: $0.01 (1%)
gemini-2.5-flash: $0.00 (<1%)
## Budget
Daily: $2.14 / $5.00 (43%)
Monthly: $12.43 / $50.00 (25%)
> Set my daily budget to $3
Budget set: $3.00/day, $50.00/month
Supported Models
| Model | Input ($/1M) | Output ($/1M) |
|---|---|---|
| claude-opus-4-6 | $15.00 | $75.00 |
| claude-sonnet-4-6 | $3.00 | $15.00 |
| claude-haiku-4-5 | $0.80 | $4.00 |
| gpt-5.4 | $2.50 | $10.00 |
| gpt-5.2 | $1.50 | $6.00 |
| gpt-5.1 | $0.60 | $2.40 |
| gpt-4o | $2.50 | $10.00 |
| gpt-4o-mini | $0.15 | $0.60 |
| deepseek-v3 | $0.27 | $1.10 |
| deepseek-r1 | $0.55 | $2.19 |
| gemini-2.5-pro | $1.25 | $10.00 |
| gemini-2.5-flash | $0.15 | $0.60 |
| llama-4-maverick | $0.20 | $0.60 |
Missing a model? Open an issue or PR.
Data Storage
All data stored locally at ~/.agent-cost-mcp/cost-log.json. Nothing is sent to external services. Your spending data never leaves your machine.
Contributing
PRs welcome. Areas to improve:
- Add more model pricing
- Auto-detect token counts from MCP protocol metadata
- Export reports to CSV/PDF
- Slack/Discord alert integrations
License
MIT
Author
Built by Ha Le — University of Central Florida
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.