Train in Silence
Enables users to describe their LLM fine-tuning job once and get the cheapest, fastest, and most balanced GPU options across a dozen cloud providers in seconds.
README
<p align="center"> <h1 align="center">Train in Silence</h1> The first Task-Aware MCP server for LLM fine-tuning. Stop comparing GPU prices. Start training. </p> <p align="center"> <a href="https://github.com/hlpun/Train-in-Silence/blob/main/LICENSE"><img src="https://img.shields.io/badge/license-MIT-blue.svg" alt="License"></a> <a href="https://www.python.org/downloads/"><img src="https://img.shields.io/badge/python-3.10+-blue.svg" alt="Python"></a> <a href="https://github.com/hlpun/Train-in-Silence/blob/main/README_zh.md">中文</a> </p> </p>
You want to fine-tune an LLM. You open Vast.ai, RunPod, AWS, etc. -- a dozen tabs, a dozen pricing models, a dozen different ways to describe a GPU. Which option can run your code, and do so more cheaply and quickly? An hour later you're still in a spreadsheet and haven't written a single line of training code.
Train in Silence is the first Task-Aware MCP server for LLM fine-tuning. It doesn't just list prices; it understands your workload. Describe your training job once, and it calculates the required VRAM/FLOPs to return the cheapest, fastest, and most balanced hardware options across a dozen cloud providers -- in seconds.
Quickstart
Option A: Ask Claude Code (recommended)
Install the library and register it as a tool in Claude Code:
pip install train-in-silence
claude mcp add tis --scope user -- tis-mcp
Then just ask in natural language:
> I want to run the fine-tune code in my current directory, and finish it within 20 hours.
Find me the best GPU options across Vast.ai, RunPod, and Lambda.
Claude Code calls TIS behind the scenes and returns a structured recommendation -- no YAML, no config files, no manual comparison.
Option B: CLI
pip install train-in-silence
tis recommend examples/request.yaml
$ tis recommend examples/request.yaml
Found 5 viable configurations
Lowest cost: $4.32 | Fastest runtime: 2.1 hours
#1 [cheapest] RunPod 1x A6000 (48 GB) $4.32 / 6.8 h
#2 [fastest] Vast.ai 2x A100 (80 GB) $9.10 / 2.1 h
#3 [balanced] RunPod 1x A100 (80 GB) $6.40 / 3.2 h
...
Note: Output above is illustrative. Actual results depend on live market data.
Use It Your Way
| Channel | Command | Docs |
|---|---|---|
| CLI | tis recommend request.yaml |
CLI Guide |
| REST API | uvicorn tis.api.server:app |
API Reference |
| Claude Code | claude mcp add tis --scope user --tis-mcp |
MCP Guide |
| Claude Desktop | Add tis-mcp to claude_desktop_config.json |
MCP Guide |
Market Providers
TIS aggregates live pricing across a dozen GPU clouds. API keys are optional: if not provided, TIS automatically falls back sequentially to universal live aggregators (GPUHunt/GPUFinder) or bundled sample data.
| Provider Class | Included Platforms | Auth Required |
|---|---|---|
| Dedicated | Vast.ai, RunPod | Optional (Highly Recommended) |
| Aggregated | Vast.ai, RunPod, AWS, CoreWeave, Lambda Labs, Tensordock, Vultr, GCP, Azure, OCI, Nebius, CloudRift, Cudo Compute, Verda | None (Auto-fallback) |
Every recommendation clearly identifies its Source of Truth (e.g., live:official, live:gpuhunt, live:gpufinder, or sample) so you always know how fresh the data is. -> Provider details
Architecture at a Glance
YAML request -> Estimator -> Market Aggregator -> Optimizer -> Pareto Frontier -> Ranked Output
| | |
VRAM/FLOPs 10+ GPU Clouds Cost vs. Time
Each recommendation shows where the data came from (live or sample) and flags any estimated fields -- no silent guesswork. -> Architecture deep-dive
Known Limitations
- Estimation model is fixed with no built-in calibration; future versions will calibrate using real runtimes.
- Upstream Provider API schema changes will require synchronized mapping updates.
🚧 Project Status & Contribution
This project is currently in the experimental development stage (Experimental).
- Issues & Suggestions: If you encounter any bugs, inaccurate estimations, or have suggestions for improvement, please feel free to submit a GitHub Issue.
- Contribute: If you'd like to improve the code or supplement hardware metadata, Pull Requests are highly welcome! We look forward to refining this LLM hardware planner with the community.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.