sarvam-mcp
A type-safe MCP server for the Sarvam AI API that enables chat, translation, transliteration, language detection, speech-to-text, and text-to-speech across 10 Indic languages and English.
README
sarvam-mcp
sarvam-mcp/ v0.1 ──────────────────────────────────────────────────
A focused, type-safe MCP server for the Sarvam AI API. Lets Claude Code (or any MCP-compatible client) speak to Sarvam's models — chat, translation, transliteration, language detection, speech-to-text, and text-to-speech — across 10 Indic languages plus English.
[!IMPORTANT] This project is not affiliated with, endorsed by, or associated with Sarvam AI. It's an unofficial MCP wrapper that calls the public Sarvam API using your own API key. No keys, audio, or text are routed through any third-party server.
[!NOTE] Requires a Sarvam AI API key. Get one at dashboard.sarvam.ai. Free tier is generous enough to test all six tools.
What it does
Wraps Sarvam's public REST surface in a small set of well-defined MCP tools. Tools are typed end-to-end with Zod schemas, validated at the boundary, and surface useful error messages when something goes wrong.
┌───────────────┐
│ Claude Code │
│ (or any MCP │
│ client) │
└───────┬───────┘
│ stdio (MCP)
▼
┌───────────────┐
│ sarvam-mcp │
└───────┬───────┘
│ HTTPS · api.sarvam.ai
▼
┌───────────────┐
│ Sarvam API │
│ (Sarvam-105B │
│ · Mayura │
│ · Saarika │
│ · Bulbul) │
└───────────────┘
Why another wrapper?
Sarvam ships a great official SDK for Python and Node. This is the MCP-shaped surface that lets agentic clients call it directly — no SDK code, no glue layer.
- 6 tools, deliberately scoped. Every tool is documented, typed, and tested.
- Strict TypeScript. No
any, no implicit returns,noUncheckedIndexedAccesson. - Zod-validated boundaries. Every input is parsed before the network call.
- Typed errors.
SarvamApiError,SarvamRateLimitError,SarvamConfigError,SarvamValidationError— never bareError. sarvam-mcp doctor. A diagnostic command that tells you exactly what's wrong with your setup.
Use this if you want a small, predictable surface you can read in an afternoon.
The 6 tools
| Tool | What it does | Sarvam endpoint |
|---|---|---|
sarvam_chat |
Chat completions (Sarvam-30B / Sarvam-105B), JSON mode supported | POST /v1/chat/completions |
sarvam_translate |
Bidirectional EN ↔ Indic and Indic ↔ Indic translation | POST /translate |
sarvam_transliterate |
Script conversion (Roman ↔ Devanagari, etc.) | POST /transliterate |
sarvam_detect_language |
Identify language and script of input text | POST /text-lid |
sarvam_speech_to_text |
Saarika v2.5 / Saaras v3 transcription from local audio file | POST /speech-to-text |
sarvam_text_to_speech |
Bulbul v3 synthesis. Returns base64 audio + optional WAV file | POST /text-to-speech |
Document Intelligence (Sarvam Vision) is on the v0.2 roadmap — it's an async batch flow that deserves its own treatment.
Install
Requires Node.js 20+.
npm install -g sarvam-mcp
# or, in a project:
npm install sarvam-mcp
For development:
git clone https://github.com/harshil1502/sarvam-mcp.git
cd sarvam-mcp
npm install
npm run build
Setup — three steps
1. Get a Sarvam API key
dashboard.sarvam.ai → create key. Free tier is enough to test all six tools.
2. Export it
export SARVAM_API_KEY=<your_key>
(Or put it in a .env file and source it before launching the MCP client — see .env.example.)
3. Verify
sarvam-mcp doctor
You should see something like:
sarvam-mcp · doctor
─────────────────────────────────────────────
[ ok ] SARVAM_API_KEY env var
Set (40 chars).
[ ok ] Language ID endpoint
Detected: hi-IN (expected hi-IN).
[ ok ] Chat completions
Got: ok
─────────────────────────────────────────────
all green — sarvam-mcp is ready.
If something fails, the doctor prints the exact reason.
Use with Claude Code
Add to your ~/.claude.json MCP servers:
{
"mcpServers": {
"sarvam": {
"command": "sarvam-mcp",
"env": { "SARVAM_API_KEY": "your_key_here" }
}
}
}
Then in Claude:
> translate "I'll be there at 9pm" to Tamil
[claude calls sarvam_translate]
→ "நான் இரவு 9 மணிக்கு அங்கு வருவேன்"
Examples
Chat in Hindi
// tool call: sarvam_chat
{
"messages": [
{ "role": "system", "content": "You are a helpful assistant. Reply in Hindi." },
{ "role": "user", "content": "What's the difference between Marathi and Hindi?" }
],
"model": "sarvam-105b",
"temperature": 0.4
}
Translate code-mixed text
// tool call: sarvam_translate
{
"input": "मेरा नाम Harshil है and I work in AI",
"source_language_code": "auto",
"target_language_code": "en-IN",
"mode": "code-mixed"
}
// → { "translated_text": "My name is Harshil and I work in AI" }
Transliterate a Hindi name to Roman script
// tool call: sarvam_transliterate
{
"input": "हर्षिल पटेल",
"source_language_code": "hi-IN",
"target_language_code": "en-IN"
}
// → { "transliterated_text": "Harshil Patel" }
Identify language
// tool call: sarvam_detect_language
{ "input": "வணக்கம் உலகம்" }
// → { "language_code": "ta-IN", "script_code": "Taml" }
Transcribe a voice memo
// tool call: sarvam_speech_to_text
{
"audio_path": "/Users/me/Downloads/voice-memo.m4a",
"language_code": "hi-IN",
"model": "saaras:v3",
"with_diarization": true
}
Generate Tamil speech and save it
// tool call: sarvam_text_to_speech
{
"inputs": ["நான் உங்களுக்கு உதவ முடியும்."],
"target_language_code": "ta-IN",
"speaker": "anushka",
"output_path": "/tmp/reply.wav"
}
// → audio also written to /tmp/reply.wav
Architecture
Four files, four responsibilities:
src/
├── client/sarvam.ts ← single fetch wrapper, both auth schemes
├── tools/
│ ├── chat.ts ← sarvam_chat
│ ├── translate.ts ← translate · transliterate · detect_language
│ └── speech.ts ← speech_to_text · text_to_speech
├── server.ts ← MCP transport + tool registration
├── cli/doctor.ts ← setup diagnostics
└── index.ts ← entry point + argv routing
When the next Sarvam API release breaks something, drift is contained to one file.
Strict TypeScript across the agent boundary
// tsconfig.json (excerpts)
{
"strict": true,
"noImplicitAny": true,
"noUncheckedIndexedAccess": true,
"noUnusedLocals": true,
"noUnusedParameters": true,
"noImplicitReturns": true
}
Agents will happily call your tool with null and then narrate the resulting error. Don't let them.
Errors
| Class | When it fires |
|---|---|
SarvamConfigError |
Missing SARVAM_API_KEY, unreadable audio file. |
SarvamApiError |
Any 4xx/5xx from Sarvam (other than 429). Includes status, endpoint, and body excerpt. |
SarvamRateLimitError |
429. Includes retryAfterSeconds if Sarvam sets Retry-After. |
SarvamValidationError |
Zod validation failed — the LLM passed bad arguments. |
Retries are intentionally not auto-implemented. The LLM gets the error and decides what to do.
Contributing / roadmap
v0.2 — Document Intelligence (Sarvam Vision):
sarvam_doc_extract_start— kick off async OCR jobsarvam_doc_extract_status— pollsarvam_doc_extract_results— fetch structured output
v0.3 — streaming (WebSocket TTS / STT) for low-latency voice agents.
PRs welcome. Run npm run typecheck && npm test before opening one.
License
MIT © 2026 Harshil Patel
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.