MCP Servers

sarvam-mcp

A type-safe MCP server for the Sarvam AI API that enables chat, translation, transliteration, language detection, speech-to-text, and text-to-speech across 10 Indic languages and English.

README

sarvam-mcp

sarvam-mcp/  v0.1   ──────────────────────────────────────────────────

A focused, type-safe MCP server for the Sarvam AI API. Lets Claude Code (or any MCP-compatible client) speak to Sarvam's models — chat, translation, transliteration, language detection, speech-to-text, and text-to-speech — across 10 Indic languages plus English.

[!IMPORTANT] This project is not affiliated with, endorsed by, or associated with Sarvam AI. It's an unofficial MCP wrapper that calls the public Sarvam API using your own API key. No keys, audio, or text are routed through any third-party server.

[!NOTE] Requires a Sarvam AI API key. Get one at dashboard.sarvam.ai. Free tier is generous enough to test all six tools.

What it does

Wraps Sarvam's public REST surface in a small set of well-defined MCP tools. Tools are typed end-to-end with Zod schemas, validated at the boundary, and surface useful error messages when something goes wrong.

                  ┌───────────────┐
                  │ Claude Code   │
                  │ (or any MCP   │
                  │  client)      │
                  └───────┬───────┘
                          │ stdio (MCP)
                          ▼
                  ┌───────────────┐
                  │  sarvam-mcp   │
                  └───────┬───────┘
                          │ HTTPS · api.sarvam.ai
                          ▼
                  ┌───────────────┐
                  │  Sarvam API   │
                  │  (Sarvam-105B │
                  │   · Mayura    │
                  │   · Saarika   │
                  │   · Bulbul)   │
                  └───────────────┘

Why another wrapper?

Sarvam ships a great official SDK for Python and Node. This is the MCP-shaped surface that lets agentic clients call it directly — no SDK code, no glue layer.

6 tools, deliberately scoped. Every tool is documented, typed, and tested.
Strict TypeScript. No any, no implicit returns, noUncheckedIndexedAccess on.
Zod-validated boundaries. Every input is parsed before the network call.
Typed errors. SarvamApiError, SarvamRateLimitError, SarvamConfigError, SarvamValidationError — never bare Error.
sarvam-mcp doctor. A diagnostic command that tells you exactly what's wrong with your setup.

Use this if you want a small, predictable surface you can read in an afternoon.

The 6 tools

Tool	What it does	Sarvam endpoint
`sarvam_chat`	Chat completions (Sarvam-30B / Sarvam-105B), JSON mode supported	`POST /v1/chat/completions`
`sarvam_translate`	Bidirectional EN ↔ Indic and Indic ↔ Indic translation	`POST /translate`
`sarvam_transliterate`	Script conversion (Roman ↔ Devanagari, etc.)	`POST /transliterate`
`sarvam_detect_language`	Identify language and script of input text	`POST /text-lid`
`sarvam_speech_to_text`	Saarika v2.5 / Saaras v3 transcription from local audio file	`POST /speech-to-text`
`sarvam_text_to_speech`	Bulbul v3 synthesis. Returns base64 audio + optional WAV file	`POST /text-to-speech`

Document Intelligence (Sarvam Vision) is on the v0.2 roadmap — it's an async batch flow that deserves its own treatment.

Install

Requires Node.js 20+.

npm install -g sarvam-mcp
# or, in a project:
npm install sarvam-mcp

For development:

git clone https://github.com/harshil1502/sarvam-mcp.git
cd sarvam-mcp
npm install
npm run build

Setup — three steps

1. Get a Sarvam API key

dashboard.sarvam.ai → create key. Free tier is enough to test all six tools.

2. Export it

export SARVAM_API_KEY=<your_key>

(Or put it in a .env file and source it before launching the MCP client — see .env.example.)

3. Verify

sarvam-mcp doctor

You should see something like:

sarvam-mcp · doctor
─────────────────────────────────────────────
[ ok ] SARVAM_API_KEY env var
       Set (40 chars).
[ ok ] Language ID endpoint
       Detected: hi-IN (expected hi-IN).
[ ok ] Chat completions
       Got: ok
─────────────────────────────────────────────
all green — sarvam-mcp is ready.

If something fails, the doctor prints the exact reason.

Use with Claude Code

Add to your ~/.claude.json MCP servers:

{
  "mcpServers": {
    "sarvam": {
      "command": "sarvam-mcp",
      "env": { "SARVAM_API_KEY": "your_key_here" }
    }
  }
}

Then in Claude:

> translate "I'll be there at 9pm" to Tamil

[claude calls sarvam_translate]
→ "நான் இரவு 9 மணிக்கு அங்கு வருவேன்"

Examples

Chat in Hindi

// tool call: sarvam_chat
{
  "messages": [
    { "role": "system", "content": "You are a helpful assistant. Reply in Hindi." },
    { "role": "user", "content": "What's the difference between Marathi and Hindi?" }
  ],
  "model": "sarvam-105b",
  "temperature": 0.4
}

Translate code-mixed text

// tool call: sarvam_translate
{
  "input": "मेरा नाम Harshil है and I work in AI",
  "source_language_code": "auto",
  "target_language_code": "en-IN",
  "mode": "code-mixed"
}
// → { "translated_text": "My name is Harshil and I work in AI" }

Transliterate a Hindi name to Roman script

// tool call: sarvam_transliterate
{
  "input": "हर्षिल पटेल",
  "source_language_code": "hi-IN",
  "target_language_code": "en-IN"
}
// → { "transliterated_text": "Harshil Patel" }

Identify language

// tool call: sarvam_detect_language
{ "input": "வணக்கம் உலகம்" }
// → { "language_code": "ta-IN", "script_code": "Taml" }

Transcribe a voice memo

// tool call: sarvam_speech_to_text
{
  "audio_path": "/Users/me/Downloads/voice-memo.m4a",
  "language_code": "hi-IN",
  "model": "saaras:v3",
  "with_diarization": true
}

Generate Tamil speech and save it

// tool call: sarvam_text_to_speech
{
  "inputs": ["நான் உங்களுக்கு உதவ முடியும்."],
  "target_language_code": "ta-IN",
  "speaker": "anushka",
  "output_path": "/tmp/reply.wav"
}
// → audio also written to /tmp/reply.wav

Architecture

Four files, four responsibilities:

src/
├── client/sarvam.ts    ← single fetch wrapper, both auth schemes
├── tools/
│   ├── chat.ts         ← sarvam_chat
│   ├── translate.ts    ← translate · transliterate · detect_language
│   └── speech.ts       ← speech_to_text · text_to_speech
├── server.ts           ← MCP transport + tool registration
├── cli/doctor.ts       ← setup diagnostics
└── index.ts            ← entry point + argv routing

When the next Sarvam API release breaks something, drift is contained to one file.

Strict TypeScript across the agent boundary

// tsconfig.json (excerpts)
{
  "strict": true,
  "noImplicitAny": true,
  "noUncheckedIndexedAccess": true,
  "noUnusedLocals": true,
  "noUnusedParameters": true,
  "noImplicitReturns": true
}

Agents will happily call your tool with null and then narrate the resulting error. Don't let them.

Errors

Class	When it fires
`SarvamConfigError`	Missing `SARVAM_API_KEY`, unreadable audio file.
`SarvamApiError`	Any 4xx/5xx from Sarvam (other than 429). Includes status, endpoint, and body excerpt.
`SarvamRateLimitError`	429. Includes `retryAfterSeconds` if Sarvam sets `Retry-After`.
`SarvamValidationError`	Zod validation failed — the LLM passed bad arguments.

Retries are intentionally not auto-implemented. The LLM gets the error and decides what to do.

Contributing / roadmap

v0.2 — Document Intelligence (Sarvam Vision):

sarvam_doc_extract_start — kick off async OCR job
sarvam_doc_extract_status — poll
sarvam_doc_extract_results — fetch structured output

v0.3 — streaming (WebSocket TTS / STT) for low-latency voice agents.

PRs welcome. Run npm run typecheck && npm test before opening one.

License

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured