MCP Servers

ElevenLabs MCP Enhanced

An enhanced server for ElevenLabs that enables high-quality text-to-speech, voice cloning, and multi-speaker dialogue management. It features advanced conversational tools for transcript retrieval, history tracking, and emotional audio synthesis using the v3 model.

README

ElevenLabs MCP Enhanced

</div>

Enhanced fork of the official ElevenLabs MCP server with additional conversational AI features including conversation history and transcript retrieval.

This enhanced version is developed and maintained by Boris Djordjevic and the 199 Longevity team.

🚀 What's New in This Fork

This enhanced version adds critical conversational AI features missing from the original:

🤖 AI-Friendly Improvements (v1.0.0)

✅ Official v3 API: Now uses official ElevenLabs endpoints - no proxy needed!
🎯 Smart Voice Defaults: search_voices() now returns common working voices instantly
📚 Educational Error Messages: Errors guide AI agents to success with examples
💡 Clear Tool Guidance: No more confusion about single vs multi-speaker tools
🎤 Accurate v3 Voice IDs: All 20 v3-optimized voices now have correct IDs and descriptions
🏯 Auto-Split Long Dialogues: Automatically splits dialogues over 3000 chars into multiple files
🎯 Auto-Adjust Stability: Invalid stability values auto-round to nearest valid option (0.0, 0.5, 1.0)
🏷️ Smart Tag Simplification: Complex tags auto-convert to valid v3 tags for better quality
⏱️ Dynamic Timeouts: Prevents timeouts on complex dialogues by calculating appropriate wait times

🆕 ElevenLabs v3 Model Support (Official)

🎭 Enhanced Expressiveness: Use the official v3 model with model="v3" parameter
🎤 Audio Tags: Add emotions and sound effects like [thoughtful], [crying], [laughing], [piano]
👥 Multi-Speaker Dialogue: Generate natural conversations between multiple speakers
✨ Dialogue Enhancement: Automatically enhance your dialogue with proper formatting and tags
🌍 70+ Languages: v3 supports multilingual synthesis with emotional control
✅ Official API: Now uses the official ElevenLabs text-to-dialogue endpoint

🎙️ Conversational AI Features

Conversation History: Retrieve full conversation details including transcripts
📝 Transcript Access: Get conversation transcripts in multiple formats (plain, timestamps, JSON)
⏳ Real-time Monitoring: Wait for ongoing conversations to complete and retrieve results
🔍 Conversation Search: List and filter conversations by agent, status, and more
🎨 Improved Formatting: Consistent formatting across all list operations

About

This is an enhanced fork of the official ElevenLabs <a href="https://github.com/modelcontextprotocol">Model Context Protocol (MCP)</a> server that enables interaction with powerful Text to Speech and audio processing APIs. This server allows MCP clients like <a href="https://www.anthropic.com/claude">Claude Desktop</a>, <a href="https://www.cursor.so">Cursor</a>, <a href="https://codeium.com/windsurf">Windsurf</a>, <a href="https://github.com/openai/openai-agents-python">OpenAI Agents</a> and others to generate speech, clone voices, transcribe audio, manage conversational AI agents, and now retrieve conversation history.

🚀 Quick Install

Zero Install (Recommended)

No installation required! Just use npx:

npx elevenlabs-mcp-enhanced --api-key YOUR_API_KEY

Global Install

Install once, use everywhere:

npm install -g elevenlabs-mcp-enhanced
elevenlabs-mcp-enhanced --api-key YOUR_API_KEY

Environment Variable

Set your API key once:

export ELEVENLABS_API_KEY="your-api-key"
npx elevenlabs-mcp-enhanced

📋 Requirements

Node.js 16+ (for npm/npx)
Python 3.11+ (automatically managed by the npm package)
ElevenLabs API Key - Get one at elevenlabs.io

Quickstart with Claude Desktop

Option 1: Using npm/npx (Recommended - No installation required!)

Get your API key from ElevenLabs. There is a free tier with 10k credits per month.
Go to Claude > Settings > Developer > Edit Config > claude_desktop_config.json to include the following:

{
  "mcpServers": {
    "ElevenLabs": {
      "command": "npx",
      "args": ["elevenlabs-mcp-enhanced"],
      "env": {
        "ELEVENLABS_API_KEY": "<insert-your-api-key-here>"
      }
    }
  }
}

That's it! No installation needed - npx will automatically download and run the server.

Option 2: Using Python (Original method)

If you prefer the original Python installation:

Get your API key from ElevenLabs.

Install from GitHub:

pip install git+https://github.com/199-biotechnologies/elevenlabs-mcp-enhanced.git

Configure Claude Desktop with:

{
  "mcpServers": {
    "ElevenLabs": {
      "command": "python",
      "args": ["-m", "elevenlabs_mcp"],
      "env": {
        "ELEVENLABS_API_KEY": "<insert-your-api-key-here>"
      }
    }
  }
}

If you're using Windows, you will have to enable "Developer Mode" in Claude Desktop to use the MCP server. Click "Help" in the hamburger menu at the top left and select "Enable Developer Mode".

Other MCP clients

Using npm/npx:

For other clients like Cursor and Windsurf, you can run the server directly:

npx elevenlabs-mcp-enhanced --api-key YOUR_API_KEY

Using Python:

pip install elevenlabs-mcp
python -m elevenlabs_mcp --api-key={{PUT_YOUR_API_KEY_HERE}} --print to get the configuration. Paste it into appropriate configuration directory specified by your MCP client.

That's it. Your MCP client can now interact with ElevenLabs through these tools:

Example usage

⚠️ Warning: ElevenLabs credits are needed to use these tools.

Try asking Claude:

"Create an AI agent that speaks like a film noir detective and can answer questions about classic movies"
"Generate three voice variations for a wise, ancient dragon character, then I will choose my favorite voice to add to my voice library"
"Convert this recording of my voice to sound like a medieval knight"
"Create a soundscape of a thunderstorm in a dense jungle with animals reacting to the weather"
"Turn this speech into text, identify different speakers, then convert it back using unique voices for each person"

🆕 v3 Model - Quick Start Guide

🎯 DECISION TREE:

Single speaker? → Use text_to_speech with model="v3"
Multiple speakers? → Use text_to_dialogue (automatically v3)
Need tag examples? → Call fetch_v3_tags() first

📋 RECOMMENDED WORKFLOW FOR AI:

1. User: "Create an emotional story with sound effects"
2. AI: fetch_v3_tags() → Gets list of available tags
3. AI: search_voices("v3") → Gets v3-optimized voices
4. AI: text_to_dialogue(...) → Creates the story

Single Speaker Examples (text_to_speech):

"Generate: '[thoughtful] The universe is vast... [piano] ...and full of mysteries.'"
"Create narration with: '[whispering] Secret message [footsteps] [door creaking]'"

Multi-Speaker Examples (text_to_dialogue - ALWAYS v3):

# Simple conversation
inputs = [
    {"text": "How are you?", "voice_name": "James"},
    {"text": "I'm great!", "voice_name": "Jane"}
]

# With emotion tags
inputs = [
    {"text": "[excited] I found treasure!", "voice_name": "James"},
    {"text": "[skeptical] Really? [pause] Where?", "voice_name": "Jane"}
]

⚠️ v3 Requirements:

Stability: MUST be 0.0, 0.5, or 1.0 (no other values!)
Best voices: James, Jane, Sarah, Mark, etc. (search "v3" to find them)
Always check fetch_v3_tags() for available audio tags

🆕 New Conversation Features

With the enhanced conversation tools, you can now:

"Get the conversation transcript from conversation ID abc123" (automatically waits for completion)
"List all conversations from my agent and show me the completed ones"
"Get conversation xyz789 immediately without waiting" (set wait_for_completion=false)
"Show me all conversations in JSON format with timestamps"
"Get the conversation history including analysis data"

Note: The get_conversation tool now waits for conversations to complete by default (up to 5 minutes), ensuring you always get the full transcript.

Optional features

You can add the ELEVENLABS_MCP_BASE_PATH environment variable to the claude_desktop_config.json to specify the base path MCP server should look for and output files specified with relative paths.

✅ v3 Model - Now Officially Available!

The v3 model is now officially available through the ElevenLabs API! No proxy or special access needed - just use your regular API key.

What's New:

Official eleven_v3 model ID
Text-to-dialogue endpoint at /v1/text-to-dialogue
70+ language support
3,000 character limit per request
Enhanced emotional expressiveness

Usage: Simply set model="v3" in text_to_speech() or use text_to_dialogue() for multi-speaker content. The server now uses the official API endpoints.

Contributing

If you want to contribute or run from source:

Clone the repository:

git clone https://github.com/elevenlabs/elevenlabs-mcp
cd elevenlabs-mcp

Create a virtual environment and install dependencies using uv:

uv venv
source .venv/bin/activate
uv pip install -e ".[dev]"

Copy .env.example to .env and add your ElevenLabs API key:

cp .env.example .env
# Edit .env and add your API key

Run the tests to make sure everything is working:

./scripts/test.sh
# Or with options
./scripts/test.sh --verbose --fail-fast

Install the server in Claude Desktop: mcp install elevenlabs_mcp/server.py
Debug and test locally with MCP Inspector: mcp dev elevenlabs_mcp/server.py

Troubleshooting

Logs when running with Claude Desktop can be found at:

Windows: %APPDATA%\Claude\logs\mcp-server-elevenlabs.log
macOS: ~/Library/Logs/Claude/mcp-server-elevenlabs.log

Timeouts when using certain tools

Certain ElevenLabs API operations, like voice design and audio isolation, can take a long time to resolve. When using the MCP inspector in dev mode, you might get timeout errors despite the tool completing its intended task.

This shouldn't occur when using a client like Claude.

MCP ElevenLabs: spawn uvx ENOENT

If you encounter the error "MCP ElevenLabs: spawn uvx ENOENT", confirm its absolute path by running this command in your terminal:

which uvx

Once you obtain the absolute path (e.g., /usr/local/bin/uvx), update your configuration to use that path (e.g., "command": "/usr/local/bin/uvx"). This ensures that the correct executable is referenced.

Credits

Enhanced Fork

Boris Djordjevic - Lead Developer
199 Longevity Team - Development and Testing

Original ElevenLabs MCP Server

Jacek Duszenko - jacek@elevenlabs.io
Paul Asjes - paul.asjes@elevenlabs.io
Louis Jordan - louis@elevenlabs.io
Luke Harries - luke@elevenlabs.io

This enhanced fork builds upon the excellent foundation created by the ElevenLabs team, adding critical conversational AI features for improved agent interaction and monitoring.

License

This project maintains the same MIT license as the original ElevenLabs MCP server. See LICENSE for details.

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

E2B

Using MCP to run code via e2b.

Official

Featured