ElevenLabs MCP Server
Enables text-to-speech conversion using ElevenLabs API with voice management, streaming support, and multiple models.
README
ElevenLabs MCP Server
A Model Context Protocol (MCP) server that provides text-to-speech functionality using the ElevenLabs API. This server allows you to convert text to natural-sounding speech with various voices and models.
Features
- Text-to-Speech Conversion: Convert any text to high-quality speech
- Voice Management: List and get detailed information about available voices
- Multiple Models: Support for various ElevenLabs TTS models
- Streaming Support: Real-time audio generation for longer texts
- Flexible Output: Multiple audio formats and custom voice settings
- Automatic File Management: Smart file naming and directory creation
Installation
- Clone this repository:
git clone <repository-url>
cd elevan-labs-mcp
- Install dependencies:
pip install -e .
- Set up your ElevenLabs API key:
export ELEVENLABS_API_KEY="your_api_key_here"
Or create a .env file in the project root:
cp .env.example .env
# Edit .env and add your API key
You can get your API key from ElevenLabs Dashboard.
Usage
Running the Server
python main.py
Or using the installed script:
elevan-labs-mcp
š¤ LLM Integration
MCP Configuration
The mcp.json file provides configuration with clients:
{
"servers": {
"elevenlabs-mcp": {
"type": "stdio",
"command": "uvx",
"args": [
"--directory",
"${workspaceFolder}",
"elevan-labs-mcp"
],
"envFile": "${workspaceFolder}/.env"
}
}
}
Available Tools
1. elevenlabs_text_to_speech
Convert text to speech with extensive customization options.
Parameters:
text(required): The text to convert to speechvoice_id(optional): Voice ID to use (defaults to George voice)model_id(optional): Model to use (defaults to eleven_multilingual_v2)output_format(optional): Audio format (defaults to mp3_44100_128)output_file(optional): Custom output file pathvoice_settings(optional): Custom voice settings (stability, similarity_boost, etc.)
Example:
{
"text": "Hello, this is a test of the ElevenLabs text-to-speech API.",
"voice_id": "JBFqnCBsd6RMkjVDRZzb",
"output_format": "mp3_44100_128"
}
2. elevenlabs_list_voices
Get all available voices from your ElevenLabs account.
Parameters: None
Returns: List of voices with their IDs, names, categories, and settings.
3. elevenlabs_get_voice_info
Get detailed information about a specific voice.
Parameters:
voice_id(required): ID of the voice to query
Returns: Detailed voice information including samples and settings.
4. elevenlabs_stream_text_to_speech
Convert text to speech with streaming for real-time generation.
Parameters:
text(required): The text to convertvoice_id(optional): Voice ID to usemodel_id(optional): Model to useoutput_file(required): Output file path
5. elevenlabs_get_models
Get all available text-to-speech models.
Parameters: None
Returns: List of models with their capabilities and limitations.
Supported Audio Formats
- MP3:
mp3_44100_128,mp3_44100_192,mp3_44100_64 - PCM:
pcm_16000,pcm_22050,pcm_24000,pcm_44100 - WAV:
wav_22050,wav_44100,wav_48000 - μ-law:
ulaw_8000
Voice Settings
You can customize voice generation with these settings:
stability(0-1): Controls consistency between generationssimilarity_boost(0-1): Enhances similarity to the original voicestyle(0-1): Amplifies the style of the original speakeruse_speaker_boost(boolean): Boost similarity to the original speaker
Error Handling
The server includes comprehensive error handling for:
- Missing API keys
- Invalid voice IDs
- API rate limits
- Network issues
- File system errors
Configuration
Environment Variables
ELEVENLABS_API_KEY: Your ElevenLabs API key (required)
Default Settings
- Default Voice: George (
JBFqnCBsd6RMkjVDRZzb) - Default Model:
eleven_multilingual_v2 - Default Format:
mp3_44100_128 - Default Output Directory:
./audio_output
Examples
Basic Text-to-Speech
{
"tool": "elevenlabs_text_to_speech",
"arguments": {
"text": "Welcome to the ElevenLabs MCP server!"
}
}
Custom Voice and Settings
{
"tool": "elevenlabs_text_to_speech",
"arguments": {
"text": "This is a custom voice example.",
"voice_id": "your_voice_id_here",
"voice_settings": {
"stability": 0.75,
"similarity_boost": 0.8,
"style": 0.6
}
}
}
List Available Voices
{
"tool": "elevenlabs_list_voices",
"arguments": {}
}
Development
Project Structure
elevan-labs-mcp/
āāā elevan_labs_mcp/
ā āāā __init__.py
ā āāā server.py # Main MCP server implementation
āāā main.py # Entry point
āāā pyproject.toml # Project configuration
āāā README.md # This file
āāā .gitignore # Git ignore rules
Dependencies
mcp>=1.0.0: Model Context Protocol SDKelevenlabs>=1.0.0: ElevenLabs Python SDKhttpx>=0.25.0: HTTP clientpydantic>=2.0.0: Data validation
Troubleshooting
Common Issues
- Missing API Key: Make sure
ELEVENLABS_API_KEYis set in your environment - Voice Not Found: Use
elevenlabs_list_voicesto see available voices - Rate Limits: ElevenLabs has rate limits based on your subscription tier
- File Permissions: Ensure the output directory is writable
Getting Help
- Check the ElevenLabs API Documentation
- Review the MCP Specification
License
This project is licensed under the MIT License - see the LICENSE file for details.
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.