ElevenLabs MCP Server

ElevenLabs MCP Server

Enables text-to-speech conversion using ElevenLabs API with voice management, streaming support, and multiple models.

Category
Visit Server

README

ElevenLabs MCP Server

A Model Context Protocol (MCP) server that provides text-to-speech functionality using the ElevenLabs API. This server allows you to convert text to natural-sounding speech with various voices and models.

Features

  • Text-to-Speech Conversion: Convert any text to high-quality speech
  • Voice Management: List and get detailed information about available voices
  • Multiple Models: Support for various ElevenLabs TTS models
  • Streaming Support: Real-time audio generation for longer texts
  • Flexible Output: Multiple audio formats and custom voice settings
  • Automatic File Management: Smart file naming and directory creation

Installation

  1. Clone this repository:
git clone <repository-url>
cd elevan-labs-mcp
  1. Install dependencies:
pip install -e .
  1. Set up your ElevenLabs API key:
export ELEVENLABS_API_KEY="your_api_key_here"

Or create a .env file in the project root:

cp .env.example .env
# Edit .env and add your API key

You can get your API key from ElevenLabs Dashboard.

Usage

Running the Server

python main.py

Or using the installed script:

elevan-labs-mcp

šŸ¤– LLM Integration

MCP Configuration

The mcp.json file provides configuration with clients:

{
    "servers": {
        "elevenlabs-mcp": {
            "type": "stdio",
            "command": "uvx",
            "args": [
                "--directory",
                "${workspaceFolder}",
                "elevan-labs-mcp"
                
            ],
            "envFile": "${workspaceFolder}/.env"
        }
    }
}

Available Tools

1. elevenlabs_text_to_speech

Convert text to speech with extensive customization options.

Parameters:

  • text (required): The text to convert to speech
  • voice_id (optional): Voice ID to use (defaults to George voice)
  • model_id (optional): Model to use (defaults to eleven_multilingual_v2)
  • output_format (optional): Audio format (defaults to mp3_44100_128)
  • output_file (optional): Custom output file path
  • voice_settings (optional): Custom voice settings (stability, similarity_boost, etc.)

Example:

{
  "text": "Hello, this is a test of the ElevenLabs text-to-speech API.",
  "voice_id": "JBFqnCBsd6RMkjVDRZzb",
  "output_format": "mp3_44100_128"
}

2. elevenlabs_list_voices

Get all available voices from your ElevenLabs account.

Parameters: None

Returns: List of voices with their IDs, names, categories, and settings.

3. elevenlabs_get_voice_info

Get detailed information about a specific voice.

Parameters:

  • voice_id (required): ID of the voice to query

Returns: Detailed voice information including samples and settings.

4. elevenlabs_stream_text_to_speech

Convert text to speech with streaming for real-time generation.

Parameters:

  • text (required): The text to convert
  • voice_id (optional): Voice ID to use
  • model_id (optional): Model to use
  • output_file (required): Output file path

5. elevenlabs_get_models

Get all available text-to-speech models.

Parameters: None

Returns: List of models with their capabilities and limitations.

Supported Audio Formats

  • MP3: mp3_44100_128, mp3_44100_192, mp3_44100_64
  • PCM: pcm_16000, pcm_22050, pcm_24000, pcm_44100
  • WAV: wav_22050, wav_44100, wav_48000
  • μ-law: ulaw_8000

Voice Settings

You can customize voice generation with these settings:

  • stability (0-1): Controls consistency between generations
  • similarity_boost (0-1): Enhances similarity to the original voice
  • style (0-1): Amplifies the style of the original speaker
  • use_speaker_boost (boolean): Boost similarity to the original speaker

Error Handling

The server includes comprehensive error handling for:

  • Missing API keys
  • Invalid voice IDs
  • API rate limits
  • Network issues
  • File system errors

Configuration

Environment Variables

  • ELEVENLABS_API_KEY: Your ElevenLabs API key (required)

Default Settings

  • Default Voice: George (JBFqnCBsd6RMkjVDRZzb)
  • Default Model: eleven_multilingual_v2
  • Default Format: mp3_44100_128
  • Default Output Directory: ./audio_output

Examples

Basic Text-to-Speech

{
  "tool": "elevenlabs_text_to_speech",
  "arguments": {
    "text": "Welcome to the ElevenLabs MCP server!"
  }
}

Custom Voice and Settings

{
  "tool": "elevenlabs_text_to_speech",
  "arguments": {
    "text": "This is a custom voice example.",
    "voice_id": "your_voice_id_here",
    "voice_settings": {
      "stability": 0.75,
      "similarity_boost": 0.8,
      "style": 0.6
    }
  }
}

List Available Voices

{
  "tool": "elevenlabs_list_voices",
  "arguments": {}
}

Development

Project Structure

elevan-labs-mcp/
ā”œā”€ā”€ elevan_labs_mcp/
│   ā”œā”€ā”€ __init__.py
│   └── server.py          # Main MCP server implementation
ā”œā”€ā”€ main.py                # Entry point
ā”œā”€ā”€ pyproject.toml         # Project configuration
ā”œā”€ā”€ README.md              # This file
└── .gitignore            # Git ignore rules

Dependencies

  • mcp>=1.0.0: Model Context Protocol SDK
  • elevenlabs>=1.0.0: ElevenLabs Python SDK
  • httpx>=0.25.0: HTTP client
  • pydantic>=2.0.0: Data validation

Troubleshooting

Common Issues

  1. Missing API Key: Make sure ELEVENLABS_API_KEY is set in your environment
  2. Voice Not Found: Use elevenlabs_list_voices to see available voices
  3. Rate Limits: ElevenLabs has rate limits based on your subscription tier
  4. File Permissions: Ensure the output directory is writable

Getting Help

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured