MCP Servers

ElevenLabs MCP Server

Enables text-to-speech conversion using ElevenLabs API with voice management, streaming support, and multiple models.

README

ElevenLabs MCP Server

A Model Context Protocol (MCP) server that provides text-to-speech functionality using the ElevenLabs API. This server allows you to convert text to natural-sounding speech with various voices and models.

Features

Text-to-Speech Conversion: Convert any text to high-quality speech
Voice Management: List and get detailed information about available voices
Multiple Models: Support for various ElevenLabs TTS models
Streaming Support: Real-time audio generation for longer texts
Flexible Output: Multiple audio formats and custom voice settings
Automatic File Management: Smart file naming and directory creation

Installation

Clone this repository:

git clone <repository-url>
cd elevan-labs-mcp

Install dependencies:

pip install -e .

Set up your ElevenLabs API key:

export ELEVENLABS_API_KEY="your_api_key_here"

Or create a .env file in the project root:

cp .env.example .env
# Edit .env and add your API key

You can get your API key from ElevenLabs Dashboard.

Usage

Running the Server

python main.py

Or using the installed script:

elevan-labs-mcp

🤖 LLM Integration

MCP Configuration

The mcp.json file provides configuration with clients:

{
    "servers": {
        "elevenlabs-mcp": {
            "type": "stdio",
            "command": "uvx",
            "args": [
                "--directory",
                "${workspaceFolder}",
                "elevan-labs-mcp"
                
            ],
            "envFile": "${workspaceFolder}/.env"
        }
    }
}

Available Tools

1. `elevenlabs_text_to_speech`

Convert text to speech with extensive customization options.

Parameters:

text (required): The text to convert to speech
voice_id (optional): Voice ID to use (defaults to George voice)
model_id (optional): Model to use (defaults to eleven_multilingual_v2)
output_format (optional): Audio format (defaults to mp3_44100_128)
output_file (optional): Custom output file path
voice_settings (optional): Custom voice settings (stability, similarity_boost, etc.)

Example:

{
  "text": "Hello, this is a test of the ElevenLabs text-to-speech API.",
  "voice_id": "JBFqnCBsd6RMkjVDRZzb",
  "output_format": "mp3_44100_128"
}

2. `elevenlabs_list_voices`

Get all available voices from your ElevenLabs account.

Parameters: None

Returns: List of voices with their IDs, names, categories, and settings.

3. `elevenlabs_get_voice_info`

Get detailed information about a specific voice.

Parameters:

voice_id (required): ID of the voice to query

Returns: Detailed voice information including samples and settings.

4. `elevenlabs_stream_text_to_speech`

Convert text to speech with streaming for real-time generation.

Parameters:

text (required): The text to convert
voice_id (optional): Voice ID to use
model_id (optional): Model to use
output_file (required): Output file path

5. `elevenlabs_get_models`

Get all available text-to-speech models.

Parameters: None

Returns: List of models with their capabilities and limitations.

Supported Audio Formats

MP3: mp3_44100_128, mp3_44100_192, mp3_44100_64
PCM: pcm_16000, pcm_22050, pcm_24000, pcm_44100
WAV: wav_22050, wav_44100, wav_48000
μ-law: ulaw_8000

Voice Settings

You can customize voice generation with these settings:

stability (0-1): Controls consistency between generations
similarity_boost (0-1): Enhances similarity to the original voice
style (0-1): Amplifies the style of the original speaker
use_speaker_boost (boolean): Boost similarity to the original speaker

Error Handling

The server includes comprehensive error handling for:

Missing API keys
Invalid voice IDs
API rate limits
Network issues
File system errors

Configuration

Environment Variables

ELEVENLABS_API_KEY: Your ElevenLabs API key (required)

Default Settings

Default Voice: George (JBFqnCBsd6RMkjVDRZzb)
Default Model: eleven_multilingual_v2
Default Format: mp3_44100_128
Default Output Directory: ./audio_output

Examples

Basic Text-to-Speech

{
  "tool": "elevenlabs_text_to_speech",
  "arguments": {
    "text": "Welcome to the ElevenLabs MCP server!"
  }
}

Custom Voice and Settings

{
  "tool": "elevenlabs_text_to_speech",
  "arguments": {
    "text": "This is a custom voice example.",
    "voice_id": "your_voice_id_here",
    "voice_settings": {
      "stability": 0.75,
      "similarity_boost": 0.8,
      "style": 0.6
    }
  }
}

List Available Voices

{
  "tool": "elevenlabs_list_voices",
  "arguments": {}
}

Development

Project Structure

elevan-labs-mcp/
├── elevan_labs_mcp/
│   ├── __init__.py
│   └── server.py          # Main MCP server implementation
├── main.py                # Entry point
├── pyproject.toml         # Project configuration
├── README.md              # This file
└── .gitignore            # Git ignore rules

Dependencies

mcp>=1.0.0: Model Context Protocol SDK
elevenlabs>=1.0.0: ElevenLabs Python SDK
httpx>=0.25.0: HTTP client
pydantic>=2.0.0: Data validation

Troubleshooting

Common Issues

Missing API Key: Make sure ELEVENLABS_API_KEY is set in your environment
Voice Not Found: Use elevenlabs_list_voices to see available voices
Rate Limits: ElevenLabs has rate limits based on your subscription tier
File Permissions: Ensure the output directory is writable

Getting Help

Check the ElevenLabs API Documentation
Review the MCP Specification

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured

ElevenLabs MCP Server

README

ElevenLabs MCP Server

Features

Installation

Usage

Running the Server

🤖 LLM Integration

MCP Configuration

Available Tools

1. elevenlabs_text_to_speech

2. elevenlabs_list_voices

3. elevenlabs_get_voice_info

4. elevenlabs_stream_text_to_speech

5. elevenlabs_get_models

Supported Audio Formats

Voice Settings

Error Handling

Configuration

Environment Variables

Default Settings

Examples

Basic Text-to-Speech

Custom Voice and Settings

List Available Voices

Development

Project Structure

Dependencies

Troubleshooting

Common Issues

Getting Help

License

Contributing

Recommended Servers

1. `elevenlabs_text_to_speech`

2. `elevenlabs_list_voices`

3. `elevenlabs_get_voice_info`

4. `elevenlabs_stream_text_to_speech`

5. `elevenlabs_get_models`