ElevenLabs Image Generation MCP Server
Generates images from text prompts using ElevenLabs' image generation API via browser automation, with support for multiple models and persistent authentication.
README
ElevenLabs Image Generation MCP Server
An MCP (Model Context Protocol) server that generates images using ElevenLabs' Image & Video feature via Playwright browser automation.
Features
- generate_image - Generate images from text prompts using ElevenLabs
- list_models - List available image generation models
- get_session_status - Check authentication status
Available Models
| Model ID | Name | Description |
|---|---|---|
gpt-image-1.5 |
GPT Image 1.5 (default) | OpenAI - precise, high-quality generation |
gpt-image-1 |
GPT Image 1 | OpenAI - text-based creation and editing |
flux-kontext-pro |
Flux 1 Kontext Pro | Professional style control via reference images |
seedream-4 |
Seedream 4 | Multi-shot sequences with stable physics |
nano-banana |
Nano Banana (Google) | High-speed iterations |
wan-2.5 |
Wan 2.5 | Strong prompt fidelity |
Installation
# Clone the repository
git clone git@github.com:ddunford/elevenlabMCP.git
cd elevenlabMCP
# Install dependencies
npm install
# Build
npm run build
# Install Playwright browser
npx playwright install chromium
Quick Setup (One-Liner)
Add to Claude Code globally:
claude mcp add elevenlabs-image -s user -- node /path/to/elevenlabMCP/dist/index.js
Or manually add to ~/.claude.json:
{
"mcpServers": {
"elevenlabs-image": {
"type": "stdio",
"command": "node",
"args": ["/path/to/elevenlabMCP/dist/index.js"],
"env": {}
}
}
}
Usage
In Claude Code
After adding the MCP server, restart Claude Code. The tools will be available as:
mcp__elevenlabs-image__generate_imagemcp__elevenlabs-image__list_modelsmcp__elevenlabs-image__get_session_status
generate_image
Generate an image from a text prompt.
Parameters:
| Parameter | Required | Description |
|---|---|---|
prompt |
Yes | Text description of the image to generate |
model |
No | Model ID (default: gpt-image-1.5) |
savePath |
No | Directory to save image (default: assets/) |
aspectRatio |
No | e.g., "1:1", "16:9", "9:16" |
negativePrompt |
No | What to avoid in the image |
email |
No | ElevenLabs account email (for first-time auth) |
password |
No | ElevenLabs account password (for first-time auth) |
Example:
Generate an image of a cyberpunk city at night with neon lights
list_models
Returns all available image generation models with their capabilities.
get_session_status
Check if currently logged in to ElevenLabs.
Authentication
On first use, provide your ElevenLabs credentials via the email and password parameters. The session is persisted in .auth/ so subsequent calls don't require credentials.
How It Works
- Uses Playwright to automate the ElevenLabs web interface (no API available for image generation)
- Maintains a persistent browser session for authentication
- Navigates to the Image & Video page
- Enters prompts and generates images
- Downloads generated images from the History page
Project Structure
elevenlabMCP/
├── src/
│ ├── index.ts # Entry point
│ ├── server.ts # MCP server setup
│ ├── tools/ # MCP tool implementations
│ ├── browser/ # Playwright automation
│ ├── auth/ # Authentication handling
│ └── config/ # Configuration
├── dist/ # Compiled JavaScript
├── assets/ # Generated images output
└── .auth/ # Session storage (gitignored)
Environment Variables (Optional)
Create a .env file:
ELEVENLABS_EMAIL=your@email.com
ELEVENLABS_PASSWORD=yourpassword
HEADLESS=true # Set to false for debugging
Notes
- Image generation typically takes 30-60 seconds
- Generated images are saved as WebP files
- The browser runs headless by default; set
HEADLESS=falseto see the browser
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.