MMAudio MCP
Enables AI-powered video-to-audio and text-to-audio generation using MMAudio's API. Create synchronized audio from video content or generate audio from text descriptions with configurable parameters.
README
MMAudio MCP
AI-powered video-to-audio and text-to-audio generation using MMAudio's advanced AI technology. This Desktop Extension implements the Model Context Protocol (MCP) to provide seamless integration with Cursor, Claude Desktop, and other MCP-compatible clients.
š Features
- Video-to-Audio Generation: Transform video content into synchronized audio with AI-powered analysis
- Text-to-Audio Generation: Create professional audio content from text descriptions
- API Key Validation: Verify MMAudio API credentials and check account status
- MCP Protocol: Full Model Context Protocol implementation for universal compatibility
- Error Handling: Comprehensive error handling with detailed feedback
- Type Safety: Full TypeScript support with Zod schema validation
- Configurable: Flexible configuration options for different use cases
š Quick Start
Prerequisites
- Node.js >= 18.0.0
- MMAudio API key (Get one here)
- MCP-compatible client (Cursor, Claude Desktop, etc.)
Installation
-
Clone or download this extension:
cd mcp npm install -
Configure your API key:
# Set environment variable export MMAUDIO_API_KEY="sk-your-api-key-here" # Or create a config.js file (copy from config.example.js) cp config.example.js config.js # Edit config.js with your API key -
Test the installation:
npm start -
Link the package:
npm link
Usage with Cursor
-
Add to your Cursor MCP configuration:
{ "mcpServers": { "mmaudio": { "command": "npx", "args": ["-y", "mmaudio-mcp@latest"], "env": { "MMAUDIO_API_KEY": "sk-your-api-key-here" } } } } -
Restart Cursor and the MMAudio tools will be available in your AI assistant.
Usage with Claude Desktop
-
Add to your Claude Desktop configuration (
claude_desktop_config.json):{ "mcpServers": { "mmaudio": { "command": "npx", "args": ["-y", "mmaudio-mcp@latest"], "env": { "MMAUDIO_API_KEY": "sk-your-api-key-here" } } } } -
Restart Claude Desktop to load the extension.
š ļø Available Tools
1. Video-to-Audio Generation
Generate AI-powered audio from video content.
Tool Name: video_to_audio
Parameters:
video_url(required): URL of the video fileprompt(required): Description of the audio you want to generatenegative_prompt(optional): What to avoid in the generated audioduration(optional): Audio duration in seconds (1-30, default: 8)num_steps(optional): Number of inference steps (1-50, default: 25)cfg_strength(optional): Guidance strength (1-10, default: 4.5)seed(optional): Random seed for reproducible results
Example:
Generate audio for this video: https://example.com/video.mp4 with the prompt "peaceful forest sounds with birds chirping and gentle wind"
2. Text-to-Audio Generation
Create audio content from text descriptions.
Tool Name: text_to_audio
Parameters:
prompt(required): Description of the audio you want to generateduration(optional): Audio duration in seconds (1-30, default: 8)num_steps(optional): Number of inference steps (1-50, default: 25)cfg_strength(optional): Guidance strength (1-10, default: 4.5)negative_prompt(optional): What to avoid in the generated audioseed(optional): Random seed for reproducible results
Example:
Create audio with the description "coffee shop ambiance with gentle chatter and espresso machine sounds"
3. API Key Validation
Validate your MMAudio API key and check account status.
Tool Name: validate_api_key
Parameters:
api_key(optional): API key to validate (uses configured key if not provided)
Example:
Validate my MMAudio API key
āļø Configuration
Environment Variables
| Variable | Description | Required | Default |
|---|---|---|---|
MMAUDIO_API_KEY |
Your MMAudio API key | Yes | - |
MMAUDIO_BASE_URL |
Base URL for MMAudio API | No | https://mmaudio.net |
MMAUDIO_TIMEOUT |
Request timeout in milliseconds | No | 60000 |
LOG_LEVEL |
Log level (error, warn, info, debug) | No | error |
DEBUG |
Enable debug mode | No | false |
Configuration File
You can also use a config.js file for configuration:
export const config = {
apiKey: "sk-your-api-key-here",
baseUrl: "https://mmaudio.net",
timeout: 60000,
// ... other options
};
š Example Usage
Video-to-Audio Example
// In your MCP client (Cursor, Claude Desktop, etc.)
"Please generate audio for this video URL: https://example.com/nature_video.mp4";
"I want forest sounds with birds chirping and a gentle breeze";
"Duration should be 10 seconds";
Text-to-Audio Example
// Create ambient audio
"Generate 15 seconds of coffee shop ambiance with gentle background chatter";
// Create sound effects
"Create the sound of rain falling on a wooden roof for 8 seconds";
// Create atmospheric audio
"Generate futuristic sci-fi ambient sounds for a space station";
š§ Development
Project Structure
mcp/
āāā server/index.js # Main MCP server implementation
āāā package.json # Node.js dependencies and scripts
āāā manifest.json # DXT extension manifest
āāā config.example.js # Configuration example
āāā README.md # This file
Testing
-
Test the server directly:
npm start -
Test with a simple MCP client:
echo '{"jsonrpc": "2.0", "method": "tools/list", "id": 1}' | npm start
Development Mode
Run the server in development mode with debugging:
npm run dev
This enables additional logging and the Node.js inspector for debugging.
šØ Troubleshooting
Common Issues
1. "API key is required" Error
Problem: The extension can't find your API key.
Solution:
- Ensure you've set the
MMAUDIO_API_KEYenvironment variable - Or create a
config.jsfile with your API key - Verify the API key is correct and active
2. "Connection refused" or Network Errors
Problem: Can't connect to MMAudio API.
Solution:
- Check your internet connection
- Verify the base URL is correct
- Check if there are any firewall restrictions
3. "Insufficient credits" Error
Problem: Your MMAudio account doesn't have enough credits.
Solution:
- Check your account balance at mmaudio.net/dashboard
- Purchase additional credits if needed
- Use the
validate_api_keytool to check your account status
4. MCP Client Not Detecting Extension
Problem: Your MCP client (Cursor, Claude Desktop) doesn't show MMAudio tools.
Solution:
- Verify the configuration path in your MCP client settings
- Ensure Node.js is in your PATH
- Check the server logs for errors
- Restart your MCP client
Debug Mode
Enable debug mode for detailed logging:
DEBUG=true MMAUDIO_API_KEY=your-key node server/index.js
Logs
The server logs to stderr. You can redirect logs to a file:
node server/index.js 2> mmaudio.log
š API Reference
Response Format
All tools return responses in this format:
{
"content": [
{
"type": "text",
"text": "{\"success\": true, \"message\": \"...\", \"result\": {...}}"
}
]
}
Success Response Example
{
"success": true,
"message": "Audio generated successfully from text",
"result": {
"audio_url": "https://example.com/generated_audio.wav",
"content_type": "audio/wav",
"file_name": "generated_audio.wav",
"file_size": 1024000,
"duration": 8,
"prompt": "coffee shop ambiance"
}
}
Error Response Example
{
"success": false,
"error": "Invalid API key. Please check your MMAudio API key.",
"code": "INVALID_REQUEST"
}
š Security
- API keys are handled securely and not logged
- All requests use HTTPS
- The extension runs in a sandboxed environment
- No sensitive data is stored locally
š Related Documentation
- MMAudio API Documentation
- Model Context Protocol Specification
- Cursor MCP Integration Guide
- Claude Desktop MCP Setup
š¤ Contributing
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests if applicable
- Submit a pull request
š License
This project is licensed under the MIT License - see the LICENSE file for details.
š¬ Support
Made with ā¤ļø by the MMAudio Team
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
E2B
Using MCP to run code via e2b.