mcp-video
A cinema-grade video production MCP server that enables automated website recording, editing, and AI-powered narration using ffmpeg and Playwright. It provides tools for color grading, captioning, and converting videos into social media formats through natural language commands.
README
mcp-video
Cinema-grade video production MCP server. 8 tools for recording, editing, effects, captions, TTS, and smart screenshots.
Built on ffmpeg and Playwright. Works with any MCP client (Claude Code, Claude Desktop, custom agents).
Features
| Tool | Operations | Description |
|---|---|---|
video_record |
cinema, scroll, multi-device | Record websites at 60fps with frame-by-frame capture |
video_edit |
speed, crop, reverse, keyframe, pip | Edit clips with zoom/pan, PiP, slow-mo |
video_color |
grade, effect, lut, chroma | Color grading, 22 LUT presets, green screen |
video_audio |
extract, music, ducking, mix, voice | Audio extraction, mixing, 9 voice effects |
video_text |
subtitles, caption, overlay, animate | Burn SRT, Whisper auto-caption, 15 text animations |
video_compose |
concat, intro, social, beat-sync, templates | Join clips, social format conversion, beat sync |
video_speech |
generate, voices, narrated | ElevenLabs/OpenAI TTS, full narrated videos |
video_screenshot |
capture, detect | Element-aware screenshots, page feature detection |
Highlights
- 60fps frame-by-frame capture — Playwright screenshots every frame, ffmpeg encodes. Zero frame drops.
- Cinema easing curves — 16 easing options including
cinematicandshowcasefor buttery smooth scrolling. - Smart screenshots — Auto-detects 15+ UI elements (chat widgets, pricing sections, booking forms, etc.).
- Narrated videos — Provide a URL + script, get a professional video with synchronized AI voiceover.
- 22 LUT presets — Film-grade color grading (teal-orange, noir, vintage, cyberpunk, etc.).
- Social format export — One-click conversion to Instagram Reel, TikTok, YouTube Short, LinkedIn.
- Dual transport — Stdio (default) or HTTP mode for persistent microservice deployment.
Prerequisites
- Node.js >= 18
- ffmpeg and ffprobe (validated on startup)
- Playwright browsers (
npx playwright install chromium) - Optional:
ELEVENLABS_API_KEYfor ElevenLabs TTS - Optional:
OPENAI_API_KEYfor Whisper captions and OpenAI TTS
Quick Start
With Claude Code (stdio)
{
"mcpServers": {
"video": {
"command": "npx",
"args": ["-y", "mcp-video"]
}
}
}
With npx
npx mcp-video
From source
git clone https://github.com/studiomeyer-io/mcp-video.git
cd mcp-video
npm install
npx playwright install chromium
npm run build
npm start
HTTP mode
# Start as HTTP microservice
npx mcp-video --http --port=9847
# Or via environment variables
MCP_HTTP=1 MCP_PORT=9847 npx mcp-video
Configuration
| Environment Variable | Default | Description |
|---|---|---|
VIDEO_OUTPUT_DIR |
./output |
Directory for generated files |
ELEVENLABS_API_KEY |
— | ElevenLabs TTS API key |
OPENAI_API_KEY |
— | OpenAI API key (Whisper + TTS) |
MCP_HTTP |
false |
Enable HTTP transport |
MCP_PORT |
9847 |
HTTP port |
MCP_HOST |
127.0.0.1 |
HTTP bind address |
MCP_VIDEO_DEBUG |
false |
Enable debug logging |
Usage Examples
Record a website
Use video_record with type "cinema" to record https://example.com
with a smooth scroll and hover over the navbar.
Create a narrated explainer video
Use video_speech with type "narrated" to create a narrated video of
https://example.com with these segments:
1. "Welcome to our homepage" — pause on hero section
2. "Check out our features" — scroll to features
3. "Get started today" — hover over CTA button
Auto-caption a video
Use video_text with type "caption" to add auto-generated captions
to /path/to/video.mp4
Export for social media
Use video_compose with type "social-all" to convert
/path/to/video.mp4 to all social media formats.
Smart screenshot
Use video_screenshot with type "capture" to screenshot the chat widget
and pricing section on https://example.com
Architecture
src/
server.ts Entry point, 8 consolidated MCP tools
lib/ Logger, types, dual transport
handlers/ Tool handlers (video, editing, post-production, tts, screenshots)
schemas/ JSON Schema definitions for legacy tool format
tools/
engine/ Core engines
capture.ts Frame-by-frame recording (Playwright → PNG → ffmpeg)
encoder.ts ffmpeg encoding pipeline
scenes.ts Scene execution (scroll, hover, click, type, wait)
cursor.ts Visible cursor simulation
smart-screenshot.ts Element-aware screenshot engine
tts.ts ElevenLabs + OpenAI TTS with fallback
narrated-video.ts Full narration pipeline
social-format.ts Social media format conversion
concat.ts Video concatenation with transitions
lut-presets.ts 22 cinema LUT presets
...and more
Development
npm run dev # Start with tsx (hot reload)
npm run typecheck # Type check
npm test # Run tests
npm run check # Verify ffmpeg/ffprobe installed
License
MIT
Credits
Built by StudioMeyer. Part of our open-source toolkit for AI-powered content creation.
- ai-shield — LLM security middleware for TypeScript
- agent-fleet — Multi-agent orchestration for Claude Code
- darwin-agents — Self-evolving agent framework
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.