MCP Servers

Doubao Image/Video Generation MCP Server

Enables AI image generation using Doubao Seedream models and video generation using Doubao Seedance models through Volcano Engine's API, supporting text-to-image, image-to-image, text-to-video, and task status queries.

README

豆包图片/视频生成 MCP 服务器

一个支持豆包图片生成和视频生成的 MCP (Model Context Protocol) 服务器

功能

图片生成: 使用豆包 Seedream 模型生成高质量图片
视频生成: 使用豆包 Seedance 模型生成视频
任务查询: 查询视频生成任务状态

获取 API Key

访问火山引擎豆包大模型平台
注册/登录账号
在控制台获取 API Key

创建推理接入点

访问火山引擎控制台
进入「开通管理」页面，参与「协作奖励计划」享免费资源包
授权图片生成接入点（Doubao-Seedream-4.5 模型）
授权视频生成接入点（Doubao-Seedance-1.0-lite-t2v 模型）
进入「安全管理」页面，点击内容安全，复制接入点 ID 到配置文件

PS: 图像生成和视频生成配置不同的接入点

安装

本地安装

cd doubao-image-video-mcp
pnpm install
pnpm build

配置

Claude Code 中使用 (推荐)

在 Claude Code 配置文件中添加：

macOS/Linux: ~/.claude/config.json Windows: %APPDATA%\claude\config.json

{
  "mcpServers": {
    "doubao-giv": {
      "command": "npx",
      "args": ["-y", "doubao-image-video-mcp@latest"],
      "env": {
        "DOUBAO_API_KEY": "your_api_key_here",
        "DOUBAO_IMAGE_ENDPOINT_ID": "ep-20241227-xxxxxxxxxxxxx",
        "DOUBAO_VIDEO_ENDPOINT_ID": "ep-20241227-xxxxxxxxxxxxx"
      }
    }
  }
}

通用 MCP 客户端配置

{
  "mcpServers": {
    "doubao-giv": {
      "command": "npx",
      "args": ["-y", "doubao-image-video-mcp@latest"],
      "env": {
        "DOUBAO_API_KEY": "your_api_key_here",
        "DOUBAO_IMAGE_ENDPOINT_ID": "ep-20241227-xxxxxxxxxxxxx",
        "DOUBAO_VIDEO_ENDPOINT_ID": "ep-20241227-xxxxxxxxxxxxx"
      }
    }
  }
}

使用 npx -y 可以自动安装最新版本，无需手动管理本地文件。

本地开发配置

如果从源码运行：

{
  "mcpServers": {
    "doubao-giv": {
      "command": "node",
      "args": ["/absolute/path/to/doubao-image-video-mcp/dist/index.js"],
      "env": {
        "DOUBAO_API_KEY": "your_api_key_here",
        "DOUBAO_IMAGE_ENDPOINT_ID": "ep-20241227-xxxxxxxxxxxxx",
        "DOUBAO_VIDEO_ENDPOINT_ID": "ep-20241227-xxxxxxxxxxxxx"
      }
    }
  }
}

环境变量

环境变量	必需	默认值	说明
`DOUBAO_API_KEY`	✅	-	API 密钥
`DOUBAO_IMAGE_ENDPOINT_ID`	✅	-	图片生成推理接入点 ID
`DOUBAO_VIDEO_ENDPOINT_ID`	✅	-	视频生成推理接入点 ID
`DOUBAO_DEFAULT_IMAGE_MODEL`	❌	`doubao-seedream-4-5`	默认图片生成模型
`DOUBAO_DEFAULT_VIDEO_MODEL`	❌	`doubao-seedance-1.0-lite-t2v`	默认视频生成模型

重要：必须在火山引擎控制台创建推理接入点并配置接入点 ID，直接使用模型名称可能因权限问题导致调用失败。

支持的工具

1. generate_image

生成单张图片。

参数：

prompt (必需): 图片描述文本
model (可选): 模型选择，默认 doubao-seedream-4-5
size (可选): 图片尺寸，默认 1024x1024
image_url (可选): 参考图片 URL (图生图)
ref_image_urls (可选): 多张参考图片 URL 数组 (多图融合)
req_key (可选): 请求标识

2. generate_video

生成视频（异步任务）。

参数：

prompt (必需): 视频描述文本 (最大 500 字符)
model (可选): 模型选择，默认 doubao-seedance-1.0-lite-t2v
video_duration (可选): 视频时长 (秒)，默认 5
fps (可选): 帧率，默认 24
resolution (可选): 分辨率，默认 1080p
first_frame_image_url (可选): 首帧图片 URL (图生视频)
ref_image_urls (可选): 参考图片 URL 数组
req_key (可选): 请求标识

返回： 任务 ID，需要使用 query_video_task 查询结果

3. query_video_task

查询视频生成任务状态。

参数：

task_id (必需): 视频生成任务 ID

返回： 任务状态和结果

支持的模型

图片生成模型

模型	说明
`doubao-seedream-4-5`	最新 4.0 模型，支持 4K 分辨率
`doubao-seedream-3-0-t2i`	3.0 文生图模型

视频生成模型

模型	说明
`doubao-seedance-1.0-pro`	专业版，高质量视频生成
`doubao-seedance-1.0-pro-fast`	专业版快速生成
`doubao-seedance-1.0-lite-t2v`	轻量版，快速生成

API 文档

常见问题

Q: 视频生成需要多长时间？

A: 视频生成是异步任务，通常需要几秒到几分钟，具体取决于视频时长和模型选择。

Q: 如何获取推理接入点 Endpoint ID？

A: 在火山引擎控制台创建推理接入点后获取，使用 endpoint_id 参数比直接使用 model 名称更稳定。

Q: 支持哪些图片尺寸？

A: 图片生成支持多种尺寸，推荐使用 1920x2160、1920x2560、2160x3840 等高分辨率尺寸（豆包 API 要求图片至少 3686400 像素）。

参与贡献

欢迎提交 Issue 和 Pull Request！

Fork 本仓库
创建特性分支 (git checkout -b feature/AmazingFeature)
提交更改 (git commit -m 'Add some AmazingFeature')
推送到分支 (git push origin feature/AmazingFeature)
开启 Pull Request

贡献指南

遵循现有代码风格
添加必要的测试和文档
提交前运行 pnpm build 确保构建成功

发布流程

项目使用 GitHub Actions 自动发布到 npm：

更新 package.json 中的版本号
创建并推送 Git 标签：
```
git tag v1.0.0
git push origin v1.0.0
```
GitHub Actions 自动构建并发布到 npm

Star 历史

License

MIT

如果这个项目对你有帮助，请给一个 Star ⭐️

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured