Doubao Vision MCP Server

Doubao Vision MCP Server

Enables image understanding using Doubao vision models via MCP, supporting local file paths and URLs with customizable prompts.

Category
Visit Server

README

Doubao Vision MCP Server

npm version

火山方舟豆包视觉模型 MCP Server,兼容预置推理接入点和模型推理接入点。

特性

  • 🖼️ 支持本地图片文件路径和远程 URL
  • 💬 自定义 prompt
  • 🎯 双模式兼容:模型名(预置推理接入点)/ ep-ID(模型推理接入点)
  • 🔄 切换模型只需改一行环境变量,代码零改动
  • ⚡ 一行 npx 部署
  • 💰 支持使用火山方舟赠送的免费 token

支持的模型

Doubao-Seed-2.0-Mini(推荐)

项目 说明
模型 ID doubao-seed-2-0-mini-260428
推理接入点 预置推理接入点(无需创建,直接填模型名)
上下文 256K
最大输出 128K
输入价格 ¥0.2 / 百万 tokens
输出价格 ¥2 / 百万 tokens
模态 文本 + 图片 + 语音 + 视频(四模态)
定位 轻量均衡,适合通用多模态解析、高并发、成本敏感场景

Doubao-Seed-1.6-Vision(备选)

项目 说明
模型 ID doubao-seed-1-6-vision-250815
推理接入点 模型推理接入点(需要创建,填 ep-xxxxx ID)
上下文 256K
最大输出 32K~64K
输入价格 ¥0.8 / 百万 tokens(0~32K)
输出价格 ¥8 / 百万 tokens(0~32K)
定位 视觉深度思考模型,适合复杂视觉推理任务

快速对比

Doubao-Seed-2.0-Mini Doubao-Seed-1.6-Vision
接入点类型 预置(用模型名) 自定义(用 ep-ID)
上下文 256K 256K
最大输出 128K 32K~64K
输入价格 ¥0.2 ¥0.8
输出价格 ¥2 ¥8

环境变量

变量 必填 默认值 说明
ARK_API_KEY 火山方舟 API Key。获取:控制台 → API Key 管理
DOUBAO_MODEL doubao-seed-2-0-mini-260428 模型名(预置接入点)或 ep-xxxxx(模型推理接入点)
DOUBAO_BASE_URL https://ark.cn-beijing.volces.com/api/v3 API 地址,通常无需修改

快速开始

使用预置推理接入点(推荐,无需创建)

{
  "mcpServers": {
    "doubao-vision": {
      "command": "npx",
      "args": ["-y", "doubao-vision-mcp-server"],
      "env": {
        "ARK_API_KEY": "ark-xxxxxxxxx-xxxxx",
        "DOUBAO_MODEL": "doubao-seed-2-0-mini-260428"
      }
    }
  }
}

使用模型推理接入点(ep-)

需要先在火山方舟控制台创建推理接入点:

{
  "mcpServers": {
    "doubao-vision": {
      "command": "npx",
      "args": ["-y", "doubao-vision-mcp-server"],
      "env": {
        "ARK_API_KEY": "ark-xxxxxxxxx-xxxxx",
        "DOUBAO_MODEL": "ep-20260607xxxxx-xxxxx"
      }
    }
  }
}

使用本地路径(开发调试)

{
  "mcpServers": {
    "doubao-vision": {
      "command": "node",
      "args": ["E:\\Projects\\Claude\\MCP\\doubao\\doubao-vision-mcp-server\\src\\index.js"],
      "env": {
        "ARK_API_KEY": "ark-xxxxxxxxx-xxxxx",
        "DOUBAO_MODEL": "doubao-seed-2-0-mini-260428"
      }
    }
  }
}

工具

doubao_vision_understand

参数 必填 默认值 说明
image 本地图片路径 C:/img.png 或 URL https://...
prompt 对图片的指令,越具体越好
detail auto 图片精度:auto / low / high
max_tokens 4096 最大输出 token 数
temperature 1 采样温度(0~2)

使用示例

分析这张 UI 截图:C:\screenshot.png,描述它的布局和配色方案

识别这张图片里的文字:https://example.com/doc.png

本地开发

git clone https://github.com/kira4094/doubao-vision-mcp-server.git
cd doubao-vision-mcp-server
npm install
node src/index.js

验证结果

实测通过以下两种接入点:

类型 DOUBAO_MODEL Token 消耗 状态
预置推理接入点 doubao-seed-2-0-mini-260428 1314 in → 295 out
模型推理接入点 ep-xxxxxxx-xxxxx 683 in → 277 out

关联项目

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured