Doubao Image Description MCP Server

Doubao Image Description MCP Server

Enables visual understanding and image description capabilities for iFlow CLI and Claude Desktop using Doubao's vision model. It supports 18 image formats and features automatic optimization, caching, and Chinese language optimization.

Category
Visit Server

README

豆包图片描述 MCP 服务器

让 iFlow CLI 和 Claude Desktop 支持豆包视觉理解

🚀 快速开始(3 步)

1. 安装依赖

pip install -r requirements.txt

2. 配置 API Key

方式 1:环境变量(推荐)

# Windows
set VOLCENGINE_API_KEY=你的豆包API Key

# Linux/Mac
export VOLCENGINE_API_KEY=你的API Key

方式 2:配置文件

cp config.example.json config.json
# 编辑 config.json,填入你的 API Key

3. 添加到 iFlow CLI

iflow mcp add doubao-image-mcp python C:\Users\15579\doubao-image-mcp\server.py

4. 测试

iflow
> 描述: https://picsum.photos/800/600

📖 完整文档

  • INSTALL_GUIDE.md - 详细安装和配置指南
  • 论坛帖子.md - 完整的开发历程和技术细节

✨ 特性

  • 快速:20-30 秒响应
  • 💾 缓存:重复查询 < 0.1 秒
  • 🖼️ 18 种格式:JPG、PNG、GIF、WEBP、HEIC 等
  • 🗜️ 自动优化:大图片自动压缩
  • 🌏 中文优化:专为豆包模型优化

📂 项目包含

  • server.py - MCP 服务器主程序
  • config.example.json - 配置示例
  • requirements.txt - Python 依赖
  • compress.py - 图片压缩工具
  • diagnose_speed.py - 速度诊断工具
  • 完整文档

🆘 获取 API Key

  1. 访问 火山引擎控制台
  2. 开通"火山方舟"服务
  3. 创建 API Key
  4. 确保有视觉理解模型权限

💡 使用示例

# 描述本地图片
描述: D:/download/photo.jpg

# 描述网络图片
描述: https://example.com/image.jpg

# 自定义提示词
描述: photo.jpg,请用英文描述

📄 License

MIT License - 自由使用、修改、分发

🔗 相关链接


版本: 1.0.0
更新: 2026-02-03
分享者: [Your Name]

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured