Doubao Image Description MCP Server
Enables visual understanding and image description capabilities for iFlow CLI and Claude Desktop using Doubao's vision model. It supports 18 image formats and features automatic optimization, caching, and Chinese language optimization.
README
豆包图片描述 MCP 服务器
让 iFlow CLI 和 Claude Desktop 支持豆包视觉理解
🚀 快速开始(3 步)
1. 安装依赖
pip install -r requirements.txt
2. 配置 API Key
方式 1:环境变量(推荐)
# Windows
set VOLCENGINE_API_KEY=你的豆包API Key
# Linux/Mac
export VOLCENGINE_API_KEY=你的API Key
方式 2:配置文件
cp config.example.json config.json
# 编辑 config.json,填入你的 API Key
3. 添加到 iFlow CLI
iflow mcp add doubao-image-mcp python C:\Users\15579\doubao-image-mcp\server.py
4. 测试
iflow
> 描述: https://picsum.photos/800/600
📖 完整文档
- INSTALL_GUIDE.md - 详细安装和配置指南
- 论坛帖子.md - 完整的开发历程和技术细节
✨ 特性
- ⚡ 快速:20-30 秒响应
- 💾 缓存:重复查询 < 0.1 秒
- 🖼️ 18 种格式:JPG、PNG、GIF、WEBP、HEIC 等
- 🗜️ 自动优化:大图片自动压缩
- 🌏 中文优化:专为豆包模型优化
📂 项目包含
server.py- MCP 服务器主程序config.example.json- 配置示例requirements.txt- Python 依赖compress.py- 图片压缩工具diagnose_speed.py- 速度诊断工具- 完整文档
🆘 获取 API Key
- 访问 火山引擎控制台
- 开通"火山方舟"服务
- 创建 API Key
- 确保有视觉理解模型权限
💡 使用示例
# 描述本地图片
描述: D:/download/photo.jpg
# 描述网络图片
描述: https://example.com/image.jpg
# 自定义提示词
描述: photo.jpg,请用英文描述
📄 License
MIT License - 自由使用、修改、分发
🔗 相关链接
版本: 1.0.0
更新: 2026-02-03
分享者: [Your Name]
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.