gemini-image-mcp
MCP server that generates images using Gemini models via an OpenAI-compatible gateway.
README
gemini-image-mcp
一个基于 MCP (Model Context Protocol) 的 Node.js 服务:通过本地 OpenAI-compatible 网关(默认 http://127.0.0.1:8317)调用 Gemini 图片模型 gemini-3-pro-image-preview 生成图片,供 Claude Code/Claude Desktop 等 IDE 工具动态使用。
1) 安装
npm install
2) 环境变量(可选)
OPENAI_BASE_URL:OpenAI-compatible 服务地址(默认http://127.0.0.1:8317)OPENAI_API_KEY:如果你的网关需要鉴权就填(可留空)OPENAI_MODEL:默认gemini-3-pro-image-previewOPENAI_IMAGE_SIZE:可选,仅作为未传入size时的默认值;建议让客户端在调用generate_image时自己传sizeOPENAI_IMAGE_MODE:chat|images|auto,默认chat(CLIProxyAPI 这类网关通常用/v1/chat/completions出图;若你的网关支持/v1/images/generations可设为images)OPENAI_IMAGE_RETURN:path|image,默认path(path会把图片保存到本地并返回文件路径,避免 base64 导致 token 暴涨;image返回 MCPimagecontent)OPENAI_IMAGE_OUT_DIR:保存目录(默认debug-output/;相对路径以项目根目录为基准)OPENAI_DEBUG:设为1时会在 stderr 打印上游请求信息(不打印 key)OPENAI_TIMEOUT_MS:默认120000
可参考 .env.example。
3) 本地调试(不用放进 Claude Code)
推荐把 .env.example 复制成 .env,然后在 .env 里填好 OPENAI_API_KEY(.gitignore 已忽略 .env)。
- 直连上游调试(确认你的
http://127.0.0.1:8317是否能出图):npm run debug:upstream -- --prompt "A beautiful sunset over mountains" --size 1024x1024
- 走 MCP 工具调试(等价于 Claude Code 调用
generate_image):npm run debug:mcp -- --prompt "A beautiful sunset over mountains" --n 1 --size 1024x1024
图片会输出到 debug-output/。
4) 作为 MCP Server 使用(stdio)
该项目是 stdio 传输方式的 MCP Server,不建议直接在终端手动运行(会等待客户端请求)。
在 Claude Code / Claude Desktop 的 MCP 配置里添加类似如下(按你的实际路径修改):
{
"mcpServers": {
"gemini-image": {
"command": "node",
"args": ["d:/task/myself/nodejs/geminiimagemcp/src/index.js"],
"env": {
"OPENAI_BASE_URL": "http://127.0.0.1:8317",
"OPENAI_API_KEY": "<YOUR_KEY>",
"OPENAI_MODEL": "gemini-3-pro-image-preview"
}
}
}
}
也可以直接参考 mcp.example.json。
5) 可用工具
generate_image- 入参:
prompt(必填),size(可选),n(可选,1-4),output(可选:path|image),outDir(可选) - 返回:默认返回保存后的图片文件路径(多行);
output=image时返回 MCPimagecontent(base64 + mimeType)
- 入参:
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.