Nano Banana MCP Server
Enables image generation, editing, and analysis using Google's Gemini 2.5 Flash and Gemini 3 Pro models, with support for batch processing, style templates, and high-resolution output.
README
🎨 Nano Banana MCP Server
A powerful Model Context Protocol (MCP) server for advanced image generation, editing, and analysis using Google's Gemini 2.5 Flash Image Preview (aka "nano-banana") and Gemini 3 Pro Image Preview (aka "nano-banana pro") models.
English | 日本語
✨ Features
Core Capabilities
- 🖼️ Image Generation - Create images from text prompts with advanced customization
- ✏️ Image Editing - Edit existing images using natural language (supports file paths or base64)
- 🔍 Image Analysis - Analyze and answer questions about images using Gemini
- 🎭 Multi-Image Processing - Combine, style transfer, or edit multiple images at once
- 📦 Batch Generation - Generate multiple images from an array of prompts
- 🎨 Style Templates - Use pre-defined templates for consistent styling
- 🔄 Variations - Generate variations of existing images with controlled randomness
- 🔬 Image Comparison - Compare and analyze differences between images
Advanced Features
- Dual Model Support - Choose between Gemini 2.5 Flash Image Preview or Gemini 3 Pro Image Preview
- High Resolution Output - Generate up to 4K images (4096x4096) with Gemini 3 Pro
- Generation Control - Fine-tune with temperature, topP, topK parameters
- Parallel Processing - Batch operations with optional parallel execution
- Smart Templates - 10+ professional style presets
- Flexible Input - Accept both file paths and base64 encoded images
- Auto-save - Automatically saves generated images with timestamps
Model Comparison
| Feature | Gemini 2.5 Flash Image | Gemini 3 Pro Image |
|---|---|---|
| Max Resolution | 1K (1024px) | 1K / 2K / 4K |
| Aspect Ratios | All supported | All supported |
| Text Rendering | Good | Excellent |
| Speed | Fast | Moderate |
| Quality | High | Premium |
| Best For | Quick iterations | Final production |
Supported Aspect Ratios: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
🚀 Quick Start
Prerequisites
- Node.js 18.0.0 or higher
- Valid Gemini API key
- MCP-compatible client (like Claude Code)
Installation
- Clone the repository:
git clone https://github.com/Riti0208/nano-banana-mcp.git
cd nano-banana-mcp
- Install dependencies:
npm install
- Build the project:
npm run build
Configuration
Add to your Claude Code MCP settings (claude_code_config.json):
{
"mcpServers": {
"nano-banana": {
"command": "node",
"args": ["./dist/index.js"],
"cwd": "/path/to/nano-banana-mcp",
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
📖 Usage Examples
Generate an Image (Standard)
generate_image({
prompt: "A serene mountain landscape at sunset",
config: {
temperature: 0.8,
topP: 0.95
}
})
Generate a High-Resolution Image (4K with Gemini 3 Pro)
generate_image({
prompt: "Ultra detailed cyberpunk city at night with neon signs",
model: "gemini-3-pro-image-preview",
resolution: "4K",
aspectRatio: "16:9",
config: {
temperature: 0.9
}
})
Edit an Image
edit_image({
prompt: "Add a rainbow in the sky",
imagePath: "./landscape.jpg",
model: "gemini-3-pro-image-preview",
resolution: "2K",
aspectRatio: "4:3",
config: {
temperature: 0.5
}
})
Generate Multiple Variations
generate_variations({
imagePath: "./original.png",
count: 3,
variationStrength: "moderate"
})
Batch Generation
batch_generate({
prompts: [
"A red apple",
"A green apple",
"A golden apple"
],
parallel: true
})
Use Style Templates
generate_with_template({
template: "photorealistic",
customization: "A vintage coffee shop interior"
})
Compare Images
compare_images({
image1Path: "./before.png",
image2Path: "./after.png",
compareType: "differences"
})
🛠️ Available Tools
| Tool | Description |
|---|---|
generate_image |
Generate images from text prompts |
edit_image |
Edit existing images with natural language |
analyze_image |
Analyze images and answer questions |
multi_image_edit |
Process multiple images together |
batch_generate |
Generate multiple images at once |
generate_variations |
Create variations of an image |
generate_with_template |
Use predefined style templates |
compare_images |
Compare two images |
🎨 Style Templates
- photorealistic - Ultra-realistic photography
- artistic - Artistic interpretation
- logo - Clean logo design
- portrait - Professional portraits
- landscape - Breathtaking landscapes
- product - Product photography
- architectural - Architectural photography
- fashion - Fashion photography
- food - Food photography
- abstract - Abstract art
⚙️ Configuration Parameters
| Parameter | Range | Description |
|---|---|---|
temperature |
0.0-2.0 | Controls randomness (lower = more focused) |
topP |
0.0-1.0 | Nucleus sampling threshold |
topK |
1-40 | Top-k sampling |
maxOutputTokens |
- | Maximum response length |
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🙏 Acknowledgments
- Google Gemini team for the amazing image generation API
- Model Context Protocol team for the MCP framework
- All contributors who help improve this project
🔗 Links
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.