Nano Banana MCP Server
Enables generating, editing, and manipulating images using Google Gemini Flash 2.5 through natural language prompts. Supports text-to-image generation, image editing, multi-image composition, and batch processing with direct file management.
README
Nano Banana MCP Server

Nano Banana is an MCP server that exposes Google Gemini Flash 2.5 Image Generation via a clean, focused interface. It is not a general multiโmodel server โ it is a thin wrapper around Gemini Flash 2.5 image generation only.
โจ What It Does
Generate, edit, and manipulate images using natural language via any MCPโcompatible client.
๐ Compatible Tools
Works with any MCPโcompatible client. Below we document Gemini CLI (recommended) and a minimal generic MCP configuration.
Features
๐จ Image Generation (Powered by Gemini Flash 2.5)
- Text-to-Image Generation: Create images from text descriptions
- Image Editing: Modify existing images with text prompts
- Multi-Image Composition: Combine multiple images or transfer styles
- Batch Generation: Generate multiple variations at once
๐ ๏ธ Image Manipulation (Powered by Sharp)
- Combine Images: Stitch images into panoramas, grids, or strips
- Transform Images: Resize, crop, rotate, flip, and flop
- Adjust Images: Blur, sharpen, grayscale, tint, brightness, saturation
- Composite Images: Layer images with blend modes and positioning
- Batch Processing: Apply operations to entire directories
๐ง Developer Features
- Smart Path Handling: Automatically creates directories and handles file paths
- Image Validation: Verify generated images are valid and meet size requirements
- Comprehensive Error Handling: Clear feedback on API errors, quota issues, and failures
- MCP Protocol: Works with any MCP-compatible AI client
๐ Quick Install (1 Minute!)
Option 1: Global Install (Recommended)
# Install globally
npm install -g @lyalindotcom/nano-banana-mcp
# Run setup wizard
nano-banana setup
Option 2: NPX (No Install)
# Run on demand via NPX (no global install)
# Great for Gemini CLI and generic MCP config
npx -y -p @lyalindotcom/nano-banana-mcp nano-banana --version
Set your API key once in your shell (required):
export GEMINI_API_KEY="your-api-key"
Gemini CLI configuration (~/.gemini/settings.json):
{
"mcpServers": {
"nano-banana": {
"command": "npx",
"args": ["-y", "-p", "@lyalindotcom/nano-banana-mcp", "nano-banana-server"],
"env": { "GEMINI_API_KEY": "${GEMINI_API_KEY}" },
"timeout": 60000,
"trust": true
}
}
}
Generic MCP configuration (for other clients using stdio):
{
"mcpServers": {
"nano-banana": {
"command": "npx",
"args": ["-y", "-p", "@lyalindotcom/nano-banana-mcp", "nano-banana-server"],
"env": { "GEMINI_API_KEY": "${GEMINI_API_KEY}" }
}
}
}
Option 3: From Source
# Clone the repo
git clone https://github.com/LyalinDotCom/nano-banana-mcp.git
cd nano-banana-mcp
# Run the quickstart script - it does EVERYTHING!
./quickstart.sh
# That's it! Start using it:
gemini chat
> "Create a robot holding a banana at ./robot.png"
The quickstart script will:
- โ Check Node.js version
- โ Install Gemini CLI if needed
- โ Build the project
- โ Configure your API key
- โ Set up Gemini CLI integration
- โ Verify everything works
Minimal Setup (from source)
- Configure API key:
cp .env.example .env
# Edit .env with your GEMINI_API_KEY
- Build:
npm run build
- Configure Gemini CLI using the JSON shown above.
Get your API key from: https://aistudio.google.com/apikey
CLI Commands
The Nano Banana CLI provides powerful management tools:
# Interactive setup wizard (configures Gemini CLI integration)
nano-banana setup
# Create .env file with API key (does NOT configure Gemini CLI)
nano-banana init --api-key YOUR_KEY
# Start the MCP server directly (for manual testing)
nano-banana serve
# Check installation status
nano-banana status
# Diagnose any issues
nano-banana doctor
# Safely remove configuration
nano-banana remove
Important: Use setup to configure with Gemini CLI. The init command only creates a .env file.
Usage
Once configured, use with your MCP client of choice. Simply describe what you want:
- "Generate a cyberpunk city at ./city.png"
- "Create 5 potion icons at ./items/potions.png"
- "Add a sunset to ./photo.jpg"
- "Combine these images into a panorama"
๐ Why Nano Banana?
- ๐จ Full Gemini Flash 2.5 Power: Access the latest image generation capabilities
- ๐ Natural Language Interface: Just describe what you want
- ๐ง Flexible Integration: Works with Gemini CLI and other MCP clients
- ๐ Direct File Management: Images save exactly where you need them
- ๐ฏ Smart Context: One tool handles generation, editing, and composition
- โก Batch Operations: Generate up to 10 variations at once
๐ Documentation
This README contains the supported setup paths. Nano Banana targets the Gemini Flash 2.5 image generation model only.
Available Tools
generate_image
Generate, edit, or compose images using Gemini Flash 2.5.
Parameters:
prompt(optional): Text description for generationimages(optional): Array of input images (base64 or file paths)outputPath: Where to save the generated image(s)count(optional): Number of variations to generate (1-10)options(optional): Additional Gemini model options
Examples:
Text-to-image:
{
"prompt": "A cyberpunk city at night with neon lights",
"outputPath": "./assets/backgrounds/city.png"
}
Image editing:
{
"prompt": "Add a rainbow in the sky",
"images": [{ "data": "./photos/landscape.jpg" }],
"outputPath": "./photos/landscape-rainbow.jpg"
}
Batch generation:
{
"prompt": "Fantasy potion bottles, different colors",
"outputPath": "./items/potion.png",
"count": 5
}
validate_image
Check if an image file exists and is valid.
Parameters:
path: File path to validate
Returns:
exists: Whether the file existsvalid: Whether it's a valid imagedimensions: Image width and heightformat: Image format (png, jpeg, etc.)fileSize: File size in byteserror: Error message if validation failed
How It Works
-
Flexible Input: The server intelligently determines the operation mode:
- Text only โ Text-to-image generation
- Text + 1 image โ Image editing
- Text + multiple images โ Composition/style transfer
-
Path Management: Automatically creates directories and handles both absolute and relative paths
-
Batch Support: When
count> 1, generates multiple variations with numbered suffixes -
Validation: Uses Sharp to verify images are properly generated and meet minimum size requirements
-
No Overwrites: The server never overwrites existing files. If the requested
outputPath(or any batch variation path) already exists, the server returns an error and does not write. Choose a new path or remove the file before retrying.
Error Handling
The server provides detailed error information:
INVALID_API_KEY: Authentication failedQUOTA_EXCEEDED: API limits reachedAPI_ERROR: General API failureINVALID_INPUT: Bad parametersFILE_WRITE_ERROR: Cannot save to pathVALIDATION_FAILED: Image corrupt or too smallFILE_EXISTS: Output file already exists; choose a new path
Testing
Run the example test script:
npm run dev examples/test.ts
This will test:
- Basic text-to-image generation
- Image validation
- Batch generation
Project Structure
nano-banana-mcp/
โโโ src/
โ โโโ index.ts # MCP server entry point
โ โโโ tools.ts # Tool implementations
โ โโโ gemini-client.ts # Gemini API wrapper
โโโ examples/
โ โโโ test.ts # Example usage
โโโ .env.example # Environment template
โโโ README.md # This file
๐ฎ Real-World Use Cases
Game Development
> Generate a complete set of 16-bit RPG sprites: warrior, mage, archer at ./sprites/
> Create terrain tiles for a top-down game: grass, stone, water at ./tiles/
> Design UI elements: health bars, mana bars, inventory slots at ./ui/
Web Development
> Create a hero section background with gradients at ./public/hero.jpg
> Generate a set of feature icons for my SaaS landing page at ./icons/
> Design social media cards for my blog posts at ./social/
Content Creation
> Generate YouTube thumbnail about "AI Revolution" at ./thumbnails/ai.jpg
> Create Instagram carousel about productivity tips at ./instagram/
> Design presentation diagrams for cloud architecture at ./slides/
๐ ๏ธ Requirements
- Node.js 18 or higher
- Gemini API Key with image generation access (Get one here)
- TypeScript 5.0+ (for development)
๐ง Project Status
This is an experimental sample project. While GitHub issues are welcome for bug reports and feedback, this project is not actively seeking contributions and long-term maintenance is not guaranteed at this time.
๐ License
MIT License - see LICENSE file for details
๐ Acknowledgments
- Built with the Model Context Protocol
- Powered by Gemini Flash 2.5
- Inspired by the amazing MCP community
๐ Links
<p align="center"> Made with โค๏ธ and ๐ by a Nano Banana fan </p>
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.