Gemini Nano Banana MCP

Gemini Nano Banana MCP

An MCP server for AI-powered image generation, editing, and video generation using Google Gemini and Veo models.

Category
Visit Server

README

Gemini Nano Banana MCP

An MCP (Model Context Protocol) server for AI-powered image generation, editing, and video generation using Google Gemini and Veo. Works with Claude Code, Cursor, and any MCP-compatible client.

Features

  • Text-to-Image Generation - Generate images from text prompts via Gemini AI
  • Image Editing - Edit existing images with natural language instructions
  • Reference Images - Use reference images for style and content guidance
  • Text-to-Video Generation - Generate videos from text prompts using Veo (veo-3.1, veo-3, veo-2)
  • Image-to-Video - Use an image as the first frame for video generation
  • First & Last Frame Interpolation - Generate videos between two keyframe images
  • Session Memory - Continue editing the last image without re-specifying the path
  • Configurable Models - Choose any Gemini model for images, any Veo model for videos
  • Media History - Track and browse recently generated images and videos
  • Cross-Platform - Works on macOS, Windows, and Linux

Quick Start

1. Get a Gemini API Key

Get your free API key from Google AI Studio.

2. Install

npm install -g @seungmanchoi/nano-banana-mcp

Or install from source:

git clone https://github.com/seungmanchoi/nano-banana-mcp.git
cd nano-banana-mcp
npm install
npm run build

3. Configure Your MCP Client

Claude Code

Add to ~/.claude/settings.json:

{
  "mcpServers": {
    "nano-banana": {
      "command": "npx",
      "args": ["-y", "@seungmanchoi/nano-banana-mcp"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here"
      }
    }
  }
}

Cursor

Add to your MCP settings:

{
  "mcpServers": {
    "nano-banana": {
      "command": "npx",
      "args": ["-y", "@seungmanchoi/nano-banana-mcp"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here"
      }
    }
  }
}

From Source

If installed from source, use the absolute path:

{
  "mcpServers": {
    "nano-banana": {
      "command": "node",
      "args": ["/absolute/path/to/nano-banana-mcp/dist/index.js"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here"
      }
    }
  }
}

You can also skip the env field and configure the API key at runtime using the configure_api_key tool.

Authentication Modes

This server supports two authentication modes. The default is API key mode (above).

Mode A — API key (official, default)

Uses a Gemini API key from Google AI Studio. Supports image generation, image editing, and video (Veo). Note that image generation models are largely a paid feature.

Mode B — Free Google-cookie mode (consumer Gemini, unofficial)

Drives your logged-in gemini.google.com session via its session cookies instead of an API key. Free, and supports image generation + editing only (no video).

⚠️ This mode is unofficial. It talks to an undocumented internal endpoint, not the official API. It may break when Google changes things, cookies expire periodically and must be re-extracted, and use is a Terms-of-Service gray area. Intended for personal use with your own account.

1. Extract your cookies from a browser where you're logged into gemini.google.com:

  1. Open https://gemini.google.com and sign in.
  2. DevTools (F12) → ApplicationCookieshttps://gemini.google.com.
  3. Copy the value of __Secure-1PSID (required) and __Secure-1PSIDTS (recommended).

2a. Configure via environment variables:

{
  "mcpServers": {
    "nano-banana": {
      "command": "npx",
      "args": ["-y", "@seungmanchoi/nano-banana-mcp"],
      "env": {
        "GEMINI_AUTH_MODE": "gemini-web",
        "GEMINI_SECURE_1PSID": "your-__Secure-1PSID-value",
        "GEMINI_SECURE_1PSIDTS": "your-__Secure-1PSIDTS-value"
      }
    }
  }
}

2b. Or configure at runtime with the configure_google_login tool:

Use configure_google_login with secure1psid "<...>" and secure1psidts "<...>"

This switches the active mode to gemini-web and persists to ~/.nano-banana/config.json. Run configure_api_key again at any time to switch back to API key mode.

Model Configuration

Image Models

The default image model is gemini-2.0-flash-preview-image-generation. You can change it in several ways:

Option 1: Environment Variable

Set GEMINI_MODEL in your MCP client config:

{
  "mcpServers": {
    "nano-banana": {
      "command": "npx",
      "args": ["-y", "@seungmanchoi/nano-banana-mcp"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here",
        "GEMINI_MODEL": "gemini-2.0-flash-preview-image-generation"
      }
    }
  }
}

Option 2: Runtime Tool

Use the configure_model tool to change the default model at runtime. The setting persists across sessions in ~/.nano-banana/config.json.

Set the model to imagen-3.0-generate-002

Option 3: Per-Request Override

Pass the model parameter directly to generate_image, edit_image, or continue_editing to override the default for a single request:

Generate an image of a cat using model imagen-3.0-generate-002

Model Priority

  1. Per-request model parameter (highest priority)
  2. GEMINI_MODEL environment variable
  3. Config file (~/.nano-banana/config.json)
  4. Default: gemini-2.0-flash-preview-image-generation

Available Image Models

Model Tier Description
gemini-2.0-flash-preview-image-generation Free Default. Native image generation via Gemini 2.0 Flash.
imagen-3.0-generate-002 Paid Best quality. Google's dedicated image generation model.
imagen-3.0-fast-generate-001 Paid Fast variant of Imagen 3, optimized for speed.

Note: Free-tier API keys support gemini-2.0-flash-preview-image-generation. Imagen models require billing enabled on your Google Cloud project.

Available Video Models

Model Description
veo-3.1-generate-preview Latest. Native audio, scene extension, reference images, 4K support.
veo-3-generate-preview Previous generation with audio support.
veo-2-generate-preview Older generation, stable.

For the latest list of models, see Google AI documentation.

Tools

Image Tools

Tool Description
configure_api_key Set or update the Gemini API key (switches to apiKey mode). Persists across sessions.
configure_google_login Switch to free, unofficial gemini-web mode using consumer Gemini cookies (secure1psid, optional secure1psidts). Image generation + editing only.
configure_model Set the default Gemini model for images (apiKey mode). Persists across sessions.
generate_image Generate a new image from a text description. Supports optional model override.
edit_image Edit an existing image with text instructions, optional reference images, and optional model override.
continue_editing Continue editing the last generated/edited image in the session. Supports optional model override.
list_history List recently generated and edited images with prompts and timestamps.

Video Tools

Tool Description
generate_video Generate a video from a text prompt. Supports text-to-video, image-to-video, and frame interpolation.
list_video_history List recently generated videos with prompts, models, and timestamps.

Utility Tools

Tool Description
get_status Check configuration status, active models, output directories, and last image/video info.

Usage Examples

Image Generation

Generate an image of a sunset over mountains with a lake reflection
Edit the image at ~/nano-banana-images/gen_2025-01-01.png to add a boat on the lake
Continue editing - make the sky more vibrant with orange and pink tones

Video Generation

Generate a video of ocean waves crashing on a rocky shore at sunset
Generate a video of a cat playing with yarn, model veo-3.1-generate-preview, resolution 1080p, duration 8 seconds

Image-to-Video (First Frame)

Generate a video starting from the image at ~/nano-banana-images/gen_2025-01-01.png showing the scene coming to life with wind blowing through the trees

First + Last Frame Interpolation

Generate a video transitioning from the image at ~/images/start.png to ~/images/end.png with a smooth camera pan

Portrait Video

Generate a video of a person walking through a garden, aspect ratio 9:16

History & Status

Show me the last 5 images I generated
Show me recent video history
Check the current status
Switch to imagen-3.0-generate-002 model for higher quality

Video Generation Details

Configuration Options

Parameter Options Default Description
model veo-3.1-generate-preview, veo-3-generate-preview, veo-2-generate-preview veo-3.1-generate-preview Veo model to use
aspectRatio 16:9, 9:16 16:9 Landscape or portrait
resolution 720p, 1080p, 4k 720p Output resolution
durationSeconds 4, 6, 8 (number) Varies by model Video length
numberOfVideos 1+ 1 Number of variants
negativePrompt Any text - Elements to avoid

Generation Modes

  1. Text-to-Video: Provide only a prompt
  2. Image-to-Video: Provide prompt + imagePath (used as first frame)
  3. Frame Interpolation: Provide prompt + imagePath (first frame) + lastFramePath (last frame)

Important Notes

  • Video generation takes 1-6 minutes depending on load
  • Generated videos are saved as .mp4 files
  • Videos are watermarked with SynthID technology
  • Pricing: $0.75 per second of generated video
  • Videos are retained on Google servers for 2 days after generation

API Key Configuration

The server loads the API key in the following priority order:

  1. Environment variable - GEMINI_API_KEY
  2. Config file - ~/.nano-banana/config.json
  3. Runtime - via the configure_api_key tool

File Storage

Images

Platform Path
macOS / Linux ~/nano-banana-images/
Windows Documents\nano-banana-images\

Videos

Platform Path
macOS / Linux ~/nano-banana-videos/
Windows Documents\nano-banana-videos\

Project Structure

src/
├── index.ts              # Entry point
├── server.ts             # MCP server setup and request routing
├── config/
│   └── settings.ts       # API key and model management
├── services/
│   ├── gemini.ts         # Google Gemini & Veo API client
│   └── storage.ts        # Image/video file I/O and history tracking
├── tools/
│   ├── definitions.ts    # MCP tool schemas
│   └── handlers.ts       # Tool request handlers
└── types/
    └── index.ts          # TypeScript type definitions

Development

npm run dev          # Run with tsx (no build needed)
npm run build        # Compile TypeScript
npm run typecheck    # Type check without emitting
npm run lint         # Run ESLint

Tech Stack

  • Runtime: Node.js
  • Language: TypeScript (strict mode, ES2022)
  • MCP SDK: @modelcontextprotocol/sdk
  • AI Models: Google Gemini (images) + Veo (videos)
  • Validation: Zod

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured