Gemini MCP

Gemini MCP

Remote MCP server that exposes Google Gemini's text, image, video (Veo), and audio transcription capabilities as tools any MCP client can call directly.

Category
Visit Server

README

Gemini MCP

Remote MCP server that exposes Google Gemini's text, image, video (Veo), and audio transcription capabilities as tools any MCP client (Claude.ai, Claude Code, Cowork) can call directly. Same architecture as the Bosta and Salestrail MCP connectors: Node.js + Express, deployed as a Render web service, talking to Gemini's REST API.

Tools

Tool What it does
generate_text Text generation — captions, copy, summaries, translation, analysis. Optional system_instruction.
generate_image Generate an image from a prompt, or edit/compose an existing one via reference_image_base64 (Nano Banana / Gemini image model).
start_video_generation Kicks off a Veo video job from a prompt (+ optional reference image for image-to-video). Returns an operation_name — generation takes ~1-6 min.
check_video_status Polls an operation_name from start_video_generation. Returns {"status":"pending"} or the finished video as an embedded base64 resource.
transcribe_audio Transcribes audio (m4a/mp3/wav/aac/ogg/flac) from base64, with an optional instruction (e.g. "transcribe in Egyptian Arabic").

All tools return MCP error results (isError: true) on failure rather than crashing the server — confirmed against the live Gemini API during build (an invalid key correctly came back as a tool error, not a connection drop).

Known limitations / things to verify with a real key

  • Audio inline size limit: requests with inline base64 data top out around ~20MB total. Longer call recordings will need the Gemini File API (upload first, then reference by URI) — not implemented yet. Worth adding if your typical .m4a files are long.
  • .m4a MIME type: the tool defaults to audio/mp4 for .m4a files, since that's the container format (AAC inside MP4). This has not been verified against a real audio file yet. If Gemini rejects it, try audio/aac instead — pass it explicitly via the mime_type argument.
  • Veo response shape: check_video_status tries a couple of known response shapes (generateVideoResponse.generatedSamples[0].video and predictions[0]). Worth confirming against a real job once you have Veo access, since Google has changed this shape across API versions before.
  • Veo access: Veo 3 may require allowlisting / billing on your Gemini API key — check the Gemini API console if start_video_generation errors out.

Local setup

npm install
cp .env.example .env   # fill in GEMINI_API_KEY
npm start

Health check: curl http://localhost:3000/health

Deploying to Render

  1. Push this folder to a GitHub repo.
  2. Render dashboard → New → Web Service → connect the repo.
  3. Build command: npm install
  4. Start command: npm start
  5. Add environment variable GEMINI_API_KEY (and optionally the model overrides from .env.example) in Render's Environment tab.
  6. Once deployed, your MCP endpoint is https://<your-app>.onrender.com/mcp.

Connecting in Claude

Add it as a custom connector pointing at https://<your-app>.onrender.com/mcp, the same way Bosta MCP / Salestrail MCP are connected.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured