gemini-search-mcp
MCP server for web search powered by Google AI Mode (Gemini). Enables any AI agent to search the web in real-time for free and without rate limits.
README
gemini-search-mcp
<p align="center"> <img src="banner.png" width="700" alt="gemini-search-mcp"> </p>
<p align="center"> <img src="logo.png" width="120" alt="logo"> </p>
<p align="center"> MCP server for web search powered by Google AI Mode (Gemini). Free, unlimited, no API key. </p>
What is this
An MCP server that gives any AI agent (Claude, Cursor, Windsurf, etc.) the ability to search the web in real-time using Google's AI Mode — the same Gemini-powered search that lives in the "AI Mode" tab on Google Search.
Think of it as a free, unlimited alternative to Grok MCP / Tavily / SerpAPI, backed by Google's search index.
Features
- Free: No API key, no subscription, no quota
- Unlimited: 60+ requests/min with zero rate limiting
- Google quality: Powered by Gemini + Google Search (grounded in real web results)
- MCP native: Works with Claude Desktop, Claude Code, Cursor, Windsurf, Cline
- Also ships OpenAI API:
/v1/chat/completionsfor non-MCP clients - Fast: ~1.5s average response time
Quick Start
pip install playwright fastapi uvicorn "mcp[cli]"
playwright install chrome
MCP Server (for AI agents)
python mcp_server.py
OpenAI-compatible API
python -m gemini_search --port 8080
MCP Integration
Claude Code
claude mcp add gemini-search -- python /path/to/gemini-search-mcp/mcp_server.py
Claude Desktop
Add to claude_desktop_config.json:
{
"mcpServers": {
"gemini-search": {
"command": "python",
"args": ["/path/to/gemini-search-mcp/mcp_server.py"],
"env": {
"CDP_URL": "http://127.0.0.1:9222"
}
}
}
}
Cursor / Windsurf
Same pattern — point to mcp_server.py as an stdio MCP server.
MCP Tools
| Tool | Description |
|---|---|
web_search(query) |
Search the web and get a synthesized answer grounded in real-time results |
ask(prompt) |
General question — AI Mode auto-decides whether to search the web |
Examples
web_search("latest AI regulation news 2026")
→ "The EU AI Act enforcement began on June 1, 2026, requiring..."
web_search("Bitcoin price today")
→ "As of June 30, 2026, Bitcoin is trading at $59,687 USD..."
ask("what is 1847 * 293")
→ "541171"
OpenAI API Usage
curl http://localhost:8080/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{"model":"gemini-search","messages":[{"role":"user","content":"What happened in the news today?"}]}'
| Field | Value |
|---|---|
| Base URL | http://localhost:8080/v1 |
| API Key | anything |
| Model | gemini-search |
Environment Variables
| Variable | Default | Description |
|---|---|---|
CDP_URL |
(none) | Chrome DevTools URL. If set, connects to existing Chrome instead of launching one |
BROWSER_CHANNEL |
chrome |
Browser to use: chrome, msedge, chromium |
HEADLESS |
1 |
Set to 0 to show browser window |
How It Works
Google rate-limits by TLS fingerprint quality — not by IP. Automated HTTP clients (curl, requests, httpx) get throttled after a few requests. But a real Chrome browser's fetch() calls are trusted unconditionally.
This tool runs a single Playwright page and executes all queries as fetch() inside it, giving every request an authentic Chrome TLS/HTTP2 fingerprint. Google sees normal browser traffic and applies no rate limits.
Agent calls web_search("query")
→ Playwright page.evaluate(fetch)
→ Google Search AI Mode (token extraction + folwr endpoint)
→ Parse answer from HTML response
→ Return to agent
Comparison
| gemini-search-mcp | Grok MCP | Tavily | |
|---|---|---|---|
| Cost | Free | xAI API key ($) | API key ($) |
| Rate limit | None | API quota | API quota |
| Search backend | Google Search | Grok + web | Proprietary |
| Answer quality | Gemini synthesized | Grok synthesized | Extracted snippets |
| Setup | Chrome + playwright | API key | API key |
Docker
docker compose up -d
Requirements
- Python 3.10+
- Chrome, Edge, or Chromium
playwright,fastapi,uvicorn,mcp[cli]
Limitations
- Requires Chrome/Edge/Chromium installed
- No conversation memory between requests
- Answer extraction relies on Google's DOM structure (may break on updates)
- Streaming is chunked, not per-token
Acknowledgments
- GenericAgent — 本项目核心开发依仗 GA 提供的 AI 能力
- linux.do community
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.