Search MCP Server
Enables AI agents to perform unified web searches, GitHub, and GitLab searches with caching, reranking, and fallback across multiple providers.
README
Search MCP Server
Unified MCP search tools for AI agents — abstracts providers, caching, reranking, and rate limits behind a few simple tools.
Tools
| Tool | Purpose |
|---|---|
search |
Universal web search with caching, reranking, and fallback across 6 providers |
github_search |
Search GitHub repos, code, issues, and users |
gitlab_search |
Search GitLab projects, issues, MRs, and code blobs |
status |
Server diagnostics, provider health, budget state |
Quick Start
git clone <repo-url> search-mcp
cd search-mcp
npm install
cp .env.example .env
npm run build
npm start
Configuration
Copy .env.example to .env and set your API keys. No keys are required — DuckDuckGo and Bing work out of the box.
| Provider | Key Required | Tier | Rate Limit |
|---|---|---|---|
| DuckDuckGo | No | 1 | 10 req/min |
| Bing | No | 1 | — |
| Brave | BRAVE_API_KEY |
2 | 2000/month |
| Tavily | TAVILY_API_KEY |
2 | 1000/month |
| Exa | EXA_API_KEY |
3 | Trial |
| Firecrawl | FIRECRAWL_API_KEY |
3 | Trial |
Query Formats
| Tool | Pattern | Example |
|---|---|---|
search |
General web search | typescript tutorial, how to install docker |
search |
GitHub-oriented | repo:vercel/next.js, stars:>1000 language:rust |
search |
Docs-oriented | express api reference, docker compose guide |
search |
News-oriented | react 19 release notes, latest ai news |
github_search |
Native GitHub search syntax | repo:org/name, user:vercel, language:typescript |
gitlab_search |
Native GitLab search syntax | project:org/name, scope filter via type param |
Tool Reference
search
search({
query: string,
})
Returns merged results from the first 2 healthy providers (parallel), deduplicated and reranked by relevance.
Intent classification runs automatically (no intent parameter). The server detects query intent
(github / docs / news / web) via NLI zero-shot (DeBERTa-v3-xsmall).
NLI Reranking
Every search result is scored against the query using the same NLI model:
- Model:
Xenova/nli-deberta-v3-xsmall(177M params, ONNX-optimized) - Inference: ~10–15ms per result on CPU, ~300ms for 20 results (parallelized)
- Score:
0.9 * NLI(query, snippet) + 0.04 * domain + 0.03 * freshness + 0.03 * position - Config: Override via
INTENT_CLASSIFIER_MODELin.env
The NLI model loads lazily on first request and is shared between intent classification and reranking.
github_search
github_search({
query: string,
type?: "repositories" | "code" | "issues" | "users",
language?: string,
stars?: string, // e.g. ">1000", "500..5000"
page?: number
})
Rate limit: 60 req/hr without token, 5000 req/hr with GITHUB_TOKEN.
gitlab_search
gitlab_search({
query: string,
scope?: "projects" | "issues" | "merge_requests" | "blobs",
page?: number
})
Requires GITLAB_TOKEN with read_api scope.
status
status()
Returns provider health, cache stats, budget state, uptime.
Architecture
Core pipeline: Budget Check → Normalize → Classify (intent + freshness) → Cache → Router (parallel 2) → Rerank → Cache → Respond
Full docs:
If you want to learn more about the decisions made, check out the Architecture Decisions document.
Donations / Support the Project
-
EVM Address (USDT, USDC, ETH, BNB):
0x3acf78e721aa065bd1509735a3ace630fcd0f452Supported networks: BNB Smart Chain (BEP20), Polygon, Arbitrum One, Ethereum (ERC20) <details> <summary>Show EVM QR Code</summary> <img src=".github/images/evm.jpg" width="300" alt="EVM Deposit QR"> </details>
-
USDT (TRC20):
TMG1BZJswd8UacHANKVi9Veq5FqSbaMjAQSupported network: Tron (TRC20) <details> <summary>Show TRC20 QR Code</summary> <img src=".github/images/trc20.jpg" width="300" alt="TRC20 Deposit QR"> </details>
-
TON / USDT (TON):
UQDjf84AZSwuCPC3UWFv28p-V44zI-1lfhqCMfaex2Q-jDsYSupported network: The Open Network (TON) <details> <summary>Show TON QR Code</summary> <img src=".github/images/ton.jpg" width="300" alt="TON Deposit QR"> </details>
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.