Search MCP Server

Search MCP Server

Enables AI agents to perform unified web searches, GitHub, and GitLab searches with caching, reranking, and fallback across multiple providers.

Category
Visit Server

README

Search MCP Server

Unified MCP search tools for AI agents — abstracts providers, caching, reranking, and rate limits behind a few simple tools.

Tools

Tool Purpose
search Universal web search with caching, reranking, and fallback across 6 providers
github_search Search GitHub repos, code, issues, and users
gitlab_search Search GitLab projects, issues, MRs, and code blobs
status Server diagnostics, provider health, budget state

Quick Start

git clone <repo-url> search-mcp
cd search-mcp
npm install
cp .env.example .env
npm run build
npm start

Configuration

Copy .env.example to .env and set your API keys. No keys are required — DuckDuckGo and Bing work out of the box.

Provider Key Required Tier Rate Limit
DuckDuckGo No 1 10 req/min
Bing No 1
Brave BRAVE_API_KEY 2 2000/month
Tavily TAVILY_API_KEY 2 1000/month
Exa EXA_API_KEY 3 Trial
Firecrawl FIRECRAWL_API_KEY 3 Trial

Query Formats

Tool Pattern Example
search General web search typescript tutorial, how to install docker
search GitHub-oriented repo:vercel/next.js, stars:>1000 language:rust
search Docs-oriented express api reference, docker compose guide
search News-oriented react 19 release notes, latest ai news
github_search Native GitHub search syntax repo:org/name, user:vercel, language:typescript
gitlab_search Native GitLab search syntax project:org/name, scope filter via type param

Tool Reference

search

search({
  query: string,
})

Returns merged results from the first 2 healthy providers (parallel), deduplicated and reranked by relevance.

Intent classification runs automatically (no intent parameter). The server detects query intent (github / docs / news / web) via NLI zero-shot (DeBERTa-v3-xsmall).

NLI Reranking

Every search result is scored against the query using the same NLI model:

  • Model: Xenova/nli-deberta-v3-xsmall (177M params, ONNX-optimized)
  • Inference: ~10–15ms per result on CPU, ~300ms for 20 results (parallelized)
  • Score: 0.9 * NLI(query, snippet) + 0.04 * domain + 0.03 * freshness + 0.03 * position
  • Config: Override via INTENT_CLASSIFIER_MODEL in .env

The NLI model loads lazily on first request and is shared between intent classification and reranking.

github_search

github_search({
  query: string,
  type?: "repositories" | "code" | "issues" | "users",
  language?: string,
  stars?: string,             // e.g. ">1000", "500..5000"
  page?: number
})

Rate limit: 60 req/hr without token, 5000 req/hr with GITHUB_TOKEN.

gitlab_search

gitlab_search({
  query: string,
  scope?: "projects" | "issues" | "merge_requests" | "blobs",
  page?: number
})

Requires GITLAB_TOKEN with read_api scope.

status

status()

Returns provider health, cache stats, budget state, uptime.

Architecture

Pipeline

Core pipeline: Budget Check → Normalize → Classify (intent + freshness) → Cache → Router (parallel 2) → Rerank → Cache → Respond

Full docs:

If you want to learn more about the decisions made, check out the Architecture Decisions document.

Donations / Support the Project

  • EVM Address (USDT, USDC, ETH, BNB): 0x3acf78e721aa065bd1509735a3ace630fcd0f452

    Supported networks: BNB Smart Chain (BEP20), Polygon, Arbitrum One, Ethereum (ERC20) <details> <summary>Show EVM QR Code</summary> <img src=".github/images/evm.jpg" width="300" alt="EVM Deposit QR"> </details>

  • USDT (TRC20): TMG1BZJswd8UacHANKVi9Veq5FqSbaMjAQ

    Supported network: Tron (TRC20) <details> <summary>Show TRC20 QR Code</summary> <img src=".github/images/trc20.jpg" width="300" alt="TRC20 Deposit QR"> </details>

  • TON / USDT (TON): UQDjf84AZSwuCPC3UWFv28p-V44zI-1lfhqCMfaex2Q-jDsY

    Supported network: The Open Network (TON) <details> <summary>Show TON QR Code</summary> <img src=".github/images/ton.jpg" width="300" alt="TON Deposit QR"> </details>

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured