MCP-AI-Gateway

MCP-AI-Gateway

Unified local MCP AI Gateway that routes across Groq, OpenRouter, Mistral, and local Ollama providers, with OpenAI-compatible APIs, MCP tools, fallback/racing router, monitoring, and web dashboard.

Category
Visit Server

README

MCP-AI-Gateway

Python FastAPI Docker MCP CI

Unified local MCP AI Gateway that routes across Groq, OpenRouter, Mistral, and local Ollama providers, with OpenAI-compatible APIs, MCP tools, fallback/racing router, monitoring, and web dashboard.

Web UI

๐Ÿš€ Features

  • Async Python gateway with provider abstraction layer
  • Built-in providers:
    • Groq (openai/gpt-oss-120b default, plus moonshotai/kimi-k2-instruct-0905)
    • OpenRouter (openai/gpt-4o-mini default)
    • Mistral (codestral-latest default, plus devstral-medium-latest)
    • Ollama
  • Intelligent router:
    • priority routing
    • fallback routing
    • parallel racing mode
    • rate-limit aware provider scoring
  • MCP server capabilities:
    • tool registration/discovery
    • prompt template discovery/rendering
    • schema validation
    • SSE and HTTP transport
  • OpenAI-compatible API:
    • POST /v1/chat/completions
    • POST /v1/completions
    • GET /v1/models
  • Web UI (React + Vite + Tailwind):
    • Dashboard
    • Providers
    • Models
    • Chat
    • Logs
    • Stats
    • Runtime gateway API key generation
  • Observability:
    • Prometheus metrics (/metrics)
    • structured logs
    • SQLite request statistics
  • Extensible plugin system for additional providers
  • CLI: mcp-ai start|stop|status|providers|models
  • Docker + docker-compose support
  • GitHub Actions CI for backend lint/tests and frontend build

๐Ÿ—‚๏ธ Repository Structure

mcp-ai-gateway/
  api/
  providers/
  router/
  mcp_server/
  plugins/
  database/
  web_ui/
  config/
  cli/
  tests/
  Dockerfile
  docker-compose.yml
  README.md

โšก Quick Start (Local)

1) ๐Ÿ“ฆ Prerequisites

  • Python 3.11+
  • Node.js 20+
  • Optional: local Ollama daemon at http://localhost:11434

2) ๐Ÿ Install backend

python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -e .[dev]

3) ๐Ÿ” Configure environment

cp .env.example .env
# edit .env values

4) โš™๏ธ Edit config

config/config.yaml controls providers, routing, security, and default models.

5) โ–ถ๏ธ Start backend

uvicorn api.main:app --host 0.0.0.0 --port 8000 --reload

6) ๐Ÿ–ฅ๏ธ Start web UI

cd web_ui
npm install
VITE_API_BASE=http://localhost:8000 VITE_API_KEY=change-me npm run dev

Open http://localhost:3000

๐Ÿณ Docker Deployment

docker compose up --build

Services:

๐Ÿงฉ API Overview

OpenAI-Compatible ๐Ÿค

  • POST /v1/chat/completions
  • POST /v1/completions
  • GET /v1/models

Use header X-API-Key: <key> (or Authorization: Bearer <key>).

MCP Endpoints ๐Ÿ”Œ

  • JSON-RPC MCP transport: POST /mcp
  • SSE streaming output: POST /mcp/stream

Supported MCP methods:

  • initialize
  • notifications/initialized
  • ping
  • tools/list
  • tools/call
  • prompts/list
  • prompts/get

Dashboard APIs ๐Ÿ“Š

  • GET /api/dashboard
  • GET /api/providers
  • POST /api/providers/{name}/toggle
  • POST /api/providers/{name}/priority
  • POST /api/providers/{name}/config
  • GET /api/models
  • GET /api/settings
  • POST /api/settings/default-model
  • POST /api/settings/api-key/generate
  • GET /api/logs
  • GET /api/stats

Monitoring ๐Ÿ“ˆ

  • GET /health
  • GET /metrics

๐Ÿ› ๏ธ CLI

mcp-ai start
mcp-ai status
mcp-ai providers
mcp-ai models
mcp-ai stop

๐Ÿงญ Warp MCP Integration

Add this to Warp MCP config:

{
  "mcpServers": {
    "mcp-ai-gateway": {
      "url": "http://localhost:8000/mcp",
      "headers": {
        "Authorization": "Bearer REPLACE_WITH_MCP_GATEWAY_API_KEY"
      }
    }
  }
}

๐Ÿงฑ Configuration Example

providers:
  groq:
    enabled: true
    api_key: ${GROQ_API_KEY}
    priority: 1
    default_model: openai/gpt-oss-120b
  mistral:
    enabled: false
    api_key: ${MISTRAL_API_KEY}
    priority: 12
    default_model: codestral-latest
  openrouter:
    enabled: true
    priority: 2
    default_model: openai/gpt-4o-mini
  ollama:
    enabled: true
    priority: 3
routing:
  fallback_enabled: true
  racing_mode: false

๐Ÿงช Plugin Providers

New providers can be added under plugins/ by implementing ProviderPlugin and enabling provider config in config/config.yaml.

Included plugin examples:

  • openai_provider.py
  • together_provider.py
  • openrouter_provider.py

โœ… Testing

pytest

CI runs on pushes and pull requests to main via .github/workflows/ci.yml:

  • Backend job: ruff check . and pytest -q
  • Frontend job: npm install and npm run build in web_ui/

๐Ÿ”„ Release Workflow

  • Open PR from dev/* branch to main
  • Ensure CI is green
  • Merge using GitHub CLI or GitHub UI

๐Ÿ“š Developer Guide

Detailed guide: docs/developer-guide.md

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured