semantic-mcp
A semantic router that enables discovery, management, and execution of tools across multiple MCP servers using natural language.
README
Semantic MCP
Semantic router for MCP ecosystems - Discover, manage, and execute tools across multiple MCP servers with progressive disclosure.
Overview
semantic-mcp is a FastMCP-based MCP server that provides semantic discovery and lifecycle management for other MCP servers. It connects to a discovery service for semantic search and manages server lifecycles locally via ZMQ-based IPC.
LLM Client (Claude/Cline)
│ MCP Protocol
▼
┌─────────────────────────────┐
│ semantic-mcp │
│ (FastMCP MCP Server) │
├─────────────────────────────┤
│ Discovery → mcp-index API │
│ Execution → ZMQ + Sessions │
└─────────────────────────────┘
│ │
▼ ▼
mcp-index MCP Servers
(Elasticsearch) (stdio/http)
Related Projects
- mcp-index - Elasticsearch-based semantic discovery service for MCP servers. Required backend for
semantic-mcpto enable semantic search and server registry.
Installation
Option 1: uvx (Recommended)
uvx semantic-mcp serve --transport stdio
Option 2: pip/uv
# Install from PyPI
pip install semantic-mcp
# Or with uv
uv pip install semantic-mcp
# Run
semantic-mcp serve --transport stdio
Option 3: Docker
docker pull milkymap/semantic-mcp:0.2
docker run -d \
-p 8001:8001 \
-e DISCOVERY_URL=http://your-discovery-service \
-e DISCOVERY_API_KEY=your-key \
milkymap/semantic-mcp:0.2 serve --transport streamable-http --port 8001
Option 4: From source
git clone https://github.com/milkymap/semantic-mcp
cd semantic-mcp
uv sync
uv run semantic-mcp serve
Configuration
Environment Variables
| Variable | Description | Default |
|---|---|---|
DISCOVERY_URL |
Discovery service API URL | http://localhost:8000 |
DISCOVERY_API_KEY |
API key for discovery authentication | None |
DISCOVERY_ENCRYPTION_KEY |
Key to decrypt sensitive env vars in server configs | None |
TOOL_OFFLOADED_DATA_PATH |
Path for large result offloading | /tmp/mcp_offloaded |
MAX_RESULT_TOKENS |
Max tokens before content offloading | 4096 |
BACKGROUND_QUEUE_SIZE |
Max background tasks in queue | 100 |
OPENAI_API_KEY |
OpenAI API key (for image descriptions) | None |
MCP Client Integration
Claude Code / Cline (uvx)
Add to your .mcp.json or MCP config:
{
"mcpServers": {
"semantic-mcp": {
"command": "uvx",
"args": ["semantic-mcp", "serve", "--transport", "stdio"],
"env": {
"DISCOVERY_URL": "https://your-discovery-service",
"DISCOVERY_API_KEY": "your-api-key"
}
}
}
}
Claude Desktop (Docker)
{
"mcpServers": {
"semantic-mcp": {
"command": "docker",
"args": [
"run", "-i", "--rm",
"-e", "DISCOVERY_URL", "-e", "DISCOVERY_API_KEY",
"--add-host=host.docker.internal:host-gateway",
"milkymap/semantic-mcp:0.2", "serve", "--transport", "stdio"
],
"env": {
"DISCOVERY_URL": "http://host.docker.internal:8000",
"DISCOVERY_API_KEY": "your-key"
}
}
}
}
Remote HTTP Server
Start the server:
semantic-mcp serve --transport streamable-http --host 0.0.0.0 --port 8001
Client configuration:
{
"mcpServers": {
"semantic-mcp": {
"url": "http://your-server:8001/mcp"
}
}
}
Available Operations
semantic-mcp exposes a single semantic_router tool with these operations:
Discovery (lightweight)
| Operation | Description |
|---|---|
search_tools |
Search for tools using natural language |
search_servers |
Search for servers using natural language |
list_servers |
List all registered servers |
get_server_tools |
List tools on a server |
get_statistics |
Get server/tool counts |
Exploration (full details)
| Operation | Description |
|---|---|
get_server_info |
Get detailed server information |
get_tool_details |
Get full tool schema and description |
Lifecycle
| Operation | Description |
|---|---|
manage_server |
Start or shutdown a server |
list_running_servers |
List currently running servers |
Execution
| Operation | Description |
|---|---|
execute_tool |
Execute a tool on a running server |
poll_task_result |
Check background task status |
cancel_task |
Cancel a running background task |
list_tasks |
List all background tasks |
get_content |
Retrieve offloaded content by reference ID |
Workflow
1. DISCOVER search_tools("your need") → Find relevant tools
↓
2. EXPLORE get_server_info(server) → Check capabilities
get_server_tools(server) → List available tools
↓
3. UNDERSTAND get_tool_details(server, tool) → Get full schema (REQUIRED)
↓
4. START manage_server(server, "start") → Start the MCP server
↓
5. EXECUTE execute_tool(server, tool, args) → Run the tool
↓
6. CLEANUP manage_server(server, "shutdown") → Stop when done (optional)
Important rules:
- Always call
get_tool_detailsbeforeexecute_toolto understand the schema - Always call
manage_server(start)before executing tools - Use
in_background=truefor long-running operations, thenpoll_task_result - Large responses are automatically offloaded; use
get_content(ref_id)to retrieve
Architecture
| Component | Description |
|---|---|
| RuntimeEngine | Core runtime managing ZMQ communication and server lifecycle |
| DiscoveryClient | HTTP client for discovery service API |
| ContentManager | Large result offloading (text chunking, images) |
| BackgroundTasks | Priority queue for async tool execution |
| FastMCP | MCP server framework exposing tools to LLMs |
Development
# Install with dev dependencies
uv sync --group dev
# Run tests
uv run pytest tests/ -v
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.