ML Experiment Tracker MCP

ML Experiment Tracker MCP

Query your MLflow experiments in plain English using Claude Desktop. No dashboards, no SQL — just ask.

Category
Visit Server

README

ML Experiment Tracker MCP

Query your MLflow experiments in plain English using Claude Desktop. No dashboards, no SQL — just ask.

FastMCP MLflow Python


What It Does

Connect Claude Desktop to your MLflow experiment tracker via an MCP server. Ask natural language questions and Claude queries your experiments automatically.

Example questions you can ask Claude:

  • "List all my ML experiments"
  • "Which run had the best val_f1 in pothole-detector?"
  • "What hyperparameters did the best model use?"
  • "Compare the top 5 runs in road-crack-detection by accuracy"
  • "Show me the metric history for run ID xyz"

Tools Exposed

Tool Description
list_experiments Lists all experiments in MLflow
compare_runs Ranks top N runs by a given metric
get_best_model Returns the single best run for a metric
fetch_metric_history Returns epoch-by-epoch metric history for a run

Prerequisites

Before starting, make sure you have:

  • Python 3.11+ installed
  • uv package manager (pip install uv)
  • Claude Desktop downloaded from https://claude.ai/download
  • Git installed

Installation

1. Clone the repository

git clone https://github.com/yourusername/experiments-mcp.git
cd experiments-mcp

2. Create virtual environment with Python 3.11

uv venv --python /opt/homebrew/opt/python@3.11/bin/python3.11
source .venv/bin/activate

Windows: use .venv\Scripts\activate instead

3. Install dependencies

uv pip install -r requirements.txt

requirements.txt contains:

fastmcp
mlflow
pandas

4. Verify installation

python --version    # Should show Python 3.11.x
mlflow --version    # Should show mlflow, version 3.x.x

Seed Dummy Experiment Data To Test MCP (Optional)

If you don't have real MLflow experiments, run the seed script to populate fake but realistic data:

python seed_mlflow.py

This creates 2 experiments with 35 total runs:

  • pothole-detector — 20 runs
  • road-crack-detection — 15 runs

Each run has params (learning_rate, batch_size, backbone, optimizer) and metrics (val_f1, val_accuracy, val_loss, train_f1).


Connect to Claude Desktop

1. Find your Python path

which python
# Example output: /Users/yourname/experiments-mcp/.venv/bin/python

2. Find your project path

pwd
# Example output: /Users/yourname/experiments-mcp

3. Edit Claude Desktop config

Open the config file:

# Mac
open ~/Library/Application\ Support/Claude/

# Windows
# %APPDATA%\Claude\claude_desktop_config.json

Add the mcpServers block to claude_desktop_config.json:

{
  "mcpServers": {
    "ml-experiment-tracker": {
      "command": "/Users/yourname/experiments-mcp/.venv/bin/python",
      "args": [
        "/Users/yourname/experiments-mcp/server.py"
      ]
    }
  }
}

Replace /Users/yourname/experiments-mcp with your actual paths from steps 1 and 2.

4. Restart Claude Desktop

Cmd + Q  →  reopen Claude Desktop

5. Verify connection

Click the + button in the chat input → Add plugins → your server should appear as connected.

Or simply type in chat:

List all my ML experiments

Project Structure

experiments-mcp/
├── server.py           # MCP server — all 4 tools
├── mlflow_client.py    # MLflow SDK wrapper
├── seed_mlflow.py      # Fake data generator
├── requirements.txt    # Dependencies
└── README.md

Using Your Own MLflow Data

By default the server points to a SQLite DB at ~/mcp/experiments-mcp/mlflow.db.

To point it to your own MLflow instance, edit the top of mlflow_client.py:

# Local SQLite (default)
mlflow.set_tracking_uri("sqlite:///path/to/your/mlflow.db")

# Remote MLflow server
mlflow.set_tracking_uri("http://your-mlflow-server:5000")

Common Errors & Fixes

PermissionError: Operation not permitted: .venv/pyvenv.cfg

Cause: Project is inside ~/Desktop — Claude Desktop cannot access Desktop on Mac due to macOS security.

Fix: Move the project out of Desktop:

mv ~/Desktop/experiments-mcp ~/experiments-mcp
cd ~/experiments-mcp

Update the paths in claude_desktop_config.json accordingly.


ModuleNotFoundError: No module named 'mlflow'

Cause: Wrong Python being used — system Python instead of venv Python.

Fix:

# Check which python is active
which python

# If it shows /usr/bin/python or /opt/homebrew/bin/python — wrong one
# Re-activate your venv:
source /full/path/to/experiments-mcp/.venv/bin/activate

# Then reinstall
uv pip install -r requirements.txt

MlflowException: filesystem tracking backend is in maintenance mode

Cause: MLflow 3.x dropped file-based storage (mlruns/ folder). Requires SQLite.

Fix: Make sure mlflow_client.py uses SQLite URI:

import os
mlflow.set_tracking_uri(
    f"sqlite:///{os.path.expanduser('~/experiments-mcp/mlflow.db')}"
)

Also add the same line to seed_mlflow.py before running it.


sqlite3.OperationalError: unable to open database file

Cause: The .db file doesn't exist yet — seed script hasn't been run, or was run from a different directory.

Fix:

cd ~/experiments-mcp
source .venv/bin/activate
python seed_mlflow.py

NameError: name 'os' is not defined

Cause: Missing import os at the top of mlflow_client.py.

Fix: Add to the very first line of mlflow_client.py:

import os

Server disconnected in Claude Desktop

Cause: Could be any Python error in server.py or mlflow_client.py.

Fix: Check the logs:

tail -f ~/Library/Logs/Claude/mcp-server-ml-experiment-tracker.log

The last error in the log will tell you exactly what went wrong.


zsh: command not found: python

Cause: On newer Macs, python is not aliased — use python3 or activate venv properly.

Fix:

# Option 1: use python3
python3 seed_mlflow.py

# Option 2: recreate venv with correct python
uv venv --python /opt/homebrew/opt/python@3.11/bin/python3.11
source .venv/bin/activate
# now 'python' works

Do I Need MLflow UI Running?

No. The MCP server reads directly from the SQLite database. You do NOT need to keep mlflow ui running for Claude to query your experiments.


Tech Stack

  • FastMCP — MCP server framework
  • MLflow 3.x — Experiment tracking (SQLite backend)
  • pandas — DataFrame processing for run results
  • Claude Desktop — MCP client

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured