homelab-ai

homelab-ai

Self-hosted AI orchestrator that monitors and manages homelab services, exposing them as MCP tools for LLMs to drive.

Category
Visit Server

README

homelab-ai

Self-hosted AI orchestrator for your homelab — monitors your services, self-heals when they break, and exposes everything you run as MCP tools any LLM can drive.

CI PyPI Docker Python 3.11+ License: MIT Self-hosted

A FastAPI + Ollama-powered control plane that sits in front of your *arr stack, Jellyfin, qBittorrent, Paperless, Immich, Home Assistant — whatever you run — and gives you:

  • Proactive monitoring with a plugin-based agent loop and SQLite failure memory (no alert spam, no flapping fixes).
  • 3-tier auto-repair: cheap rule checks → small LLM with safe tools → big LLM with file edits, full audit log, backups before every change.
  • A single AI agent (Ollama, OpenAI-compatible, anything that speaks tool-calling) that can drive every service through one HTTP surface.
  • MCP server so Claude Desktop, Open WebUI, Cursor, or any MCP client can plug into your homelab natively.
  • Mobile PWA with chat, dashboard, and per-service deep links — installable, works on LAN or over Tailscale.
  • No vendor lock-in: local-first, your data and credentials stay on your hardware.

Status: beta. The core architecture is stable and battle-tested in a production homelab; expect rough edges around new-service plugins and the PWA polish.


Why this exists

The selfhosted scene has great individual tools — Sonarr knows about TV, Immich knows about photos, Paperless knows about scans — but nothing that lets an LLM drive all of them coherently. You can ask ChatGPT for a recipe; you can't ask your homelab "is anything broken, and if so fix it" or "find me an audiobook by Brandon Sanderson and download it" without writing custom glue per service.

homelab-ai is the glue: a config-driven plugin system where every service you run becomes both a monitored entity and a callable tool, with an AI agent that does the routing and an auto-repair loop that fixes the boring stuff while you sleep.


Quick start

Recommended — Docker:

# 1) Generate config.yaml interactively (detects Ollama, scans for services, makes an API key)
docker run --rm -it --network host -v "$PWD/data:/data" \
    ghcr.io/jeremiahm37/homelab-ai:latest --config /data/config.yaml init

# 2) Run for real
docker run -d --name homelab-ai \
    -p 9105:9105 \
    -v "$PWD/data:/data" \
    -v /var/run/docker.sock:/var/run/docker.sock:ro \
    ghcr.io/jeremiahm37/homelab-ai:latest

Open http://<your-host>:9105/app. The PWA asks for the API key on first load — paste it from data/config.yaml.

Or use docker-compose.example.yml.

Try with zero setup — demo mode:

docker run --rm -p 9105:9105 ghcr.io/jeremiahm37/homelab-ai:latest demo

Open http://localhost:9105/app — five mock services, working chat with tool-call cards, real history, no Ollama needed.

Python install (for development or running as a systemd service):

pip install homelab-ai
homelab-ai --config config.yaml init
homelab-ai --config config.yaml run

The pip path is mainly for contributors and people running homelab-ai as a host-level systemd service. For day-to-day self-hosting, use Docker — dependency isolation, single-command upgrades, and the only thing you need to back up is ./data.

LLM backend — bring your own

Works with anything that speaks the Ollama or OpenAI API:

llm:
  backend: openai_compat            # or "ollama", or "auto"
  url: https://api.openai.com/v1    # or http://localhost:11434 for Ollama
  api_key: ${OPENAI_API_KEY}
  small_model: gpt-4o-mini
  smart_model: gpt-4o
  embed_model: text-embedding-3-small

Tested with Ollama, vLLM, LiteLLM proxy, OpenAI, Anthropic-via-LiteLLM, OpenRouter, Groq, LM Studio.

Auth

Set auth.enabled: true in config.yaml (the wizard does this for you) and every request needs an X-Api-Key header or a session cookie. Optional username/password users go in auth.users with bcrypt hashes (or PBKDF2 if bcrypt isn't installed).

auth:
  enabled: true
  api_key: hk_<generated-32-char-token>
  users:
    admin: "$2b$12$..."

Modular features

Everything beyond the agent/AI core is opt-in. Toggle features in config.yaml:

features:
  metrics: {enabled: true}              # /metrics endpoint
  ntfy: {enabled: true, url: "..."}     # push notifications
  scheduler: {enabled: true, schedules: [...]}
  webhooks: {enabled: true, receivers: {...}}
  multi_llm: {enabled: true, ...}       # cheap local + paid for hard problems
  history: {enabled: true}              # /api/history/{scans,ai,fixes}
  rag: {enabled: true}                  # ChromaDB-backed document search
  mcp_http: {enabled: true}             # /mcp JSON-RPC for HTTP MCP clients

A feature that's off has zero cost — its module is never imported, its routes are never registered, its deps don't need to be installed. Heavy deps live behind pip extras:

pip install homelab-ai[metrics]    # adds prometheus-client
pip install homelab-ai[rag]        # adds chromadb
pip install homelab-ai[scheduler]  # adds croniter
pip install homelab-ai[all]        # everything

See docs/features.md for the full menu.


What it looks like

┌─────────────────────────────────────────────────────────┐
│ Mobile PWA   /  AI Chat   /  MCP Client (Claude/etc.)   │
└─────────────────────┬───────────────────────────────────┘
                      │ HTTP + tool calls
┌─────────────────────▼───────────────────────────────────┐
│        FastAPI core  (REST, MCP, WebUI, Settings)       │
│        │              │                  │              │
│        ▼              ▼                  ▼              │
│   Tool router    Service plugins    Settings store      │
│   (semantic)    (Sonarr, Radarr,   (YAML + secrets)    │
│                  Jellyfin, ...)                         │
└─────────────────────┬───────────────────────────────────┘
                      │
┌─────────────────────▼───────────────────────────────────┐
│  Agent scan loop  ──►  Tier-1 rules  ──►  Tier-2 LLM    │
│                                            │            │
│  Failure memory (SQLite) ◄─────────────────┘            │
│                                            │            │
│                                            ▼            │
│                                       Tier-3 smart      │
│                                       fixer + audit     │
└─────────────────────────────────────────────────────────┘

Features

Agent & auto-repair

  • Pluggable scan modules (container health, disk, services). Add a class, drop it in agent/modules/, done.
  • SQLite failure memory: same error doesn't fire the same fix twice in a row.
  • Tier 1 — rule-based fixes (restart container, clear cache, retry indexer).
  • Tier 2 — small LLM (≈4B params) with a limited tool catalog: enough to investigate logs and call repair tools, not enough to delete things.
  • Tier 3 — smart fixer with a bigger model (≈30B). Can read/write files, but every edit is backed up to backups/ and logged in audit_log.md for human review.
  • The agent never pushes to remote git, posts to social, or runs destructive commands without a config flag set by the operator.

AI surface

  • One agent endpoint (/api/ai/agent) that picks tools semantically and streams responses.
  • Built-in tools for the popular *arr stack, qBittorrent/Transmission/SABnzbd, Jellyfin/Plex, Immich, Paperless, Ollama.
  • Drop a Python file in mcp/custom_tools/ to add your own.
  • Tool descriptions are embedded once at startup; the router picks the smallest relevant set per query (no 50-tool wall slowing every prompt).

Service plugins

  • Each service is one class with health(), restart(), and optional tools() methods.
  • The agent uses health() for monitoring; the AI uses tools() for actions.
  • Built-in plugins cover the common *arr stack — see docs/adding-services.md to wire in anything that has an HTTP API.

Mobile PWA

  • Installable, works on LAN or Tailscale.
  • Live dashboard, AI chat, per-service status, settings editor.
  • No login by default — put it behind your existing reverse proxy / SSO (Authelia, Caddy basic-auth, Cloudflare Access).

Verification

  • python -m homelab_ai.verify runs a flow-test suite against your live config.
  • Failures write a fix-request.md file the AI agent or you can pick up.
  • Use it as a nightly systemd timer or a pre-deploy gate.

Configuration

All configuration lives in one config.yaml. Environment variables override file values, so secrets can stay out of the file. See config.example.yaml for the annotated reference.

Minimal example:

ollama:
  url: http://localhost:11434
  small_model: qwen3.5:4b    # tool-calling / intent
  smart_model: qwen3.6:35b   # repair / chat

agent:
  scan_interval: 300         # seconds
  notify:
    discord_webhook: ""      # optional

services:
  sonarr:
    url: http://sonarr:8989
    api_key: ${SONARR_API_KEY}
  radarr:
    url: http://radarr:7878
    api_key: ${RADARR_API_KEY}
  jellyfin:
    url: http://jellyfin:8096
    api_key: ${JELLYFIN_API_KEY}

Service entries map to plugins by name. Unknown names are loaded from homelab_ai/services/ or the user ~/.config/homelab-ai/services/ directory.


Adding any service (no code)

For any service that speaks HTTP-JSON, you can add support without writing Python — just a config block:

services:
  my_thing:
    plugin: generic_http
    url: http://my-thing:8080
    auth:
      type: bearer
      key: ${MY_THING_TOKEN}
    health:
      path: /healthz
    tools:
      - name: list_widgets
        description: List widgets from My Thing.
        path: /api/widgets
        params:
          limit: {type: integer, default: 10}
      - name: get_widget
        description: Get one widget by ID.
        path: /api/widgets/{id}
        params:
          id: {type: string, required: true, in: path}

Restart, and the agent monitors my_thing health every scan while the AI gains two new tools. See docs/declarative-services.md for the full schema.

The 30 built-in Python plugins (sonarr, radarr, jellyfin, nut, etc.) remain the finished build — they handle wire protocols, multi-step auth, and complex transforms. generic_http is the blueprint for everything else.

Adding a Python plugin (for cases generic_http can't handle)

# ~/.config/homelab-ai/services/my_thing.py
from homelab_ai.services.base import Service, ToolSpec

class MyThing(Service):
    name = "my_thing"

    async def health(self) -> dict:
        r = await self.http.get(f"{self.config['url']}/health")
        return {"ok": r.status == 200}

    def tools(self) -> list[ToolSpec]:
        return [
            ToolSpec(
                name="my_thing_status",
                description="Get current status of My Thing.",
                handler=self._status,
                params={},
            ),
        ]

    async def _status(self) -> dict:
        r = await self.http.get(f"{self.config['url']}/status")
        return await r.json()

Add to config.yaml:

services:
  my_thing:
    url: http://my-thing:8080

Restart. The agent will start monitoring it; the AI will gain a my_thing_status tool.

Full guide: docs/adding-services.md.


Adding your own tool (no service)

If you just want to expose a function to the AI without wiring up a whole service:

# ~/.config/homelab-ai/tools/weather.py
from homelab_ai.mcp.decorators import tool

@tool(description="Get the current temperature for a city.")
async def get_weather(city: str) -> dict:
    ...

Full guide: docs/adding-tools.md.


API overview

Endpoint Purpose
GET /api/health Liveness probe
GET /api/overview All-services snapshot for dashboards
GET /api/services List of configured services and their health
POST /api/ai/agent One-shot agent call: prompt in, tool calls + answer out
POST /api/ai/agent/stream Same, server-sent-events streaming
GET /api/agent/status Last scan, recent fixes, queued escalations
POST /api/agent/scan Trigger an immediate scan
GET /api/settings / PUT /api/settings Read / update config (mirrored to disk)
GET /mcp MCP server endpoint (for Claude Desktop, Open WebUI, etc.)
GET /app Mobile PWA
GET /docs OpenAPI Swagger UI

The full OpenAPI spec is at /openapi.json. The MCP tool catalog is at /mcp/tools.


Comparison

homelab-ai Home Assistant + LLM Vision n8n + Ollama Open WebUI alone
Native *arr / media plugins manual
Auto-repair / self-healing partial manual
MCP server (Claude/Cursor) partial
Mobile PWA out of the box
Local-first / no cloud calls
Selfhosted-LLM-friendly

Pick the one whose primitives match what you do most. homelab-ai's primitive is "a service that's monitored and AI-callable", which is the right shape for media/storage/scan workflows. Home Assistant's primitive is "a device with state and triggers", which is the right shape for IoT.


Roadmap

Tracked in GitHub issues. Near-term focuses:

  • More built-in service plugins (Home Assistant, AdGuard, Nextcloud, NUT)
  • OpenAI-compatible LLM backend (not just Ollama)
  • Web installer / config wizard for first-time users
  • Optional remote-access tunnel templates (Tailscale, Cloudflare Tunnel)
  • HA-style automation triggers ("when X then call tool Y")

Contributing

See CONTRIBUTING.md. TL;DR: discuss large changes in an issue first; new services and tools should be plugins, not core patches; no personal IPs or secrets in code.

License

MIT.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured