open-monitor
Multi-machine system monitor with a built-in MCP server that enables AI agents to query health metrics, manage processes, schedule cron jobs, and run diagnostics across local and remote machines.
README
open-monitor
System monitor for AI agents and humans — watch multiple machines, detect problems, take action.
Features
- 🖥️ Multi-machine — monitor local, SSH, and EC2 machines from one place
- 📊 Metrics — CPU, memory, disk, GPU, load average, uptime, processes
- 🤖 MCP server — AI agents can query health, kill processes, manage cron jobs, and run doctor checks
- 🔍 Doctor — built-in health checks with actionable diagnostics and auto-remediation
- 💀 Process manager — detect zombies and orphans, smart kill policy with signal selection and safe list
- 📦 Container monitoring — inspect Docker/Podman/Nerdctl containers, resource usage, and logs
- 🚪 Port scanner — see which TCP/UDP ports are listening on one machine or across the fleet
- 🔄 Cron jobs — schedule actions per machine with full cron syntax
- 📨 Fleet reports — generate daily/weekly health summaries and deliver them via open-conversations or open-emails
- 🌐 Web dashboard — dark-themed real-time gauges served at
http://localhost:3848(like NVIDIA DGX Dashboard) - 🔎 Full-text search — search across machines, alerts, and processes
- 🔗 Integrations — open-todos, open-conversations, open-mementos, open-emails
- 💾 SQLite by default — zero-config persistence; optional PostgreSQL for production
- 🐚 Shell completions — zsh and bash completions included
Install
bun install -g @hasna/monitor
Or with npm:
npm install -g @hasna/monitor
Quick Start
# Check local machine
monitor status
# Add a remote SSH machine
monitor add linux-node-a --type ssh --host linux-node-a.example.com --user ubuntu --key ~/.ssh/id_ed25519
# Show all machines
monitor machines
# Run health checks
monitor doctor
# List processes, filter for zombies
monitor ps --filter zombies
# Search across everything
monitor search "high cpu"
# Start the web dashboard
monitor serve --web
MCP Setup
# Claude Code (recommended)
claude mcp add --scope user monitor -- monitor-mcp
# Or manually add to ~/.claude.json mcpServers:
# "monitor": { "type": "stdio", "command": "monitor-mcp", "args": [] }
# Codex — add to ~/.codex/config.toml:
# [mcp_servers.monitor]
# command = "monitor-mcp"
HTTP mode
Run a shared Streamable HTTP MCP server (stateless, 127.0.0.1 only):
monitor-mcp --http
# or: MCP_HTTP=1 monitor-mcp
# default port: 8826 (override with --port or MCP_HTTP_PORT)
Endpoints: GET /health, POST /mcp (Streamable HTTP).
Available MCP Tools
| Tool | Description |
|---|---|
monitor_snapshot |
Get current metrics snapshot (CPU, memory, disk, GPU) |
monitor_health |
Run health checks and return pass/warn/fail status |
monitor_processes |
List running processes with optional filters |
monitor_apps |
List installed apps/packages or compare inventories across machines |
monitor_service |
List or control system services and detected dev servers |
monitor_containers |
List containers and resource usage on one or all machines |
monitor_container_logs |
Fetch recent logs for one container |
monitor_ports |
List listening TCP/UDP ports on one or all machines |
monitor_tailscale |
Show Tailscale peer status, IPs, health, and latency |
monitor_temperature |
Show CPU/GPU temperatures, fan speeds, and thermal alerts |
monitor_mcp_health |
Check Claude MCP registration health and dead tmux panes |
monitor_mcp_status |
Show MCP server health with best-effort matched process details |
monitor_mcp_restart |
Restart a matched MCP process and re-check health |
monitor_kill |
Kill a process by PID with configurable signal |
monitor_machines |
List all configured machines |
monitor_add_machine |
Add a new machine to monitor |
monitor_alerts |
List recent alerts for a machine |
monitor_cron_jobs |
List scheduled cron jobs |
monitor_doctor |
Run the doctor and get remediation suggestions |
monitor_search |
Full-text search across machines, alerts, processes |
monitor_register_agent |
Register an AI agent for heartbeat tracking |
monitor_heartbeat |
Send a heartbeat to indicate agent is alive |
monitor_set_focus |
Set current focus/task for an agent |
monitor_list_agents |
List all registered agents and their status |
monitor_configure_integrations |
Update integration settings |
monitor_send_feedback |
Submit feedback |
CLI Reference
monitor <command> [options]
| Command | Description |
|---|---|
status [machine] |
Show current system snapshot (CPU, memory, disk, GPU) |
machines |
List all configured machines |
add <name> |
Add a machine to monitor |
doctor [machine] |
Run health checks with diagnostics |
ps [machine] |
List processes, with optional filter |
kill <pid> |
Kill a process by PID |
alerts [machine] |
Show recent alerts |
apps [machine] |
List installed apps/packages or compare them across machines |
compare-apps |
Compare installed apps across all configured machines |
service <action> [name] |
List or control system services and detected dev servers |
containers [machine] |
Show containers and resource usage |
ports [machine] |
Show listening TCP/UDP ports |
tailscale [machine] |
Show Tailscale peer status, IPs, health, and latency |
temperature [machine] |
Show CPU/GPU temperatures, fan speeds, and thermal alerts |
mcp-health [machine] |
Check Claude MCP registration health and dead tmux panes |
mcp-status [machine] |
Show MCP health plus best-effort matched process PIDs, memory, and uptime |
mcp-restart <name> |
Restart a matched MCP process if one is running, then re-check health |
report |
Build a daily fleet health report |
report --send |
Deliver the current report via configured integrations |
| `report --schedule daily | weekly` |
cron list |
List scheduled cron jobs |
cron add <name> <schedule> <command> |
Add a cron job |
cron run <job-id> |
Run a cron job immediately |
search <query> |
Full-text search |
migrate |
Run database migrations |
integrations list |
List integration status |
integrations test <name> |
Test an integration |
serve |
Start the API server |
serve --web |
Start the API + web dashboard |
mcp |
Start the MCP server (stdio) |
sync push |
Push metrics to remote store |
sync pull |
Pull metrics from remote store |
sync status |
Show sync status |
completions zsh |
Print zsh completions |
completions bash |
Print bash completions |
completions install |
Auto-install shell completions |
mcp-status and monitor_mcp_status use live process snapshots, so stdio servers can report connected with processCount: 0 when no long-lived child is present at the instant of collection.
monitor add
# Local machine (default)
monitor add mybox
# SSH machine
monitor add linux-node-a \
--type ssh \
--host linux-node-a.example.com \
--user ubuntu \
--key ~/.ssh/id_ed25519 \
--port 22
# EC2 machine (uses AWS SSM)
monitor add prod-api \
--type ec2 \
--instance-id i-0abc123def456789 \
--region us-east-1 \
--profile my-aws-profile
monitor ps
monitor ps # all processes
monitor ps --filter zombies # zombie processes only
monitor ps --filter orphans # orphan processes
monitor ps --limit 20 # top 20 by CPU
monitor ps --json # raw JSON output
monitor kill
monitor kill 1234 # SIGTERM (default)
monitor kill 1234 --signal SIGKILL
monitor kill 1234 --signal 9
monitor apps
monitor apps # local package/app inventory
monitor apps macos-node-b # one remote machine
monitor apps --all # inventories for all configured machines
monitor apps --compare # highlight missing/version-skewed/root-owned installs
monitor compare-apps # dedicated cross-machine consistency report
monitor apps --json
monitor service
monitor service list # show system services plus detected dev servers
monitor service list --machine macos-node-b # inspect one remote machine
monitor service start postgresql # systemd / brew / launchctl start
monitor service restart nginx # systemd / brew / launchctl restart
monitor service stop vite:12345 # stop a detected dev server by PID-backed name
monitor service list --json
monitor temperature
monitor temperature # local thermal snapshot
monitor temperature linux-node-a # one remote machine
monitor temperature --all # inspect all configured machines
monitor temperature --json
monitor containers
monitor containers # local container list
monitor containers linux-node-a # remote machine containers
monitor containers --all # all configured machines
monitor containers --logs api # recent logs for one container
monitor containers --logs api --tail 200
monitor containers --json
monitor ports
monitor ports # local listeners
monitor ports linux-node-a # one remote machine
monitor ports --all # scan all configured machines
monitor ports --protocol tcp # filter by protocol
monitor ports --json # raw JSON output
monitor tailscale
monitor tailscale # local Tailscale graph
monitor tailscale linux-node-a # one remote machine
monitor tailscale --all # inspect all configured machines
monitor tailscale --json # raw JSON output
monitor report
monitor report # preview daily fleet report
monitor report --period weekly # preview weekly fleet report
monitor report --send # send via configured conversations/emails integrations
monitor report --schedule daily # create/update a 9:00 daily cron report
monitor report --schedule weekly # create/update a Monday 9:00 weekly cron report
Web Dashboard
monitor serve --web
# Opens: http://localhost:3848
The dashboard shows:
- Real-time CPU, memory, and disk gauges per machine
- Recent alerts with severity indicators
- Process table with sort and filter
- Doctor check results
- Cron job schedule
Default port is 3848. Override in config: "webPort": 9000.
Configuration
Config is stored at ~/.hasna/monitor/config.json. Set MONITOR_CONFIG_DIR
to use a different config/database directory for CI, tests, or isolated agent
runs.
{
"machines": [
{
"id": "local",
"label": "Local Machine",
"type": "local",
"pollIntervalSecs": 30,
"tags": ["dev"]
},
{
"id": "linux-node-a",
"label": "Spark Node 01",
"type": "ssh",
"ssh": {
"host": "linux-node-a.example.com",
"port": 22,
"username": "ubuntu",
"privateKeyPath": "~/.ssh/id_ed25519"
},
"pollIntervalSecs": 60,
"tags": ["production", "spark"]
},
{
"id": "prod-api",
"label": "Prod API Server",
"type": "ec2",
"ec2": {
"instanceId": "i-0abc123def456789",
"region": "us-east-1",
"profile": "my-aws-profile"
}
}
],
"thresholds": {
"cpuPercent": 90,
"memPercent": 90,
"diskPercent": 85,
"loadAvg": 10
},
"dbPath": "~/.hasna/monitor/monitor.db",
"apiPort": 3847,
"webPort": 3848,
"integrations": {
"todos": {
"enabled": true,
"project_id": "my-project-id"
},
"conversations": {
"enabled": true,
"space_id": "my-space-id"
},
"emails": {
"enabled": true,
"to": "alerts@example.com"
}
}
}
Alert Thresholds
| Field | Default | Description |
|---|---|---|
cpuPercent |
90 | Alert when CPU exceeds this % |
memPercent |
90 | Alert when memory exceeds this % |
diskPercent |
85 | Alert when any disk exceeds this % |
loadAvg |
10 | Alert when 1-min load average exceeds this |
Integrations
open-monitor integrates with the open-* ecosystem to surface alerts where you're already working.
open-todos
Creates tasks when critical alerts fire.
"todos": {
"enabled": true,
"project_id": "your-project-id",
"base_url": "http://localhost:3000"
}
open-conversations
Posts alerts to a team space.
"conversations": {
"enabled": true,
"space_id": "your-space-id",
"base_url": "http://localhost:3001"
}
open-mementos
Stores alert history as memories for AI agent context.
"mementos": {
"enabled": true,
"base_url": "http://localhost:3002"
}
open-emails
Sends email notifications for critical alerts.
"emails": {
"enabled": true,
"to": "alerts@example.com",
"from": "monitor@yourdomain.com",
"base_url": "http://localhost:3003"
}
Test any integration:
monitor integrations test todos
monitor integrations test conversations
monitor integrations test emails
Shell Completions
# Install automatically (detects your shell)
monitor completions install
# Or manually:
monitor completions zsh >> ~/.zshrc
monitor completions bash >> ~/.bashrc
Database
By default uses SQLite at ~/.hasna/monitor/monitor.db. For production or multi-agent setups, use PostgreSQL:
Set DATABASE_URL environment variable:
export DATABASE_URL="postgres://user:pass@localhost:5432/monitor"
monitor migrate
Security
- Process command lines are automatically redacted before being returned to AI agents — passwords, tokens, API keys, and secrets are replaced with
*** - See SECURITY.md for the security policy and responsible disclosure process
License
Apache-2.0 — see LICENSE for details.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.