MCP Servers

berserk-mcp

Enables LLMs to answer Berserk observability questions by calling verified KQL tools instead of hand-authoring queries, with role-based tool filtering and automated query discovery.

README

berserk-mcp

An MCP server that lets an LLM answer Berserk observability questions by calling tools instead of hand-authoring KQL.

Why this matters: when you hand a model a raw query language, it guesses — wrong table names, wrong field names, subtly broken aggregations — and you pay for the retries. Every tool here wraps a verified Kusto/KQL query, so the model picks an intent (top_cpu, errors_by_service, sre_host_headroom) and the query is fixed. Determinism is the whole point. In practice this makes even small/cheap models answer observability questions reliably.

Zero dependencies. Pure Python standard library — nothing to pip beyond the package itself.
Single file. berserk_mcp.py is the entire server. Easy to read, audit, and vendor.
Cross-platform. Runs anywhere the bzrk CLI is installed, Windows included.
Safe by construction. Fixed queries, input validation on free-text tools, no shell=True, and the Berserk token never touches this code.

⚠️ Disclaimer — please read

This is an unofficial, community-built project. It is not affiliated with, sponsored by, endorsed by, or supported by the Berserk project or its maintainers. It interacts with Berserk only through the public bzrk CLI — no internal APIs, no reverse engineering.

Provided as-is, with no warranty and no liability for any use, outcome, downtime, data loss, cost incurred, or other consequence (see LICENSE). You run it at your own risk against your own infrastructure. If you point it at a production Berserk, that's your call.

Bugs, feature requests, and questions about this server: open an issue here. Anything about Berserk itself goes to the Berserk project — not us.

Why this exists

Berserk is a self-hosted observability backend: it ingests logs, metrics, and traces over OTLP and lets you query them with a Kusto-style language (KQL) through the bzrk CLI or its web UI. It's the storage and the query engine, and it assumes a human who already knows KQL.

The gap. A raw query language is the one thing LLMs are reliably bad at. Point a model at bzrk and it invents table names, mistypes fields, and burns tokens on retries. The obvious fixes — pasting the schema into the prompt, few-shot KQL examples — were tried first and didn't hold: the model kept guessing. Hardcoding the queries did.

What berserk-mcp adds. It's a translation layer in front of Berserk that exposes observability intents as MCP tools (top_cpu, errors_by_service, sre_service_health, …). Each wraps a query already verified against the live schema, so the model never authors KQL — it picks an intent and a time window. It does not replace Berserk's storage, query engine, or UI; it makes them agent-accessible and reliable on small / cheap / local models.

Beyond the fixed tools, the server adds three layers that don't exist in default Berserk:

Role lanes — tool visibility filtered by job function so each agent sees only what it needs
Discovery queue + auto-KQL worker — automated onboarding of new telemetry sources
Amendments log — every save_query write is tracked so a worker can post changelogs and keep the query store auditable

Approach	Result
Berserk web UI / `bzrk` CLI	Great for a human who knows KQL; not usable by an agent.
Point an LLM at the raw CLI + schema docs	Unreliable — models guess table/field names and pay for retries.
A generic "text-to-KQL" MCP	Still authors queries → same guessing problem, one layer up.
berserk-mcp	Fixed, verified queries → deterministic answers, even from a 7B local model.

What this adds vs. default Berserk

Berserk is a great human-facing observability backend on its own. This server doesn't replace any of it — it sits next to it and adds the agent-facing surface. Concretely:

Capability	Default Berserk	berserk-mcp
Ingest OTLP logs / metrics / traces	✅ core	reuses
KQL query engine + storage	✅ core	reuses (read-only)
Web UI + `bzrk` CLI for humans	✅ core	reuses
Token auth, profiles	✅ core	reuses (`bzrk` holds the token)
MCP surface for LLMs / agents	—	✅
Common questions answered without authoring KQL	requires correct Kusto → small models fail	✅ fixed verified tools
Role-aware tool filtering (SRE / SOC / Claude / Ops lanes)	—	✅ `BERSERK_MCP_ROLE` env var
Role primers injected at `initialize`	—	✅ KQL rules, thresholds, routing guidance per lane
Telemetry-shape discovery	partial (`.show tables`)	✅ `list_metrics` · `discover_schema` · `container_hosts`
Custom-query persistence as named, reusable tools	UI has a Query Library, but Berserk documents no API or CLI verb to create, list, or share a saved query programmatically	✅ `save_query` (verify-before-persist) → `run_saved`, agent-readable
Automated source onboarding	—	✅ `request_discovery` → worker → saved query, no KQL authoring needed
Query changelog / amendments log	—	✅ every `save_query` write tracked; worker posts Discord diff
Two-lane cost model (cheap default · on-demand `@deep`)	—	✅ tool descriptions + annotations make this safe
KQL-injection guards on free-text inputs	n/a (humans)	✅ service-name allowlist · `claude_search` reject-list

Architecture

How the lanes talk to each other and to Berserk

flowchart TB
  classDef user      fill:#0d1117,stroke:#58a6ff,color:#c9d1d9
  classDef cheap     fill:#0d3a1d,stroke:#3fb950,color:#c9d1d9
  classDef deep      fill:#3a1d0d,stroke:#d29922,color:#c9d1d9
  classDef mcp       fill:#161b22,stroke:#8b949e,color:#c9d1d9
  classDef berserk   fill:#1d1d3a,stroke:#a371f7,color:#c9d1d9
  classDef store     fill:#0d1117,stroke:#8b949e,color:#c9d1d9,stroke-dasharray:3 3

  User([User · Slack bot · agent framework]):::user

  subgraph H["MCP Host  (Claude Code · Claude Desktop · LangChain · ChatOps bot)"]
    direction TB
    Cheap["⚡ DEFAULT lane\ncheap / local model\ngpt-4.1-mini · Qwen2.5-7B · Haiku\nonly picks tools + time windows"]:::cheap
    Deep["🧠 @deep / scheduled lane\ncapable model\nsonnet · GPT-class\nauthors + verifies KQL"]:::deep
  end

  subgraph M["berserk-mcp  (stdio · JSON-RPC 2.0 · zero-dep stdlib Python)"]
    direction TB
    RoleFilter["Role filter  BERSERK_MCP_ROLE=sre|soc|claude|ops|all\ntools/list filtered at protocol level\nprimer injected at initialize"]:::mcp
    Tools["Fixed tools — verified KQL\ntop_cpu · errors_by_service · host_cpu\nlogs_for_service · sre_* · soc_* · claude_*"]:::mcp
    Disc["Discovery tools\nlist_metrics · discover_schema\ncontainer_hosts · list_services · schema"]:::mcp
    Learn["Learning loop\nsearch → save_query → run_saved\nverify-before-persist · amendments log · 500 cap"]:::mcp
    Queue[("discovery_queue.json\nknown_sources.json\namendments_log.json")]:::store
    Store[("learned.json\n~/.config/berserk-mcp")]:::store
  end

  Worker["discover-worker\ndrains queue · authors KQL · posts Discord\nruns via daily cron"]:::deep
  Bzrk["bzrk CLI\nbearer token lives only in bzrk's own 0600 config\nMCP never reads or stores it"]:::berserk

  subgraph B["Your Berserk instance"]
    direction TB
    Gw["Berserk gateway · KQL engine"]:::berserk
    Tbl[("default table\nOTLP logs · metrics · traces")]:::berserk
  end

  User -- "natural-language Q" --> Cheap
  User -- "@deep prompt · once-a-day cron" --> Deep

  Cheap -- "tools/call — role-filtered tools" --> RoleFilter
  RoleFilter --> Tools
  RoleFilter --> Disc
  RoleFilter --> Learn

  Deep -- "discover → search KQL → save_query" --> Learn
  Deep -- "request_discovery" --> Queue

  Queue --> Worker
  Worker -- "save_query per source" --> Learn
  Worker -- "Discord summary" --> User

  Tools -. "argv list (no shell)" .-> Bzrk
  Disc  -. "argv list (no shell)" .-> Bzrk
  Learn -. "verifies query before persist" .-> Bzrk
  Learn <-->|persist · reuse| Store

  Bzrk -- "read-only KQL over bearer auth" --> Gw
  Gw --> Tbl

  Learn -. "saved queries reusable by Cheap forever" .-> Cheap

Three things the diagram makes clear:

The bearer token never enters this code. bzrk owns it in its own 0600 config; the MCP shells out via an argv list (no shell, no token in process memory, no logs).
The learning loop closes back into the cheap lane. Pay the capable model once to author + verify a query; the cheap lane runs it free forever via run_saved.
The worker is the automation bridge. When request_discovery queues a new source, the worker drains it autonomously — discovers, authors KQL, saves — without operator KQL authoring.

Role lanes

Set BERSERK_MCP_ROLE to scope what an agent sees. The filter applies at the MCP protocol level — unrelated tools never appear in tools/list, so they can't be called accidentally or injected into context.

Role	`BERSERK_MCP_ROLE`	Gets	Typical agent
SRE	`sre`	Core tools + SRE tools (error rate, host headroom, ingest health, service health, top errors)	On-call Slack bot, editor assistant
SOC	`soc`	Core tools + SOC tools (high-severity logs, log spike, new services, repeated errors, incident timeline)	Security monitoring agent
Claude Code	`claude`	Core tools + Claude Code telemetry tools (sessions, tool histogram, errors, full-text search)	Developer workflow assistant
Ops	`ops`	All tools (full visibility)	Operator shell, admin scripts
Default	`all` (or unset)	All tools	Development, evaluation

Role primers

When a lane connects, the server injects a markdown primer into the MCP initialize response before the standard instructions. Primers carry:

Tool routing table — which tool to reach for first for each intent
Escalation thresholds — e.g. CPU load > 2.0, mem > 85%, error rate > 10/min, ingest lag > 30 s
KQL authoring rules — time window defaults, field name conventions, aggregation patterns
Discovery flow guidance — when to call request_discovery vs authoring ad-hoc KQL

This means no prompt engineering is needed in the agent config; the routing knowledge travels with the server.

Primers live in primers/<role>.md adjacent to the server file (or at BERSERK_MCP_PRIMERS_DIR if set). The all / ops roles receive no primer — they're expected to route from the tool descriptions directly.

Tools

Core tools (all lanes)

Tool	What it answers
`list_containers`	Containers currently sending metrics (with sample counts).
`top_cpu`	Containers ranked by CPU %. Use for container-specific questions; for host CPU use `host_cpu`.
`top_memory`	Containers ranked by memory (MB). Use for container-specific questions; for host memory use `host_memory`.
`errors_by_service`	ERROR-level log counts grouped by service.
`list_services`	All services/sources, with log vs metric breakdown.
`list_hosts`	All hosts reporting telemetry (HermesRuntime, OpenClaw, ESXi, …).
`host_cpu`	Per-host CPU (1-minute load average). Default for ambiguous whole-machine CPU questions.
`host_memory`	Per-host memory used (GB). Default for ambiguous whole-machine memory questions.
`container_hosts`	Which host/VM each container runs on (join key for container↔host questions).
`logs_for_service`	Recent log lines for one service.
`schema`	Live tables + column schema introspection.
`list_metrics`	Every metric name being ingested, with counts (discovery).
`discover_schema`	Sample rows to learn an unknown source's `resource`/`attributes` shape.
`bzrk_query_perf`	Berserk query engine latency percentiles (p50/p95/p99 in µs).
`search`	Run arbitrary KQL (escape hatch; `save_query` the result once it works).

Every query tool takes an optional since argument ("15m ago", "1h ago", "2d ago", …) with a sensible per-tool default.

Per-host vs per-container: host_cpu/host_memory report per host; top_cpu/top_memory report per container. The descriptions cross-reference each other so the model picks the right one. For ambiguous whole-machine questions ("what's hammering the server?") always prefer the host tools.

SRE tools (`sre` lane only)

Tool	What it answers
`sre_error_rate`	Error log events by service grouped per minute — "is the error rate climbing?"
`sre_host_headroom`	CPU load and memory by host — "which VM is saturated?"
`sre_ingest_health`	Berserk ingest lag and dropped data — "is observability lagging?"
`sre_service_health`	Full health summary for one named service: event volume, error count, log/metric split, last seen.
`sre_top_error_messages`	Most-repeated error messages by service — "what error should I investigate first?"

SOC tools (`soc` lane only)

Tool	What it answers
`soc_high_severity_logs`	Recent CRITICAL/FATAL log lines with service and message text.
`soc_log_spike`	Services with the largest minute-level log bursts — "anything spiking?"
`soc_new_services`	Recently first-seen services and sources — "what is new?"
`soc_repeated_errors`	Error messages that repeat persistently — probes, loops, stuck processes.
`soc_timeline`	Full incident timeline for one named service: timestamps, severity, metric names, message snippets.

Claude Code tools (`claude` lane only)

If you ship Claude Code session logs into Berserk (service name claude-code), these tools mine that data. See docs/claude-code.md for the pipeline.

Tool	What it answers
`claude_recent`	Recent Claude Code events — type, role, model, tool names, error flag.
`claude_sessions`	Sessions rollup — event counts, first/last seen, assistant turns, tool turns, error count.
`claude_tools`	Tool-use histogram — how many times each tool (Bash, Edit, Read, …) was called.
`claude_errors`	Failed tool results with message snippets.
`claude_search`	Full-text search across Claude Code message and tool bodies.

Learning loop tools (all lanes)

Tool	What it answers / does
`list_saved`	List saved queries visible to the current role. Check here before authoring new KQL.
`run_saved`	Run a saved query by name — deterministic, no KQL authoring.
`save_query`	Verify a KQL query runs, then persist it under a name (with optional role tag). Logs every write to the amendments log.

Discovery tools (all lanes)

Tool	What it does
`request_discovery`	Queue a newly-added service or metric for automated onboarding. Validates the source exists in Berserk before accepting.
`discovery_status`	List pending and completed discovery jobs.

Self-extending: discovery + learning

The fixed tools cover known telemetry. For data the server doesn't have a tool for yet — a log source you just started shipping — a two-stage loop extends the server without hand-editing code, while keeping the cheap lane deterministic.

Stage 1: Discovery queue

QUEUE    request_discovery(service="haproxy")   →  validates source, queues job
WORKER   discover-worker drains queue at 06:00  →  authors KQL by role/kind
SAVE     save_query (verify-before-persist)      →  permanent, named query
REUSE    run_saved("sre_haproxy_service")        →  cheap model, free, forever

request_discovery does one thing before accepting a job: it calls list_services (or list_metrics) to confirm the source is actually visible in Berserk. An unknown source is rejected with a clear error, so the queue never fills with phantom jobs.

The discover-worker (discover-worker.py, runs as a daily cron) drains the queue:

Chooses the right KQL template per role (sre gets a health summary, soc gets an incident timeline, claude gets a health rollup, metric kind gets a drilldown aggregation)
Calls save_query to verify and persist the result
Updates known_sources.json so the same source is never re-queued
Posts a Discord summary of completed and failed jobs

Stage 2: @deep amendments and improvements

When a capable model (@deep, a scheduled agent, or an operator) improves or corrects an existing query via save_query, the server:

Detects whether the query name already existed (action=updated) or is new (action=created)
Appends a timestamped entry to amendments_log.json with the name, description, KQL preview, role, and action
The worker reads and clears this log on the next drain run, posting a Discord changelog: ✏️ for updates, ✨ for new entries

This means the query store is auditable — every improvement made by an autonomous agent is surfaced in the team channel automatically, without any operator action.

The intended division of labour (cost-efficient):

A capable model does the rare, hard part — discover the new shape, author + verify the query, save_query it. Trigger it two ways: on a schedule (a daily job that checks the discovery queue), or on demand ("I just added HAProxy to Berserk — add support").
The cheap model reaps the result — every saved query is reusable for free via run_saved, deterministically. Authoring KQL is the one thing small models are weak at, so gate this behind the stronger model; save_query verifies the query runs before persisting as a guardrail.

This scales because learned queries live behind list_saved/run_saved, not as first-class tools — so you can learn dozens of new sources without growing the routing surface that keeps the cheap model reliable.

Worked examples

Concrete prompts you can paste into any MCP-aware client. Each shows the natural-language ask, which tools the model ends up calling, and the kind of answer you get. These all work on the cheap default lane — no frontier model required.

ChatOps: "any errors in the last hour?" (SRE lane)

Have there been any errors in the last hour, and from which service?

Calls errors_by_service (since="1h ago"). The model replies with the per-service error count, or "no errors recorded" when empty. On the SRE lane, the primer nudges it toward sre_error_rate for a time-series view if the count is above threshold.

On-call triage: "is api-gateway healthy?" (SRE lane)

Is api-gateway healthy? What's the error rate and when was it last seen?

Calls sre_service_health(service="api-gateway"). Returns total events, error count, log/metric split, and last-seen timestamp in one round trip. If error count is high, the primer's threshold guidance nudges the model to follow up with sre_top_error_messages.

SOC investigation: "what happened on journal-forwarder?" (SOC lane)

Reconstruct what happened with journal-forwarder over the last 2 hours.

Calls soc_timeline(service="journal-forwarder", since="2h ago"). Returns timestamped events with severity, metric names, and message snippets ordered newest-first — a ready-made incident narrative without any KQL authoring.

Security sweep: "anything new or anomalous?" (SOC lane)

Anything unusual in the last 30 minutes? Spikes, new sources, repeated errors?

Calls soc_log_spike, soc_new_services, and soc_repeated_errors in one pass. The SOC primer tells the model to scan all three before summarising.

Developer workflow: "what tools is Claude Code using?" (Claude lane)

What tools has Claude Code used most this week, and were there any errors?

Calls claude_tools(since="7d ago") + claude_errors. Only works if you ship Claude Code session logs into Berserk via an OTLP forwarder — see docs/claude-code.md.

Onboarding a new source

I just added HAProxy logs to Berserk. Integrate it.

(With SOUL.md / system prompt configured.) Agent calls request_discovery(service="haproxy", role_hint="sre"). The discovery worker runs overnight, authors and saves sre_haproxy_service. Next morning run_saved answers HAProxy questions on the cheap lane, permanently.

Autonomous daily health digest (cron / scheduled agent)

You are an on-call assistant. Use the Berserk MCP to:
1) Check load per host (host_cpu, host_memory) over the last 6 hours.
2) Count errors per service over the last 24 hours (errors_by_service).
3) List the top 5 noisiest containers (top_memory).
Write a 10-line digest, flag anything anomalous, and stop.

Deterministic enough to run unattended overnight on gpt-4.1-mini or a local Qwen2.5-7B. Wire it to a cron job; the answer is short and parseable.

Requirements

Python 3.8+
The bzrk CLI, installed and authenticated (bzrk -P <profile> search "..." works). The bearer token lives in bzrk's own config — this server never reads or stores it.

Install

pip install berserk-mcp
# or, isolated:
pipx install berserk-mcp
# or run without installing:
uvx berserk-mcp

From source:

git clone https://github.com/ssi0202/berserk-mcp
cd berserk-mcp
pip install .

The single file has no dependencies, so you can also just drop berserk_mcp.py somewhere and run python berserk_mcp.py.

Configure

All configuration is via environment variables — all optional:

Variable	Default	Purpose
`BZRK_BIN`	`bzrk`	Path/name of the Berserk CLI binary.
`BZRK_PROFILE`	`local`	The `bzrk` profile to query.
`BZRK_TIMEOUT`	`120`	Per-query timeout, seconds.
`BERSERK_TABLE`	`default`	The Berserk table to query.
`BERSERK_MCP_LEARNED_PATH`	platform config dir	Where saved queries persist (`~/.config/berserk-mcp/learned.json` on Linux).
`BERSERK_MCP_ROLE`	`all`	Active role lane: `sre`, `soc`, `claude`, `ops`, or `all`. Controls tool visibility and primer injection.
`BERSERK_MCP_PRIMERS_DIR`	adjacent `primers/` dir	Directory containing `<role>.md` primer files.

Connect it to a client

Claude Desktop

Add to claude_desktop_config.json (Settings → Developer → Edit Config):

{
  "mcpServers": {
    "berserk-q": {
      "command": "berserk-mcp",
      "env": {
        "BZRK_PROFILE": "local",
        "BERSERK_MCP_ROLE": "sre"
      }
    }
  }
}

If you didn't pip install it, point at the file instead:

{
  "mcpServers": {
    "berserk-q": {
      "command": "python",
      "args": ["/absolute/path/to/berserk_mcp.py"],
      "env": {
        "BZRK_PROFILE": "local",
        "BERSERK_MCP_ROLE": "sre"
      }
    }
  }
}

Claude Code

claude mcp add berserk-q -- berserk-mcp
# or from source:
claude mcp add berserk-q -- python /absolute/path/to/berserk_mcp.py

Set the role in your shell or .env:

BERSERK_MCP_ROLE=sre claude mcp add berserk-q -- berserk-mcp

Any MCP client

Launch berserk-mcp (or python berserk_mcp.py) as a stdio MCP server. It speaks newline-delimited JSON-RPC 2.0 over stdio, MCP protocol version 2025-06-18.

Choosing a model

The whole point of the fixed-query design is that the model never writes KQL — it only picks a tool and a time window. That collapses the capability bar from "can author correct Kusto" down to "can do basic tool-calling," which is what makes cheap and local models viable. Lead with the cheapest thing that works:

Local (preferred). Any Ollama/LM-Studio model with solid tool-calling: the Qwen2.5-Instruct family (7B is the sweet spot), Llama 3.1/3.3, or Mistral-Small. Tiny models (≤2B) and CPU-only prefill struggle with agentic tool-call loops — prefer GPU and ≥7B for unattended use.
Cheap API. gpt-4.1-mini, Claude Haiku, or Gemini Flash — strong tool use at a fraction of frontier cost. Good for latency-sensitive ChatOps replies.
Frontier models are rarely necessary; save them for open-ended investigations that lean on search and save_query.

Biggest reliability lever regardless of model: the tool descriptions. They're written to be narrow and unambiguous so a small model routes correctly — keep them that way if you add tools.

Security

Injection guards. logs_for_service and sre_service_health/soc_timeline validate the service name against [A-Za-z0-9._-]. claude_search rejects quotes, pipe, backslash, and backtick. Both are interpolated into KQL string literals, so this blocks single-quote injection. All other tools use fully fixed queries with no interpolation.
Read-only by construction. Every tool is annotated (readOnlyHint) and only issues read KQL. The sole exceptions are save_query (writes a local query file, never Berserk) and request_discovery (writes a local queue file).
No shell. subprocess is always invoked with an argument list (never shell=True); there is no eval.
No secrets in this code. The Berserk bearer token lives only in bzrk's own config. This server never reads, stores, or logs it.
Note on output. Tool results are whatever your telemetry contains. If logs in Berserk hold sensitive values, logs_for_service/search can surface them — redact at ingest, not here.

To report a vulnerability, see SECURITY.md.

Testing

python -m pytest tests/ -q
# 70 tests: 51 core + 19 routing eval — no live Berserk required

The tests stub the bzrk CLI, so they verify: KQL content and lock strings, default time windows, role isolation (which tools appear in which lane), injection guards, since validation, tool annotations, JSON-RPC protocol, learning loop, discovery queue deduplication, and amendments log behaviour.

Live-verified, not just unit-tested

The stubbed suite proves the server's logic is internally consistent; it can't prove the KQL actually executes correctly against a real cluster. Separately, every SRE and SOC tool has been run through this server's real dispatch path against a live Berserk deployment — and that process caught two real bugs unit tests alone couldn't surface:

soc_new_services's default 7-day window had no shard-field filter, so it scanned unindexed and timed out under real data volume. Narrowed to 24h ago; confirmed returning full results in ~28s.
sre_host_headroom returned raw bytes for memory instead of converting to GB (unlike host_memory, which already did) — summarized by a model as "1.61 billion bytes." Fixed: memory now reports in GB with an explicit unit column distinguishing it from the CPU load-average rows.

Both fixes are in the current release.

Extending — add a new tool in five minutes

The whole point of this server is fixed, verified queries — so adding a tool is a small, mechanical ritual. Aim to keep the routing surface small (~20 core tools) and let the long tail accumulate behind save_query/run_saved via the learning loop.

1. Find the KQL on a live instance. Iterate with bzrk until the query returns clean rows — names, units, sort order. Don't ship a query you haven't seen succeed against real data.

bzrk -P local search "default | where metric_name == 'system.network.io' \
  | summarize bytes=sum(value) by host=tostring(resource['host.name'])" \
  --since "1h ago"

2. Add the tool entry:

TOOLS.append({
    "name": "host_network",
    "roles": ["sre"],          # omit to make visible to all lanes
    "description": "Total network bytes (sum) per host. Per-HOST; for per-container "
                   "network use `search` for now.",
    "inputSchema": {"type": "object", "properties": _since()},
})
TITLES["host_network"] = "Per-Host Network I/O"

Wire it to the dispatcher (fixed cmd key), and add a KQL constant for the test.

3. Lock the query string with a test:

def test_q_host_net_locked(self):
    self.assertIn("system.network.io", bm.Q_HOST_NET)

4. Run the suite + re-register:

python -m pytest tests/ -q
claude mcp remove berserk-q && claude mcp add berserk-q -- berserk-mcp

A tool that touches free-text input (a service name) needs an allowlist (see logs_for_service). A tool needing two bzrk round-trips can follow discover_schema's pattern. Both are in the source as templates.

Contributing

Issues, ideas, and PRs are all welcome — see CONTRIBUTING.md for the short version. The bar is low: if the tests pass, the description is narrow, and the query has been seen working against real data, it's mergeable.

Good first contributions:

A new fixed-query tool for telemetry you actually care about
A worked example for your stack (Kubernetes, ECS, Nomad, …) under docs/
Sharpening a tool description that confused your model — the descriptions are the router; a clearer one is a real correctness improvement
Filing an issue when you hit something the server should have a tool for

License

MIT.

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured

berserk-mcp

README

berserk-mcp

⚠️ Disclaimer — please read

Why this exists

What this adds vs. default Berserk

Architecture

How the lanes talk to each other and to Berserk

Role lanes

Role primers

Tools

Core tools (all lanes)

SRE tools (sre lane only)

SOC tools (soc lane only)

Claude Code tools (claude lane only)

Learning loop tools (all lanes)

Discovery tools (all lanes)

Self-extending: discovery + learning

Stage 1: Discovery queue

Stage 2: @deep amendments and improvements

Worked examples

ChatOps: "any errors in the last hour?" (SRE lane)

On-call triage: "is api-gateway healthy?" (SRE lane)

SOC investigation: "what happened on journal-forwarder?" (SOC lane)

Security sweep: "anything new or anomalous?" (SOC lane)

Developer workflow: "what tools is Claude Code using?" (Claude lane)

Onboarding a new source

Autonomous daily health digest (cron / scheduled agent)

Requirements

Install

Configure

Connect it to a client

Claude Desktop

Claude Code

Any MCP client

Choosing a model

Security

Testing

Live-verified, not just unit-tested

Extending — add a new tool in five minutes

Contributing

License

Recommended Servers

SRE tools (`sre` lane only)

SOC tools (`soc` lane only)

Claude Code tools (`claude` lane only)