yo-bug
An MCP server that gives AI coding assistants QA superpowers, enabling users to report bugs by pointing, clicking, or typing while automatically capturing diagnostic data for AI-driven test-feedback-fix loops.
README
yo-bug ๐
English ยท ็ฎไฝไธญๆ
"Yo, bug!" โ Point at bugs, AI fixes them.

MCP Server that gives AI coding assistants QA superpowers. One install, then your AI handles the entire test-feedback-fix loop.
In vibe coding, the bottleneck is testing: humans find bugs but struggle to describe them. yo-bug solves this by letting users point, click, or type โ while the AI automatically receives element locations, console errors, network failures, action recordings, and annotated screenshots.
What it does
| For the Human | For the AI |
|---|---|
| Click a broken element โ done | Gets: CSS selector + computed styles + React/Vue component name |
| Draw on a screenshot โ done | Gets: annotated PNG as inline image content |
| Type a behavior bug ("scroll doesn't auto-jump") โ done | Gets: structured text feedback linked to a checklist case |
| Check off a multi-step test case | Gets: which cases passed/failed, with linked feedback per case |
| Just use the app normally | Gets: last 100 user actions + 50 console errors + failed network calls |
The AI drives the entire workflow through MCP tools. Humans never need to learn commands, configure proxies, or modify their code.
Install
npx yo-bug install
(or npm install -g yo-bug && yo-bug install if you prefer a global install)
This auto-detects your AI tool (Claude Code / Cursor / Windsurf) and writes the MCP config. One time, done forever.
How it works
AI writes code
โ AI calls start_test_session()
โ Reverse proxy starts, injects test SDK into HTML responses
โ Browser opens (zero code changes to your project)
โ AI pushes grouped test cases (8 QA dimensions, multi-step scenarios)
โ Human runs each case; pass/fail at the case level
โ When failing, picks feedback mode: element / screenshot / text
โ AI calls list_feedbacks() + get_feedback() โ sees diagnostic data
โ AI must state root cause BEFORE writing fix (enforced by tool response)
โ AI fixes โ calls resolve_feedback() โ browser shows verify card
โ Human clicks Fixed / Locate (jump to element) / Still broken
โ Loop until done
MCP Tools (9 total)
Session Control
| Tool | Description |
|---|---|
start_test_session(port?, open?, storage?) |
Start test mode: auto-detect dev server, launch reverse proxy with SDK injection, open browser. storage defaults to "project" (saves under <cwd>/.yo-bug/feedback/); pass "home" for ~/.yo-bug/ |
stop_test_session() |
Stop test mode, return session summary (feedback stats, checklist results, weak dimensions, whether dev server was auto-closed) |
Feedback
| Tool | Description |
|---|---|
list_feedbacks(status?, type?, limit?, all_sessions?) |
List submitted feedback. Defaults to current session only |
get_feedback(id) |
Full details: element info, console errors, network errors, action steps, annotated screenshot. Response includes a mandatory diagnostic protocol that forces the AI to identify root cause before writing code |
resolve_feedback(id) |
Mark as fixed โ pushes a rich verification card to the browser (with screenshot thumbnail, element selector, page URL, locate button). Human confirms |
Test Checklist (Grouped Test Cases)
| Tool | Description |
|---|---|
create_checklist(title, cases) |
Push structured test cases to browser. Each case has a title, sequential steps[], expected result, priority, and QA dimension |
get_checklist_status() |
See which cases passed/failed and any user feedback per case |
Test History
| Tool | Description |
|---|---|
save_test_record(module, ...) |
Save test results per module. Accumulates history for future reference |
get_test_history(module) |
Get historical test records. Shows frequently failing scenarios so AI prioritizes coverage |
8 QA Test Dimensions
The create_checklist tool embeds professional QA methodology. AI is guided to cover:
- Happy path โ Core functionality works end-to-end
- Empty/boundary โ Empty inputs, special chars, max length, zero/negative values
- Error states โ Offline, server errors, timeouts, recovery
- Duplicate ops โ Double-click, re-submit, concurrent requests
- State recovery โ Refresh, back/forward, deep links, tab close/reopen
- Loading/async โ Loading states, failed loads, stale data
- Responsive โ 375px mobile width, touch targets, overflow
- Interaction detail โ Tab order, Enter/Escape, disabled states, focus
Each case is multi-step (open page โ fill form โ click submit โ verify result), so the human marks a whole scenario passed/failed rather than micro-managing each step.
Three Feedback Modes
When a checklist case fails, users pick a mode that fits the bug:
| Mode | Shortcut | When to use |
|---|---|---|
| Element | Alt+E |
Visual / component bug โ click the broken element, AI gets selector + styles + framework component name |
| Screenshot | Alt+S |
Layout / styling bug โ drag to select a region, annotate with arrow / rect / circle / freehand / text |
| Text | (Checklist โ) |
Behavior / logic bug ("doesn't auto-scroll", "wrong order") โ pure description, AI uses console/network data to find cause |
Other shortcuts: Alt+X toggle mode bar, Esc exit anything.
In-Browser Feedback Management
Above the floating bug button you'll see a list icon with a red badge showing open feedback count. Click it to:
- See all open feedback you've submitted this session
- Edit description inline (click description text)
- Delete a feedback (trash icon, confirms first)
- View screenshot thumbnails
When all feedback is resolved, the button auto-hides.
Auto-Captured Context
Every feedback submission automatically includes:
- Console errors โ last 50
console.error/console.warnentries with stack traces - Network failures โ last 50 failed
fetch/XHRrequests with status, URL, duration - Unhandled exceptions โ last 50
window.onerror+unhandledrejectionevents - Action recording โ last 100 user actions (click / input / navigate / scroll / keypress) with timestamps
- Element info โ CSS selector, tag, text content, bounding rect, computed styles, React/Vue component name (when in element mode)
Verify-Fix Flow
When AI calls resolve_feedback(), a rich verify card appears on the left:
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ โผ VERIFY FIX [3] โ โ collapsible header with count
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ [thumbnail*] "Button click โ
โ does nothing" โ
โ bug ยท element โ
โ ๐ <button> #submit-btn โ โ only when element info exists
โ ๐ /orders โ
โ โฑ 5 min ago โ
โ [๐ฏ Locate*] [โ] [โ] โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
* thumbnail only if a screenshot was attached
* Locate only if an element selector was captured
- ๐ฏ Locate scrolls to the element + flashes a red pulse twice. If the verify card was submitted on a different URL, it navigates there first. If the element no longer exists on the current page, shows a toast
- โ Fixed confirms the fix โ status: resolved
- โ Still broken sends it back โ status: open (AI sees it in next
list_feedbackscall) - Click the header bar to collapse the whole stack when it gets in the way
Theming
- Auto-detects
prefers-color-scheme(light or dark) - All UI uses design tokens, contrast-verified (WCAG AA)
- Glass-morphism panels with backdrop blur; clearly distinct from white host pages
Works Inside Modal Dialogs
If your app opens a <dialog>.showModal() (which traps focus and inerts the rest of the page), yo-bug still works: it detects the open modal via the :modal selector plus a MutationObserver, and relocates the SDK host inside the dialog so focus is shared.
This means you can submit text feedback even when a modal is open โ the textarea will accept input normally. When the dialog closes, the host moves back out.
i18n
SDK auto-detects <html lang="...">:
- Any value starting with
zh(zh,zh-CN,zh-Hans, ...) โ Chinese interface - Everything else โ English interface
MCP tool descriptions are in English (AI translates to the user's language naturally in chat).
Architecture
Browser โ Reverse Proxy (localhost:3695+) โ Dev Server (localhost:5173)
โ
โโ Auto-injects SDK into HTML responses
โโ WebSocket passthrough (HMR works normally)
โโ Feedback API (POST/GET/PATCH/DELETE)
โโ Checklist API (push/poll/update)
โโ Verify API (push/confirm)
MCP Server (stdio) โ AI Tool (Claude Code / Cursor / Windsurf)
โ
โโ start/stop_test_session โ controls proxy lifecycle
โโ feedback tools โ reads user submissions w/ diagnostic protocol
โโ checklist tools โ pushes grouped test cases, reads results
โโ history tools โ persists test records per module
The proxy auto-detects dev server framework (Vite, Next.js, CRA, Webpack, Nuxt, Angular, Svelte, Astro) and port. If the dev server is already running, it connects. If not, it starts one. On stop_test_session it cleans up โ and tells the AI whether the dev server was auto-closed or left running (because the user had started it themselves).
If port 3695 is taken, yo-bug auto-increments to 3696, 3697... up to 3795. Multiple projects can run yo-bug simultaneously without conflict.
Security
- All data stays local โ feedback files in
<project>/.yo-bug/feedback/by default (auto-.gitignore'd), or~/.yo-bug/if you opt in - Feedback IDs validated against path traversal
- Input fields whitelist-filtered and length-limited
- Network interceptor uses exact pathname matching (no substring false positives)
- No data sent to any external service
Requirements
- Node.js >= 18
- Any MCP-compatible AI tool
- A current-version browser (Chrome / Edge / Firefox / Safari)
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.