webclaw-mcp

webclaw-mcp

MCP browser agent that runs as a Chrome extension. Control your real Chrome browser with 17 MCP tools — uses accessibility tree with @ref labels for low token usage. Supports WebMCP native tools.

Category
Visit Server

README

WebClaw

CI npm License: MIT

A WebMCP-native browser agent that runs inside your real Chrome — control it from Claude, Cursor, and any MCP client.

<p align="center"> <img src="docs/images/demo.gif" alt="WebClaw demo" width="860"> </p>

Why WebClaw?

WebClaw browser-use Playwright MCP
Runs in real Chrome Yes (extension) No (CDP) No (Playwright)
User's logged-in sessions Yes No No
Bot detection Resistant Vulnerable Vulnerable
WebMCP native tools Yes No No
Page understanding @ref a11y tree Screenshots DOM / Screenshots

WebClaw runs as a Chrome extension, so it sees exactly what you see — cookies, logins, extensions, and all. No headless browser, no CDP injection, no bot flags.

Quick Start

1. Configure your MCP client

<details open> <summary><b>Claude Desktop</b></summary>

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "webclaw": {
      "command": "npx",
      "args": ["-y", "webclaw-mcp"]
    }
  }
}

Config file locations:

  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Linux: ~/.config/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json

</details>

<details> <summary><b>Claude Code</b></summary>

claude mcp add webclaw -- npx -y webclaw-mcp

</details>

<details> <summary><b>Cursor</b></summary>

Add to .cursor/mcp.json in your project root:

{
  "mcpServers": {
    "webclaw": {
      "command": "npx",
      "args": ["-y", "webclaw-mcp"]
    }
  }
}

</details>

<details> <summary><b>VS Code (Copilot)</b></summary>

Add to .vscode/mcp.json in your project root:

{
  "servers": {
    "webclaw": {
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "webclaw-mcp"]
    }
  }
}

</details>

<details> <summary><b>Windsurf</b></summary>

Add to ~/.codeium/windsurf/mcp_config.json:

{
  "mcpServers": {
    "webclaw": {
      "command": "npx",
      "args": ["-y", "webclaw-mcp"]
    }
  }
}

</details>

2. Load the Chrome extension

  1. Download webclaw-extension-v0.7.2.zip and unzip
  2. Open chrome://extensions/ → enable Developer mode
  3. Click Load unpacked → select the dist/ folder

<details> <summary>Or build from source</summary>

git clone https://github.com/kuroko1t/webclaw.git && cd webclaw && pnpm install && pnpm build

Then load packages/extension/dist/ as above.

</details>

3. Try it

Restart your MCP client and ask:

"Go to wikipedia.org and search for Model Context Protocol"

Chrome launches automatically on the first tool call. The Side Panel shows all activity in real-time.

How It Works

flowchart LR
    LLM["AI Assistant"]
    MCP["MCP Server"]
    EXT["Chrome Extension"]
    CS["Content Script"]

    LLM -- "stdio" --> MCP -- "WebSocket" --> EXT --> CS

    style LLM fill:#7b2d8b,stroke:#333,color:#fff
    style MCP fill:#1a5276,stroke:#333,color:#fff
    style EXT fill:#1e8449,stroke:#333,color:#fff
    style CS fill:#b7950b,stroke:#333,color:#fff
  1. The AI calls MCP tools (navigate_to, page_snapshot, click, ...)
  2. The MCP server forwards requests to the Chrome extension via WebSocket
  3. The content script reads the page as a compact accessibility tree with @ref labels
  4. The AI picks the right @ref and acts — no pixel coordinates, no CSS selectors

page_snapshot output

[page "Wikipedia, the free encyclopedia"]
  [search]
    [form]
      [@e3 searchbox "Search Wikipedia"]    ← AI targets this
      [@e4 button "Search"]                 ← and clicks this
  [main]
    [heading[1] "Welcome to Wikipedia"]
    [group]
      [heading[2] "From today's featured article"]
      ...

Every interactive element gets a stable @ref label. The AI reads the tree, picks the right @ref, and calls click / type_text / select_option.

MCP Tools (19)

<details> <summary><b>Page Interaction</b> — navigate, snapshot, click, hover, type, select, drop files, screenshot</summary>

Tool Parameters Description
navigate_to url, tabId? Navigate to a URL
page_snapshot maxTokens?, tabId? Get a compact accessibility tree with @ref labels
click ref, snapshotId, tabId? Click an element by its @ref label
hover ref, snapshotId, tabId? Hover over an element to reveal hidden UI (dropdowns, tooltips)
type_text ref, text, snapshotId, tabId? Type text into an input/textarea by @ref
select_option ref, value, snapshotId, tabId? Select a dropdown option by @ref
drop_files ref, snapshotId, files, tabId? Drop files onto an element (e.g., upload images to GitHub issues)
screenshot tabId? Capture the visible area of the active tab

</details>

<details> <summary><b>Tab Management</b> — new, list, switch, close</summary>

Tool Parameters Description
new_tab url? Open a new tab (optionally with a URL)
list_tabs List all open tabs with tabId, URL, title, and active state
switch_tab tabId Switch to a specific tab
close_tab tabId Close a specific tab

</details>

<details> <summary><b>Navigation</b> — back, forward, reload, wait, scroll</summary>

Tool Parameters Description
go_back tabId? Go back to the previous page
go_forward tabId? Go forward to the next page
reload tabId?, bypassCache? Reload the current page
wait_for_navigation tabId?, timeoutMs? Wait for the page to finish loading
scroll_page direction?, amount?, ref?, snapshotId?, tabId? Scroll the page or scroll to a specific element

</details>

<details> <summary><b>WebMCP</b> — discover and invoke page-declared tools</summary>

Tool Parameters Description
list_webmcp_tools tabId? Discover tools on the page (native WebMCP + auto-synthesized from buttons/forms)
invoke_webmcp_tool toolName, args?, tabId? Invoke a discovered tool (native or synthesized)

</details>

Development

git clone https://github.com/kuroko1t/webclaw.git
cd webclaw
pnpm install
pnpm build        # Build all packages
pnpm test         # Run all tests
pnpm dev          # Watch mode

Project Structure

packages/
  shared/          Type definitions, Zod schemas, utilities
  mcp-server/      MCP server with 19 tools, WebSocket bridge
  extension/       Chrome MV3 extension (service worker, content scripts, side panel)
examples/
  webmcp-demo-site/  WebMCP-enabled Todo app for testing native tool discovery

Multi-Session Support

WebClaw automatically scans ports 18080–18089, so you can run up to 10 MCP server instances simultaneously (e.g., multiple Claude Code sessions). Each instance binds to the first available port, and the Chrome extension connects to all of them.

Environment Variables

Variable Default Description
WEBCLAW_PORT 18080 WebSocket port for MCP server ↔ extension communication. When set, disables auto-scanning and uses only this port.

<details> <summary><b>Troubleshooting</b></summary>

Chrome extension not connected

  • Chrome is auto-launched when a tool is called. If it fails, start Chrome manually with the extension loaded
  • Check the Service Worker console (chrome://extensions/ → Details → Service Worker) for Connected to MCP server
  • Verify the MCP server is running (look for WebSocket server listening on 127.0.0.1:<port> in stderr)

MCP client cannot connect

  • Ensure npx webclaw-mcp runs successfully from your terminal
  • Check for port conflicts: WebClaw auto-scans 18080–18089. If all are in use, set WEBCLAW_PORT to a custom port
  • Restart your MCP client after updating the config

Content script not injecting

  • Content scripts only inject into http:// and https:// pages (not chrome://, file://, etc.)
  • Try refreshing the page after loading the extension

WebMCP tools not discovered

  • Native WebMCP requires Chrome 146+ with the navigator.modelContext API
  • Verify the page declares tools via navigator.modelContext.addTool() or <link rel="webmcp-manifest">

</details>

Contributing

Contributions are welcome! Please open an issue to discuss your idea before submitting a pull request.

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured