mcp-unified-automation
A custom MCP server for Playwright browser automation and system control, providing 25 tools to enable Claude Code to perform web interactions and system tasks via natural language.
README
mcp-unified-automation
Custom MCP server — Playwright browser automation + system control, 25 tools, stdio transport.
Launched via RUN_MCP_FAST.bat on George's Desktop. Connects to Claude Code as a local MCP server.
What it does
Runs a headed Chrome instance using George's actual Chrome profile (User Data dir), so the browser has real auth cookies — logged into Google, Search Console, Gmail, everything. Claude operates it through 25 registered MCP tools.
Tool groups:
| Group | Count | What it covers |
|---|---|---|
browser_* |
7 | Backward-compatible Playwright tools (navigate, click, type, screenshot, etc.) |
system_* |
6 | Shell commands, file read/write, process control |
session_* |
2 | Named browser session management (multiple tabs/contexts) |
task_* |
5 | Task DSL engine — plan, run, resume, pause, commit |
observe_* |
1 | DOM observation / mutation watching |
network_* |
2 | Request blocking + API endpoint discovery |
evidence_* |
1 | Export + hash-chain verification of browser actions |
metrics_* |
1 | Step-level timing and performance reporting |
Setup
git clone https://github.com/gugosf114/mcp-unified-automation
cd mcp-unified-automation
npm install
cp .env.example .env
# Edit .env if your Chrome User Data path differs
npm run build
Then double-click RUN_MCP_FAST.bat (or right-click → Run as administrator if system tools need elevation).
.env config
BROWSER_HEADED=true # Show the browser window
BROWSER_BLOCK_MEDIA=false # Block images/video to speed up crawls
HUMAN_DELAY_MIN=50 # ms — min delay between actions
HUMAN_DELAY_MAX=200 # ms — max delay (human-like pacing)
FAST_MODE=true # Skip networkidle waits + nav waits on click (10x faster)
CHROME_USER_DATA_DIR=C:\Users\georg\AppData\Local\mcp-unified-automation\chrome-profile
CHROME_USER_DATA_DIR points to a dedicated Chrome profile so sessions, cookies, and logins carry over. Must NOT be Chrome's default user-data-dir (Chrome blocks DevTools on it).
FAST_MODE skips Playwright's networkidle wait after every navigation and click. Modern sites (GA4, Gmail, etc.) never truly go network-idle due to analytics pings and websockets, so the default behavior just burns 10-30 seconds hitting the timeout. With FAST_MODE, pages are interactable as soon as the DOM loads.
Claude Code config
Add to your Claude Code MCP config (claude_desktop_config.json or equivalent):
{
"mcpServers": {
"unified-automation": {
"command": "node",
"args": ["C:\\path\\to\\mcp-unified-automation\\dist\\index.js"]
}
}
}
Or just run the BAT — it handles the node invocation.
Architecture
src/
index.ts — server entry, registers all tool groups
kernel.ts — wires up shared managers (session, task, observer, network, evidence, metrics)
tools/ — one file per tool group
cdp/ — Chrome DevTools Protocol bridge
session/ — named session + page slot management
task/ — task DSL (plan/run/resume/pause/commit)
observer/ — DOM mutation bus
network/ — request interceptor + API discovery
evidence/ — action ledger + hash chain
metrics/ — step timing engine
checkpoint/ — mid-task state snapshots
recovery/ — resume after crash/disconnect
policy/ — action policy enforcement
types/ — shared TypeScript types
Stack
- Node.js (ESM)
- TypeScript 5.7
- Playwright 1.52
@modelcontextprotocol/sdk1.12- Zod (schema validation)
What's New (March 2026)
Features you can use
-
Domain readiness profiles (
src/readiness.ts) — The server now knows how to wait for LinkedIn, Gmail, Google Search, GitHub, Yelp, Facebook, and Instagram. Instead of waiting fornetworkidle(which times out on every modern site), it waits for the actual content selector that proves the page loaded. You don't call this directly — it fires automatically on everynavigate,goto, andwarmcall. If you add a new site you automate frequently, add its selector toreadiness.ts. -
Selective evidence mode — New
.envsettingEVIDENCE_MODE=selectiverecords screenshots/DOM snapshots only on errors, approval gates, and the first/last step of a task. Set tofullfor audit trails,noneto skip entirely. Controlled in.env, no code changes needed. -
Semantic tool returns —
browser_navigate,browser_click, andsession_opennow return richer data:readyState, whether navigation occurred, whether the readiness selector matched, the domain profile used. This means Claude doesn't need a follow-upgetPageInfocall after every action — the info comes back in the tool response. Fewer round trips = faster task execution. -
Task management tools renamed —
task.list→task_list,task.status→task_status,task.cancel→task_cancel. Claude Desktop rejects dots in tool names. If you had prompts referencing the old names, update them.
Performance & safety (automatic, no action needed)
-
FAST_MODE — When
FAST_MODE=truein.env: zero human delays, skipnetworkidlewaits, sparse checkpoint writes (every 3rd step instead of every step), evidence recording followsEVIDENCE_MODEsetting. Already enabled in your.env. -
Stable path resolution — Data directories (
data/checkpoints/,data/evidence/) now resolve from the module's own location (import.meta.url), notprocess.cwd(). This fixed the crash when Claude Desktop spawned the server fromC:\WINDOWS\system32. Centralized insrc/env.tsasDATA_ROOT. -
Retry with exponential backoff (
src/action/retry.ts) — Network-sensitive actions auto-retry on transient failures. -
Conditional steps & parallel runner — Task DSL now supports
conditionfields on steps and a parallel step executor for independent actions.
Notes
browser_*tools use a”default”page slot for backward compatibility — existing Claude prompts that callbrowser_navigateetc. still work unchanged.- The task engine supports mid-run pause/resume — useful for long multi-step jobs that might hit context limits.
- Evidence ledger produces a hash-chained audit trail of every browser action. Useful for compliance work.
- Recovery module handles reconnect after Claude Code crashes or context resets.
Execution Policy
This server runs in operator-first mode.
Reversible automation and read/write navigation steps are allowed by default — no approval gate required. Human approval is required only for:
- Financial actions (payments, subscriptions, transactions)
- Destructive actions (deletes, overwrites, sends that cannot be undone)
- Explicit pproval_gate checkpoints defined in the task DSL
The operator, not the client UI, is the final authority over execution policy. Approval prompts are task-level constructs, not UI-level guardrails.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.