cloakbrowser-mcp
Stealth browser automation for AI agents, using source-patched Chromium to bypass bot detection systems like Cloudflare, reCAPTCHA, and FingerprintJS.
README
CloakBrowser MCP
<!-- mcp-name: io.github.ColorSource/cloakbrowser-mcp -->
Stealth browser automation for AI agents — a Model Context Protocol server combining CloakBrowser's anti-detection with Playwright MCP-inspired architecture.
CloakBrowser is a source-level patched Chromium that passes Cloudflare Turnstile, reCAPTCHA v3 (0.9 score), FingerprintJS, BrowserScan, and 30+ bot detection services.
Why CloakBrowser MCP?
| Feature | Playwright MCP | CloakBrowser MCP |
|---|---|---|
| Anti-detection | ❌ None | ✅ Source-patched Chromium |
| Cloudflare bypass | ❌ | ✅ |
| reCAPTCHA v3 | ❌ | ✅ 0.9 score |
| Snapshot-first | ✅ | ✅ |
| Markdown extraction | ❌ | ✅ Readability-style |
| Annotated screenshots | ❌ | ✅ browser-use style |
| Smart page settling | Basic | ✅ MutationObserver + networkidle |
| Auto-retry clicks | ❌ | ✅ |
| Humanized input | ❌ | ✅ Mouse curves, keyboard timing |
| Capability gating | ✅ --caps | ✅ --caps |
Quick Start
Install
pip install cloakbrowser-mcp
Use with Claude Desktop
Add to your claude_desktop_config.json:
{
"mcpServers": {
"cloakbrowser": {
"command": "cloakbrowser-mcp"
}
}
}
Use with VS Code / Cursor
Add to .vscode/mcp.json:
{
"servers": {
"cloakbrowser": {
"command": "cloakbrowser-mcp",
"args": ["--caps", "all"]
}
}
}
Use with Hermes Agent
Add to ~/.hermes/config.yaml:
mcp_servers:
cloakbrowser:
command: cloakbrowser-mcp
args: ["--caps", "all"]
timeout: 120
How It Works
Snapshot-First Architecture
CloakBrowser MCP uses accessibility tree snapshots as the primary way for AI models to understand web pages — not screenshots, not raw HTML.
1. browser_launch() → Start stealth browser
2. browser_navigate(pid, url) → Go to page (auto-waits for settle)
3. browser_snapshot(pid) → Get interactive elements with [@eN] refs
4. browser_click(pid, '@e5') → Click element by ref
5. browser_type(pid, '@e3', 'hello') → Type into input
6. browser_read_page(pid) → Get content as clean markdown
7. browser_close() → Done
Each interactive element gets a [@eN] ref ID. All interaction tools use these refs — no CSS selectors needed.
Three Ways to See a Page
browser_snapshot()— Accessibility tree with[@eN]refs. Fast, cheap, reliable. Use this.browser_read_page()— Clean markdown extraction. For reading content, not interacting.browser_screenshot()— Annotated screenshot with element indices. For visual context (images, charts, CAPTCHAs).
Stealth by Default
All anti-detection features are ON by default:
- Source-patched Chromium binary (not Playwright patches — actual Chromium source modifications)
- Human-like mouse curves, keyboard timing, and scroll patterns (
humanize=True) - Stealth fingerprint arguments (consistent canvas, WebGL, audio fingerprints)
- Proxy support with GeoIP-based timezone/locale detection
Tools
Core Tools (25 — always available)
| Tool | Description |
|---|---|
browser_launch |
Start stealth browser (all anti-detection ON) |
browser_close |
Close browser and release resources |
browser_snapshot |
PRIMARY — accessibility tree with [@eN] refs |
browser_click |
Click element by ref (auto-retry) |
browser_type |
Humanized type into input by ref |
browser_select |
Select dropdown option by ref |
browser_hover |
Hover over element by ref |
browser_drag |
Humanized drag from one ref to another |
browser_check |
Check/uncheck checkbox by ref |
browser_read_page |
Page content as clean markdown |
browser_extract_links |
Extract page links as structured JSON |
browser_get_images |
Extract page images as structured JSON |
browser_screenshot |
Annotated screenshot with element indices |
browser_navigate |
Go to URL (auto-waits for settle) |
browser_back |
Navigate back in history |
browser_forward |
Navigate forward in history |
browser_press_key |
Press keyboard key |
browser_scroll |
Scroll page up/down |
browser_wait |
Wait for page to settle |
browser_wait_for_text |
Wait for page text to appear |
browser_wait_for_ref |
Wait for an element ref state |
browser_evaluate |
Execute JavaScript in page |
browser_new_page |
Open new page/tab |
browser_list_pages |
List all open pages |
browser_close_page |
Close a specific page |
Capability-Gated Tools (enabled via --caps)
Enable with cloakbrowser-mcp --caps network,cookies,pdf,console or --caps all.
| Tool | Capability | Description |
|---|---|---|
browser_network_intercept |
network | Block/mock/passthrough requests |
browser_network_continue |
network | Remove interception rule |
browser_get_cookies |
cookies | Get all cookies |
browser_set_cookies |
cookies | Set cookies |
browser_pdf |
Save page as PDF | |
browser_console |
console | Get browser console output |
Configuration
CLI Options
cloakbrowser-mcp [--caps CAPS] [--transport {stdio,sse}] [--port PORT]
--caps: Comma-separated capabilities:network,cookies,pdf,console,all--transport: MCP transport —stdio(default) orsse--port: Port for SSE transport (default: 8931)
Environment Variables
| Variable | Default | Description |
|---|---|---|
CLOAKBROWSER_LOG_LEVEL |
INFO |
Log level |
CLOAKBROWSER_LOG_FILE |
~/.cloakbrowser/logs/server.log |
Log file path |
CLOAKBROWSER_LOG_STDERR |
false |
Also log to stderr |
Launch Options
browser_launch(
headless=True, # False for headed mode (some sites require it)
proxy="http://...", # Residential proxy recommended
humanize=True, # Human-like input (ON by default)
stealth_args=True, # Stealth fingerprints (ON by default)
timezone="America/New_York",
locale="en-US",
geoip=False, # Auto-detect from proxy IP
fingerprint_seed="my-identity", # Consistent fingerprint across sessions
user_data_dir="/path", # Persistent profile
)
Architecture
cloakbrowsermcp/
├── server.py # FastMCP server, tool registration, error handling
├── session.py # Browser lifecycle, page management, ref storage
├── snapshot.py # Accessibility tree JS, ref resolution
├── markdown.py # Readability-style HTML-to-markdown extraction
├── vision.py # Annotated screenshots with element indices
├── waiting.py # Smart wait, page settle, retry logic
├── stealth.py # Stealth config inspection
├── network.py # Network intercept, cookies (capability-gated)
├── __init__.py
└── __main__.py
Design Principles
- Snapshot-first — Tool descriptions steer models to use
browser_snapshot()as the primary page understanding tool - Ref-based only — No CSS selector tools. All interaction via
[@eN]refs from snapshots - Stealth by default — Anti-detection, humanization, and stealth args all ON without configuration
- Auto-snapshot after actions — Click, type, navigate all return an updated snapshot
- Smart waiting — Auto-wait on navigation (networkidle + MutationObserver settle), auto-retry on failed clicks
- Capability gating — Advanced tools (network, cookies, PDF) off by default to keep tool count low
- Clean content extraction — Markdown for reading, snapshot for interaction, annotated screenshots for vision
Development
git clone https://github.com/overtimepog/CloakMCP
cd CloakMCP
pip install -e ".[dev]"
pytest
License
Apache-2.0
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.