Chrome Extension MCP Bridge
Enables AI agents to control the Google Chrome browser through a Node.js WebSocket bridge and a dedicated browser extension. It provides tools for capturing screenshots, executing JavaScript, managing tabs, and extracting page content via the MCP protocol.
README
Chrome Extension MCP Bridge
AI Agent controls Chrome browser via MCP protocol + Chrome Extension + WebSocket bridge
Architecture
[AI Agent]
⇅ (MCP Tools via stdio)
[Node.js App — MCP Server + WebSocket Server]
⇅ (WebSocket on port 7890)
[Chrome Extension (Manifest V3)]
⇅ (chrome.* APIs)
[Browser Tabs]
How It Works
- AI Agent calls MCP tools (e.g.,
capture_screenshot,execute_js) via the MCP protocol (stdio transport) - Node.js Server receives the MCP tool call, generates a
requestId, and sends a JSON command over WebSocket to the Chrome Extension - Chrome Extension (service worker) receives the command, executes the browser action using Chrome APIs, and sends back a JSON response with the same
requestId - Node.js Server resolves the pending Promise and returns the result to the AI Agent
Project Structure
/chrome-extension-mcp
├── extension/ # Chrome Extension (Manifest V3)
│ ├── manifest.json # Extension manifest
│ ├── background.js # Service worker — WS client + action handlers
│ └── content.js # Content script (placeholder)
├── server/ # Node.js backend
│ ├── index.js # Entry point — starts WS + MCP servers
│ ├── wsServer.js # WebSocket server — client management + command dispatch
│ ├── mcpServer.js # MCP server — tool registration via @modelcontextprotocol/sdk
│ ├── cli.js # CLI tool for testing commands
│ ├── tools/ # MCP tool definitions
│ │ ├── captureScreenshot.js
│ │ ├── getHtml.js
│ │ ├── executeJs.js
│ │ ├── openTab.js
│ │ ├── listTabs.js
│ │ └── closeTab.js
│ └── package.json # Server dependencies
├── package.json # Root scripts
└── README.md # This file
MCP Tools
| Tool | Description | Input |
|---|---|---|
capture_screenshot |
Capture a tab screenshot as base64 PNG | { tabId?, title? } |
get_html |
Get full HTML source of a page | { tabId? } |
execute_js |
Execute JavaScript in page context | { tabId?, script } |
open_tab |
Open a new browser tab | { url } |
list_tabs |
List all open tabs | {} |
close_tab |
Close a tab by ID | { tabId } |
Additional Extension Actions (available via CLI/WebSocket)
| Action | Description | Payload |
|---|---|---|
focus_tab |
Focus/activate a tab | { tabId } |
get_text |
Extract visible text from a page | { tabId? } |
get_element |
Get element by CSS selector | { tabId?, selector } |
Communication Protocol
Request (Server → Extension)
{
"requestId": "550e8400-e29b-41d4-a716-446655440000",
"action": "capture_screenshot",
"payload": {
"tabId": 123
}
}
Response (Extension → Server)
{
"requestId": "550e8400-e29b-41d4-a716-446655440000",
"success": true,
"data": {
"tabId": 123,
"format": "png",
"base64": "iVBORw0KGgo..."
},
"error": null
}
Error Response
{
"requestId": "550e8400-e29b-41d4-a716-446655440000",
"success": false,
"data": null,
"error": "No active tab found"
}
Setup Instructions
Prerequisites
- Node.js v18+ (LTS recommended)
- Google Chrome browser
- npm package manager
1. Install Server Dependencies
cd server
npm install
Or from the project root:
npm run install:server
2. Start the Node.js Server
The server runs two components simultaneously:
- WebSocket Server on port
7890(configurable viaWS_PORTenv var) - MCP Server on stdio (for AI Agent communication)
cd server
node index.js
Or from the project root:
npm start
You should see:
[WS] WebSocket server attached to HTTP server
[Server] HTTP + WebSocket server listening on port 7890
[MCP] Server started on stdio transport
3. Load the Chrome Extension
- Open Chrome and navigate to
chrome://extensions/ - Enable Developer mode (toggle in the top-right corner)
- Click "Load unpacked"
- Select the
extension/folder from this project - The extension will load and automatically connect to the WebSocket server
You can verify the connection:
- Open the extension's service worker console (click "Inspect views: service worker" on the extensions page)
- You should see:
[MCP Extension] Connected to server
4. Verify the Health Check
curl http://localhost:7890/health
Expected response:
{
"status": "ok",
"connectedClients": 1,
"clients": ["<client-uuid>"]
}
Testing MCP Tools
Using the CLI Tool
The CLI tool connects directly to the WebSocket server and sends commands to the extension.
Single Command Mode
# List all open tabs
cd server
node cli.js list_tabs
# Open a new tab
node cli.js open_tab '{"url":"https://www.google.com"}'
# Capture screenshot of active tab
node cli.js capture_screenshot
# Capture screenshot by tab title
node cli.js capture_screenshot '{"title":"Google"}'
# Get HTML of active tab
node cli.js get_html
# Execute JavaScript
node cli.js execute_js '{"script":"return document.title"}'
# Close a tab (use tabId from list_tabs)
node cli.js close_tab '{"tabId":123}'
# Get visible text
node cli.js get_text
# Get element by selector
node cli.js get_element '{"selector":"h1"}'
Interactive Mode
cd server
node cli.js
This starts a REPL where you can type commands:
mcp> list_tabs
mcp> open_tab {"url":"https://github.com"}
mcp> capture_screenshot {"title":"GitHub"}
mcp> execute_js {"script":"return document.title"}
mcp> exit
Using with an AI Agent (MCP Client)
Configure your MCP client (e.g., Claude Desktop, Cursor, etc.) to use this server:
Claude Desktop Configuration
Add to your claude_desktop_config.json:
{
"mcpServers": {
"chrome-browser": {
"command": "node",
"args": ["/absolute/path/to/chrome-extension-mcp/server/index.js"],
"env": {
"WS_PORT": "7890"
}
}
}
}
Cursor Configuration
Add to your .cursor/mcp.json:
{
"mcpServers": {
"chrome-browser": {
"command": "node",
"args": ["/absolute/path/to/chrome-extension-mcp/server/index.js"],
"env": {
"WS_PORT": "7890"
}
}
}
}
After configuration, the AI Agent will have access to all 6 MCP tools to control your Chrome browser.
Configuration
Environment Variables
| Variable | Default | Description |
|---|---|---|
WS_PORT |
7890 |
WebSocket + HTTP server port |
WS_URL |
ws://localhost:7890 |
WebSocket URL (CLI only) |
Extension Settings
The WebSocket URL is hardcoded in extension/background.js:
const WS_URL = 'ws://localhost:7890';
Change this if your server runs on a different host/port.
Security Considerations
- Local-only by default: The WebSocket server binds to
localhost. The extension only connects tolocalhost. - No arbitrary code execution without explicit tool call: The
execute_jstool requires an explicitscriptparameter — the extension won't execute code without a proper request. - Request-response correlation: Every command uses a unique
requestId(UUID v4) to prevent response spoofing. - Timeout protection: Commands timeout after 30 seconds to prevent hanging.
- Client identification: Each extension identifies itself with a unique
clientIdon connection.
Production Recommendations
- Add authentication tokens to WebSocket connections
- Restrict
execute_jsto a whitelist of allowed scripts - Use WSS (WebSocket Secure) with TLS certificates
- Add rate limiting to prevent abuse
Error Handling
Server-side
- No clients connected: Returns
"No Chrome extension clients connected" - Request timeout: Returns
"Request timed out after 30000ms" - Send failure: Returns
"Failed to send command: ..."
Extension-side
- Unknown action: Returns
"Unknown action: <action>" - Missing required fields: Returns
"Missing required field: <field>" - Tab not found: Returns
"No tab found matching title: <title>" - Script execution error: Returns the error message from the page context
Development
Debugging the Extension
- Go to
chrome://extensions/ - Find "Chrome Extension MCP Bridge"
- Click "Inspect views: service worker" to open DevTools
- All logs are prefixed with
[MCP Extension]
Debugging the Server
All server logs go to stderr (so they don't interfere with MCP stdio communication):
[WS]— WebSocket server events[MCP]— MCP server events[Server]— General server events
Adding New Tools
- Create a new file in
server/tools/following the existing pattern - Export a factory function that takes
wsServerand returns{ name, description, inputSchema, handler } - Import and register it in
server/mcpServer.js - Add the corresponding action handler in
extension/background.js
Troubleshooting
| Problem | Solution |
|---|---|
| Extension won't connect | Make sure the server is running first (node server/index.js) |
| "No Chrome extension clients connected" | Check that the extension is loaded and the service worker is active |
| Screenshot returns error | Some pages (chrome://, file://) don't allow screenshots |
| execute_js fails | Some pages have CSP restrictions that block injected scripts |
| Service worker goes inactive | Chrome may suspend idle service workers; the auto-reconnect handles this |
| Port 7890 already in use | Change WS_PORT environment variable or kill the existing process |
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.