Remote Control MCP (Electrobun)
Enables remote desktop control with screen capture and mouse/keyboard simulation through an MCP server exposed via a WebSocket tunnel.
README
Remote Control MCP (Electrobun)
A desktop remote-control MCP server built with Electrobun. It captures a user-selected screen and exposes mouse/keyboard control + screenshots as an MCP (Model Context Protocol) server, reachable from anywhere through a WebSocket tunnel — so external agents and tools can drive the machine.
Native input simulation currently targets Linux (via
/dev/uinput). See Setup & permissions.
Working on this codebase with an AI agent? Read AGENTS.md first — it documents the architecture and the non-obvious gotchas.
Features
- Screen capture — pick a display via the Web
getDisplayMediaAPI and take JPEG screenshots (90% quality). - Synthetic mouse — click at absolute pixel coordinates (left/right/middle).
- Synthetic keyboard — type text (with human-like timing) and press named keys with modifiers.
- Remote MCP server — 5 tools (
get_system_info,screenshot,click,type_text,press_key) exposed over a WebSocket tunnel with a stable public URL. - In-app test panel — exercise clicks/typing/keys locally before going remote.
Architecture (short version)
Electrobun runs two processes that talk over RPC:
- Bun process (
src/bun/) — OS access:/dev/uinput(FFI),xrandr, persistent storage. - Webview (
src/mainview/) — UI, screenshots, and the MCP server + WebSocket tunnel client.
Native actions are forwarded from the webview to Bun via RPC; screenshots are captured in the webview from the active screen-share stream. Full detail and the data-flow diagram live in AGENTS.md.
src/
├── bun/
│ ├── index.ts # RPC schema + handlers, window, screen-size detection, KV store
│ └── uinput.ts # FFI → /dev/uinput: VirtualMouse + VirtualKeyboard
└── mainview/
├── index.html # UI
├── index.css
├── index.ts # ScreenCaptureApp + McpPanel
└── mcp/ # tunnel client + MCP server bridge + tool definitions
Getting started
bun install
bun start # run (electrobun dev)
bun dev # run with file watching
bun run build:canary # build / verify it compiles
In the app, click Select Screen & Start Remote, choose a display, and the Remote MCP panel will show the public URL once the tunnel connects.
Setup & permissions
Input simulation creates kernel-level virtual devices through /dev/uinput, which
is root-only by default. On startup the app offers to grant access via pkexec
(graphical password prompt), running:
chmod 666 /dev/uinput
This is session-only (does not survive reboot). Manual equivalent:
sudo chmod 666 /dev/uinput
Do not run the whole app with sudo.
Remote control via MCP
The app exposes a Streamable HTTP MCP endpoint at the public tunnel URL. Tools:
| Tool | Description |
|---|---|
get_system_info |
Host details: screen size (the click coordinate space), OS/distro, kernel, arch, session (Wayland/X11), hostname, time. |
screenshot |
JPEG (90% quality) of the shared screen (requires active screen sharing). |
click |
Click at absolute pixel x,y (button: left/right/middle). |
type_text |
Type a string (US layout; non-ASCII skipped + reported). |
press_key |
Press a named key / char, optional ctrl/shift/alt/meta. |
Smoke test
smoke-mcp.sh drives a demo sequence (click → type "chrome" → down → enter):
./smoke-mcp.sh 'https://layerz.me:4433/mcp/<session>'
# or
MCP_URL='https://layerz.me:4433/mcp/<session>' ./smoke-mcp.sh
Paste the URL from the app's Remote MCP panel.
Security considerations
- Screen sharing requires explicit user consent; the tunnel only connects after consent.
- Once connected, the public URL lets remote clients control mouse/keyboard and read the screen — treat it like a remote-access credential and disconnect when not in use.
- The tunnel never autostarts; it is tied to the screen-selection action.
Notes
- US QWERTY keyboard layout assumed for synthetic typing.
- Tunnel server:
wss://layerz.me:4433/connect(already deployed; configurable intunnel.ts).
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.