MCP Desktop Tools
An MCP server that provides Claude with comprehensive desktop automation capabilities including browser control, window management, and native mouse/keyboard input on Windows. It enables users to capture screenshots, launch applications, and interact with the system clipboard through natural language.
README
MCP Desktop Tools
An MCP server that gives Claude desktop automation capabilities — browser control, screenshots, mouse/keyboard input, window management, and clipboard access.
Built with TypeScript, Playwright, and native Windows APIs.
Tools
Browser
| Tool | Description |
|---|---|
browser_open |
Launch Chromium and navigate to a URL |
browser_navigate |
Navigate to a URL with configurable wait conditions |
browser_click |
Click elements by CSS selector |
browser_type |
Type into input fields, optionally clear or press Enter |
browser_read |
Read page content (text, HTML, title, URL, or specific elements) |
browser_screenshot |
Capture viewport or full-page screenshots |
browser_close |
Close the browser |
Screenshots
| Tool | Description |
|---|---|
screenshot_fullscreen |
Capture entire screen (multi-monitor supported) |
screenshot_region |
Capture a rectangular region by coordinates |
screenshot_window |
Capture a specific window by title (partial match) |
Desktop
| Tool | Description |
|---|---|
desktop_mouse_click |
Click at screen coordinates |
desktop_mouse_move |
Move cursor (instant or smooth animation) |
desktop_keyboard_type |
Type text via simulated keystrokes |
desktop_keyboard_hotkey |
Press keyboard shortcuts (e.g. ctrl+c, alt+tab) |
desktop_window_list |
List all visible windows with positions and sizes |
desktop_window_focus |
Focus a window by title |
desktop_window_resize |
Move and/or resize a window |
desktop_app_launch |
Launch apps by path, name, or URI |
desktop_clipboard_read |
Read clipboard text |
desktop_clipboard_write |
Write text to clipboard |
Setup
npm install
npm run build
npx playwright install chromium
Claude Code Configuration
Add to your Claude Code MCP settings (~/.claude/settings.json):
{
"mcpServers": {
"desktop-tools": {
"command": "node",
"args": ["C:/Users/<you>/mcp-desktop-tools/dist/index.js"]
}
}
}
Restart Claude Code to pick up the new server.
Requirements
- Windows 10/11
- Node.js 18+
- PowerShell (used for native window/mouse/keyboard operations)
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
E2B
Using MCP to run code via e2b.