Total PC Control

Total PC Control

Total PC Control MCP server - v2 with fixes and compression

jasondsmith72

OS Automation
Visit Server

README

Total PC Control

An MCP (Model Context Protocol) server that provides control over your screen, mouse, and keyboard using nut.js.

⚠️ Warning: Use with Caution

This software enables programmatic control of your mouse, keyboard, and other system operations. By using this software, you acknowledge and accept that:

  • Giving AI models direct control over your computer through this tool can lead to unintended consequences
  • The software can control your mouse, keyboard, and other system functions
  • You are using this software entirely at your own risk
  • The creators and contributors of this project accept NO responsibility for any damage, data loss, or other consequences that may arise from using this software

Features

  • 📷 Screen Capture: Capture screenshots of your entire screen or specific regions
  • 🖱️ Mouse Control: Move the mouse cursor, click, double-click, and scroll
  • ⌨️ Keyboard Input: Type text and press keyboard shortcuts
  • 🪟 Window Management: Find, focus, and manipulate application windows
  • 📋 Clipboard Access: Copy and paste text

Prerequisites

  • Node.js 16 or higher
  • npm or yarn
  • cmake-js (for building native dependencies)

Installation

  1. Clone the repository:
git clone https://github.com/jasondsmith72/total-pc-control.git
cd total-pc-control
  1. Install cmake-js globally (required for building native dependencies):
npm install -g cmake-js
  1. Install the libnut core library (required for nut.js):
git clone https://github.com/nut-tree/libnut.git libnut-core
cd libnut-core
npm install
cmake-js rebuild
cd ..
  1. Install dependencies and build the project:
npm install
npm run build

Using with Claude for Desktop

  1. Edit your Claude for Desktop configuration file:
  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json
  1. Add the following to your configuration:
{
  "mcpServers": {
    "total-pc-control": {
      "command": "node",
      "args": [
        "/ABSOLUTE/PATH/TO/total-pc-control/build/index.js"
      ]
    }
  }
}

Replace /ABSOLUTE/PATH/TO/ with the actual path to where you cloned the repository.

  1. Restart Claude for Desktop

  2. Look for the hammer icon in the Claude interface to indicate available tools.

Available Tools

Screen Capture

  • capture_screen: Capture the entire screen as an image. Supports format (png/jpeg) and quality (jpeg only) parameters.
  • capture_region: Capture a specific region of the screen. Requires left, top, width, height. Supports format and quality.
  • get_screen_size: Get the dimensions (width and height) of the screen.

Mouse Control

  • move_mouse: Move the mouse cursor to a specific x, y position.
  • get_mouse_position: Get the current x, y position of the mouse cursor.
  • click_mouse: Click the mouse at the current position. Optional button (left, middle, right).
  • click_at: Click the mouse at a specific x, y position. Optional button.
  • double_click: Double-click the mouse at the current position.
  • double_click_at: Double-click the mouse at a specific x, y position.
  • scroll_mouse: Scroll the mouse wheel. Requires direction (up/down). Optional amount.
  • drag_mouse: Drag the mouse from the current position to a target x, y position.
  • drag_mouse_from_to: Drag the mouse from a startX, startY position to an endX, endY position.

Keyboard Input

  • type_text: Type text at the current cursor position. Requires text.
  • type_text_with_delay: Type text with a delay between keystrokes. Requires text. Optional delayMs.
  • press_key: Press a specific keyboard key. Requires key.
  • press_key_shortcut: Press a keyboard shortcut (combination of keys). Requires keys array.
  • hold_key: Hold down a keyboard key. Requires key.
  • release_key: Release a held keyboard key. Requires key.

Clipboard Operations

  • get_clipboard_text: Get text from the clipboard.
  • set_clipboard_text: Set text to the clipboard. Requires text.
  • copy_selected_text: Copy selected text to clipboard and return it (uses Ctrl+C/Cmd+C).
  • paste_text: Paste text at current cursor position (uses Ctrl+V/Cmd+V). Requires text.
  • get_clipboard_image: Get image from the clipboard (if available) as base64 data.

UI Automation Tools (Windows Only)

These tools use Windows UI Automation via PowerShell to interact with UI elements.

  • get_ui_element_info: Finds a UI element within a specified window and returns its properties (Name, AutomationId, ClassName, ControlType, BoundingRectangle, IsEnabled, IsOffscreen, Value, Children).
    • Requires windowTitle (can be partial match).
    • Requires at least one of elementName, automationId, or className to find a specific element.
    • If no element identifier is provided, it lists the direct children of the window.
  • invoke_ui_element_action: Performs an action on a specified UI element.
    • Requires windowTitle.
    • Requires action (Click, SetValue, or Focus).
    • Requires at least one of elementName, automationId, or className.
    • Requires valueToSet (string) if action is SetValue.

Development

To run the server in development mode:

npm run dev

To run tests:

npm test

License

This project is licensed under the MIT License - see the LICENSE file for details.

Recommended Servers

@kazuph/mcp-taskmanager

@kazuph/mcp-taskmanager

Model Context Protocol server for Task Management. This allows Claude Desktop (or any MCP client) to manage and execute tasks in a queue-based system.

Featured
Local
JavaScript
Claude Code MCP

Claude Code MCP

An implementation of Claude Code as a Model Context Protocol server that enables using Claude's software engineering capabilities (code generation, editing, reviewing, and file operations) through the standardized MCP interface.

Featured
Local
JavaScript
ThingsPanel MCP

ThingsPanel MCP

An integration server that connects AI models with ThingsPanel IoT platform, allowing AI assistants to interact with IoT devices through natural language for device control, data retrieval, and management operations.

Official
Python
Beamlit MCP Server

Beamlit MCP Server

An MCP server implementation that enables seamless integration between Beamlit CLI and AI models using the Model Context Protocol standard.

Official
TypeScript
Xcode MCP Server

Xcode MCP Server

Bridges Claude AI with Xcode, enabling AI-powered code assistance, project management, and automated development tasks securely on your local machine.

Local
TypeScript
mcp-screenshot

mcp-screenshot

Provides screenshot and OCR capabilities for macOS.

Local
JavaScript
macOS Defaults MCP Server

macOS Defaults MCP Server

MCP server that enables reading and writing macOS system preferences and application settings through the defaults system.

Local
Python
Shell MCP Server

Shell MCP Server

A Model Context Protocol server that allows LLMs to execute shell commands and receive their output in a controlled manner.

Local
Python
iTerm MCP Server

iTerm MCP Server

This MCP server enables users to execute shell commands in the current iTerm2 session through integration with Claude Desktop, facilitating seamless command execution via the Model Context Protocol.

Local
TypeScript
MCP Command Server

MCP Command Server

A secure server that allows LLM applications like Claude to execute whitelisted system commands with user confirmation and comprehensive security features.

Local
Python