hyprland-mcp-server

hyprland-mcp-server

Enables LLMs to capture screenshots, inspect windows, and interact with the Hyprland desktop environment through mouse clicks and keyboard input.

Category
Visit Server

README

hyprland-mcp-server

MCP server (Model Context Protocol) for capturing screenshots and querying window/monitor/workspace state in the Hyprland Wayland compositor.

Enables LLMs to see your screen, inspect open windows, and navigate your desktop layout.

Requirements

Binary Source
hyprctl Bundled with Hyprland
grim https://sr.ht/~emersion/grim/ (pacman -S grim / apt install grim)

The server must run inside an active Hyprland session with WAYLAND_DISPLAY set.

Installation

cd hyprland-mcp-server
npm install
npm run build

MCP Client Configuration

Claude Desktop / opencode

{
  "mcpServers": {
    "hyprland": {
      "command": "node",
      "args": ["/absolute/path/to/hyprland-mcp-server/dist/index.js"]
    }
  }
}

Run manually (for testing)

node dist/index.js
# or during development:
npm run dev

Tools

Tool Description
hyprland_list_windows List all open windows with class, title, address, geometry, workspace
hyprland_get_active_window Get metadata for the currently focused window
hyprland_screenshot_window Capture a PNG of a specific window (by address, class, title, or PID)
hyprland_screenshot_active_window Capture a PNG of the currently focused window
hyprland_screenshot_monitor Capture a full monitor screenshot
hyprland_list_monitors List monitors with resolution, refresh rate, position, scale
hyprland_list_workspaces List workspaces with window counts and monitor assignments
hyprland_click_at Simulates a mouse click at absolute global coordinates in Hyprland. Useful for programmatically interacting with specific screen positions.
hyprland_click_window Targets the center of a specific window in Hyprland for a mouse click, with optional offsets.
hyprland_click_window_relative Clicks inside a selected window using window-local coordinates (local_x,local_y) with optional bounds validation.
hyprland_click_and_screenshot_active Performs click with optional bounded retries and returns active-window screenshot for immediate verification.
hyprland_move_cursor Moves the mouse cursor to specified global coordinates without clicking.
hyprland_get_cursor_pos Retrieves the current cursor position in the global layout coordinates.
hyprland_type_text Types a text string into the currently focused window using Wayland-native input injection.
hyprland_press_key Sends a single keypress event (e.g., Enter, Tab, Escape) to the currently focused window.
hyprland_send_key Sends a key or a key combination (e.g., "Ctrl+C") to a specific window or the active window.
hyprland_focus_window Brings keyboard and compositor focus to a specific window, enabling further interactions.

Example Usage

"Screenshot the terminal window"
  -> hyprland_screenshot_window(class="Alacritty")

"What apps are open on workspace 2?"
  -> hyprland_list_windows(workspace_id=2)

"Take a screenshot of my screen"
  -> hyprland_screenshot_monitor()

"What am I looking at right now?"
  -> hyprland_get_active_window()

Screenshot Response

Screenshot tools return:

  • A text block with geometry, file path, and window info
  • An image block (base64 PNG) so the LLM can visually inspect the content
  • A structuredContent block with JSON metadata

Screenshots are saved to a temporary directory under /tmp/hyprland-mcp-*.

Click Precision Notes

  • hyprland_click_at expects global monitor coordinates.
  • hyprland_screenshot_window is naturally read as window-local pixels.
  • Convert local → global when needed:
global_x = window_x + local_x
global_y = window_y + local_y

To avoid coordinate-space mistakes, prefer:

  1. hyprland_click_window_relative when targeting UI points from a window screenshot.
  2. hyprland_click_window when clicking near window center with offsets.
  3. hyprland_click_and_screenshot_active when you want click+verify loop in one call.

Development

npm run dev      # tsx watch mode
npm run build    # compile TypeScript
npm run clean    # remove dist/

Architecture

src/
  index.ts              # Server init, dependency check, tool registration
  types.ts              # TypeScript interfaces for Hyprland IPC structs
  schemas/index.ts      # Shared Zod schemas (ResponseFormat, WindowSelector)
  services/
    hyprland.ts         # hyprctl wrapper, grim capture, window/monitor queries
  tools/
    list-windows.ts
    get-active-window.ts
    screenshot-window.ts
    screenshot-active-window.ts
    screenshot-monitor.ts
    list-monitors.ts
    list-workspaces.ts

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured