Screenshot MCP Server

Screenshot MCP Server

Enables AI assistants running in WSL to capture Windows screenshots of full desktops, specific monitors, or individual windows by title or process name. It features DPI awareness, automatic path conversion between WSL and Windows, and customizable storage options.

Category
Visit Server

README

This repo is in Read Only mode and has been replaced

Use the WSLSnapIt-MCP repo instead (works better, more features!)

Screenshot MCP Server

A Model Context Protocol (MCP) server that enables AI assistants running in WSL to capture Windows screenshots with advanced features like monitor selection and window-specific capture.

Features

  • 📸 Full Desktop Capture - Capture all monitors (default behavior)
  • 🖥️ Monitor Selection - Capture specific monitors (primary, 1, 2, etc.)
  • 🪟 Window Capture - Capture specific windows by title match with DPI awareness
  • 🚀 Process Capture - Capture windows by process name (e.g., notepad.exe)
  • 📂 Custom Save Locations - Save to any folder using WSL or Windows paths
  • 🔄 Automatic Path Conversion - Converts WSL paths to Windows paths
  • 📁 Organized Storage - Screenshots saved to workspace/screenshots/ by default
  • 🎯 DPI Aware - Proper scaling for high-DPI displays
  • 🖼️ Full Window Capture - Includes window shadows and borders without clipping

Prerequisites

  • Windows with WSL (Windows Subsystem for Linux)
  • Node.js installed in WSL
  • Claude Desktop or Claude Code with MCP support

Installation

  1. Clone this repository:
git clone https://github.com/peterparker57/screenshot-mcp.git
cd screenshot-mcp
  1. Install dependencies:
npm install
# or with bun:
bun install
  1. Add to your Claude configuration (~/.claude.json or ~/.claude/claude_desktop_config.json):
{
  "mcpServers": {
    "screenshot": {
      "command": "node",
      "args": [
        "/path/to/screenshot-mcp/index.js"
      ],
      "disabled": false,
      "alwaysAllow": [
        "take_screenshot"
      ]
    }
  }
}
  1. Restart Claude Desktop/Code

Usage

Once installed, you can ask Claude to take screenshots:

Basic Usage

Take a screenshot

Capture Specific Monitor

Take a screenshot of monitor 1
Take a screenshot of the primary monitor

Capture Specific Window by Title

Take a screenshot of the "Chrome" window
Take a screenshot of window titled "Visual Studio Code"

Capture Specific Window by Process Name

Take a screenshot of notepad.exe
Take a screenshot of the chrome process
Take a screenshot of process "Code"

Custom Filename

Take a screenshot and save it as "test.png"

Save to Custom Folder

Take a screenshot and save it to /mnt/c/Users/username/Pictures/
Take a screenshot and save to C:\Users\username\Desktop\
Take a screenshot of monitor 1 and save to folder "../docs/images"

API Reference

The MCP server provides a single tool:

take_screenshot

Parameters:

  • filename (optional): Name for the screenshot file (default: "screenshot.png")
  • monitor (optional): Which monitor to capture
    • "all" - Capture all monitors (default)
    • "primary" - Capture primary monitor only
    • 1, 2, etc. - Capture specific monitor by index
  • windowTitle (optional): Capture a specific window by its title (partial match supported)
  • processName (optional): Capture a specific window by process name (e.g., "notepad.exe" or "notepad")
  • folder (optional): Custom folder path to save the screenshot
    • Supports WSL paths: /mnt/c/Users/...
    • Supports Windows paths: C:\Users\...
    • Supports relative paths: ../images
    • Default: workspace/screenshots/

Returns:

  • Success message with the file path
  • Error message if capture fails

Notes:

  • If both windowTitle and processName are provided, windowTitle takes precedence
  • Custom folders are created automatically if they don't exist
  • Path formats are automatically converted between WSL and Windows as needed

Technical Details

Architecture

  • MCP Server: Node.js with @modelcontextprotocol/sdk
  • Screenshot Capture: PowerShell with .NET Windows Forms
  • Communication: Executes PowerShell commands from WSL
  • Encoding: Base64 encoding for reliable command execution

Window Capture Features

  • DPI Awareness: Automatically handles high-DPI displays for crisp captures
  • Window Padding: Adds 10px padding to capture window shadows and borders
  • Render Wait: Waits 200ms after focusing window to ensure complete rendering
  • Bounds Checking: Prevents negative coordinates when windows are near screen edges
  • Process Matching: Intelligent process name matching (strips .exe extension automatically)

How It Works

  1. MCP server receives screenshot request from Claude
  2. Determines save location (custom folder or default)
  3. Converts paths between WSL and Windows formats as needed
  4. Constructs appropriate PowerShell script based on parameters
  5. Encodes script in base64 to avoid escaping issues
  6. Executes PowerShell command from WSL
  7. PowerShell captures screenshot using Windows Forms APIs
  8. Image is saved to the specified location

Error Handling

  • Filters PowerShell CLIXML output (verbose logging, not errors)
  • Validates monitor indices
  • Provides clear error messages for missing windows or processes
  • Lists available windows when capture fails
  • Automatically creates directories if needed
  • Handles both WSL and Windows path formats

Troubleshooting

Screenshots folder not created

The server automatically creates folders as needed. Ensure you have write permissions to the target location.

PowerShell execution errors

Check your PowerShell execution policy:

Get-ExecutionPolicy

Window not found errors

  • Ensure the window is open and not minimized
  • The title match is case-insensitive and supports partial matches
  • Try using a more specific window title
  • When searching by process, the tool will list all available windows to help you identify the correct one

Window capture is clipped

The latest version includes automatic padding and DPI awareness. If you still experience clipping:

  • Ensure you're using the latest version
  • Try maximizing the window before capture
  • Check if the window has unusual rendering (some apps use custom chrome)

Path conversion issues

The server automatically converts between WSL and Windows path formats:

  • WSL paths like /mnt/c/... are converted to C:\... for PowerShell
  • Windows paths like C:\... are converted to /mnt/c/... for file verification
  • Ensure your paths are accessible from both WSL and Windows

Recent Updates

v1.2.0

  • Added custom folder support with the folder parameter
  • Supports both WSL paths (/mnt/...) and Windows paths (C:\...)
  • Automatic path conversion between WSL and Windows formats
  • Creates custom directories automatically if they don't exist
  • Maintains backward compatibility with default screenshots folder

v1.1.0

  • Added process name capture support
  • Can now capture windows by process name (e.g., "notepad.exe")
  • Intelligent .exe extension handling
  • Enhanced error messages showing available windows

v1.0.1

  • Fixed window capture clipping issues
  • Added DPI awareness for high-DPI displays
  • Added padding to capture window shadows
  • Improved window rendering wait time
  • Added bounds checking for edge cases

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT License - see LICENSE file for details

Acknowledgments

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured