🖼️ MCP Screenshot Server

🖼️ MCP Screenshot Server

A lightweight MCP-compatible Python server for capturing Windows screenshots via REST API. Supports full screen, region-based, or window-specific captures. Ideal for AI agent integrations and automation workflows.

margusmartsepp

Developer Tools
Visit Server

README

🖼️ MCP Screenshot Server

A lightweight, MCP-compatible screenshot microservice built with FastAPI for Windows.
It allows AI agents and automation tools to capture full-screen, region-based, or window-specific screenshots via simple HTTP calls.

MCP Compatible License: MIT Python


🔧 Features

  • 📸 Capture full-screen screenshots
  • 🪟 Capture specific window by title
  • 🔲 Capture custom regions [x, y, width, height]
  • 🧠 MCP-compliant REST API
  • 🖼️ Returns images as PNG or base64
  • 🚀 Built with FastAPI, ready for production or LLM use

🧠 Use Cases

  • Integrating with LLMs using Model Context Protocol (MCP)
  • QA test automation pipelines
  • Monitoring and remote capture tools
  • Visual logging/debugging tools for agents

📦 Installation

git clone https://github.com/yourusername/mcp-screenshot-server.git
cd mcp-screenshot-server
python -m venv .venv
source .venv/bin/activate   # or .venv\Scripts\activate on Windows
pip install -r requirements.txt
uvicorn main:app --reload

🔌 API Usage

POST /screenshot

Request JSON body:

{
  "region": [0, 0, 1280, 720],        // optional
  "window_title": "Untitled - Notepad", // optional
  "base64": true                      // optional (default: false)
}

Response (base64 mode):

{
  "status": "ok",
  "mode": "region",
  "image_format": "base64",
  "image": "<base64-encoded-image>"
}

🛠️ Tech Stack

  • Python 3.11+
  • FastAPI
  • mss or pyautogui for screenshot
  • pillow for image processing
  • pygetwindow for window matching (optional)

📄 License

MIT License.
Feel free to use, fork, and integrate — commercial or personal.
See LICENSE for details.


📬 Contributing

Pull requests and issues welcome!
Open a PR to add features or improve compatibility across platforms (e.g., Mac/Linux support).


🙋 FAQ

  • Does it work on Linux/macOS?
    Not yet. This version is Windows-focused, but you’re welcome to extend it.

  • Is it MCP-certified?
    This project aims to follow the MCP spec as closely as possible for maximum compatibility with LLM agents.


🧠 Inspired By



Would you like me to tailor a specific section to emphasize AI agent use (e.g., “how to use with o1 or GPT-4o via plugin”)?

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
MCP Package Docs Server

MCP Package Docs Server

Facilitates LLMs to efficiently access and fetch structured documentation for packages in Go, Python, and NPM, enhancing software development with multi-language support and performance optimization.

Featured
Local
TypeScript
Claude Code MCP

Claude Code MCP

An implementation of Claude Code as a Model Context Protocol server that enables using Claude's software engineering capabilities (code generation, editing, reviewing, and file operations) through the standardized MCP interface.

Featured
Local
JavaScript
@kazuph/mcp-taskmanager

@kazuph/mcp-taskmanager

Model Context Protocol server for Task Management. This allows Claude Desktop (or any MCP client) to manage and execute tasks in a queue-based system.

Featured
Local
JavaScript
Linear MCP Server

Linear MCP Server

Enables interaction with Linear's API for managing issues, teams, and projects programmatically through the Model Context Protocol.

Featured
JavaScript
mermaid-mcp-server

mermaid-mcp-server

A Model Context Protocol (MCP) server that converts Mermaid diagrams to PNG images.

Featured
JavaScript
Jira-Context-MCP

Jira-Context-MCP

MCP server to provide Jira Tickets information to AI coding agents like Cursor

Featured
TypeScript
Linear MCP Server

Linear MCP Server

A Model Context Protocol server that integrates with Linear's issue tracking system, allowing LLMs to create, update, search, and comment on Linear issues through natural language interactions.

Featured
JavaScript
Sequential Thinking MCP Server

Sequential Thinking MCP Server

This server facilitates structured problem-solving by breaking down complex issues into sequential steps, supporting revisions, and enabling multiple solution paths through full MCP integration.

Featured
Python