🖼️ MCP Screenshot Server
A lightweight MCP-compatible Python server for capturing Windows screenshots via REST API. Supports full screen, region-based, or window-specific captures. Ideal for AI agent integrations and automation workflows.
margusmartsepp
README
🖼️ MCP Screenshot Server
A lightweight, MCP-compatible screenshot microservice built with FastAPI for Windows.
It allows AI agents and automation tools to capture full-screen, region-based, or window-specific screenshots via simple HTTP calls.
🔧 Features
- 📸 Capture full-screen screenshots
- 🪟 Capture specific window by title
- 🔲 Capture custom regions
[x, y, width, height]
- 🧠 MCP-compliant REST API
- 🖼️ Returns images as PNG or base64
- 🚀 Built with FastAPI, ready for production or LLM use
🧠 Use Cases
- Integrating with LLMs using Model Context Protocol (MCP)
- QA test automation pipelines
- Monitoring and remote capture tools
- Visual logging/debugging tools for agents
📦 Installation
git clone https://github.com/yourusername/mcp-screenshot-server.git
cd mcp-screenshot-server
python -m venv .venv
source .venv/bin/activate # or .venv\Scripts\activate on Windows
pip install -r requirements.txt
uvicorn main:app --reload
🔌 API Usage
POST /screenshot
Request JSON body:
{
"region": [0, 0, 1280, 720], // optional
"window_title": "Untitled - Notepad", // optional
"base64": true // optional (default: false)
}
Response (base64 mode):
{
"status": "ok",
"mode": "region",
"image_format": "base64",
"image": "<base64-encoded-image>"
}
🛠️ Tech Stack
- Python 3.11+
- FastAPI
mss
orpyautogui
for screenshotpillow
for image processingpygetwindow
for window matching (optional)
📄 License
MIT License.
Feel free to use, fork, and integrate — commercial or personal.
See LICENSE for details.
📬 Contributing
Pull requests and issues welcome!
Open a PR to add features or improve compatibility across platforms (e.g., Mac/Linux support).
🙋 FAQ
-
Does it work on Linux/macOS?
Not yet. This version is Windows-focused, but you’re welcome to extend it. -
Is it MCP-certified?
This project aims to follow the MCP spec as closely as possible for maximum compatibility with LLM agents.
🧠 Inspired By
- Anthropic’s Model Context Protocol
- Real-world automation use cases powered by LLMs and Python
Would you like me to tailor a specific section to emphasize AI agent use (e.g., “how to use with o1 or GPT-4o via plugin”)?
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
MCP Package Docs Server
Facilitates LLMs to efficiently access and fetch structured documentation for packages in Go, Python, and NPM, enhancing software development with multi-language support and performance optimization.
Claude Code MCP
An implementation of Claude Code as a Model Context Protocol server that enables using Claude's software engineering capabilities (code generation, editing, reviewing, and file operations) through the standardized MCP interface.
@kazuph/mcp-taskmanager
Model Context Protocol server for Task Management. This allows Claude Desktop (or any MCP client) to manage and execute tasks in a queue-based system.
Linear MCP Server
Enables interaction with Linear's API for managing issues, teams, and projects programmatically through the Model Context Protocol.
mermaid-mcp-server
A Model Context Protocol (MCP) server that converts Mermaid diagrams to PNG images.
Jira-Context-MCP
MCP server to provide Jira Tickets information to AI coding agents like Cursor

Linear MCP Server
A Model Context Protocol server that integrates with Linear's issue tracking system, allowing LLMs to create, update, search, and comment on Linear issues through natural language interactions.

Sequential Thinking MCP Server
This server facilitates structured problem-solving by breaking down complex issues into sequential steps, supporting revisions, and enabling multiple solution paths through full MCP integration.