rudycanshoot
An MCP server that enables AI assistants to capture and view screenshots, supporting multiple AI tools and platforms.
README
rudycanshoot
An MCP server + CLI that lets AI assistants take and view screenshots. Works with Claude Code, Cursor, Windsurf, Codex CLI, Gemini CLI, OpenCode, Continue, Cline, Aider, and GitHub Copilot.
Install
npm install -g rudycanshoot
Or run without installing:
npx rudycanshoot serve
Quick Start
1. Auto-configure your AI tools
# All tools at once
rudycanshoot install --all
# Or a specific tool
rudycanshoot install --tool claude-code
rudycanshoot install --tool cursor
rudycanshoot install --tool windsurf
rudycanshoot install --tool codex
rudycanshoot install --tool gemini
rudycanshoot install --tool opencode
rudycanshoot install --tool continue
rudycanshoot install --tool cline
2. Restart your AI tool
The MCP server will now appear in your AI assistant's tool list.
3. Use it
Ask your AI: "Take a screenshot and show me what's on screen."
MCP Tools
| Tool | Description |
|---|---|
take_screenshot |
Capture fullscreen, active window, or a region |
read_screenshot |
Read a saved image so the AI can view it |
list_screenshots |
List recent captures |
take_screenshot parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
mode |
fullscreen|window|area |
fullscreen |
What to capture |
area |
string | — | x,y,width,height — required when mode=area |
filename |
string | auto | Output filename |
outputDir |
string | ~/.rudycanshoot/captures/ |
Where to save |
CLI
# Take a screenshot
rudycanshoot capture
rudycanshoot capture --mode window
rudycanshoot capture --mode area --area 0,0,1920,1080
rudycanshoot capture --output /tmp/snap.png
# List recent screenshots
rudycanshoot list
# Start MCP server (used by AI tools — usually run automatically)
rudycanshoot serve
# Configure AI tools
rudycanshoot install --all
Supported AI Tools
| Tool | Config location | What's installed |
|---|---|---|
| Claude Code | ~/.claude/settings.json |
MCP server entry + /screenshot command |
| Cursor | ~/.cursor/mcp.json |
MCP server entry |
| Windsurf | ~/.codeium/windsurf/mcp_config.json |
MCP server entry |
| Codex CLI | ~/AGENTS.md |
Tool documentation |
| Gemini CLI | ~/.gemini/settings.json |
MCP server entry |
| OpenCode | ~/.config/opencode/opencode.json |
MCP server + agent |
| Continue | ~/.continue/config.json |
MCP server entry |
| Cline | ~/.clinerules/ |
Rules file |
| Aider | ~/.aider.conf.yml |
Comment reference |
| GitHub Copilot | ~/.github/copilot-instructions.md |
Instructions |
Screenshot Backends
Linux
Installed automatically when available, in priority order:
| Tool | Display | Install |
|---|---|---|
grim |
Wayland | sudo apt install grim |
scrot |
X11 | sudo apt install scrot |
maim |
X11 | sudo apt install maim |
import |
X11 | sudo apt install imagemagick |
xwd |
X11 | sudo apt install x11-apps |
macOS
Uses the built-in screencapture command — no extra install needed.
Windows
Uses PowerShell + System.Windows.Forms — no extra install needed.
Project Config Files (for contributors)
When you clone this repo, your AI tool will auto-discover:
| File | Tool |
|---|---|
CLAUDE.md |
Claude Code |
AGENTS.md |
Codex CLI, OpenCode |
GEMINI.md |
Gemini CLI |
.github/copilot-instructions.md |
GitHub Copilot |
.cursor/mcp.json |
Cursor |
.mcp.json |
Claude Code (project-level) |
.windsurfrules |
Windsurf |
.clinerules/ |
Cline |
.claude/commands/screenshot.md |
Claude Code /screenshot command |
.opencode/agents/screenshot.md |
OpenCode agent |
License
MIT
Image Processing
| Function | Description |
|---|---|
annotateImage |
Add text label to a screenshot |
diffScreenshots |
Highlight/heatmap/side-by-side diff |
compareScreenshots |
Pixel-level similarity metrics |
highlightRegions |
Color overlays with labels |
redactRegions |
Fill or blur sensitive areas |
addWatermark |
Corner text watermark |
addBorder |
Solid border with optional radius |
cropImage |
Crop to a region |
resizeImage |
Resize preserving aspect ratio |
stitchImages |
Horizontal or vertical concat |
makeGrid |
N×M grid composite |
makeGif |
Animated GIF from PNG frames |
ocrImage |
Extract text via Tesseract |
Pipeline API
import { Pipeline } from "rudycanshoot";
const path = await Pipeline.capture({ mode: "fullscreen" })
.annotate("CT-6101 Boson capture", { position: "bottom" })
.redact([{ x: 0, y: 0, w: 1920, h: 30 }], { style: "blur" })
.watermark("CONFIDENTIAL", { corner: "br" })
.save("/tmp/final.png");
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.