ScreenMonitorMCP

ScreenMonitorMCP

Enables real-time screen monitoring, UI element analysis, and predictive user behavior learning for AI assistants.

Category
Visit Server

README

๐Ÿš€ Revolutionary Screen Monitor MCP Server

CI License: MIT Python 3.9+ MCP Compatible

A REVOLUTIONARY Model Context Protocol (MCP) server! Gives AI real-time vision capabilities, UI intelligence, and predictive behavior learning power. This isn't just screen capture - it gives AI the power to truly "see" and understand your digital world!

๐ŸŒŸ WHY ScreenMonitorMCP?

  • ๐Ÿ”ฅ First & Only: Real-time continuous screen monitoring feature
  • ๐Ÿง  AI Intelligence: AI that understands UI elements and can interact with them
  • ๐Ÿ”ฎ Predictive: System that learns and predicts user behaviors
  • โšก Proactive: Assistant that offers help before you need it
  • ๐ŸŽฏ Natural: AI that understands commands like "Click the save button"

๐Ÿ”ฅ REVOLUTIONARY FEATURES

๐Ÿ”„ Real-Time Continuous Monitoring

  • AI's Eyes Never Close: 2-5 FPS continuous screen monitoring
  • Smart Change Detection: Distinguishes between small, major, and critical changes
  • Proactive Analysis: AI automatically analyzes important changes
  • Adaptive Performance: Smart frame rate adjustment

๐ŸŽฏ UI Element Intelligence

  • Computer Vision UI Detection: Automatically recognizes buttons, forms, menus
  • OCR Text Extraction: Reads text from anywhere on the screen
  • Smart Click System: Natural language commands like "Click the save button"
  • Interaction Mapping: AI knows exactly where and how to interact

๐Ÿง  Predictive Intelligence

  • Behavior Learning: AI learns your usage patterns and habits
  • Intent Prediction: Predicts what you'll do next based on context
  • Proactive Help: Offers help before you ask
  • Workflow Optimization: Suggests improvements in your work patterns

๐Ÿ› ๏ธ REVOLUTIONARY MCP TOOLS

๐Ÿ”„ Real-Time Monitoring Tools

  • start_continuous_monitoring() - Starts AI's continuous vision capability
  • stop_continuous_monitoring() - Stops continuous monitoring
  • get_monitoring_status() - Real-time status information and statistics
  • get_recent_changes() - Recently detected screen changes

๐ŸŽฏ UI Intelligence Tools

  • analyze_ui_elements() - Recognizes and maps all UI elements on screen
  • smart_click() - Smart clicking with natural language commands ("Click the save button")
  • extract_text_from_screen() - OCR text extraction from screen

๐Ÿง  Predictive AI Tools

  • learn_user_patterns() - Learns and analyzes user behavior patterns
  • predict_user_intent() - Predicts user intent based on current context
  • proactive_assistance() - Offers proactive help before user requests
  • record_user_action() - Records user actions and feeds learning system

๐Ÿ“ธ Traditional Tools

  • capture_and_analyze() - Screen capture and AI analysis (enhanced)
  • list_tools() - MCP standard compliant lists all tools (categorized, detailed information)

๐ŸŽฏ USAGE SCENARIOS

๐Ÿ” Real-Time Monitoring

# Start AI's continuous vision capability
await start_continuous_monitoring(fps=3, change_threshold=0.1)

# Check monitoring status
status = await get_monitoring_status()

# View recent changes
changes = await get_recent_changes(limit=5)

๐ŸŽฏ UI Intelligence

# Analyze all UI elements on screen
ui_analysis = await analyze_ui_elements()

# Smart clicking with natural language
await smart_click("Click the save button")

# Extract text from screen
text_data = await extract_text_from_screen()

๐Ÿง  Predictive AI

# Learn user behavior patterns
patterns = await learn_user_patterns()

# Predict user intent
intent = await predict_user_intent()

# Get proactive assistance
assistance = await proactive_assistance()

๐Ÿš€ INSTALLATION

1. Prepare Project Files

# Navigate to project directory
cd ScreenMonitorMCP

# Install required libraries
pip install -r requirements.txt

2. Configure Environment Variables

Edit the .env file:

# Server Configuration
HOST=127.0.0.1
PORT=7777
API_KEY=your_secret_key

# AI Configuration
OPENAI_API_KEY=your_openai_api_key
OPENAI_BASE_URL=https://api.openai.com/v1
DEFAULT_OPENAI_MODEL=gpt-4o

3. Standalone Testing (Optional)

# Test the server
python main.py

# Test revolutionary features
python test_revolutionary_features.py

๐Ÿ”ง MCP CLIENT SETUP

Claude Desktop / MCP Client Configuration

Add the following JSON to your MCP client's configuration file:

๐ŸŽฏ Simple Configuration (Recommended)

{
  "mcpServers": {
    "screenMonitorMCP": {
      "command": "python",
      "args": ["/path/to/ScreenMonitorMCP/main.py"],
      "cwd": "/path/to/ScreenMonitorMCP"
    }
  }
}

๐Ÿ”ง Advanced Configuration

{
  "mcpServers": {
    "screenMonitorMCP": {
      "command": "python",
      "args": [
        "/path/to/ScreenMonitorMCP/main.py"
      ],
      "cwd": "/path/to/ScreenMonitorMCP",
      "env": {
        "OPENAI_API_KEY": "your-api-key-here"
      }
    }
  }
}

๐Ÿ›ก๏ธ Secure Configuration

{
  "mcpServers": {
    "screenMonitorMCP": {
      "command": "python",
      "args": [
        "/path/to/ScreenMonitorMCP/main.py",
        "--api-key", "your-secret-key"
      ],
      "cwd": "/path/to/ScreenMonitorMCP"
    }
  }
}

๐ŸชŸ Windows Example

{
  "mcpServers": {
    "screenMonitorMCP": {
      "command": "python",
      "args": ["C:/path/to/ScreenMonitorMCP/main.py"],
      "cwd": "C:/path/to/ScreenMonitorMCP"
    }
  }
}

โš ๏ธ Important Notes

  1. File Path: Update /path/to/ScreenMonitorMCP/main.py path according to your project directory
  2. Python Path: Make sure Python is in PATH or use full path: "C:/Python311/python.exe"
  3. Working Directory: cwd parameter is important for proper .env file reading
  4. API Keys: All settings are automatically read from .env file

๐Ÿงช USAGE EXAMPLES

๐Ÿ”„ Starting Real-Time Monitoring

# Start AI's continuous vision capability
result = await start_continuous_monitoring(
    fps=3,
    change_threshold=0.1,
    smart_detection=True
)

# Check monitoring status
status = await get_monitoring_status()

# View recent changes
changes = await get_recent_changes(limit=10)

# Stop monitoring
await stop_continuous_monitoring()

๐ŸŽฏ Using UI Intelligence

# Analyze all UI elements on screen
ui_elements = await analyze_ui_elements(
    detect_buttons=True,
    extract_text=True,
    confidence_threshold=0.7
)

# Smart clicking with natural language
await smart_click("Click the save button", dry_run=False)

# Extract text from specific region
text_data = await extract_text_from_screen(
    region={"x": 100, "y": 100, "width": 500, "height": 300}
)

๐Ÿง  Predictive Intelligence

# Learn user behavior patterns
patterns = await learn_user_patterns()

# Predict user intent
intent = await predict_user_intent(
    current_context={"current_app": "VSCode"}
)

# Get proactive assistance
assistance = await proactive_assistance()

# Record user action
await record_user_action(
    action_type="click",
    target="save_button",
    app_context="VSCode"
)

๐Ÿ“ธ Traditional Screen Capture

# Enhanced screen capture and analysis
result = await capture_and_analyze(
    capture_mode="all",
    analysis_prompt="What do you see on this screen?",
    max_tokens=500
)

# List all tools
tools = await list_tools()

๐Ÿš€ REVOLUTIONARY CAPABILITIES

This MCP server gives AI the following capabilities:

  • ๐Ÿ‘๏ธ Continuous Vision: AI can monitor the screen non-stop
  • ๐Ÿง  Smart Understanding: Recognizes UI elements and interacts with them
  • ๐Ÿ”ฎ Future Prediction: Learns and predicts user behaviors
  • โšก Proactive Help: Offers help before you need it
  • ๐ŸŽฏ Natural Interaction: Understands commands like "Click the save button"

๐Ÿ”ง TROUBLESHOOTING

Common Issues and Solutions

  1. Unicode/Encoding Error (Windows)

    UnicodeEncodeError: 'charmap' codec can't encode character
    

    Solution: โœ… This error is fixed! Server automatically uses UTF-8 encoding.

  2. JSON Configuration Error

    // โŒ Wrong
    {
      "command": "python",
      "args": ["path/to/main.py",]  // Trailing comma is wrong
    }
    
    // โœ… Correct
    {
      "command": "python",
      "args": ["path/to/main.py"]
    }
    
  3. Python Path Issue

    {
      "command": "C:/Python311/python.exe",  // Use full path
      "args": ["C:/path/to/ScreenMonitorMCP/main.py"]
    }
    
  4. Missing Dependencies

    cd ScreenMonitorMCP
    pip install -r requirements.txt
    
  5. OCR Issues

    # Install Tesseract (optional)
    # EasyOCR installs automatically
    
  6. MCP Connection Closed Error

    MCP error -32000: Connection closed
    

    Solution: Check file paths and add cwd parameter.

๐Ÿ“ LICENSE

This project is licensed under the MIT License.


๐Ÿš€ Revolutionary MCP server that gives AI real "eyes"! ๐Ÿ”ฅ Next-generation AI-human interaction starts here!

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured