web-automation-mcp

web-automation-mcp

An MCP server that enables Claude Code CLI to interact with Cloud Desktop, ChatGPT, and Gemini web interfaces through browser automation.

Category
Visit Server

README

Web Automation MCP Server

An MCP (Model Context Protocol) server that enables Claude Code CLI to interact with Cloud Desktop, ChatGPT, and Gemini web interfaces through browser automation.

Features

  • Multi-Service Support: Interact with Cloud Desktop, ChatGPT, and Gemini
  • Browser Automation: Uses Playwright for reliable web automation
  • Session Persistence: Maintains browser sessions across requests
  • Response Extraction: Automatically extracts and returns responses
  • Conversation Management: Clear conversation history when needed

Installation

  1. Clone the repository:
cd ~/web-automation-mcp
  1. Install dependencies:
npm install
  1. Build the project:
npm run build

Configuration

Add the server to your Claude MCP settings (~/Library/Application Support/Claude/claude_desktop_config.json):

{
  "mcpServers": {
    "web-automation": {
      "command": "node",
      "args": ["/Users/YOUR_USERNAME/web-automation-mcp/dist/index.js"]
    }
  }
}

Usage

Natural Language Support

The MCP server now supports natural language queries through the web_automation_query tool:

# Ask ChatGPT a question
"Ask ChatGPT: What is quantum computing?"

# Send to Gemini
"Send to Gemini: Explain machine learning"

# Clear conversation
"Clear ChatGPT conversation"

# Get latest response
"Get the latest response from Gemini"

# Browser control
"Initialize browser in visible mode"
"Close browser"

Available Tools

  1. web_automation_query - Process natural language queries

    • query: string - Natural language description of what you want to do
  2. initialize_browser - Initialize the browser

    • headless: boolean (default: false) - Run in headless mode
  3. send_to_service - Send input to a service and get response

    • service: "clouddesktop" | "chatgpt" | "gemini"
    • content: string - The message to send
  4. get_latest_response - Get the latest response from a service

    • service: "clouddesktop" | "chatgpt" | "gemini"
  5. clear_conversation - Clear conversation history

    • service: "clouddesktop" | "chatgpt" | "gemini"
  6. cleanup_browser - Close all browser windows

Example Usage in Claude Code

Natural language examples:

# Ask ChatGPT
Use web_automation_query with query: "Ask ChatGPT about the history of computing"

# Send to Gemini
Use web_automation_query with query: "Send to Gemini: write a Python function to sort a list"

# Clear conversation
Use web_automation_query with query: "Clear ChatGPT conversation"

Direct tool usage:

# Initialize browser (visible mode)
Use the initialize_browser tool with headless: false

# Send a message to ChatGPT
Use the send_to_service tool with service: "chatgpt" and content: "What is the capital of France?"

# Clean up when done
Use the cleanup_browser tool

Important Notes

  1. Manual Login Required: You must manually log in to each service in the browser window the first time

  2. Service URLs: Update the URLs in the handler files if they change:

    • src/handlers/cloudDesktop.ts - Update Cloud Desktop URL
    • src/handlers/chatgpt.ts - ChatGPT URL
    • src/handlers/gemini.ts - Gemini URL
  3. Selectors: The CSS selectors in handlers may need updates if the services change their UI

Development

To run in development mode:

npm run dev

To modify service handlers, edit the files in src/handlers/.

Troubleshooting

  • Login Required Error: Open the browser in non-headless mode and log in manually
  • Selector Not Found: The service UI may have changed - update selectors in the handler
  • Timeout Errors: Increase timeout values in the handlers for slower connections

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured