web-automation-mcp
An MCP server that enables Claude Code CLI to interact with Cloud Desktop, ChatGPT, and Gemini web interfaces through browser automation.
README
Web Automation MCP Server
An MCP (Model Context Protocol) server that enables Claude Code CLI to interact with Cloud Desktop, ChatGPT, and Gemini web interfaces through browser automation.
Features
- Multi-Service Support: Interact with Cloud Desktop, ChatGPT, and Gemini
- Browser Automation: Uses Playwright for reliable web automation
- Session Persistence: Maintains browser sessions across requests
- Response Extraction: Automatically extracts and returns responses
- Conversation Management: Clear conversation history when needed
Installation
- Clone the repository:
cd ~/web-automation-mcp
- Install dependencies:
npm install
- Build the project:
npm run build
Configuration
Add the server to your Claude MCP settings (~/Library/Application Support/Claude/claude_desktop_config.json):
{
"mcpServers": {
"web-automation": {
"command": "node",
"args": ["/Users/YOUR_USERNAME/web-automation-mcp/dist/index.js"]
}
}
}
Usage
Natural Language Support
The MCP server now supports natural language queries through the web_automation_query tool:
# Ask ChatGPT a question
"Ask ChatGPT: What is quantum computing?"
# Send to Gemini
"Send to Gemini: Explain machine learning"
# Clear conversation
"Clear ChatGPT conversation"
# Get latest response
"Get the latest response from Gemini"
# Browser control
"Initialize browser in visible mode"
"Close browser"
Available Tools
-
web_automation_query - Process natural language queries
query: string - Natural language description of what you want to do
-
initialize_browser - Initialize the browser
headless: boolean (default: false) - Run in headless mode
-
send_to_service - Send input to a service and get response
service: "clouddesktop" | "chatgpt" | "gemini"content: string - The message to send
-
get_latest_response - Get the latest response from a service
service: "clouddesktop" | "chatgpt" | "gemini"
-
clear_conversation - Clear conversation history
service: "clouddesktop" | "chatgpt" | "gemini"
-
cleanup_browser - Close all browser windows
Example Usage in Claude Code
Natural language examples:
# Ask ChatGPT
Use web_automation_query with query: "Ask ChatGPT about the history of computing"
# Send to Gemini
Use web_automation_query with query: "Send to Gemini: write a Python function to sort a list"
# Clear conversation
Use web_automation_query with query: "Clear ChatGPT conversation"
Direct tool usage:
# Initialize browser (visible mode)
Use the initialize_browser tool with headless: false
# Send a message to ChatGPT
Use the send_to_service tool with service: "chatgpt" and content: "What is the capital of France?"
# Clean up when done
Use the cleanup_browser tool
Important Notes
-
Manual Login Required: You must manually log in to each service in the browser window the first time
-
Service URLs: Update the URLs in the handler files if they change:
src/handlers/cloudDesktop.ts- Update Cloud Desktop URLsrc/handlers/chatgpt.ts- ChatGPT URLsrc/handlers/gemini.ts- Gemini URL
-
Selectors: The CSS selectors in handlers may need updates if the services change their UI
Development
To run in development mode:
npm run dev
To modify service handlers, edit the files in src/handlers/.
Troubleshooting
- Login Required Error: Open the browser in non-headless mode and log in manually
- Selector Not Found: The service UI may have changed - update selectors in the handler
- Timeout Errors: Increase timeout values in the handlers for slower connections
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.