Selenium MCP Server

Selenium MCP Server

Enables AI assistants to automate web browser interactions through Selenium WebDriver. Supports multi-browser automation, element interaction, navigation, and web testing capabilities.

Category
Visit Server

README

Selenium MCP Server

A Model Context Protocol (MCP) server for Selenium that provides comprehensive Selenium WebDriver automation tools for AI assistants and applications. This server enables automated web browser interactions, testing, and scraping through a standardized interface.

🚀 Overview

The Selenium MCP Server bridges the gap between AI models and web automation by providing a robust set of tools for browser control, element interaction, and web testing. Built with TypeScript and modern ES modules, it offers type-safe browser automation capabilities through the Model Context Protocol.

✨ Key Features

  • Multi-Browser Support: Chrome, Firefox, and Edge browser automation
  • Comprehensive Element Interaction: Click, type, hover, drag & drop, file uploads
  • Advanced Navigation: Forward, backward, refresh, window management
  • Wait Strategies: Intelligent waiting for elements and page states
  • Type Safety: Full TypeScript implementation with Zod validation

🤝 Integration

MCP Client Integration

Configure your MCP client to connect to the Selenium server:

Standard Configuration (applicable to Windsurf, Warp, Gemini CLI etc)

{
  "servers": {
    "Selenium": {
      "command": "npx",
      "args": ["-y", "selenium-webdriver-mcp"]
    }
  }
}

Installation in VS Code

Update your mcp.json in VS Code with below configuration

NOTE: If you're new to MCP servers, follow this link Use MCP servers in VS Code

Example 'stdio' type connection

{
  "servers": {
    "Selenium": {
      "command": "npx",
      "args": [
        "-y",
        "selenium-webdriver-mcp"
      ],
      "type": "stdio"
    }
  },
  "inputs": []
}

Example 'http' type connection

{
  "servers": {
    "Selenium": {
      "url": "https://smithery.ai/server/@pshivapr/selenium-mcp",
      "type": "http"
    }
  },
  "inputs": []
}

After installation, the Selenium MCP server will be available for use with your GitHub Copilot agent in VS Code.

To install the Selenium MCP server using the VS Code CLI

# For VS Code
code --add-mcp '{\"name\":\"Selenium\",\"command\": \"npx\",\"args\": [\"selenium-webdriver-mcp\"]}'

# For VS Code Insiders
vscode-insiders --add-mcp '{\"name\":\"Selenium\",\"command\": \"npx\",\"args\": [\"selenium-webdriver-mcp\"]}'

To install the package using either npm, or Smithery

Using npm:

npm install -g selenium-webdriver-mcp

Using smithery badge

To install Selenium MCP for Claude Desktop automatically via Smithery:

npx @smithery/cli install @pshivapr/selenium-mcp --client claude

Claude Desktop Integration

Add to your Claude Desktop configuration:

{
  "mcpServers": {
    "Selenium": {
      "command": "npx",
      "args": ["-y", "selenium-webdriver-mcp"]
    }
  }
}

Screenshot

Selenium + Claude

🛠️ MCP Available Tools

Browser Management Tools

Tool Description Parameters
browser_open Open a new browser session browser, options
browser_navigate Navigate to a URL url
browser_navigate_back Navigate back in history None
browser_navigate_forward Navigate forward in history None
browser_title Get the current page title None
browser_refresh Refresh the current page None
browser_resize Resize browser window width, height
browser_switch_tab_or_window Switch to a tab or window handle
browser_switch_to_original_window Switch to the original window None
browser_close Close current browser session None

Element Interaction Tools

Tool Description Parameters
browser_find_element Find an element on the page by, value, timeout
browser_click Click on an element by, value, timeout
browser_type Type text into an element by, value, text, timeout
browser_get_element_text Get text content of element by, value, timeout
browser_file_upload Upload file via input element by, value, filePath, timeout
browser_clear Clear text from an element by, value, timeout
browser_get_attribute Get element attribute value by, value, attribute, timeout
browser_element_is_displayed Check if element is displayed by, value, timeout
browser_switch_to_frame Switch to a frame by locator by, value, timeout, timeout

Advanced Action Tools

Tool Description Parameters
browser_hover Hover over an element by, value, timeout
browser_double_click Double-click on an element by, value, timeout
browser_right_click Right-click (context menu) by, value, timeout
browser_drag_and_drop Drag from source to target by, value, targetBy, targetValue, timeout
browser_wait_for_element Wait for element to appear by, value, timeout
browser_scroll_to_element Scroll element into view by, value, timeout
browser_execute_script Execute JavaScript code script, args
browser_screenshot Take a screenshot filename (optional)
browser_select_dropdown_by_text Select dropdown option by visible text by, value, text, timeout
browser_select_dropdown_by_value Select dropdown option by value by, value, dropdownValue, timeout
browser_key_press Press a keyboard key in the browser key, timeout

Element Locator Strategies

  • id: Find by element ID
  • css: Find by CSS selector
  • xpath: Find by XPath expression
  • name: Find by name attribute
  • tag: Find by HTML tag name
  • class: Find by CSS class name

📋 Requirements

  • Node.js: Version 18.0.0 or higher
  • Browsers: Chrome, Firefox, or Edge installed
  • WebDrivers: Automatically managed by selenium-webdriver
  • Operating System: Windows, macOS, or Linux

🚦 Development

Getting Started

  1. Clone the repository:

    git clone https://github.com/pshivapr/selenium-mcp.git
    cd selenium-mcp
    
  2. Install dependencies:

    npm install
    
  3. Build the project:

    npm run build
    

Running the Server

Production Mode

npm start

Development Mode (with auto-reload)

npm run dev

Direct Execution

node dist/index.js

Using as CLI Tool

After building, you can use the server as a global command:

npx selenium-webdriver-mcp

📝 License

MIT License - see LICENSE file for details.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

📊 Version History

  • 0.2.0 - Selenium MCP Server
    • Multi-browser support (Chrome, Firefox, Edge)
    • Complete element interaction toolset
    • Advanced action capabilities
    • Type-safe TypeScript implementation
    • MCP protocol compliance

Built with ❤️ for the Model Context Protocol ecosystem

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured