Selenium MCP Server
Enables AI assistants to automate web browser interactions through Selenium WebDriver. Supports multi-browser automation, element interaction, navigation, and web testing capabilities.
README
Selenium MCP Server
A Model Context Protocol (MCP) server for Selenium that provides comprehensive Selenium WebDriver automation tools for AI assistants and applications. This server enables automated web browser interactions, testing, and scraping through a standardized interface.
🚀 Overview
The Selenium MCP Server bridges the gap between AI models and web automation by providing a robust set of tools for browser control, element interaction, and web testing. Built with TypeScript and modern ES modules, it offers type-safe browser automation capabilities through the Model Context Protocol.
✨ Key Features
- Multi-Browser Support: Chrome, Firefox, and Edge browser automation
- Comprehensive Element Interaction: Click, type, hover, drag & drop, file uploads
- Advanced Navigation: Forward, backward, refresh, window management
- Wait Strategies: Intelligent waiting for elements and page states
- Type Safety: Full TypeScript implementation with Zod validation
🤝 Integration
MCP Client Integration
Configure your MCP client to connect to the Selenium server:
Standard Configuration (applicable to Windsurf, Warp, Gemini CLI etc)
{
"servers": {
"Selenium": {
"command": "npx",
"args": ["-y", "selenium-webdriver-mcp"]
}
}
}
Installation in VS Code
Update your mcp.json in VS Code with below configuration
NOTE: If you're new to MCP servers, follow this link Use MCP servers in VS Code
Example 'stdio' type connection
{
"servers": {
"Selenium": {
"command": "npx",
"args": [
"-y",
"selenium-webdriver-mcp"
],
"type": "stdio"
}
},
"inputs": []
}
Example 'http' type connection
{
"servers": {
"Selenium": {
"url": "https://smithery.ai/server/@pshivapr/selenium-mcp",
"type": "http"
}
},
"inputs": []
}
After installation, the Selenium MCP server will be available for use with your GitHub Copilot agent in VS Code.
To install the Selenium MCP server using the VS Code CLI
# For VS Code
code --add-mcp '{\"name\":\"Selenium\",\"command\": \"npx\",\"args\": [\"selenium-webdriver-mcp\"]}'
# For VS Code Insiders
vscode-insiders --add-mcp '{\"name\":\"Selenium\",\"command\": \"npx\",\"args\": [\"selenium-webdriver-mcp\"]}'
To install the package using either npm, or Smithery
Using npm:
npm install -g selenium-webdriver-mcp
To install Selenium MCP for Claude Desktop automatically via Smithery:
npx @smithery/cli install @pshivapr/selenium-mcp --client claude
Claude Desktop Integration
Add to your Claude Desktop configuration:
{
"mcpServers": {
"Selenium": {
"command": "npx",
"args": ["-y", "selenium-webdriver-mcp"]
}
}
}
Screenshot

🛠️ MCP Available Tools
Browser Management Tools
| Tool | Description | Parameters |
|---|---|---|
browser_open |
Open a new browser session | browser, options |
browser_navigate |
Navigate to a URL | url |
browser_navigate_back |
Navigate back in history | None |
browser_navigate_forward |
Navigate forward in history | None |
browser_title |
Get the current page title | None |
browser_refresh |
Refresh the current page | None |
browser_resize |
Resize browser window | width, height |
browser_switch_tab_or_window |
Switch to a tab or window | handle |
browser_switch_to_original_window |
Switch to the original window | None |
browser_close |
Close current browser session | None |
Element Interaction Tools
| Tool | Description | Parameters |
|---|---|---|
browser_find_element |
Find an element on the page | by, value, timeout |
browser_click |
Click on an element | by, value, timeout |
browser_type |
Type text into an element | by, value, text, timeout |
browser_get_element_text |
Get text content of element | by, value, timeout |
browser_file_upload |
Upload file via input element | by, value, filePath, timeout |
browser_clear |
Clear text from an element | by, value, timeout |
browser_get_attribute |
Get element attribute value | by, value, attribute, timeout |
browser_element_is_displayed |
Check if element is displayed | by, value, timeout |
browser_switch_to_frame |
Switch to a frame by locator | by, value, timeout, timeout |
Advanced Action Tools
| Tool | Description | Parameters |
|---|---|---|
browser_hover |
Hover over an element | by, value, timeout |
browser_double_click |
Double-click on an element | by, value, timeout |
browser_right_click |
Right-click (context menu) | by, value, timeout |
browser_drag_and_drop |
Drag from source to target | by, value, targetBy, targetValue, timeout |
browser_wait_for_element |
Wait for element to appear | by, value, timeout |
browser_scroll_to_element |
Scroll element into view | by, value, timeout |
browser_execute_script |
Execute JavaScript code | script, args |
browser_screenshot |
Take a screenshot | filename (optional) |
browser_select_dropdown_by_text |
Select dropdown option by visible text | by, value, text, timeout |
browser_select_dropdown_by_value |
Select dropdown option by value | by, value, dropdownValue, timeout |
browser_key_press |
Press a keyboard key in the browser | key, timeout |
Element Locator Strategies
id: Find by element IDcss: Find by CSS selectorxpath: Find by XPath expressionname: Find by name attributetag: Find by HTML tag nameclass: Find by CSS class name
📋 Requirements
- Node.js: Version 18.0.0 or higher
- Browsers: Chrome, Firefox, or Edge installed
- WebDrivers: Automatically managed by selenium-webdriver
- Operating System: Windows, macOS, or Linux
🚦 Development
Getting Started
-
Clone the repository:
git clone https://github.com/pshivapr/selenium-mcp.git cd selenium-mcp -
Install dependencies:
npm install -
Build the project:
npm run build
Running the Server
Production Mode
npm start
Development Mode (with auto-reload)
npm run dev
Direct Execution
node dist/index.js
Using as CLI Tool
After building, you can use the server as a global command:
npx selenium-webdriver-mcp
📝 License
MIT License - see LICENSE file for details.
🤝 Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
📊 Version History
- 0.2.0 - Selenium MCP Server
- Multi-browser support (Chrome, Firefox, Edge)
- Complete element interaction toolset
- Advanced action capabilities
- Type-safe TypeScript implementation
- MCP protocol compliance
Built with ❤️ for the Model Context Protocol ecosystem
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.