MCP Servers

MCP Macaco Playwright

Enables comprehensive browser automation and web interaction through Playwright with 50+ specialized functions for navigation, form filling, data extraction, and Chrome DevTools Protocol support. Designed specifically for AI agents to perform complex web workflows including scraping, testing, and automated browsing tasks.

README

MCP Macaco Playwright

Enhanced Playwright Tools for Model Context Protocol (MCP) with Chrome DevTools Protocol (CDP) Support

Overview

MCP Macaco Playwright is a comprehensive browser automation server that provides AI agents with powerful web interaction capabilities through the Model Context Protocol. It combines Playwright's robust browser automation with CDP integration for advanced debugging and control scenarios.

Features

Complete Browser Automation: Navigate, interact, and extract data from web pages
Chrome DevTools Protocol (CDP) Support: Connect to existing browser instances
AI-Optimized: Designed specifically for AI agents and automated workflows
Comprehensive Tool Set: 50+ specialized functions for web automation
Multi-Browser Support: Chrome, Firefox, Safari, and Edge
Screenshot & Snapshot Capabilities: Visual and accessibility-based page capture
Form Automation: Complete form filling and submission workflows
Network Monitoring: Track requests, responses, and console messages

Installation

npm install mcp-macaco-playwright

Quick Start

import { createConnection } from 'mcp-macaco-playwright';

// Create MCP server connection
const server = await createConnection();

// Use with MCP client
await client.callTool({
  name: 'browser_navigate',
  arguments: { url: 'https://example.com' }
});

Function Reference

Navigation Functions

`browser_navigate`

Navigate to a specific URL.

Parameters:

url (string, required): The URL to navigate to

Example:

await client.callTool({
  name: 'browser_navigate',
  arguments: { url: 'https://github.com' }
});

`browser_navigate_back`

Go back to the previous page in browser history.

Parameters: None

Example:

await client.callTool({
  name: 'browser_navigate_back',
  arguments: {}
});

`browser_navigate_forward`

Go forward to the next page in browser history.

Parameters: None

Example:

await client.callTool({
  name: 'browser_navigate_forward',
  arguments: {}
});

Page Analysis Functions

`browser_snapshot`

Capture an accessibility snapshot of the current page for analysis and interaction.

Parameters: None

Example:

await client.callTool({
  name: 'browser_snapshot',
  arguments: {}
});

`browser_take_screenshot`

Take a visual screenshot of the page or specific element.

Parameters:

type (string, optional): Image format ('png' or 'jpeg', default: 'png')
filename (string, optional): Custom filename for the screenshot
element (string, optional): Human-readable element description
ref (string, optional): Element reference from snapshot
fullPage (boolean, optional): Capture full scrollable page

Example:

await client.callTool({
  name: 'browser_take_screenshot',
  arguments: { 
    type: 'png',
    fullPage: true,
    filename: 'homepage.png'
  }
});

Element Interaction Functions

`browser_click`

Click on a specific element on the page.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot

Example:

await client.callTool({
  name: 'browser_click',
  arguments: {
    element: 'Sign in button',
    ref: 'button-signin-123'
  }
});

`browser_double_click`

Perform a double-click on an element.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot

Example:

await client.callTool({
  name: 'browser_double_click',
  arguments: {
    element: 'File icon',
    ref: 'file-icon-456'
  }
});

`browser_right_click`

Perform a right-click to open context menu.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot

Example:

await client.callTool({
  name: 'browser_right_click',
  arguments: {
    element: 'Image thumbnail',
    ref: 'img-thumb-789'
  }
});

Text Input Functions

`browser_type`

Type text into an editable element.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot
text (string, required): Text to type
submit (boolean, optional): Press Enter after typing
slowly (boolean, optional): Type character by character

Example:

await client.callTool({
  name: 'browser_type',
  arguments: {
    element: 'Search input field',
    ref: 'search-input-123',
    text: 'playwright automation',
    submit: true
  }
});

`browser_press_key`

Press a specific key on the keyboard.

Parameters:

key (string, required): Key name (e.g., 'Enter', 'ArrowLeft', 'a')

Example:

await client.callTool({
  name: 'browser_press_key',
  arguments: { key: 'Escape' }
});

Form Functions

`browser_select_option`

Select options in a dropdown menu.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot
values (array, required): Array of values to select

Example:

await client.callTool({
  name: 'browser_select_option',
  arguments: {
    element: 'Country dropdown',
    ref: 'country-select-456',
    values: ['United States']
  }
});

`browser_check_checkbox`

Check or uncheck a checkbox element.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot
checked (boolean, required): Whether to check (true) or uncheck (false)

Example:

await client.callTool({
  name: 'browser_check_checkbox',
  arguments: {
    element: 'Terms and conditions checkbox',
    ref: 'terms-checkbox-789',
    checked: true
  }
});

`browser_select_radio`

Select a radio button.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot

Example:

await client.callTool({
  name: 'browser_select_radio',
  arguments: {
    element: 'Payment method: Credit Card',
    ref: 'payment-radio-cc'
  }
});

`browser_clear_input`

Clear the content of an input field.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot

Example:

await client.callTool({
  name: 'browser_clear_input',
  arguments: {
    element: 'Email input field',
    ref: 'email-input-123'
  }
});

Data Extraction Functions

`browser_get_text`

Extract text content or attribute values from elements.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot
attribute (string, optional): Specific attribute to extract (e.g., 'href', 'src')

Example:

await client.callTool({
  name: 'browser_get_text',
  arguments: {
    element: 'Product price',
    ref: 'price-display-456'
  }
});

`browser_get_elements`

Get multiple elements matching a selector.

Parameters:

selector (string, required): CSS selector to find elements
attribute (string, optional): Attribute to extract from each element

Example:

await client.callTool({
  name: 'browser_get_elements',
  arguments: {
    selector: '.product-card h3',
    attribute: 'textContent'
  }
});

Scrolling and Focus Functions

`browser_scroll_to`

Scroll to a specific element or coordinate position.

Parameters:

element (string, optional): Human-readable element description
ref (string, optional): Element reference from page snapshot
x (number, optional): X coordinate to scroll to
y (number, optional): Y coordinate to scroll to
behavior (string, optional): Scroll behavior ('auto' or 'smooth')

Example:

await client.callTool({
  name: 'browser_scroll_to',
  arguments: {
    element: 'Footer section',
    ref: 'footer-section-123',
    behavior: 'smooth'
  }
});

`browser_get_scroll_position`

Get the current scroll position of the page.

Parameters: None

Example:

await client.callTool({
  name: 'browser_get_scroll_position',
  arguments: {}
});

`browser_focus_element`

Set focus on a specific element.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot

Example:

await client.callTool({
  name: 'browser_focus_element',
  arguments: {
    element: 'Search input',
    ref: 'search-input-456'
  }
});

`browser_blur_element`

Remove focus from a specific element.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot

Example:

await client.callTool({
  name: 'browser_blur_element',
  arguments: {
    element: 'Email input',
    ref: 'email-input-789'
  }
});

Wait Functions

`browser_wait_for`

Wait for specific conditions to be met.

Parameters:

time (number, optional): Time to wait in seconds
text (string, optional): Text to wait for to appear
textGone (string, optional): Text to wait for to disappear

Example:

await client.callTool({
  name: 'browser_wait_for',
  arguments: {
    text: 'Loading complete',
    time: 5
  }
});

Tab Management Functions

`browser_tab_list`

List all open browser tabs.

Parameters: None

Example:

await client.callTool({
  name: 'browser_tab_list',
  arguments: {}
});

`browser_tab_new`

Open a new browser tab.

Parameters:

url (string, optional): URL to navigate to in the new tab

Example:

await client.callTool({
  name: 'browser_tab_new',
  arguments: { url: 'https://example.com' }
});

`browser_tab_close`

Close a browser tab.

Parameters:

index (number, optional): Index of tab to close (closes current if not specified)

Example:

await client.callTool({
  name: 'browser_tab_close',
  arguments: { index: 1 }
});

`browser_tab_select`

Switch to a specific tab by index.

Parameters:

index (number, required): Index of the tab to select

Example:

await client.callTool({
  name: 'browser_tab_select',
  arguments: { index: 0 }
});

Network and Console Functions

`browser_network_requests`

Get all network requests made since page load.

Parameters: None

Example:

await client.callTool({
  name: 'browser_network_requests',
  arguments: {}
});

`browser_console_messages`

Get all console messages from the page.

Parameters: None

Example:

await client.callTool({
  name: 'browser_console_messages',
  arguments: {}
});

Chrome DevTools Protocol (CDP) Functions

`browser_connect_cdp`

Connect to an existing browser instance via CDP.

Parameters:

endpoint (string, required): CDP endpoint URL
timeout (number, optional): Connection timeout in milliseconds (default: 30000)

Example:

await client.callTool({
  name: 'browser_connect_cdp',
  arguments: {
    endpoint: 'ws://localhost:9222',
    timeout: 30000
  }
});

`browser_get_cdp_endpoints`

Discover available CDP endpoints from running browsers.

Parameters:

port (number, optional): CDP port to check (default: 9222)
host (string, optional): Host to check (default: 'localhost')

Example:

await client.callTool({
  name: 'browser_get_cdp_endpoints',
  arguments: {
    port: 9222,
    host: 'localhost'
  }
});

`browser_disconnect_cdp`

Disconnect from the current CDP connection.

Parameters: None

Example:

await client.callTool({
  name: 'browser_disconnect_cdp',
  arguments: {}
});

JavaScript Evaluation Functions

`browser_evaluate`

Execute JavaScript code in the browser context.

Parameters:

script (string, required): JavaScript code to execute

Example:

await client.callTool({
  name: 'browser_evaluate',
  arguments: {
    script: 'document.title'
  }
});

Dialog Handling Functions

`browser_handle_dialog`

Handle browser dialogs (alert, confirm, prompt).

Parameters:

action (string, required): Action to take ('accept' or 'dismiss')
text (string, optional): Text to enter for prompt dialogs

Example:

await client.callTool({
  name: 'browser_handle_dialog',
  arguments: {
    action: 'accept',
    text: 'User input'
  }
});

File Functions

`browser_upload_file`

Upload files to file input elements.

Parameters:

element (string, required): Human-readable element description
ref (string, required): Element reference from page snapshot
files (array, required): Array of file paths to upload

Example:

await client.callTool({
  name: 'browser_upload_file',
  arguments: {
    element: 'File upload input',
    ref: 'file-input-123',
    files: ['/path/to/document.pdf']
  }
});

PDF Functions

`browser_save_pdf`

Save the current page as a PDF.

Parameters:

filename (string, optional): Custom filename for the PDF
format (string, optional): Page format (e.g., 'A4', 'Letter')
landscape (boolean, optional): Use landscape orientation

Example:

await client.callTool({
  name: 'browser_save_pdf',
  arguments: {
    filename: 'report.pdf',
    format: 'A4',
    landscape: false
  }
});

Configuration

The MCP server can be configured with various options:

const config = {
  browser: {
    headless: false,
    viewport: { width: 1280, height: 720 },
    cdpEndpoint: 'ws://localhost:9222' // Optional CDP connection
  },
  capabilities: ['core', 'vision'] // Enable specific tool capabilities
};

const server = await createConnection(config);

Common Usage Patterns

Web Scraping Workflow

// Navigate to page
await client.callTool({
  name: 'browser_navigate',
  arguments: { url: 'https://example.com' }
});

// Take snapshot to analyze page structure
await client.callTool({
  name: 'browser_snapshot',
  arguments: {}
});

// Extract data from elements
await client.callTool({
  name: 'browser_get_text',
  arguments: {
    element: 'Product title',
    ref: 'product-title-123'
  }
});

Form Automation Workflow

// Navigate to form page
await client.callTool({
  name: 'browser_navigate',
  arguments: { url: 'https://example.com/contact' }
});

// Fill form fields
await client.callTool({
  name: 'browser_type',
  arguments: {
    element: 'Name input',
    ref: 'name-input-123',
    text: 'John Doe'
  }
});

// Submit form
await client.callTool({
  name: 'browser_click',
  arguments: {
    element: 'Submit button',
    ref: 'submit-btn-456'
  }
});

CDP Integration Workflow

// Connect to existing browser
await client.callTool({
  name: 'browser_connect_cdp',
  arguments: { endpoint: 'ws://localhost:9222' }
});

// Work with existing tabs
await client.callTool({
  name: 'browser_tab_list',
  arguments: {}
});

// Disconnect when done
await client.callTool({
  name: 'browser_disconnect_cdp',
  arguments: {}
});

Troubleshooting

CDP Connection Issues

If you encounter CDP connection problems:

Start Chrome with debugging enabled:

google-chrome --remote-debugging-port=9222 --remote-debugging-address=0.0.0.0

Verify the endpoint is accessible:

curl http://localhost:9222/json/version

Check for firewall or network restrictions

Browser Launch Issues

Ensure Playwright browsers are installed: npx playwright install
Check system dependencies: npx playwright install-deps
Verify sufficient system resources for browser instances

License

Apache License 2.0 - see LICENSE file for details.

Contributing

Contributions are welcome! Please see the contributing guidelines for more information.

Support

For issues and questions:

GitHub Issues: https://github.com/macacoai/mcp-playwright/issues
Email: gaston@macaco.ai

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

E2B

Using MCP to run code via e2b.

Official

Featured