SlimAtlas AI
An MCP server for lightweight browser automation using Lightpanda with Chrome fallback, providing LLM-optimized accessibility tree snapshots for precise element targeting.
README
<div align="center"> <img src="img/slim-atlas-logo.svg" alt="SlimAtlas AI Logo" width="180"> </div>
<h1 align="center">SlimAtlas AI</h1>
<p align="center"> A Model Context Protocol (MCP) server for extremely lightweight browser automation. </p>
<p align="center"> Works with <strong>macOS</strong> and <strong>Linux</strong>. </p>
<p align="center"> Built on top of <a href="https://pptr.dev/">Puppeteer</a> and <a href="https://github.com/lightpanda-io/browser">Lightpanda</a>. </p>
Features
- Browser Automation: Navigate, click, type, fill forms, and evaluate JavaScript
- Page Snapshots: Get compact YAML accessibility tree snapshots with unique node IDs for precise element targeting
- LLM-Optimized Context: Snapshots are stripped to semantic essentials, keeping context usage tiny so you can fit more pages and longer sessions into the same window
- Node Inspection: View specific nodes by ID to inspect text content or images
- History Navigation: Go back, go forward, and reload pages
- Lightweight by Default: Uses Lightpanda browser (9x less memory than Chrome, 11x faster)
- Automatic Chrome Fallback: If Lightpanda crashes or fails, automatically switches to Chrome for reliability
- Session Management: Reuse sessions across multiple operations with unique session IDs
- Cross-Platform: Works on macOS and Linux (Lightpanda), with automatic Chrome fallback when needed
Installation
# Install dependencies
npm install
Lightpanda binary will be downloaded automatically on first run. You can also manually download it:
# Linux x86_64
curl -L -o lightpanda https://github.com/lightpanda-io/browser/releases/download/nightly/lightpanda-x86_64-linux && chmod a+x ./lightpanda
# macOS aarch64 (Apple Silicon)
curl -L -o lightpanda https://github.com/lightpanda-io/browser/releases/download/nightly/lightpanda-aarch64-macos && chmod a+x ./lightpanda
# macOS x86_64
curl -L -o lightpanda https://github.com/lightpanda-io/browser/releases/download/nightly/lightpanda-x86_64-macos && chmod a+x ./lightpanda
Chrome Fallback: If Lightpanda crashes or a page triggers bot detection, the server automatically switches to Chrome (via Puppeteer) for that session. Chrome/Chromium can be installed locally, or Puppeteer can download it automatically.
Usage
Run the MCP Server
# Run with npm
npm run dev
Configuration
Add to your MCP client configuration:
{
"mcpServers": {
"slimatlas": {
"command": "npx",
"args": ["tsx", "path/to/mcp/src/index.ts"]
}
}
}
Tip: Set CHROME_ENABLED=false to disable Chrome fallback and use Lightpanda only.
CLI Flags
Every environment variable in .env.example can also be passed as a lower-case CLI flag in --flag=value form. CLI flags override environment variables.
npx tsx src/index.ts --chrome-enabled=false --lightpanda-pool-size=3 --navigate-timeout=60000
To use flags from an MCP client, append them to the args array:
{
"mcpServers": {
"slimatlas": {
"command": "npx",
"args": [
"tsx",
"path/to/mcp/src/index.ts",
"--chrome-enabled=false",
"--lightpanda-pool-size=3"
]
}
}
}
Usage Workflow
# Example: Navigate, snapshot, and interact with a page
# 1. Navigate to a URL (creates a new session automatically)
result = mcp.call("browser_navigate", {"url": "https://example.com"})
# Returns: session_id: abc1, result: [lightpanda] Navigated to https://example.com. Title: Example Domain
# 2. Take a snapshot to see the page structure
snapshot = mcp.call("browser_snapshot", {"session_id": "abc1"})
# Returns YAML with node IDs like: 0: {type: div, children: ...}
# 3. Click a node by ID (from the snapshot)
mcp.call("browser_click", {"session_id": "abc1", "nodeId": 2})
# 4. Type into a search box
mcp.call("browser_type", {"session_id": "abc1", "nodeId": 5, "text": "search query"})
# 5. Close the session when done
mcp.call("browser_close", {"session_id": "abc1"})
Available Tools
| Tool | Description | Value |
|---|---|---|
browser_navigate |
Navigate to a URL with configurable wait strategy | Entry point for all web interactions. Supports load, domcontentloaded, networkidle0, networkidle2 |
browser_snapshot |
Get YAML accessibility tree with unique node IDs | Structured page representation ideal for LLM understanding. Node IDs enable precise targeting for clicks/types |
browser_view_node |
View specific node content by ID (text or image) | Inspect individual elements without full page re-read. Returns images as base64 for visual verification |
browser_click |
Click element by node ID or CSS selector | Node ID (from snapshot) is recommended over CSS selectors for reliability and simplicity |
browser_type |
Type text into element with optional keystroke delay | Simulates human typing. Use for search boxes, forms, and text inputs |
browser_fill |
Fill input element with a value instantly | Faster than browser_type for form fields. Clears existing value before filling |
browser_go_back |
Navigate back in browser history | Essential for multi-step workflows and correcting navigation mistakes |
browser_go_forward |
Navigate forward in browser history | Complements browser_go_back for bidirectional navigation |
browser_reload |
Reload the current page | Refresh dynamic content or recover from stale page state |
browser_get_page_info |
Get current page URL and title | Quick way to verify navigation success and current context |
browser_close |
Close browser session and free resources | Important for cleanup. Sessions auto-close on timeout, but explicit closing is recommended |
Running Tests
# Run all tests
npm test
# Run tests in watch mode
npx vitest
Requirements
- Node.js 18+
- Lightpanda browser (macOS/Linux, downloads automatically) or Chrome/Chromium (fallback)
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.