MinionWorks – Modular browser agents that work for bananas 🍌

MinionWorks – Modular browser agents that work for bananas 🍌

minionworks

Browser Automation
Visit Server

README

<!-- Light/Dark Mode Banner Hack --> <p align="center"> <img src="static/minion-works-cover-light.png#gh-light-mode-only" alt="Minion Works" width="100%" /> <img src="static/minion-works-cover-dark.png#gh-dark-mode-only" alt="Minion Works" width="100%" /> </p>

<h1 align="center"> MinionWorks – Modular browser agents that work for bananas 🍌</h1>

<p align="center"> <em>Modular. Extensible. AI-native browser agents for modern web automation.</em> </p>


πŸš€ Overview

Minion Works is a modular AI agent framework that connects to your browser and executes complex tasks autonomously. Built for developers, researchers, and curious builders.

✨ Features

  • 🌐 Perform Google searches and scrape content
  • πŸ€– Use LLMs (like GPT-4) to plan actions
  • πŸ”— Modular architecture for plug-and-play use cases
  • πŸ”Ž DOM interaction & content extraction
  • πŸ”„ Run workflows via Python or UI

πŸ› οΈ Installation

  1. Install the package

    pip install minion-agent
    
  2. Set up environment variables

    cp .env.example .env
    # Edit your .env file with OpenAI or other API keys
    

πŸ§ͺ Quick Start

Here’s a complete example using MinionAgent with langchain-openai:

from langchain_openai import ChatOpenAI
from minion_agent.browser import MinionAgent
import asyncio
from dotenv import load_dotenv

# Load environment variables from .env file
load_dotenv()

async def main():
    # Example 1: Using environment variables for OpenAI configuration
    agent1 = MinionAgent(
        task="Compare the price of GPT-4 and DeepSeek-V3",
        llm=ChatOpenAI(model="gpt-4o"),
        headless=True
    )
    result1 = await agent1.run()
    print("Result 1:", result1)

    # Example 2: Providing custom LLM and configuration
    agent2 = MinionAgent(
        task="Search for the latest news about AI",
        llm=ChatOpenAI(model="gpt-4o"),
        headless=False  # Show browser window
    )
    result2 = await agent2.run()
    print("Result 2:", result2)

if __name__ == "__main__":
    asyncio.run(main())

🧠 Example Use Case

agent = MinionAgent(
    task="Find the top 3 ML conferences in 2025 and summarize each.",
    llm=ChatOpenAI(model="gpt-4")
)
await agent.run()

πŸ§ͺ Testing

pytest --maxfail=1 --disable-warnings -q

Ensure you’re in the root folder where tests/ lives.


🀝 Contributing

We welcome PRs, feedback, and creative ideas!

  1. Fork β†’ Branch β†’ Commit
  2. Add tests
  3. Submit a Pull Request
  4. Tell your friends πŸš€

πŸ“– Citation

@software{minion_works2025,
  author = {Sairaam, Aman, Cheena},
  title = {Minion Works: Let AI take the helm of your browser.},
  year = {2025},
  publisher = {GitHub},
  url = {https://github.com/minionworks/minions}
}

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Playwright MCP Server

Playwright MCP Server

Provides a server utilizing Model Context Protocol to enable human-like browser automation with Playwright, allowing control over browser actions such as navigation, element interaction, and scrolling.

Featured
Local
TypeScript
@kazuph/mcp-fetch

@kazuph/mcp-fetch

Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.

Featured
Local
JavaScript
DuckDuckGo MCP Server

DuckDuckGo MCP Server

A Model Context Protocol (MCP) server that provides web search capabilities through DuckDuckGo, with additional features for content fetching and parsing.

Featured
Python
YouTube Transcript MCP Server

YouTube Transcript MCP Server

This server retrieves transcripts for given YouTube video URLs, enabling integration with Goose CLI or Goose Desktop for transcript extraction and processing.

Featured
Python
serper-search-scrape-mcp-server

serper-search-scrape-mcp-server

This Serper MCP Server supports search and webpage scraping, and all the most recent parameters introduced by the Serper API, like location.

Featured
TypeScript
The Verge News MCP Server

The Verge News MCP Server

Provides tools to fetch and search news from The Verge's RSS feed, allowing users to get today's news, retrieve random articles from the past week, and search for specific keywords in recent Verge content.

Featured
TypeScript
Tavily MCP Server

Tavily MCP Server

Provides AI-powered web search capabilities using Tavily's search API, enabling LLMs to perform sophisticated web searches, get direct answers to questions, and search recent news articles.

Featured
Python
mcp-pinterest

mcp-pinterest

A Pinterest Model Context Protocol (MCP) server for image search and information retrieval

Featured
TypeScript
Crawlab MCP Server

Crawlab MCP Server

Official
Python