Scraper.is

Scraper.is

Contribute to Ai-Quill/scraperis-mcp development by creating an account on GitHub.

Ai-Quill

Content Fetching
Database Interaction
AI Content Generation
Visit Server

README

Scraper.is MCP

npm version License: MIT

A Model Context Protocol (MCP) integration for Scraper.is - A powerful web scraping tool for AI assistants.

This package allows AI assistants to scrape web content through the MCP protocol, enabling them to access up-to-date information from the web.

Features

  • 🌐 Web Scraping: Extract content from any website
  • 📸 Screenshots: Capture visual representations of web pages
  • 📄 Multiple Formats: Get content in markdown, HTML, or JSON
  • 🔄 Progress Updates: Real-time progress reporting during scraping operations
  • 🔌 MCP Integration: Seamless integration with MCP-compatible AI assistants

Installation

npm install -g scraperis-mcp

Or with yarn:

yarn global add scraperis-mcp

Prerequisites

You need a Scraper.is API key to use this package.

Getting Your API Key

  1. Sign up or log in at scraper.is
  2. Navigate to the API Keys section in your dashboard: https://www.scraper.is/dashboard/apikeys
  3. Create a new API key or copy your existing key
  4. Store this key securely as you'll need it to use this package

Usage

Environment Setup

Create a .env file with your Scraper.is API key:

SCRAPERIS_API_KEY=your_api_key_here

Claude Desktop Integration

To use this package with Claude Desktop:

  1. Install the package globally:

    npm install -g scraperis-mcp
    
  2. Add the following configuration to your claude_desktop_config.json file:

    {
      "mcpServers": {
        "scraperis_scraper": {
          "command": "scraperis-mcp",
          "args": [],
          "env": {
            "SCRAPERIS_API_KEY": "your-api-key-here",
            "DEBUG": "*"
          }
        }
      }
    }
    
  3. Replace your-api-key-here with your actual Scraper.is API key.

  4. Restart Claude Desktop to apply the changes.

Running with MCP Inspector

For development and testing, you can use the MCP Inspector:

npx @modelcontextprotocol/inspector scraperis-mcp

Integration with AI Assistants

This package is designed to be used with AI assistants that support the Model Context Protocol (MCP). When properly configured, the AI assistant can use the following tools:

Scrape Tool

The scrape tool allows the AI to extract content from websites. It supports various formats:

  • markdown: Returns the content in markdown format
  • html: Returns the content in HTML format
  • screenshot: Returns a screenshot of the webpage
  • json: Returns structured data in JSON format

Example prompt for the AI:

Can you scrape the latest news from techcrunch.com and summarize it for me?

API Reference

Tools

scrape

Scrapes content from a webpage based on a prompt.

Parameters:

  • prompt (string): The prompt describing what to scrape, including the URL
  • format (string): The format to return the content in (markdown, html, screenshot, json, quick)

Example:

{
  "prompt": "Get me the top 10 products from producthunt.com",
  "format": "markdown"
}

Development

Setup

  1. Clone the repository:

    git clone https://github.com/Ai-Quill/scraperis-mcp.git
    cd scraperis-mcp
    
  2. Install dependencies:

    npm install
    
  3. Build the project:

    npm run build
    

Scripts

  • npm run build: Build the project
  • npm run watch: Watch for changes and rebuild
  • npm run dev: Run with MCP Inspector for development
  • npm run test: Run tests
  • npm run lint: Run ESLint

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

Recommended Servers

Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Mult Fetch MCP Server

Mult Fetch MCP Server

A versatile MCP-compliant web content fetching tool that supports multiple modes (browser/node), formats (HTML/JSON/Markdown/Text), and intelligent proxy detection, with bilingual interface (English/Chinese).

Featured
Local
AIO-MCP Server

AIO-MCP Server

🚀 All-in-one MCP server with AI search, RAG, and multi-service integrations (GitLab/Jira/Confluence/YouTube) for AI-enhanced development workflows. Folk from

Featured
Local
Persistent Knowledge Graph

Persistent Knowledge Graph

An implementation of persistent memory for Claude using a local knowledge graph, allowing the AI to remember information about users across conversations with customizable storage location.

Featured
Local
Hyperbrowser MCP Server

Hyperbrowser MCP Server

Welcome to Hyperbrowser, the Internet for AI. Hyperbrowser is the next-generation platform empowering AI agents and enabling effortless, scalable browser automation. Built specifically for AI developers, it eliminates the headaches of local infrastructure and performance bottlenecks, allowing you to

Featured
Local
React MCP

React MCP

react-mcp integrates with Claude Desktop, enabling the creation and modification of React apps based on user prompts

Featured
Local
Any OpenAI Compatible API Integrations

Any OpenAI Compatible API Integrations

Integrate Claude with Any OpenAI SDK Compatible Chat Completion API - OpenAI, Perplexity, Groq, xAI, PyroPrompts and more.

Featured
Exa MCP

Exa MCP

A Model Context Protocol server that enables AI assistants like Claude to perform real-time web searches using the Exa AI Search API in a safe and controlled manner.

Featured
AI 图像生成服务

AI 图像生成服务

可用于cursor 集成 mcp server

Featured