jcrawl4ai-mcp-server

jcrawl4ai-mcp-server

jcrawl4ai-mcp-server

Category
Visit Server

README

jcrawl4ai-mcp-server

Java implementation of MCP Server for interacting with Crawl4ai API.

Project Overview

jcrawl4ai-mcp-server is a Spring Boot-based MCP server that interacts with the Crawl4ai API to perform web crawling. The main functionalities include:

  • Crawling specified URLs using a given strategy, maximum depth, and output format.
  • Getting the crawl result by a given task ID.

Configuration

application.properties

Configure the following properties in the src/main/resources/application.properties file:

  • cawl4ai.base-url: Base URL of the Crawl4ai server.
  • cawl4ai.api-token: API token for the Crawl4ai server.

Example configuration:

cawl4ai.base-url=http://your-cral4ai-server-url:11235
cawl4ai.api-token=your-api-token

Dependencies

The project depends on the following libraries:

  • Spring AI MCP Server
  • Spring Boot
  • Hutool

Running the Project

Build and run the project using Maven:

mvn clean install
java -jar target/jcawl4ai-mcp-server-1.0.0.jar

APIs

Crawl4aiApi

crawl Method

  • Description: Call the Crawl4ai API to crawl the specified URLs.
  • Parameters:
    • urls: Array of target website URLs.
    • strategy: Crawl strategy.
    • max_depth: Maximum depth.
    • output_format: Output format.
  • Return Value: JSON string of the crawl result.

task Method

  • Description: Get the crawl result by a given task ID.
  • Parameters:
    • taskId: Task ID.
  • Return Value: JSON string of the crawl result.

Logging

Log file path: ./target/mcp-stdio-server.log.

MCP Server Configuration

{
  "mcpServers": {
    "jcawl4ai-mcp-server": {
      "autoApprove": [
        "crawl",
        "task"
      ],
      "disabled": false,
      "timeout": 60,
      "command": "java",
      "args": [
        "-jar",
        "/path/to/your/jar/file/jcawl4ai-mcp-server-1.0.0.jar"
      ],
      "transportType": "stdio"
    }
  }
}

Contact

If you have any questions or suggestions, please contact Ken Ye.


jcrawl4ai-mcp-server

Java 实现的 MCP 服务器,用于与 Crawl4ai API 进行交互。

项目概述

jcrawl4ai-mcp-server 是一个基于 Spring Boot 的 MCP 服务器,用于调用 Crawl4ai API 进行网页爬取。该项目的主要功能包括:

  • 使用指定的策略、最大深度和输出格式对给定的 URL 进行爬取。
  • 根据给定的任务 ID 获取爬取结果。

配置

application.properties

src/main/resources/application.properties 文件中配置以下属性:

  • cawl4ai.base-url:Crawl4ai 服务器的基础 URL。
  • cawl4ai.api-token:Crawl4ai 服务器的 API 令牌。

示例配置:

cawl4ai.base-url=http://your-cral4ai-server-url:11235
cawl4ai.api-token=your-api-token

依赖

项目依赖于以下库:

  • Spring AI MCP Server
  • Spring Boot
  • Hutool

启动

使用 Maven 构建并运行项目:

mvn clean install
java -jar target/jcawl4ai-mcp-server-1.0.0.jar

接口

Crawl4aiApi

crawl 方法

  • 描述:调用 Crawl4ai API 爬取指定的 URL。
  • 参数
    • urls:目标网站的 URL 数组。
    • strategy:爬取策略。
    • max_depth:最大深度。
    • output_format:输出格式。
  • 返回值:爬取结果的 JSON 字符串。

task 方法

  • 描述:根据给定的任务 ID 获取爬取结果。
  • 参数
    • taskId:任务 ID。
  • 返回值:爬取结果的 JSON 字符串。

日志

日志文件路径为 ./target/mcp-stdio-server.log

MCP Server 配置

{
  "mcpServers": {
    "jcawl4ai-mcp-server": {
      "autoApprove": [
        "crawl",
        "task"
      ],
      "disabled": false,
      "timeout": 60,
      "command": "java",
      "args": [
        "-jar",
        "/path/to/your/jar/file/jcawl4ai-mcp-server-1.0.0.jar"
      ],
      "transportType": "stdio"
    }
  }
}

联系

如果您有任何问题或建议,请联系 Ken Ye

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured