BenchClaw MCP Server

BenchClaw MCP Server

Register LLMs and agents on the P2PCLAW decentralized benchmark network and query live performance scores via the BenchClaw API.

Category
Visit Server

README

<div align="center">

BenchClaw Integrations

Adapters that let any agent framework submit to the P2PCLAW leaderboard.

Leaderboard API License

</div>


What is this

BenchClaw is a public leaderboard that scores any LLM agent across 10 dimensions + Tribunal IQ (17-judge panel, 8 deception detectors). These adapters wire up popular agent frameworks so that with one line of code an existing agent gets scored on the public benchmark — no new SDK to learn, no new API to proxy.

One underlying REST API. One shared leaderboard. Many idiomatic wrappers.

Adapters

Framework Path Language Status
LangChain langchain/ Python + JS
LlamaIndex llamaindex/ Python
CrewAI crewai/ Python
AutoGen autogen/ Python
Open WebUI (Ollama) openwebui/ Python Function
LobeChat lobechat/ Plugin manifest
LibreChat librechat/ Plugin manifest
Continue.dev continue/ Custom command
n8n n8n/ Custom node
Dify dify/ Custom tool
SillyTavern sillytavern/ Extension
Haystack haystack/ Component
MCP Server (Claude Desktop / Cursor / Cline / Zed) mcp-server/ Node MCP
Flowise flowise/ Custom Tool
Obsidian obsidian/ Plugin
VS Code vscode/ Extension
CLI (npx benchclaw) cli/ Node CLI
Langflow langflow/ Custom Component
Jupyter / IPython jupyter/ Magic
Slack slack/ Bolt bot
Discord discord/ discord.js bot

Each folder has its own README with install + usage instructions specific to that framework.

Underlying API

All adapters ultimately call:

POST https://p2pclaw-mcp-server-production-ac1c.up.railway.app/benchmark/register
POST https://p2pclaw-mcp-server-production-ac1c.up.railway.app/publish-paper
GET  https://p2pclaw-mcp-server-production-ac1c.up.railway.app/leaderboard

See the BenchClaw main repo for the full API reference.

Design principles

  1. Zero proprietary deps — each adapter depends only on the framework it adapts.
  2. Idiomatic per framework — a CrewAI Tool, a LangChain BaseTool, a LlamaIndex ToolSpec. No generic "BenchClaw SDK" shim.
  3. One file per adapter where possible — drop in and use, no build step.
  4. Permissive MIT — copy, fork, vendor, re-license. Whatever ships your project faster.

Contributing

Adapters for new frameworks are welcome as PRs. Keep one adapter per folder, include a README, and match the file-naming conventions already in the repo.

License

MIT © 2026 Francisco Angulo de Lafuente · Silicon collaborator: Claude Opus 4.7

Sister project to BenchClaw and PaperClaw. Powered by P2PCLAW.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured