diffbot-mcp

diffbot-mcp

A Diffbot MCP server that provides tools for extracting structured web content, searching the web with accuracy-focused ranking, and enhancing entity data from the Diffbot Knowledge Graph.

Category
Visit Server

README

Diffbot MCP Server

A Diffbot MCP server with a variety of helpful web data handling tools for your agent or LLM pipeline.

What is Diffbot? Diffbot is a small research company building AI that structure the web. Our products connect apps to the structured web automatically, like an API for all of the public web.

Tools

🧬 extract

Web fetch tool alternative. Conventional fetch tools return either 500k+ tokens of HTML or a markdown approximation of the page's content. Extract classifies the page it is fetching and extracts content into a meaningful and consistent JSON structure. Responses are optimized for minimal token usage while maintaining data fidelity. Powered by Diffbot Extract API.

🔎 search_web

Web search tool that ranks accuracy, not popularity. Built on Diffbot's first party web index. Ranking model heavily favors primary sources over secondary sources (e.g. press release > news piece on said press release). Returns markdown formatted content chunks for source citation. Optimized token management built-in.

🪄 enhance

Web lookup tool for organizations and people. Unlike web search, enhance will return structured data from the Diffbot Knowledge Graph on an organization or person (by name, url, or other acceptable input). Helpful for verifying facts, market research, or sales intelligence workflows.

Authentication

A free Diffbot token is required for tool use. Get one here.

Get Started

Quick Start

The easiest way to get started is to connect your MCP client directly to the remote MCP server hosted by Diffbot.

https://mcp.diffbot.com/mcp?token=<YOUR_DIFFBOT_TOKEN>

This repo deploys directly to the remote server.

Local Installation

Clone the repository

git clone git@github.com:diffbot/diffbot-mcp.git

Install requirements

pip install .

You can now run the server with python3 diffbot_mcp_server.py, or skip this step if you will be setting this server up with an MCP client.

(Optional) Docker Install

Build the image

docker build -t diffbot-mcp .

Setup with MCP Clients

To setup Claude Desktop, follow the official installation guide and use this example claude_desktop_config.json configuration (replace variables as needed).

This standard config works with most IDEs and environments.

{
    "mcpServers": {
        "diffbot-mcp": {
            "command": "python3",
            "args": [
                "/<YOUR_PROJECT_FOLDER>/diffbot-mcp/diffbot_mcp_server.py"
            ],
            "env": {
                "DIFFBOT_TOKEN": "<YOUR_DIFFBOT_TOKEN>"
            }
        }
    }
}

For Docker installs, follow this configuration.

{
    "mcpServers": {
        "diffbot-mcp": {
            "command": "docker",
            "args": [
                "run",
                "--rm",
                "-i",
                "-e", "DIFFBOT_TOKEN=<YOUR_DIFFBOT_TOKEN>",
                "diffbot-mcp"
            ]
        }
    }
}

<details> <summary>Visual Studio Code / Copilot</summary>

For remote setup, simply enter the remote server URL and include "http" as the transport type.

{
  "servers": {
    "diffbot-mcp": {
      "type": "http",
      "url": "https://mcp.diffbot.com/mcp/?token=<YOUR_DIFFBOT_TOKEN>"
    }
  }
}

For local setup, try this modified config for better secrets management.

{
	"servers": {
        "diffbot-mcp": {
            "command": "python3",
            "args": [
                "/<YOUR_PROJECT_FOLDER>/diffbot-mcp/diffbot_mcp_server.py"
            ],
            "env": {
                "DIFFBOT_TOKEN": "${input:diffbot-api-key}"
            }
        }
	},
	"inputs": [
		{
			"password": true,
			"id": "diffbot-api-key",
			"type": "promptString",
			"description": "Diffbot API Key"
		}
	]
}

</details>

Help & Support

This project is built and maintained by @jeromechoo. For support, file an issue and he'll get back to you as soon as he can. For faster answers, write to sales@diffbot.com 😜.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured