diffbot-mcp
A Diffbot MCP server that provides tools for extracting structured web content, searching the web with accuracy-focused ranking, and enhancing entity data from the Diffbot Knowledge Graph.
README
Diffbot MCP Server
A Diffbot MCP server with a variety of helpful web data handling tools for your agent or LLM pipeline.
What is Diffbot? Diffbot is a small research company building AI that structure the web. Our products connect apps to the structured web automatically, like an API for all of the public web.
Tools
🧬 extract
Web fetch tool alternative. Conventional fetch tools return either 500k+ tokens of HTML or a markdown approximation of the page's content. Extract classifies the page it is fetching and extracts content into a meaningful and consistent JSON structure. Responses are optimized for minimal token usage while maintaining data fidelity. Powered by Diffbot Extract API.
🔎 search_web
Web search tool that ranks accuracy, not popularity. Built on Diffbot's first party web index. Ranking model heavily favors primary sources over secondary sources (e.g. press release > news piece on said press release). Returns markdown formatted content chunks for source citation. Optimized token management built-in.
🪄 enhance
Web lookup tool for organizations and people. Unlike web search, enhance will return structured data from the Diffbot Knowledge Graph on an organization or person (by name, url, or other acceptable input). Helpful for verifying facts, market research, or sales intelligence workflows.
Authentication
A free Diffbot token is required for tool use. Get one here.
Get Started
Quick Start
The easiest way to get started is to connect your MCP client directly to the remote MCP server hosted by Diffbot.
https://mcp.diffbot.com/mcp?token=<YOUR_DIFFBOT_TOKEN>
This repo deploys directly to the remote server.
Local Installation
Clone the repository
git clone git@github.com:diffbot/diffbot-mcp.git
Install requirements
pip install .
You can now run the server with python3 diffbot_mcp_server.py, or skip this step if you will be setting this server up with an MCP client.
(Optional) Docker Install
Build the image
docker build -t diffbot-mcp .
Setup with MCP Clients
To setup Claude Desktop, follow the official installation guide and use this example claude_desktop_config.json configuration (replace variables as needed).
This standard config works with most IDEs and environments.
{
"mcpServers": {
"diffbot-mcp": {
"command": "python3",
"args": [
"/<YOUR_PROJECT_FOLDER>/diffbot-mcp/diffbot_mcp_server.py"
],
"env": {
"DIFFBOT_TOKEN": "<YOUR_DIFFBOT_TOKEN>"
}
}
}
}
For Docker installs, follow this configuration.
{
"mcpServers": {
"diffbot-mcp": {
"command": "docker",
"args": [
"run",
"--rm",
"-i",
"-e", "DIFFBOT_TOKEN=<YOUR_DIFFBOT_TOKEN>",
"diffbot-mcp"
]
}
}
}
<details> <summary>Visual Studio Code / Copilot</summary>
For remote setup, simply enter the remote server URL and include "http" as the transport type.
{
"servers": {
"diffbot-mcp": {
"type": "http",
"url": "https://mcp.diffbot.com/mcp/?token=<YOUR_DIFFBOT_TOKEN>"
}
}
}
For local setup, try this modified config for better secrets management.
{
"servers": {
"diffbot-mcp": {
"command": "python3",
"args": [
"/<YOUR_PROJECT_FOLDER>/diffbot-mcp/diffbot_mcp_server.py"
],
"env": {
"DIFFBOT_TOKEN": "${input:diffbot-api-key}"
}
}
},
"inputs": [
{
"password": true,
"id": "diffbot-api-key",
"type": "promptString",
"description": "Diffbot API Key"
}
]
}
</details>
Help & Support
This project is built and maintained by @jeromechoo. For support, file an issue and he'll get back to you as soon as he can. For faster answers, write to sales@diffbot.com 😜.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.