free-web-search-ultimate
Universal Search-First Knowledge Acquisition Plugin for LLMs. Enables real-time web search and deep page browsing via MCP or CLI. Zero-cost, privacy-first, supports DuckDuckGo, Bing, Google, Brave, Wikipedia, Arxiv, YouTube, Reddit and more.
README
<div align="center"> <img src="assets/banner.png" alt="Cross-Validated Search" width="100%"/> <h1>π Cross-Validated Search</h1> <p><strong>The Only Search Skill That Prevents LLM Hallucinations</strong></p> <p><em>Multi-source verification. Zero hallucinations. Free forever.</em></p>
<br/>
Stop hallucinations. Start verification.
Every LLM hallucinates factsβClaude, GPT-4, Gemini, Llamaβall of them. This plugin introduces Cross-Validated Search: every claim is verified against multiple independent sources before being presented as fact.
ποΈ Architecture
<div align="center"> <img src="assets/architecture.png" alt="Architecture Diagram" width="90%"/> </div>
One plugin, every ecosystem. Whether you use Claude Desktop, Cursor, OpenClaw, or a custom LangChain agent, this plugin connects your LLM to verified facts through cross-validation.
π The Cross-Validation Paradigm
| Old Paradigm (Standard LLM) | New Paradigm (Cross-Validated Search) |
|---|---|
| Answers from training data | Answers from real-time web search |
| May hallucinate facts | Cross-validates facts across sources |
| Single knowledge source | Multiple independent sources |
| No confidence score | Confidence score per fact: β π’ π‘ π΄ |
| User must trust blindly | Cites all sources for verification |
When this plugin is installed, the AI agent:
- Never Claims Unverified Facts β Every factual claim is checked against multiple sources
- Cross-Validates β Facts must appear in 2+ sources to be marked as verified
- Assigns Confidence β Each fact gets a confidence score based on source agreement
- Cites All Sources β Every claim comes with verifiable URLs
π¦ Installation
pip install cross-validated-search
Requirements: Python 3.10+
Or install from source:
git clone https://github.com/wd041216-bit/cross-validated-search.git
cd cross-validated-search
pip install -e .
π Enhanced Version (with API Providers)
Need more search providers? Check out the with-api-providers branch:
# Install enhanced version with Tavily support
pip install git+https://github.com/wd041216-bit/cross-validated-search@with-api-providers
# Or with Tavily extra
pip install "cross-validated-search[tavily] @ git+https://github.com/wd041216-bit/cross-validated-search@with-api-providers"
Enhanced features:
- π Tavily API β Premium search quality (requires
TAVILY_API_KEY) - π Parallel execution β Multiple engines running concurrently
- π Better relevance β Advanced search depth options
Note: The enhanced version requires API keys. The main branch remains free and keyless.
π Integration Guide
Claude Desktop & Cursor (via MCP)
Add to your claude_desktop_config.json or Cursor MCP settings:
{
"mcpServers": {
"cross-validated-search": {
"command": "cross-validated-mcp",
"args": []
}
}
}
OpenClaw (via CLI-Anything)
# Install β the skill is auto-discovered from the bundled SKILL.md
pip install cross-validated-search
LangChain / Custom Agents
from langchain.tools import Tool
import subprocess, json
def cross_validate_search(query: str) -> str:
result = subprocess.run(
["cross-validate", query, "--json"],
capture_output=True, text=True
)
data = json.loads(result.stdout)
return data.get("answer", "No results found.")
search_tool = Tool(
name="cross_validate_search",
func=cross_validate_search,
description="Search the web with cross-validation. Every fact is verified against multiple sources."
)
OpenAI Function Calling
tools = [
{
"type": "function",
"function": {
"name": "cross_validate_search",
"description": "Search the web with multi-source cross-validation. Prevents hallucinations by verifying facts across independent sources.",
"parameters": {
"type": "object",
"properties": {
"query": {"type": "string", "description": "The search query"},
"type": {"type": "string", "enum": ["text", "news", "images", "videos", "books"], "default": "text"}
},
"required": ["query"]
}
}
}
]
π» CLI Usage
cross-validate β Cross-Validated Web Search
# General knowledge (3+ sources, cross-validated)
cross-validate "What is the population of Tokyo?"
# Breaking news (multiple news sources)
cross-validate "OpenAI GPT-5" --type news --timelimit w
# Images (verified sources)
cross-validate "neural network diagram" --type images
# Chinese search
cross-validate "δΊΊε·₯ζΊθ½ζζ°θΏε±" --region zh-cn
# JSON output for programmatic use
cross-validate "quantum computing" --json
browse-page β Deep Page Reading
# Read full page content
browse-page "https://arxiv.org/abs/2303.08774"
# JSON output
browse-page "https://example.com/article" --json
π― Confidence Scoring System
| Score | Meaning | When to Use |
|---|---|---|
| β Verified | 3+ sources agree, high authority | Cite as fact |
| π’ Likely True | 2 sources agree, medium confidence | Cite with confidence note |
| π‘ Uncertain | Single source or minor conflicts | Flag as unverified |
| π΄ Likely False | Major contradictions or no sources | Do not use |
π Why This Over Alternatives?
| Feature | Cross-Validated Search | Tavily API | Serper API | Bing Search API |
|---|---|---|---|---|
| Cost | Free | $0.01/req | $0.001/req | $3/1000 req |
| Cross-Validation | Yes | No | No | No |
| Confidence Score | Yes | No | No | No |
| Hallucination Prevention | Yes | No | No | No |
| API Key Required | No | Yes | Yes | Yes |
| MCP Support | Yes | Partial | No | No |
| CLI-Anything | Yes | No | No | No |
π License
MIT License β free for personal and commercial use.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.