free-web-search-ultimate

free-web-search-ultimate

Universal Search-First Knowledge Acquisition Plugin for LLMs. Enables real-time web search and deep page browsing via MCP or CLI. Zero-cost, privacy-first, supports DuckDuckGo, Bing, Google, Brave, Wikipedia, Arxiv, YouTube, Reddit and more.

Category
Visit Server

README

<div align="center"> <img src="assets/banner.png" alt="Cross-Validated Search" width="100%"/> <h1>πŸ” Cross-Validated Search</h1> <p><strong>The Only Search Skill That Prevents LLM Hallucinations</strong></p> <p><em>Multi-source verification. Zero hallucinations. Free forever.</em></p>

PyPI version Python Version MCP Ready CLI-Anything OpenClaw License </div>

<br/>

Stop hallucinations. Start verification.

Every LLM hallucinates factsβ€”Claude, GPT-4, Gemini, Llamaβ€”all of them. This plugin introduces Cross-Validated Search: every claim is verified against multiple independent sources before being presented as fact.

πŸ—οΈ Architecture

<div align="center"> <img src="assets/architecture.png" alt="Architecture Diagram" width="90%"/> </div>

One plugin, every ecosystem. Whether you use Claude Desktop, Cursor, OpenClaw, or a custom LangChain agent, this plugin connects your LLM to verified facts through cross-validation.

🌟 The Cross-Validation Paradigm

Old Paradigm (Standard LLM) New Paradigm (Cross-Validated Search)
Answers from training data Answers from real-time web search
May hallucinate facts Cross-validates facts across sources
Single knowledge source Multiple independent sources
No confidence score Confidence score per fact: βœ… 🟒 🟑 πŸ”΄
User must trust blindly Cites all sources for verification

When this plugin is installed, the AI agent:

  1. Never Claims Unverified Facts β€” Every factual claim is checked against multiple sources
  2. Cross-Validates β€” Facts must appear in 2+ sources to be marked as verified
  3. Assigns Confidence β€” Each fact gets a confidence score based on source agreement
  4. Cites All Sources β€” Every claim comes with verifiable URLs

πŸ“¦ Installation

pip install cross-validated-search

Requirements: Python 3.10+

Or install from source:

git clone https://github.com/wd041216-bit/cross-validated-search.git
cd cross-validated-search
pip install -e .

πŸš€ Enhanced Version (with API Providers)

Need more search providers? Check out the with-api-providers branch:

# Install enhanced version with Tavily support
pip install git+https://github.com/wd041216-bit/cross-validated-search@with-api-providers

# Or with Tavily extra
pip install "cross-validated-search[tavily] @ git+https://github.com/wd041216-bit/cross-validated-search@with-api-providers"

Enhanced features:

  • πŸ” Tavily API β€” Premium search quality (requires TAVILY_API_KEY)
  • πŸ”„ Parallel execution β€” Multiple engines running concurrently
  • πŸ“Š Better relevance β€” Advanced search depth options

Note: The enhanced version requires API keys. The main branch remains free and keyless.

πŸ”Œ Integration Guide

Claude Desktop & Cursor (via MCP)

Add to your claude_desktop_config.json or Cursor MCP settings:

{
  "mcpServers": {
    "cross-validated-search": {
      "command": "cross-validated-mcp",
      "args": []
    }
  }
}

OpenClaw (via CLI-Anything)

# Install β€” the skill is auto-discovered from the bundled SKILL.md
pip install cross-validated-search

LangChain / Custom Agents

from langchain.tools import Tool
import subprocess, json

def cross_validate_search(query: str) -> str:
    result = subprocess.run(
        ["cross-validate", query, "--json"],
        capture_output=True, text=True
    )
    data = json.loads(result.stdout)
    return data.get("answer", "No results found.")

search_tool = Tool(
    name="cross_validate_search",
    func=cross_validate_search,
    description="Search the web with cross-validation. Every fact is verified against multiple sources."
)

OpenAI Function Calling

tools = [
    {
        "type": "function",
        "function": {
            "name": "cross_validate_search",
            "description": "Search the web with multi-source cross-validation. Prevents hallucinations by verifying facts across independent sources.",
            "parameters": {
                "type": "object",
                "properties": {
                    "query": {"type": "string", "description": "The search query"},
                    "type": {"type": "string", "enum": ["text", "news", "images", "videos", "books"], "default": "text"}
                },
                "required": ["query"]
            }
        }
    }
]

πŸ’» CLI Usage

cross-validate β€” Cross-Validated Web Search

# General knowledge (3+ sources, cross-validated)
cross-validate "What is the population of Tokyo?"

# Breaking news (multiple news sources)
cross-validate "OpenAI GPT-5" --type news --timelimit w

# Images (verified sources)
cross-validate "neural network diagram" --type images

# Chinese search
cross-validate "δΊΊε·₯ζ™Ίθƒ½ζœ€ζ–°θΏ›ε±•" --region zh-cn

# JSON output for programmatic use
cross-validate "quantum computing" --json

browse-page β€” Deep Page Reading

# Read full page content
browse-page "https://arxiv.org/abs/2303.08774"

# JSON output
browse-page "https://example.com/article" --json

🎯 Confidence Scoring System

Score Meaning When to Use
βœ… Verified 3+ sources agree, high authority Cite as fact
🟒 Likely True 2 sources agree, medium confidence Cite with confidence note
🟑 Uncertain Single source or minor conflicts Flag as unverified
πŸ”΄ Likely False Major contradictions or no sources Do not use

πŸ† Why This Over Alternatives?

Feature Cross-Validated Search Tavily API Serper API Bing Search API
Cost Free $0.01/req $0.001/req $3/1000 req
Cross-Validation Yes No No No
Confidence Score Yes No No No
Hallucination Prevention Yes No No No
API Key Required No Yes Yes Yes
MCP Support Yes Partial No No
CLI-Anything Yes No No No

πŸ“„ License

MIT License β€” free for personal and commercial use.

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured