Forage-mcp

Forage-mcp

MCP server giving AI agents real-time web search, page scraping, company intelligence, email discovery, local lead generation, and a persistent knowledge graph. Pay only for what you use, no subscriptions.

Category
Visit Server

README

Forage MCP — Web Intelligence & Persistent Knowledge Graph for AI Agents

By Riccardo Minniti / Ernesta Labs | riccardo@ernestalabs.com

Forage is a Model Context Protocol (MCP) server that gives AI agents real-time web intelligence and a self-accumulating knowledge graph. One connection provides 24 tools and 12 multi-step skills: web search, company data, verified B2B emails, local leads, and a graph that remembers everything your agent has ever discovered.

Built on Apify's scraping infrastructure. Powered by FalkorDB for persistent graph storage.


Author: Riccardo Minniti
Organization: Ernesta Labs
Contact: riccardo@ernestalabs.com
GitHub: github.com/ErnestaLabs/web-intelligence-mcp
Apify: apify.com/ernesta_labs/forage


The Knowledge Graph — Your Agent's Memory

Every tool call automatically feeds a private knowledge graph that grows smarter over time. No other MCP server does this.

┌─────────────────────────────────────────────────────────────┐
│                    FORAGE KNOWLEDGE GRAPH                    │
├─────────────────────────────────────────────────────────────┤
│                                                             │
│   [Company A] ──── has_domain ────▶ [domain.com]           │
│       │                                │                    │
│       │ works_at                       │ has_email_pattern  │
│       ▼                                ▼                    │
│   [Person B] ◀─── has_title ──── [Email Pattern]           │
│       │                                │                    │
│       │ located_in                     │ verified_emails    │
│       ▼                                ▼                    │
│   [San Francisco]              [john@domain.com]           │
│       │                                │                    │
│       │ operates_in                    │ linkedin           │
│       ▼                                ▼                    │
│   [SaaS Industry]              [LinkedIn Profile]          │
│                                                             │
│   Claims: "Raised Series A in 2024" (confidence: 89%)      │
│   Signals: Hiring spike in Q4 2025 (+45%)                  │
│   Regime: growth                                           │
│                                                             │
└─────────────────────────────────────────────────────────────┘

What makes it different

Feature Forage Graph Generic RAG/Vector DB
Entity deduplication ✓ SHA-256 identity ✗ Creates duplicates
Relationship tracking ✓ Typed edges (works_at, located_in) ✗ Similarity only
Confidence scoring ✓ Increases with corroboration ✗ Static embeddings
Provenance (claims) ✓ Who said what, when ✗ No source tracking
Time-series signals ✓ Track metrics over time ✗ Snapshot only
Causal inference ✓ Find what drives what ✗ No causality
Regime detection ✓ Normal / stressed / pre-tipping ✗ No state tracking

Graph tools

Tool What it does Price
query_knowledge Search entities by name/type $0.05
enrich_entity Full profile + all relationships $0.08
find_connections Path between two entities $0.12
add_claim Store provenance assertion $0.05
get_claims Retrieve claims for entity $0.05
add_signal Record time-series data point $0.05
get_signals Query metrics over time $0.05
set_regime Label entity state $0.03
get_regime Check entity state $0.03
causal_parents What drives this entity $0.08
causal_children What this entity drives $0.08
causal_path Highest-weight causal path $0.15
simulate Propagate shock/boost through graph $0.25
get_graph_stats Entity/relationship counts Free

The graph is persistent — stored in FalkorDB on our infrastructure. Your agent's research accumulates across sessions. The more you use Forage, the smarter it gets.


Email Verification — How It Actually Works

We don't just guess email patterns. Each find_emails call runs a 4-step verification pipeline:

Step 1: Pattern Discovery

Scrape the target domain for email patterns (e.g., firstname.lastname@domain.com). Extract from:

  • Contact pages, footers, team pages
  • Press releases, blog author pages
  • WHOIS records, SSL certificates

Step 2: Candidate Generation

Generate candidate emails using discovered patterns + LinkedIn data. Cross-reference with:

  • Company employee listings (if public)
  • Job postings with contact info
  • Conference speaker lists

Step 3: SMTP Verification

For each candidate, we perform an SMTP handshake check:

  • Connect to the domain's mail server
  • Verify the recipient exists (RCPT TO)
  • Detect catch-all domains (score penalty)
  • Detect mailboxes that accept then bounce (honeypots)

Step 4: Confidence Scoring

Each email gets a confidence score (0-100) based on:

Signal Weight Example
SMTP accept 40% Mail server accepted RCPT TO
Pattern match 25% Matches known company format
LinkedIn match 20% Name matches LinkedIn profile
Source corroboration 15% Found on multiple public sources

Return format:

{
  "email": "sarah.chen@stripe.com",
  "name": "Sarah Chen",
  "title": "VP of Sales",
  "seniority": "vp",
  "department": "sales",
  "linkedin": "linkedin.com/in/sarahchen",
  "confidence": 94,
  "verified": true,
  "verification_steps": ["smtp_accepted", "linkedin_match", "pattern_match"]
}

What "verified" means

  • Confidence 90-100: SMTP accepted + LinkedIn match + multiple sources. High deliverability.
  • Confidence 70-89: SMTP accepted or strong pattern match. Good for outreach.
  • Confidence 50-69: Pattern-based with partial verification. Use with caution.
  • Below 50: Not returned (filtered out).

This is not a simple mailserver check. It's a multi-source corroboration pipeline that other MCP servers don't offer.


Web Intelligence Tools

Core Tools

Tool What it does Price Why this price
search_web Multi-source search, deduplicated, ranked $0.03 Aggregates Brave, Bing, DuckDuckGo + dedup + rank. Cheaper than calling each API separately ($0.03 total vs $0.06+ if you called 2 search APIs)
scrape_page Extract clean markdown from any URL $0.07 Includes proxy rotation, JavaScript rendering, anti-bot bypass
get_company_info Domain → full company profile $0.08 Aggregates 5+ data sources: website, LinkedIn, Crunchbase patterns, social profiles
find_emails Verified B2B emails with LinkedIn $0.10 4-step pipeline above
find_local_leads Local businesses by niche + location $0.15 Google Maps + enrichment + phone/website extraction
find_leads B2B leads by title/industry/location $0.25/100 leads That's $0.0025 per lead. Try finding 100 leads manually.

Skills (Multi-Step Workflows)

Skills chain multiple tools into one call, returning ready-to-use intelligence packages:

Skill Price Returns
skill_company_dossier $0.50 Full company profile + 10 contacts with emails
skill_prospect_company $0.75 15 decision makers sorted by seniority + emails
skill_outbound_list $3.50 100 verified leads ready for CRM import
skill_local_market_map $0.80 Up to 60 local businesses with contact info
skill_decision_maker_finder $1.00 20 decision makers by seniority tier
skill_competitor_intel $0.80 Pricing, features, reviews, positioning
skill_competitor_ads $0.65 Active ad copy, landing pages, platforms
skill_job_signals $0.55 Hiring trends, open roles, expansion signals
skill_tech_stack $0.45 Technologies used with confidence scores
skill_funding_intel $0.70 Funding rounds, investors, valuation estimates
skill_social_proof $0.55 Reviews, ratings, testimonials aggregated
skill_market_map $1.20 Complete competitor landscape for a market

Why Forage over other MCP search tools?

Capability Forage Brave Search MCP Apify MCP AgentQL
Web search
Page scraping
Email discovery ✓ (4-step verified)
B2B leads Partial
Company intelligence Partial
Local businesses
Persistent knowledge graph
Provenance & claims
Causal analysis
Time-series signals
Multi-step skills ✓ (12 skills)
Actor gateway (1000+)

The knowledge graph is the differentiator. Other tools give you data. Forage gives you accumulated intelligence. Every search, every email lookup, every company profile feeds your private graph. After a week of use, your agent knows more about your market than any single search ever could.


Quick Start

1. Get Your API Token

Go to Apify Console → Settings → Integrations and copy your Personal API Token.

2. Connect to Claude / Cursor / n8n

Claude Desktop (%APPDATA%\Claude\claude_desktop_config.json):

{
  "mcpServers": {
    "forage": {
      "command": "npx",
      "args": [
        "-y", "@anthropic/mcp-proxy",
        "https://ernesta-labs--forage.apify.actor/mcp/sse"
      ],
      "env": {
        "APIFY_API_TOKEN": "YOUR_APIFY_TOKEN"
      }
    }
  }
}

Cursor / Windsurf:

{
  "mcpServers": {
    "forage": {
      "command": "npx",
      "args": [
        "-y", "@anthropic/mcp-proxy",
        "https://ernesta-labs--forage.apify.actor/mcp/sse"
      ],
      "env": {
        "APIFY_API_TOKEN": "YOUR_APIFY_TOKEN"
      }
    }
  }
}

n8n / LangGraph / Custom: Connect to the SSE endpoint at https://ernesta-labs--forage.apify.actor/mcp/sse with your Apify token in the Authorization header.

3. System Prompt (Optional)

Add to your agent's system prompt:

When you need live web data, company info, verified emails, or lead lists, use Forage tools. Each call costs money (shown in responses), so batch operations when possible. Your knowledge graph persists across sessions — check it first before making new web calls.


Examples

Find 20 HVAC leads in Dallas

Call: find_leads
Params: { "industry": "HVAC", "location": "Dallas, TX", "limit": 20 }
Returns: 20 companies with name, phone, website, email, address
Cost: $0.05

Get decision makers at a prospect

Call: skill_prospect_company
Params: { "domain": "stripe.com" }
Returns: 15 decision makers with title, email, LinkedIn, seniority
Cost: $0.75

Build a local market map

Call: skill_local_market_map
Params: { "business_type": "dentist", "location": "London, UK" }
Returns: 60 dentists with address, phone, website, rating, reviews
Cost: $0.80

Track a company's hiring over time

Call: add_signal
Params: { "entity": "Acme Corp", "metric": "job_postings", "value": 45 }
... repeat weekly ...
Call: get_signals
Params: { "entity": "Acme Corp", "metric": "job_postings" }
Returns: Time-series of job postings — hiring trend visible

Find who influenced a deal

Call: find_connections
Params: { "from": "Your Company", "to": "Acme Corp" }
Returns: Path through shared connections, events, technologies

Architecture

┌─────────────────────────────────────────────────────────────┐
│                    YOUR AI AGENT (Claude, Cursor, n8n)      │
└─────────────────────────┬───────────────────────────────────┘
                          │ MCP Protocol
                          ▼
┌─────────────────────────────────────────────────────────────┐
│                    FORAGE MCP SERVER (Apify)                 │
│  ┌────────────┐  ┌────────────┐  ┌──────────────────────┐  │
│  │ Web Search │  │   Scraper  │  │  Email Discovery     │  │
│  │ (3 engines)│  │ (rendered) │  │  (4-step pipeline)   │  │
│  └─────┬──────┘  └─────┬──────┘  └──────────┬───────────┘  │
│        └───────────────┼────────────────────┘              │
│                        ▼                                    │
│              ┌─────────────────┐                            │
│              │  Graph Client   │                            │
│              └────────┬────────┘                            │
└───────────────────────┼────────────────────────────────────┘
                        │ HTTPS
                        ▼
┌─────────────────────────────────────────────────────────────┐
│                FORAGE GRAPH API (Railway)                    │
│  ┌──────────────────────────────────────────────────────┐  │
│  │              FalkorDB (Redis-compatible)              │  │
│  │  Entities ──── RELATES ────▶ Entities                │  │
│  │  Claims (provenance)                                  │  │
│  │  Signals (time-series)                                │  │
│  │  Regimes (state tracking)                             │  │
│  └──────────────────────────────────────────────────────┘  │
└─────────────────────────────────────────────────────────────┘

Pricing

Pay per tool call. No subscription. No minimum. Every response includes the cost.

Your Apify spend Your Forage cost Ratio
$1 ~$0.75 25% markup
$10 ~$7.50 25% markup
$100 ~$75 25% markup

The 25% markup covers: proxy infrastructure, knowledge graph storage, email verification pipeline, multi-engine search aggregation, and ongoing maintenance.

Free trial: New Apify accounts get $5 platform credit. Try Forage risk-free.


Limitations

  • Some sites block scraping — we use proxies + JS rendering, but some sites (LinkedIn, closed social networks) are protected
  • Email accuracy ≠ 100% — confidence scores reflect real verification, but email addresses can change
  • Knowledge graph is persistent but not portable — data lives on our FalkorDB instance (not exported yet)
  • Rate limits — Apify enforces per-account limits; Forage doesn't add extra limits on top

Support & Links


License

MIT License — Copyright (c) 2026 Riccardo Minniti / Ernesta Labs

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured