re-leak-scan
MCP server for detecting publisher telemetry pipeline leaks in binary artifacts by scanning for sensitive strings like Sentry DSNs, Slack tokens, and AWS keys.
README
re-leak-scan
MCP server for detecting publisher telemetry pipeline leaks in binary artifacts. Scans the file's string table for:
- Sentry DSNs (with embedded public auth) — enables forged crash-report submission
- Logstash / log-ingestion URLs — internal observability infrastructure
- Confluence wiki page links — often engineering-only docs / secrets
- Google Drive document URLs — publisher-internal documents
- AWS access key IDs — long-lived credentials
- Slack tokens — long-lived API credentials
- Generic high-entropy hex strings — possible keys / secrets
The output is vendor-neutral: pattern categories describe observable string content, not specific publishers.
Why
The 2026-06-05 stress test surfaced a new attack-surface class that the existing tools did not cover:
- Sample A (
GameAssembly.dll): 16,236 Google Drive URL matches — the bulk are publisher-internal design documents. - Sample B (
CrimsonDesert.exe): a Sentry DSN with embedded auth, a Logstash ingestion URL, an internal dev server URL, and a Confluence wiki page link — all in plaintext, all unprotected by the encrypted-VM bytecode anti-tamper.
re-leak-scan fills that gap. It is pure-Python (no .NET, no system tools), works on any binary file, and is the .re-leak-scan / .re-telemetry-extract foundation for the re-leak-scan and re-telemetry-extract skills.
Tools
| Tool | What it does |
|---|---|
check_leak_scan |
Health check — return pattern catalog + httpx availability |
extract_strings |
Walk the file, extract ASCII + UTF-16LE printable strings |
find_secrets |
Apply the regex catalog over a binary's string table |
scan |
Full pipeline: extract → apply all detectors → return findings |
verify_sentry_dsn |
Parse a Sentry DSN + probe <host>/api/0/projects/.../ to confirm liveness |
verify_confluence_url |
Probe a Confluence URL to confirm reachability + anon-access |
Install
Part of the RE-AI plugin; ./install.sh installs the package. To install standalone:
pip install -e ./servers/re-leak-scan
# Optional: live verification (Sentry / Confluence HTTP probes)
pip install -e './servers/re-leak-scan[verify]'
Run
re-leak-scan # stdio transport (default for MCP)
python -m re_leak_scan # equivalent
Pattern catalog
The 7 patterns are defined in src/re_leak_scan/patterns.py. Adding a new one is a 6-line dataclass entry. The patterns are all vendor-neutral — they match the URL schemes of public infrastructure (Sentry.io, Logstash, Atlassian Confluence, Google Docs) without naming any specific publisher.
Active verification
verify_sentry_dsn and verify_confluence_url make outbound HTTP requests. By default, they are passive — they only check that the endpoint responds. They do not submit forged crash reports, do not authenticate, and do not exfiltrate the leaked data.
If you run these in an air-gapped environment, the verifier returns verified: false, reason: "connection failed: ..." — the leak detection itself is unaffected.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.