re-lief
MCP server exposing LIEF for cross-format binary analysis, enabling parsing, section listing, imports/exports, disassembly, and string extraction for PE, ELF, MachO, DEX, ART, and OAT files.
README
re-lief
MCP server exposing LIEF (Library to Instrument Executable Formats) for cross-format binary analysis. Handles PE, ELF, MachO, COFF, DEX, ART, OAT in a single, normalized API.
Why
LIEF is the Python successor to pefile — same data for PE, plus ELF, MachO, DEX, ART, and OAT. It also handles DWARF/PDB debug info, ObjC metadata, the Dyld Shared Cache, and (optionally) has a built-in disassembler/assembler.
This server is the foundation of the RE-AI plugin: it works without any system tools installed (no rizin, no gdb, no anything), is pure Python, and runs in-process.
Tools
| Tool | What it does |
|---|---|
check_lief |
Health check — return LIEF version, supported formats |
parse_binary |
Auto-detect format and return normalized header + high-level structure |
get_sections |
Section list with permissions (R/W/X), virtual vs raw size, entropy |
get_imports_exports |
Symbol-level import/export tables (per format) |
get_authenticode |
PE signature details (Win) |
get_overlay |
Appended data after the last section |
list_dex_classes |
Android DEX class list |
list_dex_methods |
Methods of a DEX class |
list_oat_art |
Android OAT/ART method list |
disasm_capstone |
Capstone disassembly (works for any LIEF-parsed binary) |
extract_strings |
ASCII + UTF-16LE string extraction with section awareness |
categorize_strings |
ASCII + UTF-16LE string extraction, section-aware, bucketed into keyword categories from data/drm-indicators.yaml::string_categories. Superset of extract_strings. |
get_imphash |
PE import hash (MD5 of normalized import table) |
normalize_for_diff |
Produce a structural snapshot suitable for diffing two binaries |
Install
This server is part of the RE-AI plugin. The plugin's install.sh / install.bat installs it as part of the standard flow.
To install standalone:
pip install -e ./servers/re-lief
Run
re-lief # stdio transport (default for MCP)
python -m re_lief # equivalent
Format support
LIEF auto-detects the format and exposes a polyglot API. Most tools return results shaped by format:
- PE (
.exe,.dll,.sys): full sections, imports/exports, imphash, Authenticode, resources, exceptions, TLS, debug info (PDB path) - ELF (Linux binaries, .so, kernel modules): sections, segments (program headers), dynamic symbols, RELRO, BIND_NOW, NX, PIE, RPATH/RUNPATH, SONAME, dynamic libs
- MachO (macOS/iOS binaries, .dylib, frameworks): load commands, segments, LC_BUILD_VERSION, code signature, dyld info, ObjC metadata
- DEX (Android Dalvik): class list with FQN, method list per class, string pool
- OAT/ART (Android runtime): method list with class/method indices, vdex references
- COFF (Windows object files, EFI): sections, symbols, relocations
Deprecation of pefile
If you're familiar with the v1 re-ai repo, this server supersedes the old pefile-based code. The string-extraction algorithm (ASCII + UTF-16LE) and imphash logic were ported from backend/analysis/native.py; the rest of the API is LIEF-native and works for all formats.
Categorization vocabulary
categorize_strings reads its 11 keyword categories from
data/drm-indicators.yaml::string_categories at MCP-server load
time. The anti_debug and hwid categories inherit their
keyword lists from
drm-indicators.yaml::anti_debug_indicators.checks[].name and
hwid_apis.high_signal[].api via a seed_from: YAML pointer —
when a future agent adds a new HWID API to hwid_apis.high_signal,
the categorizer picks it up automatically on next reload. The
other 9 categories have their keyword lists inline in the YAML
under string_categories.categories[].keywords.
This makes the categorizer idempotent with the catalog: the
YAML is the single source of truth for both the indicator set
that re-drm-fingerprint reads and the keyword set that the
categorizer reads. Both the static analysis and the string
analysis will give consistent answers.
On large binaries (>100 MB, e.g. a Unity IL2CPP GameAssembly.dll
wrapped by an encrypted-VM bytecode interpreter), pass
skip_sections=[".idata", ".xtls", ".xpdata", ".udata", ".xdata", ".didata", ".ecode", ".00cfg"] to skip the encrypted-VM
bytecode regions. Note: on the bundled IL2CPP target sample,
the import-table strings live inside those sections,
so skipping them blinds the categorizer to the imports. Use
skip_sections for memory-bound runs; use the full section walk
for completeness.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.