Regex Toolkit MCP Server
Enables LLM agents to extract, validate, and mask personally identifiable information using deterministic regular expressions, reducing token usage and hallucination risks.
README
Regex Toolkit MCP Server
A specialized, high-performance Model Context Protocol (MCP) server engineered to handle complex Regular Expression operations deterministically. This server equips your LLM agents with the ability to securely extract, validate, and mask Personally Identifiable Information (PII) without relying on token-heavy, hallucination-prone AI pattern matching.
The LLM Pattern Matching Dilemma
Through extensive testing with autonomous data-processing agents, we identified a critical limitation in how Large Language Models handle unstructured text: LLMs are highly inefficient at strict pattern matching.
When tasked with extracting or redacting emails, URLs, or phone numbers from large text blobs (such as chat logs or scraped web pages), an LLM must read every single token. This process:
- Consumes massive context windows, driving up inference costs exponentially.
- Risks Hallucination: The LLM may "invent" emails that look similar, or miss edge-case formatted phone numbers.
- Introduces Privacy Risks: Asking an LLM to process and return raw PII directly exposes sensitive data to the model provider's inference pipeline.
The Regex Toolkit Solution
The Regex Toolkit MCP solves this by shifting pattern matching away from the AI and into a deterministic, sandboxed execution environment. By leveraging native regex engines, this MCP server can scan megabytes of text in milliseconds, perfectly extracting or masking data. The LLM only receives the exact structured data it needs, saving thousands of tokens and ensuring absolute accuracy.
Technical Capabilities
This server exposes three distinct, highly optimized tools for your AI workflows:
-
extract_pattern- Function: Scans a large body of raw text and extracts all unique instances of a specified pattern (
email,url, orphone). - Use Case: Harvesting links from a scraped webpage or compiling a contact list from unstructured meeting transcripts.
- Function: Scans a large body of raw text and extracts all unique instances of a specified pattern (
-
validate_pattern- Function: Strictly validates if a single string perfectly matches a standard email, URL, or international phone format.
- Use Case: Data sanitization pipelines where an agent must verify user input before writing to a database.
-
mask_sensitive_data- Function: Redacts sensitive PII from a text blob by deterministically replacing matches with
[REDACTED]tags. - Use Case: Privacy compliance. An agent can use this tool to sanitize logs or customer messages before passing the text to an external analytics API.
- Function: Redacts sensitive PII from a text blob by deterministically replacing matches with
Run on Vinkius Edge (Free Edge Hosting)
Vinkius provides free, highly available edge hosting using secure V8 isolates. Deploying to the Vinkius Edge is the fastest way to make this MCP server accessible to any AI agent anywhere, with sub-millisecond response times and zero maintenance.
- Clone this repository
- Run the deployment command:
npx mcpfusion deploy
That's it. Your MCP server is now live, secure, and ready to be connected to your agents.
👉 Access the Regex Toolkit MCP on Vinkius
Local Development
Constructed using MCP Fusion for reliable, strictly typed execution.
npm install
npm run dev
Security & Architecture
This server is strictly stateless. It does not store, log, or transmit the text you send it for evaluation. The mask_sensitive_data tool is explicitly designed to help organizations meet GDPR and CCPA compliance requirements by ensuring PII is scrubbed before it hits downstream AI models or storage layers.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.