mhlabs-mcp-tools
Modular MCP server providing text preprocessing and NLP tools for AI agent ecosystems.
README
mhlabs-mcp-tools
mcp-name: io.github.MusaddiqueHussainLabs/mhlabs_mcp_tools
š§ mhlabs-mcp-tools
mhlabs-mcp-tools is a Modular MCP Tools Server built using FastMCP.
It provides an extendable AI tool ecosystem organized into functional categories (Text Preprocessing, NLP Components, Document Analysis, etc.) that can be dynamically loaded and served through MCP (Model Context Protocol) via STDIO transport.
This project is part of the MHLabs AI Agentic Ecosystem, designed to work with mhlabs-mcp-server, mhlabs-mcp-agents, and downstream A2A agent frameworks.
Features
- FastMCP Server: Pure FastMCP implementation supporting multiple transport protocols
- Factory Pattern: Reusable MCP tools factory for easy service management
- Domain-Based Organization: Services organized by business domains (HR, Tech Support, etc.)
- Authentication: Optional Azure AD authentication support
- Multiple Transports: STDIO, HTTP (Streamable), and SSE transport support
- VS Code Integration: Debug configurations and development settings
- Comprehensive Testing: Unit tests with pytest
- Flexible Configuration: Environment-based configuration management
Architecture
mhlabs_mcp_tools/
āāā .gitignore
āāā .vscode/
ā āāā settings.json
āāā CHANGELOG.md
āāā LICENSE
āāā README.md
āāā docs/
ā āāā index.md
āāā examples/
ā āāā example_client.py
ā āāā example_client_http.py
āāā mkdocs.yml
āāā pyproject.toml
āāā requirements.txt
āāā server.json
āāā src/
āāā __init__.py
āāā main.py
āāā mhlabs_mcp_tools/
āāā __init__.py
āāā core/
ā āāā __init__.py
ā āāā config.py
ā āāā constants.py
ā āāā factory.py
ā āāā prompts.py
āāā data/
ā āāā __init__.py
ā āāā external/
ā ā āāā __init__.py
ā āāā interim/
ā ā āāā __init__.py
ā āāā processed/
ā ā āāā __init__.py
ā āāā raw/
ā āāā __init__.py
ā āāā contractions_dict.json
ā āāā custom_substitutions.csv
ā āāā leftovers_dict.json
ā āāā slang_dict.json
āāā handlers/
ā āāā __init__.py
ā āāā custom_exceptions.py
ā āāā output_generator.py
āāā mcp_server.py
āāā models/
ā āāā __init__.py
āāā nlp_components/
ā āāā __init__.py
ā āāā nlp_model.py
āāā services/
ā āāā __init__.py
ā āāā langchain_framework.py
ā āāā spacy_extractor.py
āāā text_preprocessing/
āāā __init__.py
āāā contractions.py
āāā emo_unicode.py
āāā slang_text.py
āāā text_preprocessing.py
Available Services
Currently the package is organized into three primary modules:
1. NLP Components
| Component Type | Description |
|---|---|
| tokenize | Text tokenization |
| pos | Part-of-Speech tagging |
| lemma | Word lemmatization |
| morphology | Study of word forms |
| dep | Dependency parsing |
| ner | Named Entity Recognition |
| norm | Text normalization |
2. Text Preprocessing
This module equips users with an extensive set of text preprocessing tools:
| Function | Description |
|---|---|
| to_lower | Convert text to lowercase |
| to_upper | Convert text to uppercase |
| remove_number | Remove numerical characters |
| remove_itemized_bullet_and_numbering | Eliminate itemized/bullet-point numbering |
| remove_url | Remove URLs from text |
| remove_punctuation | Remove punctuation marks |
| remove_special_character | Remove special characters |
| keep_alpha_numeric | Keep only alphanumeric characters |
| remove_whitespace | Remove excess whitespace |
| normalize_unicode | Normalize Unicode characters |
| remove_stopword | Eliminate common stopwords |
| remove_freqwords | Remove frequently occurring words |
| remove_rarewords | Remove rare words |
| remove_email | Remove email addresses |
| remove_phone_number | Remove phone numbers |
| remove_ssn | Remove Social Security Numbers (SSN) |
| remove_credit_card_number | Remove credit card numbers |
| remove_emoji | Remove emojis |
| remove_emoticons | Remove emoticons |
| convert_emoticons_to_words | Convert emoticons to words |
| convert_emojis_to_words | Convert emojis to words |
| remove_html | Remove HTML tags |
| chat_words_conversion | Convert chat language to standard English |
| expand_contraction | Expand contractions (e.g., "can't" to "cannot") |
| tokenize_word | Tokenize words |
| tokenize_sentence | Tokenize sentences |
| stem_word | Stem words |
| lemmatize_word | Lemmatize words |
| preprocess_text | Combine multiple preprocessing steps into one function |
Quick Start
Development Setup
-
Clone and Navigate:
cd src/mhlabs_mcp_tools -
Install Dependencies:
pip install -r requirements.txt -
Configure Environment:
cp .env.example .env # Edit .env with your configuration -
Start the Server:
# Default STDIO transport (for local MCP clients) python mcp_server.py # HTTP transport (for web-based clients) python mcp_server.py --transport http --port 9000 or after installed mhlabs-mcp-tools python -m mhlabs_mcp_tools.mcp_server --transport http --port 9000 # Using FastMCP CLI (recommended) fastmcp run mcp_server.py -t streamable-http --port 9000 -l DEBUG # Debug mode with authentication disabled python mcp_server.py --transport http --debug --no-auth
Transport Options
1. STDIO Transport (default)
- š§ Perfect for: Local tools, command-line integrations, Claude Desktop
- š Usage:
python mcp_server.pyorpython mcp_server.py --transport stdio
2. HTTP (Streamable) Transport
- š Perfect for: Web-based deployments, microservices, remote access
- š Usage:
python mcp_server.py --transport http --port 9000 - š URL:
http://127.0.0.1:9000/mcp/
3. SSE Transport (deprecated)
- ā ļø Legacy support only - use HTTP transport for new projects
- š Usage:
python mcp_server.py --transport sse --port 9000
FastMCP CLI Usage
# Standard HTTP server
fastmcp run mcp_server.py -t streamable-http --port 9000 -l DEBUG
# With custom host
fastmcp run mcp_server.py -t streamable-http --host 0.0.0.0 --port 9000 -l DEBUG
# STDIO transport (for local clients)
fastmcp run mcp_server.py -t stdio
# Development mode with MCP Inspector
fastmcp dev mcp_server.py -t streamable-http --port 9000
VS Code Development
-
Open in VS Code:
code . -
Use Debug Configurations:
Debug MCP Server (STDIO): Run with STDIO transportDebug MCP Server (HTTP): Run with HTTP transportDebug Tests: Run the test suite
Configuration
Environment Variables
Create a .env file based on .env.example:
# Server Settings
MCP_HOST=0.0.0.0
MCP_PORT=9000
MCP_DEBUG=false
MCP_SERVER_NAME=MHLABS MCP Server
# Authentication Settings
MCP_ENABLE_AUTH=true
AZURE_TENANT_ID=your-tenant-id-here
AZURE_CLIENT_ID=your-client-id-here
AZURE_JWKS_URI=https://login.microsoftonline.com/your-tenant-id/discovery/v2.0/keys
AZURE_ISSUER=https://sts.windows.net/your-tenant-id/
AZURE_AUDIENCE=api://your-client-id
Authentication
When MCP_ENABLE_AUTH=true, the server expects Azure AD Bearer tokens. Configure your Azure App Registration with the appropriate settings.
For development, set MCP_ENABLE_AUTH=false to disable authentication.
Adding New Services
-
Create Service Class:
from core.factory import MCPToolBase, Domain class MyService(MCPToolBase): def __init__(self): super().__init__(Domain.MY_DOMAIN) def register_tools(self, mcp): @mcp.tool(tags={self.domain.value}) async def my_tool(param: str) -> str: # Tool implementation pass @property def tool_count(self) -> int: return 1 # Number of tools -
Register in Server:
# In mcp_server.py (gets registered automatically from services/ directory) factory.register_service(MyService()) -
Add Domain (if new):
# In core/factory.py class Domain(Enum): # ... existing domains MY_DOMAIN = "my_domain"
MCP Client Usage
Python Client
import asyncio
from fastmcp import Client
client = Client("http://localhost:9000/mcp")
async def main():
async with client:
tools = await client.list_tools()
# tools -> list[mcp.types.Tool]
# print(tools)
for tool in tools:
print(f"Tool: {tool.name}")
result = await client.call_tool("textprep.expand_contraction", {"input_text": "The must've SSN is 859-98-0987. The employee's phone number is 555-555-5555."})
print("Result:", result)
asyncio.run(main())
Command Line Testing
# Test the server is running
curl http://localhost:9000/mcp/
# With FastMCP CLI for testing
fastmcp dev mcp_server.py -t streamable-http --port 9000
Quick Test
Test STDIO Transport:
# Start server in STDIO mode
python mcp_server.py --debug --no-auth
# Test with client_example.py
python client_example.py
Test HTTP Transport:
# Start HTTP server
python mcp_server.py --transport http --port 9000 --debug --no-auth
# Test with FastMCP client
python -c "
from fastmcp import Client
import asyncio
async def test():
async with Client('http://localhost:9000/mcp') as client:
result = await client.call_tool("textprep.expand_contraction", {"input_text": "The must've SSN is 859-98-0987. The employee's phone number is 555-555-5555."})
print(result)
asyncio.run(test())
"
Test with FastMCP CLI:
# Start with FastMCP CLI
fastmcp run mcp_server.py -t streamable-http --port 9000 -l DEBUG
# Server will be available at: http://127.0.0.1:9000/mcp/
Troubleshooting
Common Issues
- Import Errors: Make sure you're in the correct directory and dependencies are installed
- Authentication Errors: Check your Azure AD configuration and tokens
- Port Conflicts: Change the port in configuration if 9000 is already in use
- Missing fastmcp: Install with
pip install fastmcp
Debug Mode
Enable debug mode for detailed logging:
python mcp_server.py --debug --no-auth
Or set in environment:
MCP_DEBUG=true
Server Arguments
usage: mcp_server.py [-h] [--transport {stdio,http,streamable-http,sse}]
[--host HOST] [--port PORT] [--debug] [--no-auth]
MHLABS MCP Server
options:
-h, --help show this help message and exit
--transport, -t Transport protocol (default: stdio)
--host HOST Host to bind to for HTTP transport (default: 127.0.0.1)
--port, -p PORT Port to bind to for HTTP transport (default: 9000)
--debug Enable debug mode
--no-auth Disable authentication
š License
MIT License Ā© 2025 MusaddiqueHussain Labs
š¤ Contributing
- Follow the existing code structure and patterns
- Add tests for new functionality
- Update documentation for new features
- Use the provided VS Code configurations for development
š§ Learn More
- MCP Protocol: https://modelcontextprotocol.io
- FastMCP GitHub: https://github.com/fastmcp/fastmcp
- LangGraph Integration Guide (coming soon)
š” Tip
If you want to embed mhlabs-mcp-tools into a larger MCP-based orchestrator:
from fastmcp import StdioServerParameters
server_params = StdioServerParameters(
command="python",
args=["-m", "mhlabs_mcp_tools.server"],
//env={"MHLABS_MCP_CATEGORY": "textprep,nlp"}
)
Developed with ā¤ļø by MusaddiqueHussain Labs
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.