DataFlow MCP Server
A production-grade MCP server for secure MongoDB CRUD operations with filtering, pagination, health monitoring, and rate limiting.
README
DataFlow MCP Server - Production Grade
A secure, production-ready Model Context Protocol (MCP) server with MongoDB integration, featuring comprehensive security controls, CRUD operations, logging, and monitoring.
๐ Features
Security
- โ Input Validation & Sanitization - Prevents NoSQL injection attacks
- โ MongoDB SSL/TLS Support - Secure cloud deployments
- โ Rate Limiting - Protects against abuse (100 req/min default)
- โ Connection Pooling - Optimized for performance
- โ Document Size Limits - Prevents resource exhaustion
- โ Field Name Validation - Blacklists dangerous operators
Operations
- โ CRUD Operations - Create, Read, Update, Delete documents
- โ Filtering & Pagination - Flexible data retrieval with limits
- โ Sorting Support - Sort by any field (ascending/descending)
- โ Bulk Operations Ready - Extensible architecture
Monitoring & Observability
- โ Comprehensive Logging - File & console with rotation
- โ Health Checks - Service health status endpoint
- โ Metrics Tracking - Request counts, success rates
- โ Error Handling - Detailed error reporting
Production Ready
- โ Security First - SSL/TLS support, input validation
- โ Environment Config - 12-factor app ready
- โ Graceful Shutdown - Proper resource cleanup
๐ Quick Start
Prerequisites
- Python 3.12+
- Docker & Docker Compose (optional)
- MongoDB (or use Docker Compose)
Local Development
- Clone and setup:
cd dataflow_mcp
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
pip install -e .
- Configure environment:
cp .env.example .env
# Edit .env with your MongoDB connection
- Run the server:
python main.py
๐ก API Tools
Health Check
Get server status and metrics.
{
"status": "healthy",
"uptime_seconds": 123.45,
"metrics": {
"total_requests": 42,
"successful_requests": 40,
"failed_requests": 2,
"success_rate": 95.24
}
}
Read Collection
Retrieve documents with filtering, pagination, and sorting.
Parameters:
collection_name(required): Collection namefilter_query: JSON string with MongoDB filterlimit: Max documents (default: 100, max: 1000)skip: Skip N documents (default: 0)sort_by: Field to sort by
Example:
{
"collection_name": "users",
"filter_query": "{\"status\": \"active\"}",
"limit": 10,
"skip": 0,
"sort_by": "created_at"
}
Get Document
Retrieve a single document by ID.
Parameters:
collection_name: Collection namedocument_id: MongoDB ObjectId as string
Create Document
Create a new document in a collection.
Parameters:
collection_name: Collection namedocument_json: JSON string representing the document
Example:
{
"collection_name": "users",
"document_json": "{\"name\": \"John\", \"email\": \"john@example.com\", \"status\": \"active\"}"
}
Update Document
Update an existing document.
Parameters:
collection_name: Collection namedocument_id: MongoDB ObjectId as stringupdate_json: JSON with fields to update
Example:
{
"collection_name": "users",
"document_id": "65f8a1b2c3d4e5f6g7h8i9j0",
"update_json": "{\"status\": \"inactive\", \"updated_at\": \"2024-01-01T12:00:00Z\"}"
}
Delete Document
Delete a document from a collection.
Parameters:
collection_name: Collection namedocument_id: MongoDB ObjectId as string
๐ Security Features
Input Validation
- Collection names: Alphanumeric, dash, underscore only
- Field names: Prevents dangerous operators ($where, $function, etc.)
- Filters: Maximum 10KB, blacklist dangerous operations
- Documents: Maximum 1MB, enforced size limits
MongoDB Security
-
Connection Options:
- Connection pooling (default: 10 connections)
- Retry writes enabled
- Write concern: majority
- Journaling enabled
- SSL/TLS for cloud deployments
-
Environment Variables:
MONGO_USE_TLS=true MONGO_CA_CERT_PATH=/path/to/ca.pem MONGO_ALLOW_INVALID_CERTS=false
Rate Limiting
- 100 requests per 60 seconds (configurable)
- Per-client tracking
- Returns clear error on limit exceeded
Error Handling
- Safe error messages (no sensitive data leaks)
- Detailed internal logging
- Graceful degradation
๐ Environment Variables
Required
MONGO_URI=mongodb://user:password@host:port/database
MONGO_DB_NAME=dataflow
Optional (with defaults)
MONGO_TIMEOUT=5000 # Connection timeout (ms)
MONGO_POOL_SIZE=10 # Connection pool size
MONGO_MAX_IDLE_TIME=45000 # Max idle time (ms)
MONGO_USE_TLS=false # Enable TLS
MONGO_CA_CERT_PATH= # CA certificate path
LOGS_DIR=./logs # Log directory
LOG_LEVEL=INFO # Logging level
๐ Project Structure
dataflow_mcp/
โโโ config/
โ โโโ mongodb.py # MongoDB connection with pooling
โ โโโ security.py # Validation and rate limiting
โ โโโ logging_config.py # Logging setup
โโโ tools/
โ โโโ data_manager.py # CRUD operations and update logic (DataManager)
โโโ scripts/
โโโ main.py # MCP server and tools
โโโ pyproject.toml # Dependencies and config
โโโ .env.example # Environment template
๐ง Configuration for Cloud Deployment
AWS Deployment
MONGO_URI=mongodb+srv://user:password@cluster.mongodb.net/dataflow
MONGO_USE_TLS=true
MONGO_ALLOW_INVALID_CERTS=false
Azure Deployment
MONGO_URI=mongodb://user:password@host.mongo.cosmos.azure.com:10255/database
MONGO_USE_TLS=true
MONGO_CA_CERT_PATH=/etc/ssl/certs/ca-certificates.crt
GCP Deployment
MONGO_URI=mongodb://user:password@instance:27017/database
MONGO_USE_TLS=true
๐จ Production Checklist
- [ ] MongoDB backups configured
- [ ] SSL/TLS certificates installed
- [ ] Environment variables set securely (not in code)
- [ ] Logs redirected to centralized logging
- [ ] Health checks configured in load balancer
- [ ] Rate limits adjusted for your use case
- [ ] MongoDB indexes optimized
- [ ] Connection pool size tuned
- [ ] Monitoring/alerting setup
- [ ] Graceful shutdown tested
๐ Performance Optimization
MongoDB Indexes
Pre-created indexes in scripts/mongo-init.js:
- User email: unique constraint
- Timestamps: for sorting and TTL
- Status: for filtering
Connection Pooling
- Default pool size: 10 (adjust via
MONGO_POOL_SIZE) - Min connections: 2 (automatically maintained)
- Max idle time: 45 seconds
Request Limits
- Max filter size: 10KB
- Max document size: 1MB
- Max page size: 1000 documents
- Rate limit: 100 req/min
๐งช Testing & Development
Install dev dependencies:
pip install -e ".[dev]"
Run tests:
pytest --cov=tools --cov=config
Code formatting:
black .
flake8 .
mypy .
๐ Logging
Logs are written to:
- File:
./logs/mcp_server_YYYYMMDD.log(rotated daily, max 10MB) - Console: Real-time output
Log levels:
DEBUG- Detailed diagnostic infoINFO- General eventsWARNING- Warning messagesERROR- Error events
๐ Troubleshooting
MongoDB Connection Failed
Check MONGO_URI and credentials
Verify MongoDB is running: mongosh "mongodb://..."
Check network connectivity and firewall
Rate Limit Exceeded
Default: 100 requests per 60 seconds
Increase MONGO_POOL_SIZE and optimize queries
Implement request queuing on client
High Memory Usage
Reduce MONGO_POOL_SIZE
Lower MONGO_MAX_IDLE_TIME
Check for large result sets (use pagination)
๐ References
๐ License
MIT License - See LICENSE file for details
๐ค Support
For issues and questions:
- Check troubleshooting section
- Review logs in
./logs/ - Check MongoDB connection
- Verify environment variables
Built for production-grade data operations with security-first design.
dataflow_mcp
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.