DataFlow MCP Server

DataFlow MCP Server

A production-grade MCP server for secure MongoDB CRUD operations with filtering, pagination, health monitoring, and rate limiting.

Category
Visit Server

README

DataFlow MCP Server - Production Grade

A secure, production-ready Model Context Protocol (MCP) server with MongoDB integration, featuring comprehensive security controls, CRUD operations, logging, and monitoring.

๐Ÿ“‹ Features

Security

  • โœ… Input Validation & Sanitization - Prevents NoSQL injection attacks
  • โœ… MongoDB SSL/TLS Support - Secure cloud deployments
  • โœ… Rate Limiting - Protects against abuse (100 req/min default)
  • โœ… Connection Pooling - Optimized for performance
  • โœ… Document Size Limits - Prevents resource exhaustion
  • โœ… Field Name Validation - Blacklists dangerous operators

Operations

  • โœ… CRUD Operations - Create, Read, Update, Delete documents
  • โœ… Filtering & Pagination - Flexible data retrieval with limits
  • โœ… Sorting Support - Sort by any field (ascending/descending)
  • โœ… Bulk Operations Ready - Extensible architecture

Monitoring & Observability

  • โœ… Comprehensive Logging - File & console with rotation
  • โœ… Health Checks - Service health status endpoint
  • โœ… Metrics Tracking - Request counts, success rates
  • โœ… Error Handling - Detailed error reporting

Production Ready

  • โœ… Security First - SSL/TLS support, input validation
  • โœ… Environment Config - 12-factor app ready
  • โœ… Graceful Shutdown - Proper resource cleanup

๐Ÿš€ Quick Start

Prerequisites

  • Python 3.12+
  • Docker & Docker Compose (optional)
  • MongoDB (or use Docker Compose)

Local Development

  1. Clone and setup:
cd dataflow_mcp
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -e .
  1. Configure environment:
cp .env.example .env
# Edit .env with your MongoDB connection
  1. Run the server:
python main.py

๐Ÿ“ก API Tools

Health Check

Get server status and metrics.

{
  "status": "healthy",
  "uptime_seconds": 123.45,
  "metrics": {
    "total_requests": 42,
    "successful_requests": 40,
    "failed_requests": 2,
    "success_rate": 95.24
  }
}

Read Collection

Retrieve documents with filtering, pagination, and sorting.

Parameters:

  • collection_name (required): Collection name
  • filter_query: JSON string with MongoDB filter
  • limit: Max documents (default: 100, max: 1000)
  • skip: Skip N documents (default: 0)
  • sort_by: Field to sort by

Example:

{
  "collection_name": "users",
  "filter_query": "{\"status\": \"active\"}",
  "limit": 10,
  "skip": 0,
  "sort_by": "created_at"
}

Get Document

Retrieve a single document by ID.

Parameters:

  • collection_name: Collection name
  • document_id: MongoDB ObjectId as string

Create Document

Create a new document in a collection.

Parameters:

  • collection_name: Collection name
  • document_json: JSON string representing the document

Example:

{
  "collection_name": "users",
  "document_json": "{\"name\": \"John\", \"email\": \"john@example.com\", \"status\": \"active\"}"
}

Update Document

Update an existing document.

Parameters:

  • collection_name: Collection name
  • document_id: MongoDB ObjectId as string
  • update_json: JSON with fields to update

Example:

{
  "collection_name": "users",
  "document_id": "65f8a1b2c3d4e5f6g7h8i9j0",
  "update_json": "{\"status\": \"inactive\", \"updated_at\": \"2024-01-01T12:00:00Z\"}"
}

Delete Document

Delete a document from a collection.

Parameters:

  • collection_name: Collection name
  • document_id: MongoDB ObjectId as string

๐Ÿ”’ Security Features

Input Validation

  • Collection names: Alphanumeric, dash, underscore only
  • Field names: Prevents dangerous operators ($where, $function, etc.)
  • Filters: Maximum 10KB, blacklist dangerous operations
  • Documents: Maximum 1MB, enforced size limits

MongoDB Security

  • Connection Options:

    • Connection pooling (default: 10 connections)
    • Retry writes enabled
    • Write concern: majority
    • Journaling enabled
    • SSL/TLS for cloud deployments
  • Environment Variables:

    MONGO_USE_TLS=true
    MONGO_CA_CERT_PATH=/path/to/ca.pem
    MONGO_ALLOW_INVALID_CERTS=false
    

Rate Limiting

  • 100 requests per 60 seconds (configurable)
  • Per-client tracking
  • Returns clear error on limit exceeded

Error Handling

  • Safe error messages (no sensitive data leaks)
  • Detailed internal logging
  • Graceful degradation

๐Ÿ“Š Environment Variables

Required

MONGO_URI=mongodb://user:password@host:port/database
MONGO_DB_NAME=dataflow

Optional (with defaults)

MONGO_TIMEOUT=5000              # Connection timeout (ms)
MONGO_POOL_SIZE=10              # Connection pool size
MONGO_MAX_IDLE_TIME=45000       # Max idle time (ms)
MONGO_USE_TLS=false             # Enable TLS
MONGO_CA_CERT_PATH=             # CA certificate path
LOGS_DIR=./logs                 # Log directory
LOG_LEVEL=INFO                  # Logging level

๐Ÿ“ Project Structure

dataflow_mcp/
โ”œโ”€โ”€ config/
โ”‚   โ”œโ”€โ”€ mongodb.py           # MongoDB connection with pooling
โ”‚   โ”œโ”€โ”€ security.py          # Validation and rate limiting
โ”‚   โ””โ”€โ”€ logging_config.py    # Logging setup
โ”œโ”€โ”€ tools/
โ”‚   โ””โ”€โ”€ data_manager.py      # CRUD operations and update logic (DataManager)
โ”œโ”€โ”€ scripts/
โ”œโ”€โ”€ main.py                  # MCP server and tools
โ”œโ”€โ”€ pyproject.toml          # Dependencies and config
โ””โ”€โ”€ .env.example            # Environment template

๐Ÿ”ง Configuration for Cloud Deployment

AWS Deployment

MONGO_URI=mongodb+srv://user:password@cluster.mongodb.net/dataflow
MONGO_USE_TLS=true
MONGO_ALLOW_INVALID_CERTS=false

Azure Deployment

MONGO_URI=mongodb://user:password@host.mongo.cosmos.azure.com:10255/database
MONGO_USE_TLS=true
MONGO_CA_CERT_PATH=/etc/ssl/certs/ca-certificates.crt

GCP Deployment

MONGO_URI=mongodb://user:password@instance:27017/database
MONGO_USE_TLS=true

๐Ÿšจ Production Checklist

  • [ ] MongoDB backups configured
  • [ ] SSL/TLS certificates installed
  • [ ] Environment variables set securely (not in code)
  • [ ] Logs redirected to centralized logging
  • [ ] Health checks configured in load balancer
  • [ ] Rate limits adjusted for your use case
  • [ ] MongoDB indexes optimized
  • [ ] Connection pool size tuned
  • [ ] Monitoring/alerting setup
  • [ ] Graceful shutdown tested

๐Ÿ“ˆ Performance Optimization

MongoDB Indexes

Pre-created indexes in scripts/mongo-init.js:

  • User email: unique constraint
  • Timestamps: for sorting and TTL
  • Status: for filtering

Connection Pooling

  • Default pool size: 10 (adjust via MONGO_POOL_SIZE)
  • Min connections: 2 (automatically maintained)
  • Max idle time: 45 seconds

Request Limits

  • Max filter size: 10KB
  • Max document size: 1MB
  • Max page size: 1000 documents
  • Rate limit: 100 req/min

๐Ÿงช Testing & Development

Install dev dependencies:

pip install -e ".[dev]"

Run tests:

pytest --cov=tools --cov=config

Code formatting:

black .
flake8 .
mypy .

๐Ÿ“ Logging

Logs are written to:

  • File: ./logs/mcp_server_YYYYMMDD.log (rotated daily, max 10MB)
  • Console: Real-time output

Log levels:

  • DEBUG - Detailed diagnostic info
  • INFO - General events
  • WARNING - Warning messages
  • ERROR - Error events

๐Ÿ› Troubleshooting

MongoDB Connection Failed

Check MONGO_URI and credentials
Verify MongoDB is running: mongosh "mongodb://..."
Check network connectivity and firewall

Rate Limit Exceeded

Default: 100 requests per 60 seconds
Increase MONGO_POOL_SIZE and optimize queries
Implement request queuing on client

High Memory Usage

Reduce MONGO_POOL_SIZE
Lower MONGO_MAX_IDLE_TIME
Check for large result sets (use pagination)

๐Ÿ“š References

๐Ÿ“„ License

MIT License - See LICENSE file for details

๐Ÿ‘ค Support

For issues and questions:

  1. Check troubleshooting section
  2. Review logs in ./logs/
  3. Check MongoDB connection
  4. Verify environment variables

Built for production-grade data operations with security-first design.

dataflow_mcp

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured