mcp_server
fulong98
README
RunPod Python Code Execution with MCP
This project enables AI assistants to execute Python code on RunPod infrastructure using the Model Context Protocol (MCP). It consists of two main components:
- A RunPod Serverless API that executes Python code
- An MCP server that connects to the RunPod API and provides a standardized interface for AI assistants
System Architecture
┌───────────────┐ ┌───────────────┐ ┌───────────────┐
│ │ │ │ │ │
│ AI Assistant │────▶│ MCP Server │────▶│ RunPod API │
│ (e.g. Cline) │ │ (Python) │ │ (Serverless) │
│ │ │ │ │ │
└───────────────┘ └───────────────┘ └───────────────┘
Interaction Sequence
UML
1. RunPod Serverless Setup
Prerequisites
- Docker installed
- RunPod account with API key
- Basic knowledge of Docker and Python
Step 1: Build and Push Docker Image
The Dockerfile includes common data science libraries and the code handler: current docker image: docker.io/79c2e466/code_handler_serverless
# Clone this repository
git clone https://github.com/yourusername/runpod-python-executor.git
cd runpod-python-executor
# Build the Docker image
docker build -t yourusername/runpod-python-executor:latest .
# Login to Docker Hub
docker login
# Push the image to Docker Hub
docker push yourusername/runpod-python-executor:latest
Step 2: Deploy on RunPod Serverless
- Go to RunPod.io
- Navigate to Serverless → Deploy
- Enter your Docker image URL:
docker.io/79c2e466/code_handler_serverless
- Configure settings:
- Worker Type: CPU or basic GPU
- Min/Max Workers: 1/3 (adjust based on your needs)
- Idle Timeout: 5 minutes
- Click "Deploy"
Step 3: Get Endpoint ID
After deployment, RunPod will generate an endpoint ID. Copy this ID as you will need it to configure the MCP server.
Your Serverless API endpoint will be: https://api.runpod.io/v2/{endpoint_id}/run
You can test your endpoint using:
curl -X POST \
https://api.runpod.io/v2/{endpoint_id}/runsync \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer YOUR_RUNPOD_API_KEY' \
-d '{
"input": {
"code": "print(\"Hello, World!\")"
}
}'
2. MCP Server Setup
The MCP server connects AI assistants to your RunPod endpoint using the Model Context Protocol.
Prerequisites
- Python 3.8+ installed
- RunPod API key
- RunPod endpoint ID from Step 3 above
Step 1: Install Requirements
Installation Adding MCP to your python project We recommend using uv to manage your Python projects. In a uv managed python project, add mcp to dependencies by:
uv add "mcp[cli]" Alternatively, for projects using pip for dependencies:
pip install mcp Running the standalone MCP development tools To run the mcp command with uv:
uv run mcp
pip install mcp requests
Step 2: Configure the MCP Server
Create a file named runpod_mcp_server.py
with the code from the provided template. Update the following environment variables:
# Update these values
RUNPOD_API_KEY = "your_runpod_api_key"
RUNPOD_ENDPOINT_ID = "your_endpoint_id" # From serverless deployment
Step 3: Run the MCP Server
cline add mcp setting
"PythonServerlessExecutor": {
"command": "xx/.local/bin/uv",
"args": [
"run",
"xxx/mcp_server/runpod_serverless_mcp_server/server.py"
],
"disabled": false,
"autoApprove": []
},
Step 4: Using the MCP Server
Once connected, you can use the following tools:
execute_python_code
: Runs Python code on RunPodcheck_runpod_status
: Checks the connection status with RunPod
Example usage with Cline:
Can you run this Python code?
```python
import numpy as np
import matplotlib.pyplot as plt
x = np.linspace(0, 10, 100)
y = np.sin(x)
plt.plot(x, y)
plt.title("Sine Wave")
plt.savefig("sine_wave.png")
print("Generated sine wave plot")
Cline will use the MCP server to execute this code and return the results.
Sample
How It Works
- The AI assistant (like Cline) receives a request to run Python code
- It invokes the
execute_python_code
tool via the MCP protocol - The MCP server sends the code to the RunPod serverless endpoint
- RunPod executes the code in the Docker container
- Results (stdout, stderr, return code) are returned to the MCP server
- The MCP server formats the results and sends them back to the AI assistant
- The AI assistant displays the results to the user
Security Considerations
- The code execution happens in an isolated container
- Execution time is limited to prevent resource abuse
- You should implement additional security measures for production use
Troubleshooting
- MCP Server Connection Issues: Check your API key and endpoint ID
- Code Execution Timeouts: Your code might be too complex or infinite looping
- Missing Libraries: Check if your code requires libraries not in the Dockerfile
Advanced Configuration
You can modify the following to adjust behavior:
MAX_EXECUTION_TIME
: Change the timeout for code execution- Dockerfile: Add or remove libraries based on your needs
- Error handling: Customize error messages for specific scenarios
Why Serverless Instead of Pod Creation
We chose to use RunPod's serverless infrastructure rather than creating full pods for several important reasons:
- Log Access Limitations: RunPod does not provide a direct API to retrieve logs from created pods. This makes it difficult to capture the output of code execution when using the pod creation approach.
- Connection Complexity: Maintaining persistent connections to pods would require additional infrastructure for SSH or other remote access methods.
- Resource Management: Pods remain running and incur charges even when idle, whereas serverless functions only charge for actual execution time.
Cons of the Serverless Approach
- Cold Start Latency: The first execution might experience a delay as the container spins up (cold start).
- No Persistent Storage: Each execution starts with a fresh environment, making it difficult to maintain state between executions.
- Resource Constraints: Serverless functions often have more limited CPU, memory, and disk allocations compared to full pods. 4.Limited GPU Access: While possible, GPU access in serverless is more limited and typically more expensive than with dedicated pods.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
MCP Package Docs Server
Facilitates LLMs to efficiently access and fetch structured documentation for packages in Go, Python, and NPM, enhancing software development with multi-language support and performance optimization.
Claude Code MCP
An implementation of Claude Code as a Model Context Protocol server that enables using Claude's software engineering capabilities (code generation, editing, reviewing, and file operations) through the standardized MCP interface.
@kazuph/mcp-taskmanager
Model Context Protocol server for Task Management. This allows Claude Desktop (or any MCP client) to manage and execute tasks in a queue-based system.
Linear MCP Server
Enables interaction with Linear's API for managing issues, teams, and projects programmatically through the Model Context Protocol.
mermaid-mcp-server
A Model Context Protocol (MCP) server that converts Mermaid diagrams to PNG images.
Jira-Context-MCP
MCP server to provide Jira Tickets information to AI coding agents like Cursor

Linear MCP Server
A Model Context Protocol server that integrates with Linear's issue tracking system, allowing LLMs to create, update, search, and comment on Linear issues through natural language interactions.

Sequential Thinking MCP Server
This server facilitates structured problem-solving by breaking down complex issues into sequential steps, supporting revisions, and enabling multiple solution paths through full MCP integration.