OpenAI Image MCP Server
Enables conversational image generation, editing, and refinement through OpenAI models with session memory for iterative creative workflows.
README

OpenAI Image MCP Server
A Model Context Protocol (MCP) server that provides conversational OpenAI image generation capabilities. Generate, edit, and refine images through multi-turn conversations with advanced models like GPT-4o and GPT-4.1.
šÆ What Problems Does This Solve?
Traditional Image Generation Pain Points
ā Single-shot limitations - "Make it more blue" requires re-describing everything
ā No conversation memory - Each request starts from scratch
ā Context loss - Can't reference previous images naturally
ā Manual workflows - Complex multi-step processes require multiple tools
Our Solution
ā
Conversational refinement - "Make it more blue" works naturally
ā
Session memory - Builds on previous context automatically
ā
Reference awareness - "Use the same style as the previous image"
ā
Integrated workflows - Single interface for complex creative projects
š Key Capabilities
š Session-Based Conversations
# Start a focused session
session = create_image_session("Logo design for tech startup")
# Initial generation
result1 = generate_image_in_session(session_id, "modern tech logo")
# Natural refinement - no need to repeat everything
result2 = generate_image_in_session(session_id, "make it more minimalist")
# Build on context
result3 = generate_image_in_session(session_id, "try it in dark blue")
š Hybrid Workflows
Start simple, expand when needed:
# Quick one-shot for immediate need
result = generate_image("modern office workspace")
# Later, promote to session for refinement
session = promote_image_to_session(
result["image_path"],
"Office workspace refinement project"
)
# Continue with conversational context
generate_image_in_session(session_id, "add more plants and warmer lighting")
šØ Specialized Tools
- Product photography - E-commerce optimized with multiple angles
- UI/UX assets - Design elements with consistent styling
- Reference-based editing - Use existing images as style guides
- Batch processing - Multiple variations with consistent themes
šÆ Perfect For
LLM Applications
- Claude Desktop integration - Conversational image workflows
- AI assistants - Contextual image generation capabilities
- Chatbots - Visual content creation with memory
Creative Workflows
- Iterative design - Refine concepts through conversation
- Brand development - Consistent visual identity across assets
- Product visualization - Multiple angles and contexts
- Content creation - Blog headers, social media, presentations
Development Teams
- Rapid prototyping - Quick UI mockups and concepts
- Documentation - Visual aids and diagrams
- Marketing assets - Consistent brand imagery
- User testing - Visual variations for A/B testing
š Quick Start
1. Installation
Requirements: Python 3.10 or higher
# Install the package
pip install openai-image-mcp
If you need to upgrade Python:
# Using pyenv (recommended)
pyenv install 3.11.8
pyenv global 3.11.8
pip install openai-image-mcp
# Or using Homebrew (macOS)
brew install python@3.11
python3.11 -m pip install openai-image-mcp
For development installation from source, see DEVELOPMENT.md
2. Claude Desktop Integration
Add to your Claude Desktop MCP configuration:
{
"mcpServers": {
"openai-image-mcp": {
"command": "sh",
"args": [
"-c",
"openai-image-mcp 2> mcp_server_stderr.log"
],
"env": {
"OPENAI_API_KEY": "your_openai_api_key_here"
}
}
}
}
For development setup and alternative configurations, see DEVELOPMENT.md
3. Start Creating
# Create a session for your project
session = create_image_session("Website hero images")
# Generate with natural language
generate_image_in_session(session_id, "modern tech office with diverse team")
# Refine naturally
generate_image_in_session(session_id, "make the lighting warmer")
# Add context
generate_image_in_session(session_id, "create a mobile version of this scene")
š ļø Available Tools
Core Session Management
create_image_session- Start conversational sessiongenerate_image_in_session- Generate with context awarenessget_session_status- View conversation history and progressclose_session- End session and cleanup
Image Generation & Editing
generate_image- General purpose (session optional)edit_image- Modify existing imagesgenerate_product_image- E-commerce optimizedgenerate_ui_asset- UI/UX design elementsanalyze_and_improve_image- AI-powered image enhancement
Workflow Tools
promote_image_to_session- Upgrade one-shot to conversationallist_active_sessions- Manage multiple projectsget_usage_guide- Comprehensive tool documentation
šÆ Usage Patterns
š± Conversational Design Sessions (Recommended)
Best for: Multi-image projects, iterative refinement, brand consistency
session = create_image_session("App icon design")
generate_image_in_session(session_id, "colorful chat app icon")
generate_image_in_session(session_id, "make it more professional")
generate_image_in_session(session_id, "try different color schemes")
ā” Quick One-Shot Generation
Best for: Immediate needs, single images, uncertain scope
generate_image("professional headshot for LinkedIn")
generate_product_image("wireless headphones", background_type="white")
š Hybrid Start-Simple-Expand-Later
Best for: Testing concepts, uncertain requirements, flexible workflows
# Start quick
result = generate_image("logo concept for bakery")
# Expand when needed
session = promote_image_to_session(result["image_path"], "Bakery brand development")
generate_image_in_session(session_id, "create business card version")
šØ Example Workflows
Brand Identity Development
session = create_image_session("TechCorp brand identity")
# Logo concepts
generate_image_in_session(session_id, "modern tech company logo")
generate_image_in_session(session_id, "make it more geometric and minimal")
# Expand to brand elements
generate_image_in_session(session_id, "business card design using this logo")
generate_image_in_session(session_id, "website header with the logo")
Product Marketing Suite
session = create_image_session("Wireless headphones marketing")
# Product shots
generate_product_image("premium wireless headphones", angle="45deg")
result = promote_image_to_session(previous_result["image_path"], "headphones campaign")
# Marketing variations
generate_image_in_session(session_id, "lifestyle shot with person using them")
generate_image_in_session(session_id, "create packaging design mockup")
š Documentation
- LLM.md - Comprehensive guide for LLMs using this server
- DEVELOPMENT.md - Technical implementation, testing, and contribution guide
š Requirements
- Python 3.10+ (3.11+ recommended)
- OpenAI API key with GPT-4o/GPT-4.1 access
- Poetry for dependency management (development only)
š Environment Variables
OPENAI_API_KEY(required) - Your OpenAI API keyMCP_MAX_SESSIONS(optional) - Maximum concurrent sessions (default: 100)MCP_SESSION_TIMEOUT(optional) - Session timeout in seconds (default: 3600)
š¤ Contributing
We welcome contributions! Please see DEVELOPMENT.md for:
- Technical architecture details
- Development setup instructions
- Testing guidelines
- Code style requirements
š License
This project is licensed under the MIT License - see the LICENSE file for details.
š Related Resources
- Model Context Protocol - Protocol specification
- OpenAI Responses API - Underlying API
- Claude Desktop - Primary integration target
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.