claude-imagine
An MCP server that enables Claude Code to generate context-aware images using local GPU-powered diffusion models, either automatically during coding or on-demand via slash commands.
README
<div align="center"> <h1>Claude-Imagine</h1> <p><strong>Context-aware image generation for Claude Code, powered by your local GPU</strong></p>
<p>An MCP server that lets Claude Code generate real images while it works. No external APIs, no stock photos. Just your GPU and whatever diffusion models you have installed.</p>
<br/>
<a href="https://www.npmjs.com/package/claude-imagine"><img src="https://img.shields.io/npm/v/claude-imagine?style=for-the-badge" alt="NPM Version"></a> <a href="https://github.com/prenats/claude-imagine/blob/main/LICENSE"><img src="https://img.shields.io/github/license/prenats/claude-imagine?style=for-the-badge" alt="License"></a> <a href="https://github.com/prenats/claude-imagine/stargazers"><img src="https://img.shields.io/github/stars/prenats/claude-imagine?style=for-the-badge" alt="Stars"></a> <br/><a href="https://nodejs.org/"><img src="https://img.shields.io/badge/node-%3E%3D20-brightgreen?style=for-the-badge&logo=node.js" alt="Node.js"></a> <a href="https://www.typescriptlang.org/"><img src="https://img.shields.io/badge/typescript-5.x-blue?style=for-the-badge&logo=typescript&logoColor=white" alt="TypeScript"></a> <a href="https://modelcontextprotocol.io/"><img src="https://img.shields.io/badge/protocol-MCP-8A2BE2?style=for-the-badge" alt="MCP"></a> <a href=""><img src="https://img.shields.io/badge/Area-Image-blue?style=for-the-badge&logo=image&logoColor=white" alt="Area Image"></a>
<sub>ā” Runs on</sub><br/> <img src="https://img.shields.io/badge/backend-ComfyUI-8B5CF6?style=for-the-badge" alt="ComfyUI">
<a href="docs/getting-started.md"><img src="https://img.shields.io/badge/docs-getting%20started-green?style=for-the-badge" alt="Getting Started"></a> <a href="docs/architecture.md"><img src="https://img.shields.io/badge/docs-architecture-orange?style=for-the-badge" alt="Architecture"></a> <a href="docs/image-reference.md"><img src="https://img.shields.io/badge/docs-image%20reference-red?style=for-the-badge" alt="Image Reference"></a> </div>
š” Why i made this
Every time you build something with Claude Code, you hit the same wall: placeholder images. Grey boxes, lorem picsum links, "TODO: add real image here." The layout is done, the code is clean, but the result looks lifeless.
Claude-Imagine changes that. Claude reads the code it's writing ā the HTML structure, the component, the page theme, the brand colors ā and uses that context to generate an image tailored for that exact spot. A hero section for a wellness brand gets a hero image that matches the palette and mood of the page. A product card for a vintage leather bag gets a photo that fits the store's tone. Images that belong where they are, because they were born from the context they live in.
When you're not coding, there are direct commands too. /claude-imagine:image-generate lets you generate any image on demand with full control over style, mood, lighting, and composition. /claude-imagine:image-suggest analyzes a project and recommends a visual asset plan. Same prompt engineering pipeline, just a different trigger.
⨠Key Features
- Context-Aware Prompts ā Claude reads the surrounding code and crafts image prompts tailored to the exact spot where the image will live
- 11 Image Types ā icons to hero images, each with optimized defaults for dimensions, style, mood, and lighting
- Full Creative Control ā 15 styles, 12 moods, 12 compositions, 12 lighting options, 8 color palettes
- 3 Quality Tiers ā fast / standard / high, auto-mapped to your fastest and best models
- Auto-Detection ā discovers your installed models and assigns them to quality tiers
- Model Pinning ā pin only the models you want for generation; other models on the server are ignored
- Smart Negative Prompts ā auto-generated per type and style (SDXL only)
- Pluggable Backends ā ComfyUI today, extensible architecture for future backends
- Flexible Scope ā install globally or per-project
š Quick Start
Requirements
- Claude Code installed
- Node.js 20+
- ComfyUI running with GPU access
- At least one diffusion model installed
See Getting Started for more detailed instructions, tested models, VRAM requirements, and CLIP/VAE setup for Flux models, etc.
Option 1: npx (recommended)
npx claude-imagine@latest
The interactive installer will:
- Ask for install scope (global or per-project)
- Copy skills, commands, and rules to your Claude Code config
- Detect your ComfyUI server and discover installed models
- Let you select which models to use for generation (model pinning)
- Assign quality tiers to selected models
- Generate config and register the MCP server
Tip ā local scope: Run
npx claude-imagine@latestfrom inside the project you want to install into.npxuses your current directory automatically ā no path entry needed. This is the main advantage ofnpxover the from-source install for per-project setups.
Option 2: From source
git clone https://github.com/prenats/claude-imagine.git
cd claude-imagine
npm install
npm run build
npm test
./install.sh # install from source
Local scope note: Running
./install.shfrom inside the cloned repo and choosing Local scope will show a warning and prompt you to enter the target project path. Type an absolute path (e.g.~/projects/my-app) or press Enter to usepwd. See Getting Started for details.
Verify
Open Claude Code and run /mcp to check that the Claude-Imagine MCP server is installed and connected. You should see it listed with a green status.
You can also verify from the terminal:
npx claude-imagine check
First image
/claude-imagine:image-generate a cozy coffee shop on a rainy evening
See Getting Started for the full setup guide, configuration, and verification.
š How It Works
While coding (automatic)
You: "Build me a landing page for a sustainable coffee brand"
Claude Code writes the hero section, hits the <img> tag, and:
1. Reads the surrounding context (earth-tone palette, organic theme, warm copy)
2. Engineers a 150-250 word prompt tailored to that exact spot
3. Picks the right model, resolution, and quality tier
4. Sends the workflow to ComfyUI on your GPU
5. Drops the generated PNG into your project and keeps coding
Result: The hero image matches the page ā not a random stock photo.
On demand (direct commands)
You: /claude-imagine:image-generate a cozy coffee shop on a rainy evening
Claude Code:
1. Engineers a 150-250 word prompt (scene, lighting, atmosphere, camera, color, materials, style, quality)
2. Selects the right model and resolution for the image type
3. Sends the workflow to ComfyUI
4. Saves the generated PNG to your project
Result: generated/cozy-coffee-shop-rainy-evening.png (1344x768)
Both modes use the same pipeline. The difference is where the context comes from ā the code Claude is writing, or the description you provide. Prompt engineering runs on whichever Claude model powers your session. Image generation runs on your GPU.
šØ Image Types
| Type | Resolution | Tier | Use For |
|---|---|---|---|
ICON |
512x512 | Fast | App icons, UI elements, favicons |
THUMBNAIL |
768x432 | Fast | Blog cards, video thumbnails |
BACKGROUND |
1344x768 | Fast | Page/section backgrounds |
TEXTURE |
1024x1024 | Fast | Tileable patterns, surfaces |
AVATAR |
768x768 | Standard | Profile photos, team portraits |
CONTENT |
1024x768 | Standard | Article illustrations |
BANNER |
1344x384 | Standard | Horizontal promo strips |
PRODUCT |
896x1152 | Standard | E-commerce product photos |
LOGO |
1024x1024 | High | Brand logo marks |
HERO |
1344x768 | High | Full-width hero sections |
FEATURED |
1024x1024 | High | Featured post/card images |
See Image Reference for all styles, moods, compositions, lighting, palettes, and dimension overrides.
š ļø Usage
Skills (user-facing)
These are the slash commands you invoke directly in Claude Code:
| Skill | Description |
|---|---|
/claude-imagine:image-generate |
Engineer a detailed prompt and generate an image |
/claude-imagine:image-suggest |
Analyze a project and recommend 4-8 images with types, styles, and rationale |
MCP Tools (what Claude calls under the hood)
Skills call these tools on the MCP server. You don't invoke them directly ā Claude does.
| Tool | Description |
|---|---|
generate_image |
Generate a single image with full control over type, style, mood, lighting, composition, palette, quality, dimensions, seed |
batch_generate |
Generate multiple images sequentially (one GPU job at a time) |
list_capabilities |
List all available types, styles, moods, compositions, lighting, palettes, and discovered models |
check_server |
Check if ComfyUI is reachable and report detected backend |
CLI
| Command | Description |
|---|---|
npx claude-imagine@latest |
Run interactive setup |
npx claude-imagine reconfigure |
Re-select which models to pin and reassign quality tiers |
npx claude-imagine check |
Verify installation (skills, config, server) |
npx claude-imagine uninstall |
Remove all installed files and MCP registration |
npx claude-imagine --version |
Print version |
āļø Configuration
Config file: ~/.config/claude-imagine/config.json (auto-generated during setup)
| Setting | What it controls |
|---|---|
server.url |
ComfyUI server address |
models |
Discovered models with type, tier, and sampling params |
pinnedModels |
Array of model IDs to use for generation (others are ignored) |
imageTypes |
Which model each image type uses, with optional dimension overrides |
output.dir |
Where generated images are saved (default: generated) |
Environment Variable Overrides
| Variable | Description |
|---|---|
IMAGINE_SERVER_URL |
Override server URL |
IMAGINE_BACKEND |
Override backend (default: comfyui) |
IMAGINE_OUTPUT_DIR |
Override output directory |
IMAGINE_CONFIG |
Override config file path |
Priority: environment variables > config file > hardcoded defaults
See Getting Started for the full config reference, model tuning, quality tiers, and CLIP/VAE setup.
šļø Architecture Overview
Skill (/claude-imagine:image-generate)
ā Claude engineers a 150-250 word prompt
ā Infers type, style, mood, lighting from context
ā¼
MCP Tool (generate_image)
ā Validates params, resolves model and dimensions
ā¼
Backend (ComfyUI)
ā Builds workflow ā queues prompt ā polls for result ā downloads PNG
ā¼
Output
Saves image to project, returns report
The backend is pluggable ā any server that implements the ImageBackend interface can be added to Claude-Imagine.
See Architecture for the full module map, generation flow, backend abstraction, and config chain.
š¤ Contributing
Contributions are welcome ā whether it's a bug fix, a new backend, or an improvement to prompt engineering. Claude-Imagine is TypeScript end-to-end, with a pluggable backend architecture that makes it straightforward to add support for new image generation servers.
See CONTRIBUTING.md for the full development guide, project structure, and how to extend.
š Documentation
| Document | Description |
|---|---|
| Getting Started | Install, configure, verify, generate your first image |
| Architecture | Module map, generation flow, backend abstraction, config chain |
| Image Reference | All types, styles, moods, compositions, lighting, palettes |
| Contributing | Development setup, project structure, how to extend |
| Changelog | Release history |
š License
This project is licensed under the MIT License. See the LICENSE file for details.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.