MCP Image Server
An intelligent MCP server that searches for images, labels them, combines them into a composite image, and uses Google's Gemini 2.0 Flash AI to select the best match for your query.
README
MCP Image Server
An intelligent MCP (Model Context Protocol) server that searches for images, labels them, combines them into a composite image, and uses Google's Gemini 2.0 Flash AI to select the best match for your query.
Features
- š Image Search: Searches multiple sources (Unsplash, Pexels, with fallback to Lorem Picsum)
- š·ļø Smart Labeling: Labels each image (1-20) with black borders for easy identification
- š¼ļø Image Composition: Combines all images into a single grid view
- š¤ AI Selection: Uses Google's Gemini 2.0 Flash to intelligently select the best matching image
- š¦ Base64 Output: Returns images as base64 for easy integration
Installation
-
Clone or download this repository
-
Install dependencies:
npm install -
Copy the environment template:
copy .env.example .env -
Edit
.envand add your API keys:GOOGLE_AI_API_KEY=your_google_ai_api_key_here UNSPLASH_ACCESS_KEY=your_unsplash_access_key_here PEXELS_API_KEY=your_pexels_api_key_here
Getting API Keys
Google AI API Key (Required for AI selection)
- Go to Google AI Studio
- Create a new API key
- Add it to your
.envfile asGOOGLE_AI_API_KEY
Unsplash API Key (Optional but recommended)
- Go to Unsplash Developers
- Create a new application
- Get your Access Key
- Add it to your
.envfile asUNSPLASH_ACCESS_KEY
Pexels API Key (Optional but recommended)
- Go to Pexels API
- Create a free account and get your API key
- Add it to your
.envfile asPEXELS_API_KEY
Usage
Development Mode
npm run dev
Build and Run
npm run build
npm start
Using with Claude Desktop
Add this server to your Claude Desktop configuration:
{
"mcpServers": {
"mcp-image-server": {
"command": "node",
"args": ["C:/path/to/your/project/build/index.js"]
}
}
}
Available Tools
search_and_select_best_image
Searches for images matching your query and returns the AI-selected best match.
Parameters:
query(string, required): The search query for imagesmaxImages(number, optional): Maximum number of images to search (1-20, default: 20)
Example usage in Claude:
"Search for the best image of a sunset over mountains"
How It Works
- Search: Queries multiple image APIs (Unsplash, Pexels) for relevant images
- Download: Downloads up to 20 images and resizes them to thumbnails
- Label: Adds numbered labels (1-20) with black borders to each image
- Compose: Combines all labeled images into a single grid
- AI Analysis: Sends the composite to Google's Gemini 2.0 Flash with your original query
- Select: AI analyzes each numbered image and selects the best match
- Return: Downloads and returns the selected image as base64
Error Handling
- If no API keys are provided, the server uses placeholder images from Lorem Picsum
- If Google AI API is not available, it uses a fallback selection algorithm
- Failed image downloads are replaced with placeholder images
- All errors are logged to stderr for debugging
Project Structure
src/
āāā index.ts # Main MCP server entry point
āāā services/
āāā ImageSearchService.ts # Handles image search from various APIs
āāā ImageProcessingService.ts # Processes and combines images
āāā AISelectionService.ts # Uses Google AI for selection
Development
npm run dev- Run in development mode with auto-restartnpm run build- Build TypeScript to JavaScriptnpm run clean- Clean build directory
License
MIT License
Contributing
Feel free to submit issues and enhancement requests!
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.