MCP Image Analyzer Server

MCP Image Analyzer Server

Analyzes images using a vision AI model to provide detailed descriptions, enabling non-vision models to understand image content.

Category
Visit Server

README

MCP Image Analyzer Server

MCP server yang menganalisis gambar menggunakan vision AI model (mimo-v2.5-free) dan mengembalikan deskripsi detail untuk model tanpa vision.

šŸ“ Struktur

mcp-image-analyzer/
ā”œā”€ā”€ main.py              # MCP server implementation
ā”œā”€ā”€ requirements.txt     # Python dependencies
└── .venv/              # Python virtual environment

šŸ› ļø Tools yang Tersedia

1. analyze_image

Menganalisis gambar dengan instruksi custom atau analisis komprehensif.

Parameters:

  • image_path (required): Path ke file gambar
  • instruction (optional): Instruksi khusus untuk analisis

Contoh Penggunaan:

# Analisis komprehensif (default)
analyze_image("/home/nyx/photo.jpg")

# Ekstrak teks saja
analyze_image("/home/nyx/screenshot.png", "Extract all visible text from this image")

# Deskripsi UI
analyze_image("/home/nyx/ui.png", "Describe the UI layout and components")

# Analisis kode
analyze_image("/home/nyx/code.png", "Extract the code and explain what it does")

2. list_supported_formats

Menampilkan daftar format gambar yang didukung.

šŸ–¼ļø Format Gambar yang Didukung

  • JPG/JPEG
  • PNG
  • GIF
  • BMP
  • WebP
  • SVG
  • TIFF
  • HEIC
  • ICO

āš™ļø Konfigurasi

Sudah otomatis terintegrasi di opencode.json:

"imageAnalyzer": {
  "command": "/home/nyx/.config/opencode/mcp-image-analyzer/.venv/bin/python",
  "args": ["/home/nyx/.config/opencode/mcp-image-analyzer/main.py"],
  "type": "local",
  "enabled": true
}

šŸ”§ Setup & Maintenance

Install Dependencies (jika diperlukan)

cd ~/.config/opencode/mcp-image-analyzer
uv pip install -r requirements.txt

Test Server

~/.config/opencode/mcp-image-analyzer/.venv/bin/python main.py

Update Dependencies

cd ~/.config/opencode/mcp-image-analyzer
uv pip install --upgrade -r requirements.txt

šŸ“Š Workflow

User: @image.png
      ↓
Qwen 3.7 Max (tanpa vision)
      ↓
Memanggil analyze_image tool
      ↓
Image Analyzer MCP
      ↓
Mimo-v2.5-free (Vision Model)
      ↓
Teks deskripsi detail
      ↓
Qwen 3.7 Max memahami isi gambar

šŸŽÆ Use Cases

  1. Screenshot Analysis: Ekstrak teks dan deskripsi UI dari screenshot
  2. Code Review: Analisis kode dari screenshot atau diagram
  3. Design Review: Deskripsi layout dan komponen UI
  4. Document Analysis: Ekstrak konten dari dokumen scan
  5. Image Description: Deskripsi komprehensif untuk konteks

šŸ› Troubleshooting

Error: "File not found"

  • Pastikan path gambar benar
  • Gunakan absolute path atau path relatif dari home directory

Error: "Unsupported image format"

  • Cek ekstensi file
  • Lihat daftar format yang didukung dengan list_supported_formats

Error: "API Error" atau "Network Error"

  • Pastikan vision API server berjalan di http://192.168.1.10:20128/v1
  • Cek koneksi network
  • Verifikasi API key valid

MCP server tidak muncul di opencode

  • Restart opencode
  • Cek konfigurasi di opencode.json
  • Pastikan file main.py executable

šŸ“ Notes

  • Output dalam Bahasa Inggris untuk kompatibilitas maksimal
  • Model Qwen 3.7 Max dapat menentukan sendiri jenis analisis yang dibutuhkan
  • Analisis komprehensif membutuhkan lebih banyak token
  • Timeout API: 60 detik

šŸ”— Related

  • Vision Model: oc/mimo-v2.5-free
  • API Endpoint: http://192.168.1.10:20128/v1
  • FastMCP Documentation: https://gofastmcp.com

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured