visual-hunt-mcp
A local TypeScript MCP server for macOS that connects to Chrome via CDP to search and download high-quality image candidates from curated sources like Unsplash, Pexels, and others for use in posters, wallpapers, and inspiration boards.
README
visual-hunt-mcp
visual-hunt-mcp is a local TypeScript MCP server for macOS that connects to your existing Chrome session over CDP and helps Codex / Claude find visually strong image candidates for posters, wallpapers, and inspiration boards.
v0.4.0 · 11 tools · audited end-to-end · 15/15 behavioural tests pass · 108 images downloaded in a real-world 10-topic stress test · ships with eval suite and full-test harness.
It is designed to:
- use the official
@modelcontextprotocol/sdkwithMcpServerandregisterTool - connect to a local Chrome instance via
chromium.connectOverCDP(...) - reuse your local logged-in browser profile when you choose to do manual-assisted review
- avoid bypassing paywalls, login walls, CAPTCHA, or platform controls
- return candidate metadata and optionally download image files into a project folder created under the MCP server's current working directory
Why CDP
This project uses Chrome DevTools Protocol instead of launching a fresh automation browser so the MCP server can reuse a local Chrome profile you control. That makes Instagram and Xiaohongshu workflows more practical because the browser session can already be logged in, while the MCP server still stays local on your Mac.
Why Instagram / Xiaohongshu Are Manual-Assisted
Instagram and Xiaohongshu often require login, can change layout frequently, and may gate content behind platform controls. This MCP deliberately opens those pages in your connected local Chrome and only extracts visible public image candidates when available. It does not try to bypass login, rate limits, or access restrictions.
Download Scope
This project can now download image files for sources such as Unsplash, Pexels, Pixabay, wallpaper sites, and visible image URLs surfaced from manual-assisted social browsing when your connected Chrome session has access. A few guardrails still apply:
- Google-hosted previews and Google Maps imagery stay blocked
- Instagram and Xiaohongshu still depend on your connected Chrome login state and whatever visible image URLs the page actually exposes
- you are still responsible for checking licensing before reuse
save_candidates_jsonremains metadata-only, whiledownload_candidate_imageswrites image files plus a download manifest
Features
MCP tools (11)
| Tool | Reliability | Notes |
|---|---|---|
search_wallpaper_sites |
★★★★★ | Flagship. Curated 6-source set (Unsplash, Pexels, Pixabay, Wikimedia, Wallhaven, Alpha Coders). 100% topic coverage in stress tests. |
open_xiaohongshu_visual_search |
★★★★★ | Flagship for travel/design topics. 100% topic coverage. |
open_google_maps_visual_search |
★★★ | Manual-assisted. Most images correctly blocked from download by design — use for visual inspiration. |
open_pinterest_visual_search |
★★ | Manual-assisted. Pinterest serves a login modal on most searches; log in via Chrome, then call extract_manual_page_candidates. |
open_instagram_visual_search |
★★ | Manual-assisted. Same login-wall pattern as Pinterest. |
search_google_images |
★★ | Inconsistent due to Google's lazy-loaded udm=2 layout. Prefer search_wallpaper_sites for primary search. |
open_url_and_extract_images |
★★★★ | Generic extractor. Works on Wikipedia, blogs, brand pages. |
extract_manual_page_candidates |
★★★★★ | Re-extract after manual scroll/login on a held page. |
close_manual_page |
★★★★★ | Cleanup. Idempotent. |
save_candidates_json |
★★★★★ | Metadata-only persistence. |
download_candidate_images |
★★★★★ | Download with referer + user-agent from connected Chrome. Google-hosted previews are blocked by design. |
Default source set (v0.4.0 curated)
The default search_wallpaper_sites source list was trimmed in v0.4.0 from 13 sources to the 6 that returned downloadable candidates on >80% of topics in real-world stress testing:
- Unsplash — modern editorial photography
- Pexels — modern editorial photography
- Pixabay — modern editorial photography
- Wikimedia Commons — landmarks, architecture, public-domain
- Wallhaven — top free 4K/8K wallpapers
- Alpha Coders / Wallpaper Abyss — top free 4K/8K wallpapers
Adapters for the dropped sources still ship — pass them explicitly when you need them: sites: ["openverse.org", "loc.gov", "images.nasa.gov", "rawpixel.com", "publicdomainpictures.net", "wallpaperscraft.com", "hdqwalls.com"].
Candidate scoring
Each candidate gets a lightweight scoreHint based on:
- image size
- aspect ratio closeness to common poster / wallpaper ratios
- title or alt text keywords such as
poster,cinematic,wallpaper,4k,8k,travel,film, andlandscape
Install
cd /Users/lanston/Desktop/Codex/visual-hunt-mcp
npm install
Start Chrome For MCP
npm run chrome:debug
This helper binds Chrome remote debugging to 127.0.0.1 only. It does not expose the CDP port publicly.
If you prefer the raw script:
chmod +x scripts/start-chrome-mcp.sh
./scripts/start-chrome-mcp.sh
Validation Workflow
Run the deterministic MCP smoke test after Chrome is up:
npm run smoke
For a comprehensive behavioural sweep covering all 11 tools (schema coercion, query optimization, manual-page lifecycle, login-wall suppression, false-positive filters, security guards):
node scripts/full-test.mjs # 15 cases including live Chrome paths
node scripts/full-test.mjs --skip-live # 10 cases, fast, no browser deps
What npm run smoke validates:
- the local MCP server starts over stdio from the current
dist/output tools/listexposes the expected tool setopen_url_and_extract_imagescan extract visible candidates from a local fixture pageextract_manual_page_candidatesandclose_manual_pageare exposed for manual-assisted follow-up flowssave_candidates_jsonwrites candidate metadata to a temporary output rootdownload_candidate_imagesdownloads an allowed local image and skips a blocked Google-hosted candidate with a clear reason
The smoke test uses a temporary VISUAL_HUNT_OUTPUT_ROOT and cleans it up on success. To keep the artifacts for inspection:
npm run smoke -- --keep-output
For a higher-confidence pass that also hits live Google Images and wallpaper search flows through your connected Chrome session:
npm run smoke:live
Use npm run validate as the stable default validation command. It now runs npm run build first and then the deterministic smoke test.
Run MCP Locally
npm run dev
For production-style use:
npm run build
node dist/index.js
Claude Desktop Config
{
"mcpServers": {
"visual-hunt": {
"command": "node",
"args": [
"/Users/lanston/Desktop/Codex/visual-hunt-mcp/dist/index.js"
],
"env": {
"CHROME_CDP_ENDPOINT": "http://127.0.0.1:9222"
}
}
}
}
Example Prompts
Use visual-hunt to search_google_images for "cinematic Oahu Hawaii sunset travel poster 8k", limit 10.
Use visual-hunt to search_wallpaper_sites for "dark luxury black gold abstract 8k wallpaper", limit 15.
Use visual-hunt to open_xiaohongshu_visual_search for "夏威夷 绝美 海报 壁纸 旅行 摄影", limit 10.
Use visual-hunt to download_candidate_images for project "oahu-poster-board" using the candidates from the last search, limit 5.
Use visual-hunt to extract_manual_page_candidates for the manualPageId from the Xiaohongshu search after I log in or scroll more.
Notes
- Logs are written to
stderronly so MCP JSON-RPC onstdoutstays clean. - Chrome must already be running with remote debugging enabled.
- Output folders are created under the MCP server's current working directory by default. Example:
/path/to/visual-hunt-mcp/<project>/images. - Override the output root with
VISUAL_HUNT_OUTPUT_ROOTif you want downloads somewhere else. save_candidates_jsonwrites metadata to<output-root>/<project>/candidates.json.download_candidate_imageswrites image files to<output-root>/<project>/imagesand a manifest to<output-root>/<project>/downloads.json.scripts/smoke-test.mjsis the quickest way to validate the MCP contract end-to-end against a local server process.scripts/smoke-test.mjsvalidates the local server through the currentdist/output and uses a temporary output root for artifacts.- The server does not store credentials.
- The server does not execute arbitrary shell commands.
Environment variables
| Var | Default | Effect |
|---|---|---|
CHROME_CDP_ENDPOINT |
http://127.0.0.1:9222 |
Chrome remote-debugging endpoint |
VISUAL_HUNT_OUTPUT_ROOT |
process.cwd() |
Root directory for downloaded images and manifests |
VISUAL_HUNT_TIMEOUT_MS |
20000 |
Per-page navigation/extraction timeout |
VISUAL_HUNT_MANUAL_PAGE_TTL_MS |
1800000 (30 min) |
TTL for entries in the manual-page registry; getManualPage refreshes |
VISUAL_HUNT_MANUAL_PAGE_MAX |
50 |
Hard cap on manual-page registry size; oldest evicted first |
VISUAL_HUNT_AUTO_REGISTER |
unset | Set to 1 to rewrite ~/.claude.json with this server entry on every startup. Off by default since v0.3.0. |
Manual Checks Beyond Smoke
When you want to sanity-check the live/manual-assisted parts of the product, these are the highest-value follow-ups after npm run smoke:
npm run smoke:liveopen_instagram_visual_searchwith the same Chrome profile you plan to use day-to-dayopen_xiaohongshu_visual_searchwith that same profile if login is requiredopen_google_maps_visual_searchto confirm the manual-review note and non-bypass behavior still look right
Exact Mac Commands
cd /Users/lanston/Desktop/Codex/visual-hunt-mcp
npm install
npm run chrome:debug
npm run smoke
npm run smoke:live
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.