Replicate

Replicate

Enables users to access and manage Replicate's AI models and predictions via the Model Context Protocol, offering tools for creating, canceling, and retrieving model predictions and parameters.

deepfates

Media Content Processing
Programming Docs Access
Content Fetching
Database Interaction
AI Content Generation
AI Integration Systems
Data & App Analysis
Visit Server

Tools

search_models

Search for models using semantic search

list_models

List available models with optional filtering

list_collections

List available model collections

get_collection

Get details of a specific collection

create_prediction

Create a new prediction using either a model version (for community models) or model name (for official models)

cancel_prediction

Cancel a running prediction

get_prediction

Get details about a specific prediction

list_predictions

List recent predictions

get_model

Get details of a specific model including available versions

view_image

Display an image in the system's default web browser

clear_image_cache

Clear the image viewer cache

get_image_cache_stats

Get statistics about the image cache

README

Replicate MCP Server

A Model Context Protocol server implementation for Replicate. Run Replicate models through a simple tool-based interface.

Quickstart

  1. Install the server:
npm install -g mcp-replicate
  1. Get your Replicate API token:

  2. Configure Claude Desktop:

    • Open Claude Desktop Settings (<kbd>⌘</kbd><kbd>,</kbd>)
    • Select the "Developer" section in the sidebar
    • Click "Edit Config" to open the configuration file
    • Add the following configuration, replacing your_token_here with your actual Replicate API token:
{
  "mcpServers": {
    "replicate": {
      "command": "mcp-replicate",
      "env": {
        "REPLICATE_API_TOKEN": "your_token_here"
      }
    }
  }
}
  1. Start Claude Desktop. You should see a 🔨 hammer icon in the bottom right corner of new chat windows, indicating the tools are available.

(You can also use any other MCP client, such as Cursor, Cline, or Continue.)

Alternative Installation Methods

Install from source

git clone https://github.com/deepfates/mcp-replicate
cd mcp-replicate
npm install
npm run build
npm start

Run with npx

npx mcp-replicate

Features

Models

  • Search models using semantic search
  • Browse models and collections
  • Get detailed model information and versions

Predictions

  • Create predictions with text or structured input
  • Track prediction status
  • Cancel running predictions
  • List your recent predictions

Image Handling

  • View generated images in your browser
  • Manage image cache for better performance

Configuration

The server needs a Replicate API token to work. You can get one at Replicate.

There are two ways to provide the token:

1. In Claude Desktop Config (Recommended)

Add it to your Claude Desktop configuration as shown in the Quickstart section:

{
  "mcpServers": {
    "replicate": {
      "command": "mcp-replicate",
      "env": {
        "REPLICATE_API_TOKEN": "your_token_here"
      }
    }
  }
}

2. As Environment Variable

Alternatively, you can set it as an environment variable if you're using another MCP client:

export REPLICATE_API_TOKEN=your_token_here

Available Tools

Model Tools

  • search_models: Find models using semantic search
  • list_models: Browse available models
  • get_model: Get details about a specific model
  • list_collections: Browse model collections
  • get_collection: Get details about a specific collection

Prediction Tools

  • create_prediction: Run a model with your inputs
  • get_prediction: Check a prediction's status
  • cancel_prediction: Stop a running prediction
  • list_predictions: See your recent predictions

Image Tools

  • view_image: Open an image in your browser
  • clear_image_cache: Clean up cached images
  • get_image_cache_stats: Check cache usage

Troubleshooting

Server is running but tools aren't showing up

  1. Check that Claude Desktop is properly configured with the MCP server settings
  2. Ensure your Replicate API token is set correctly
  3. Try restarting both the server and Claude Desktop
  4. Check the server logs for any error messages

Tools are visible but not working

  1. Verify your Replicate API token is valid
  2. Check your internet connection
  3. Look for any error messages in the server output

Development

  1. Install dependencies:
npm install
  1. Start development server (with auto-reload):
npm run dev
  1. Check code style:
npm run lint
  1. Format code:
npm run format

Requirements

  • Node.js >= 18.0.0
  • TypeScript >= 5.0.0
  • Claude Desktop for using the tools

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured
Mult Fetch MCP Server

Mult Fetch MCP Server

A versatile MCP-compliant web content fetching tool that supports multiple modes (browser/node), formats (HTML/JSON/Markdown/Text), and intelligent proxy detection, with bilingual interface (English/Chinese).

Featured
Local
AIO-MCP Server

AIO-MCP Server

🚀 All-in-one MCP server with AI search, RAG, and multi-service integrations (GitLab/Jira/Confluence/YouTube) for AI-enhanced development workflows. Folk from

Featured
Local
Persistent Knowledge Graph

Persistent Knowledge Graph

An implementation of persistent memory for Claude using a local knowledge graph, allowing the AI to remember information about users across conversations with customizable storage location.

Featured
Local
Hyperbrowser MCP Server

Hyperbrowser MCP Server

Welcome to Hyperbrowser, the Internet for AI. Hyperbrowser is the next-generation platform empowering AI agents and enabling effortless, scalable browser automation. Built specifically for AI developers, it eliminates the headaches of local infrastructure and performance bottlenecks, allowing you to

Featured
Local