MCP Servers

falai-mcp-server

A FastMCP server that exposes core fal.ai model API operations, enabling model catalogue browsing, search, schema retrieval, inference, queue management, and CDN uploads through natural language.

README

falai-mcp-server

A FastMCP server that exposes core fal.ai model API operations (model catalogue, search, schema retrieval, inference, queue management, CDN uploads). The server can run locally over STDIO or remotely via the Streamable HTTP transport, and now ships with Docker support for easier deployment.

<video controls width="1920" height="512" src="https://github.com/user-attachments/assets/f8cfb202-3d69-4395-959d-76b2a11181e7">Your browser does not support the video tag.</video>

Quick Start

PyPI Installation (Recommended)

pip install falai-mcp-tools

After installation, you can run the server with:

falai-mcp

Manual Installation

Clone the repository:

git clone https://github.com/berkbirkan/falai-mcp.git
cd falai-mcp

Create and activate a virtual environment:

python3 -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install the project in editable mode:
```
pip install -e .
```

Requirements

Python 3.10 or newer
A fal.ai API key: either FAL_KEY or the FAL_KEY_ID/FAL_KEY_SECRET pair
Docker (optional, only if you prefer containerized execution)

Configuration

Environment variables (prefixed with FALAI_) control runtime behaviour:

Variable	Description
`FAL_KEY` or `FAL_KEY_ID`/`FAL_KEY_SECRET`	fal.ai credentials (required for live API calls)
`FALAI_ALLOWED_MODELS`	Comma-separated list of explicit model IDs to expose
`FALAI_MODEL_KEYWORDS`	Comma-separated keywords to pre-filter models when no explicit list is provided
`FALAI_REQUEST_TIMEOUT`	HTTP timeout (seconds) for fal.ai requests (default: `120`)
`FALAI_ENABLE_HTTP`	Set to `true` to run the server with the Streamable HTTP transport
`FALAI_HTTP_HOST` / `FALAI_HTTP_PORT`	Bind address and port when HTTP transport is enabled (defaults: `0.0.0.0` / `8080`)

If you prefer a .env file, place it next to the project root (or mount it into the container) and load it before running the server.

Clients can override credentials and model filters per MCP session through the configure tool. Environment variables supply defaults when the client does not set overrides.

Usage

Local STDIO usage

Ensure your virtual environment is active and credentials are exported:
```
export FAL_KEY=sk_live_...
```
Run the server with the default STDIO transport:
```
falai-mcp
```
Leave the process running; configure your MCP client (Claude, Cursor, etc.) to launch this command via STDIO (see the client integration section).

Remote HTTP usage

Export credentials and enable the HTTP transport:

export FAL_KEY=sk_live_...
export FALAI_ENABLE_HTTP=true
export FALAI_HTTP_PORT=8080  # optional override

Start the server so it listens on the configured host/port:
```
falai-mcp
```
Confirm the HTTP transport is reachable (for example with curl -I http://localhost:8080/mcp/). Clients should connect to http://<host>:<port>/mcp/.

Docker Usage

Build the container image:
```
docker build -t falai-mcp .
```

Run the container with HTTP enabled and publish the port:

docker run \
  --rm \
  -e FAL_KEY=sk_live_... \
  -e FALAI_ENABLE_HTTP=true \
  -e FALAI_HTTP_PORT=8080 \
  -p 8080:8080 \
  falai-mcp

The MCP endpoint is now available at http://localhost:8080/mcp/.

Client integrations

Below are example configurations for popular MCP clients. Adjust paths, environment variables, and identifiers to match your setup.

Claude Desktop

Claude Desktop keeps its configuration in ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or the equivalent path on your platform.

STDIO (local process)

{
  "mcpServers": {
    "falai-local": {
      "command": "falai-mcp",
      "args": [],
      "env": {
        "FAL_KEY": "sk_live_..."
      }
    }
  }
}

Restart Claude Desktop after saving changes. Claude will spawn falai-mcp and communicate over STDIO.

Remote HTTP server

{
  "mcpServers": {
    "falai-remote": {
      "transport": {
        "type": "http",
        "url": "http://localhost:8080/mcp/"
      }
    }
  }
}

Cursor

Cursor reads MCP configuration from ~/.cursor/mcp.json.

STDIO (local process)

{
  "clients": {
    "falai-local": {
      "command": "falai-mcp",
      "args": [],
      "env": {
        "FAL_KEY": "sk_live_..."
      }
    }
  }
}

Remote HTTP server

{
  "clients": {
    "falai-remote": {
      "transport": {
        "type": "http",
        "url": "http://localhost:8080/mcp/"
      }
    }
  }
}

After editing mcp.json, restart Cursor (or reload MCP connections) to pick up the new configuration.

Available tools

Tool	Description
`configure(api_key=None, allowed_models=None, model_keywords=None)`	Override credentials and access scope for the active session
`models(page=None, total=None)`	List available models with optional pagination
`search(keywords)`	Search the model catalogue using space-separated keywords
`schema(model_id)`	Retrieve the OpenAPI schema for a model
`generate(model, parameters, queue=False)`	Run synchronous or queued inference
`result(url)`	Fetch the result of a queued request
`status(url)`	Check the status (optionally with logs) of a queued request
`cancel(url)`	Cancel a queued request
`upload(path)`	Upload a local file to fal.ai CDN

All tools enforce any configured allow-list and respect per-session overrides from the configure tool.

Development

Building for PyPI

Install build tools:
```
pip install build twine
```
Build the package:
```
python -m build
```

Upload to PyPI (test first with TestPyPI):

# Test upload
python -m twine upload --repository testpypi dist/*

# Production upload
python -m twine upload dist/*

Notes

Schema retrieval and queue inspection require valid fal.ai credentials; errors appear as MCP tool errors if credentials are missing or invalid.
Model discovery falls back to the bundled fal-client endpoint catalogue when fal.ai's public APIs are unavailable.
When running remotely, ensure network access between the client and the MCP server (open firewall ports, configure TLS or reverse proxies if needed).

License

This project is licensed under the MIT License - see the LICENSE file for details.

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured