fugu-mcp

fugu-mcp

An MCP server that exposes Sakana Fugu as on-demand tools inside Claude Code for second opinions and critiques.

Category
Visit Server

README

fugu-mcp

An MCP server that exposes Sakana Fugu — an OpenAI-compatible orchestration LLM — as on-demand tools inside Claude Code. Use it to pull a second opinion or run a small multi-model panel on a discrete, self-contained subproblem, without leaving your Claude Code session.

Fugu is a tool, not the driver. This does not make Fugu the main model for Claude Code. Claude Code only drives on Anthropic / Bedrock / Vertex / Foundry models. fugu-mcp lets the Claude main loop call out to Fugu on demand — it is a narrow tool, not a model swap.


What you get

Two tools, both backed by Sakana's POST /v1/chat/completions endpoint:

Tool Use it for
ask_fugu A general second opinion on a hard, self-contained question (tricky algorithm, design trade-off, "is my approach sound?").
fugu_second_opinion A review-shaped critique: hand it content (code, diff, answer, plan) plus a question, get back a skeptical senior-reviewer pass.

Plus a /fugu slash command for manual invocation.


Install

Requires Node 20+ (Node 26 recommended; uses native fetch).

git clone https://github.com/<your-org>/fugu-mcp.git
cd fugu-mcp
npm install
npm run build            # tsc -> dist/

export SAKANA_API_KEY=...   # your real key; never commit it

# run from the repo root so $(pwd) resolves to your clone
claude mcp add fugu -- node "$(pwd)/dist/index.js"

Then in Claude Code, e.g.: "Ask Fugu to cross-check this approach."

claude mcp add inherits your shell environment, so the exported SAKANA_API_KEY (and any FUGU_* overrides) flow through to the server.

Alternative: .mcp.json

To check the registration into a project (so collaborators get it automatically), add a .mcp.json at the repo root instead of running claude mcp add. A ready-to-edit template lives at .mcp.json.example:

{
  "mcpServers": {
    "fugu": {
      "command": "node",
      "args": ["dist/index.js"],
      "env": {
        "SAKANA_API_KEY": "REPLACE_WITH_YOUR_SAKANA_API_KEY",
        "FUGU_BASE_URL": "https://api.sakana.ai/v1",
        "FUGU_DEFAULT_MODEL": "fugu-ultra",
        "FUGU_TIMEOUT_MS": "120000"
      }
    }
  }
}

Do not commit a real SAKANA_API_KEY in .mcp.json. Either keep .mcp.json out of version control, or leave the placeholder and set the real key via the environment (the server reads process.env.SAKANA_API_KEY regardless of where the session is launched).

args uses a relative dist/index.js; this resolves against the directory Claude Code is launched from. Use an absolute path (/path/to/fugu-mcp/dist/index.js) if you run sessions from elsewhere.

Use with OpenClaw

OpenClaw supports MCP natively, so the same server works there — no separate build. After npm run build:

openclaw mcp add fugu \
  --command node \
  --arg "/path/to/fugu-mcp/dist/index.js" \
  --env SAKANA_API_KEY=your_key_here

openclaw mcp add probes the server before saving. Confirm with openclaw mcp list, and the ask_fugu / fugu_second_opinion tools become available to your OpenClaw agents. Run openclaw mcp reload after changing config so the new settings take effect on the next turn.


Tools

ask_fugu

A general second opinion / hard self-contained query. The prompt must contain everything Fugu needs — it cannot see your repo, files, or the current conversation.

Argument Type Required Default Notes
prompt string yes The full, self-contained problem or question.
system_prompt string no Steer Fugu's role/behavior.
model "fugu" | "fugu-ultra" no FUGU_DEFAULT_MODEL Pick a specific model.
max_tokens integer > 0 no 2000 Max output tokens.

Example invocation:

{
  "name": "ask_fugu",
  "arguments": {
    "prompt": "I'm choosing between a single-writer WAL and a per-shard log for a 50k-writes/sec ledger. Walk through the durability/throughput trade-offs and recommend one. Assume NVMe and fsync-per-commit.",
    "system_prompt": "You are a senior distributed-systems engineer.",
    "model": "fugu-ultra",
    "max_tokens": 1500
  }
}

In conversation you can just say: "Use ask_fugu to sanity-check this proof."

fugu_second_opinion

A review-shaped critique. Wraps a skeptical-senior-reviewer system prompt internally, so you only supply the material and the question.

Argument Type Required Default Notes
content string yes The material to review: code, a diff, an answer, a design, or a plan.
question string yes What you want assessed.
model "fugu" | "fugu-ultra" no FUGU_DEFAULT_MODEL Pick a specific model.

Example invocation:

{
  "name": "fugu_second_opinion",
  "arguments": {
    "content": "func transfer(from, to *Account, amt int) {\n  from.mu.Lock(); to.mu.Lock()\n  from.bal -= amt; to.bal += amt\n  from.mu.Unlock(); to.mu.Unlock()\n}",
    "question": "Is this concurrency-safe?",
    "model": "fugu-ultra"
  }
}

In conversation: "Get a fugu_second_opinion on this diff — is the error handling complete?"

/fugu slash command

/fugu <your question> routes your text to ask_fugu and asks Claude to summarize Fugu's answer and note where it agrees or disagrees — treating Fugu as one voice in a panel, not ground truth. See .claude/commands/fugu.md.


Configuration (environment variables)

All config is via env. Copy .env.example to .env for local use (.env is gitignored).

Variable Required Default Purpose
SAKANA_API_KEY yes (at call time) Sent as Authorization: Bearer <key>. Optional at startup so the server can boot and answer tools/list; a missing key surfaces as a clear tool error only when a tool is actually called — it never crashes the process. Env only; never write a real key to a tracked file.
FUGU_BASE_URL no https://api.sakana.ai/v1 OpenAI-compatible base URL. Requests go to <base>/chat/completions.
FUGU_DEFAULT_MODEL no fugu-ultra Model used when a tool call omits model.
FUGU_TIMEOUT_MS no 120000 Per-request timeout in milliseconds. Orchestrators are slow; the default is generous.

A malformed FUGU_BASE_URL (not a URL) or non-positive FUGU_TIMEOUT_MS fails fast at startup with a clear message. A missing key does not.


Hooks (for working on this repo)

This repo ships .claude/settings.json with hooks that keep the build green and guard secrets while you edit fugu-mcp itself:

  • PreToolUse on Write | Edit | Bash → runs scripts/guard-secrets.sh, which scans the payload for an API-key pattern and blocks the operation if a real-looking key is detected (so a key can't be written into a file or slipped into a commit).
  • PostToolUse on Edit | Write | MultiEdit → runs npm run typecheck (non-blocking; keeps types green during edits).

These hooks are for development of fugu-mcp; they are not required to use the MCP server in another project.


MCP_TIMEOUT for the consuming session

Fugu calls can take tens of seconds (up to ~120s). Claude Code's default MCP startup/tool timeout can be too short. In the session that consumes this server, set:

export MCP_TIMEOUT=120000

This raises Claude Code's MCP timeout to match the server's own FUGU_TIMEOUT_MS default, so slow Fugu responses aren't cut off.


Standalone vs integrated

Use Fugu standalone when: you want a one-shot answer and you're happy outside Claude Code. Drive Sakana's OpenAI-compatible API directly from Codex, Cursor, or curl. That path is simpler and avoids a hop.

Use this MCP tool when: you're already mid-task in Claude Code and want a narrow second opinion — a different model's take on a discrete, self-contained subproblem — without breaking flow. fugu-mcp is deliberately small: two tools, second-opinion shaped, no repo access.

Trade-offs to keep in mind:

  • Latency. Going through the Claude orchestrator adds a hop, and Fugu itself is slow (tens of seconds, occasionally up to ~120s). It is not for tight iteration loops.
  • Cost. Fugu output is roughly ~$30 / 1M output tokens. max_tokens defaults are kept modest (ask_fugu defaults to 2000); keep prompts and outputs tight.
  • Region. Sakana's API is blocked in the EU/EEA. If you're in that region you won't be able to call it.
  • Model IDs. This server defaults to fugu-ultra and exposes fugu / fugu-ultra. Verify the exact, current Fugu model IDs and endpoint against the Sakana docs — they can change. If they differ, it's a config edit (FUGU_DEFAULT_MODEL, FUGU_BASE_URL), not a code change.

Development

npm run build        # compile TypeScript to dist/
npm run typecheck    # tsc --noEmit
npm test             # vitest run (mocked fetch; no network, no key needed)
npm start            # node dist/index.js (stdio server)

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured