firecrawl-mcp-server
Enables web scraping and search with batch support, rate limiting, and flexible output modes (direct or file).
README
Firecrawl MCP Server
An MCP server for Firecrawl - web scraping and search with batch support, rate limiting, and flexible output modes.
Features
-
Scrape Tool: Extract content from web pages
- Single URL or batch scraping (up to 100 URLs)
- Multiple output formats: markdown, HTML, links, screenshot
- Smart content extraction (main content only)
- Geographic targeting
-
Search Tool: Search the web
- Single or batch queries (up to 10)
- Web, image, and news results
- Location-based search
- Time filtering (hour/day/week/month/year)
- Optional content scraping from results
-
Rate Limit Handling: Automatic retry with exponential backoff
-
Two Output Modes:
direct: Return results to the agentfile: Save as markdown files with date-based structure
Quick Start
# Install dependencies
bun install
# Set your Firecrawl API key
export FIRECRAWL_API_KEY=fc-your-api-key
# Run in development mode
bun run dev
# Test with MCP Inspector
bun run inspector
Configuration
All configuration via environment variables:
| Variable | Default | Description |
|---|---|---|
FIRECRAWL_API_KEY |
(required) | Your Firecrawl API key |
FIRECRAWL_API_URL |
https://api.firecrawl.dev/v2 |
Firecrawl API base URL |
FIRECRAWL_OUTPUT_MODE |
direct |
Output mode: direct or file |
FIRECRAWL_OUTPUT_DIR |
./firecrawl-output |
Directory for file output |
FIRECRAWL_RATE_LIMIT_RPM |
100 |
Requests per minute |
FIRECRAWL_RATE_LIMIT_RETRY_MS |
1000 |
Base retry delay in ms |
FIRECRAWL_RATE_LIMIT_MAX_RETRIES |
3 |
Maximum retry attempts |
LOG_LEVEL |
info |
Log level: debug, info, warning, error |
Tools
scrape
Scrape web pages and extract content.
{
"urls": "https://example.com",
"formats": ["markdown"],
"onlyMainContent": true,
"outputMode": "direct"
}
Parameters:
urls: Single URL or array of URLs (max 100)formats: Output formats -markdown,html,rawHtml,links,screenshotonlyMainContent: Extract only main content (default: true)includeTags: HTML tags to includeexcludeTags: HTML tags to excludewaitFor: Wait time in ms for dynamic contenttimeout: Request timeout in mslocation: Geographic targeting{ country: "US", languages: ["en"] }outputMode: Override mode -directorfileoutputDir: Override output directory
search
Search the web and optionally scrape results.
{
"queries": "firecrawl web scraping",
"limit": 10,
"scrapeResults": true,
"outputMode": "direct"
}
Parameters:
queries: Single query or array of queries (max 10)limit: Results per query (max 20, default: 10)location: Search location (e.g., "Germany", "United States")tbs: Time filter -qdr:h(hour),qdr:d(day),qdr:w(week),qdr:m(month),qdr:y(year)scrapeResults: Also scrape content from resultsscrapeFormats: Formats for scraped contentoutputMode: Override mode -directorfileoutputDir: Override output directory
Output Modes
Direct Mode (default)
Results are returned directly to the agent in the response.
File Mode
Results are saved as markdown files:
firecrawl-output/
├── scrape/
│ └── 2026-01-25/
│ └── example.com/
│ ├── index.md
│ └── about.md
└── search/
└── 2026-01-25/
└── 14-30-00/
└── firecrawl-web-scraping.md
Each file includes YAML frontmatter with metadata:
---
url: https://example.com
title: Example Domain
scraped_at: 2026-01-25T14:30:00.000Z
---
# Example Domain
This domain is for use in illustrative examples...
Client Configuration
Claude Desktop
Add to claude_desktop_config.json:
{
"mcpServers": {
"firecrawl": {
"command": "bun",
"args": ["run", "/path/to/firecrawl-mcp/src/index.ts"],
"env": {
"FIRECRAWL_API_KEY": "fc-your-api-key",
"FIRECRAWL_OUTPUT_MODE": "direct"
}
}
}
}
Cursor
Add to MCP settings:
{
"firecrawl": {
"command": "bun",
"args": ["run", "/path/to/firecrawl-mcp/src/index.ts"],
"env": {
"FIRECRAWL_API_KEY": "fc-your-api-key"
}
}
}
Rate Limiting
The server includes built-in rate limiting to handle Firecrawl's API limits:
- Token bucket algorithm for request throttling
- Automatic retry on 429 (rate limit) and 5xx errors
- Exponential backoff between retries
- Configurable limits via environment variables
Development
# Lint
bun run lint
# Type check
bun run typecheck
# Format code
bun run format
# Build for distribution
bun run build
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.