groq-docs-mcp
Enables users to search Groq's API documentation using natural language queries through an MCP-compatible interface.
README
Groq Documentation MCP Server
An MCP (Model Context Protocol) server that provides semantic search over Groq's documentation using Cloudflare AI Search (AutoRAG) with R2 as the data source.
Features
search_documentationTool: Query Groq's API documentation using natural language- AI-Powered RAG: Uses Cloudflare AI Search for semantic search and retrieval
- Fast & Scalable: Built on Cloudflare Workers for global edge deployment
- MCP Compatible: Works with Claude Desktop and other MCP clients
Setup Instructions
Prerequisites
- Cloudflare account with Workers enabled
- Wrangler CLI installed:
npm install -g wrangler - Authenticated with Wrangler:
wrangler login
1. Install Dependencies
npm install
2. Install and Configure Rclone
Install rclone for fast bulk uploads:
brew install rclone # macOS
# Or: curl https://rclone.org/install.sh | sudo bash # Linux
Configure rclone for R2:
rclone config
# Choose 'n', name: 'r2', storage: 5, provider: 24
# Enter your Account ID and R2 API Token
3. Create R2 Bucket
wrangler r2 bucket create groq-docs
4. Scrape Documentation
Set up Browser Rendering API credentials:
export CLOUDFLARE_ACCOUNT_ID="your-account-id"
export CLOUDFLARE_API_TOKEN="your-api-token"
Run the scraper:
npm run scrape
This will:
- Use Browser Rendering API for clean content extraction
- Scrape all pages from https://console.groq.com/docs
- Save locally to
./scraped-docs/ - Bulk upload to R2 using rclone
Note: Takes several minutes depending on page count.
5. Configure AI Search (Manual)
In the Cloudflare Dashboard:
- Go to AI > AI Search
- Create a new AI Search instance named
groq-docs-ai-search - Configure the data source:
- Select R2 as the data source
- Choose the
groq-docsbucket
- Select embedding and generation models (use defaults)
- Set up AI Gateway for monitoring
- Assign a Service API token
- Wait for indexing to complete (monitor in the AI Search dashboard)
6. Deploy the Worker
Deploy the MCP server to Cloudflare Workers:
npm run deploy
Your server will be available at: groq-docs-mcp.<your-account>.workers.dev/mcp or groq-docs-mcp.<your-account>.workers.dev/sse
Usage
Connect to Claude Desktop
To use this MCP server with Claude Desktop:
- Open Claude Desktop settings
- Go to Settings > Developer > Edit Config
- Add this configuration:
{
"mcpServers": {
"groq-docs": {
"command": "npx",
"args": [
"mcp-remote",
"https://groq-docs-mcp.<your-account>.workers.dev/sse"
]
}
}
}
- Restart Claude Desktop
Connect to Cloudflare AI Playground
- Go to https://playground.ai.cloudflare.com/
- Enter your deployed MCP server URL:
groq-docs-mcp.<your-account>.workers.dev/sse - Start using the
search_documentationtool!
Example Queries
Try asking:
- "How do I use the Groq API?"
- "What models are available on Groq?"
- "How do I implement streaming with Groq?"
- "What are the rate limits for Groq API?"
- "How do I use OpenAI compatibility with Groq?"
Development
Local Development
Run the server locally:
npm run dev
The server will be available at http://localhost:8787
Type Checking
Generate types for Cloudflare bindings:
npm run cf-typegen
Check types:
npm run type-check
Code Formatting
Format code with Biome:
npm run format
npm run lint:fix
Project Structure
groq-docs-mcp/
├── src/
│ └── index.ts # Main MCP server implementation
├── scripts/
│ └── scrape-groq-docs.js # Documentation scraper script
├── scraped-docs/ # Local cache of scraped docs (git-ignored)
├── package.json # Dependencies and scripts
├── wrangler.jsonc # Cloudflare Worker configuration
└── README.md # This file
How It Works
- Scraping: Uses Cloudflare Browser Rendering API to extract clean markdown from Groq's documentation
- Storage: Documentation is stored as markdown files in R2 (uploaded via rclone)
- Indexing: Cloudflare AI Search indexes the R2 content using embeddings
- Query: The MCP tool queries the AI Search index and returns relevant documentation snippets
- Results: Formatted results include URLs, titles, content, and relevance scores
Troubleshooting
Scraper Issues
If the scraper fails:
- Check your internet connection
- Verify Groq's documentation site is accessible
- Ensure Wrangler is authenticated:
wrangler whoami
AI Search Not Working
If searches return no results:
- Verify the AI Search instance is created and named
groq-docs-ai-search - Check that indexing is complete in the AI Search dashboard
- Ensure the R2 bucket contains the scraped files:
wrangler r2 object list groq-docs
Worker Deployment Issues
If deployment fails:
- Verify Wrangler is up to date:
npm install -g wrangler@latest - Check your Cloudflare account has Workers enabled
- Ensure the R2 bucket exists:
wrangler r2 bucket list
License
MIT
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.