Skrape MCP Server
This server converts webpages into clean, structured Markdown optimized for language model consumption, removing unnecessary content and supporting JavaScript rendering.
skrapeai
Tools
get_markdown
Get markdown content from a webpage using skrape.ai
README
Skrape MCP Server
Convert any webpage into clean, LLM-ready Markdown using skrape.ai. Perfect for feeding web content into LLMs.
This MCP server provides a simple interface to convert web pages to structured, clean Markdown format using the skrape.ai API. It's designed to work seamlessly with Claude Desktop, other LLMs, and MCP-compatible applications.
<a href="https://glama.ai/mcp/servers/7i81qzgkzd"> <img width="380" height="200" src="https://glama.ai/mcp/servers/7i81qzgkzd/badge" /> </a>
Why Use Skrape for LLM Integration?
- Clean, Structured Output: Generates well-formatted Markdown that's ideal for LLM consumption
- Noise Reduction: Automatically removes ads, navigation menus, and other irrelevant content
- Consistent Format: Ensures web content is uniformly structured regardless of the source
- JavaScript Support: Handles dynamic content by rendering JavaScript before conversion
- LLM-Optimized: Perfect for feeding web content into LLMs like Claude, GPT, and other LLM models
Features
Tools
get_markdown- Convert any webpage to LLM-ready Markdown- Takes any input URL and optional parameters
- Returns clean, structured Markdown optimized for LLM consumption
- Supports JavaScript rendering for dynamic content
- Optional JSON response format for advanced integrations
Installation
Installing via Smithery
To install Skrape MCP Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @skrapeai/skrape-mcp --client claude
Manual Installation
-
Get your API key from skrape.ai
-
Install dependencies:
npm install
- Build the server:
npm run build
- Add the server config to Claude Desktop:
On MacOS:
nano ~/Library/Application\ Support/Claude/claude_desktop_config.json
On Windows:
notepad %APPDATA%/Claude/claude_desktop_config.json
Add this configuration (replace paths and API key with your values):
{
"mcpServers": {
"skrape": {
"command": "node",
"args": ["path/to/skrape-mcp/build/index.js"],
"env": {
"SKRAPE_API_KEY": "your-key-here"
},
}
}
}
Using with LLMs
Here's how to use the server with Claude or other LLM models:
- First, ensure the server is properly configured in your LLM application
- Then, you can ask the ALLMI to fetch and process any webpage:
Convert this webpage to markdown: https://example.com
Claude will use the MCP tool like this:
<use_mcp_tool>
<server_name>skrape</server_name>
<tool_name>get_markdown</tool_name>
<arguments>
{
"url": "https://example.com",
"options": {
"renderJs": true
}
}
</arguments>
</use_mcp_tool>
The resulting Markdown will be clean, structured, and ready for LLM processing.
Advanced Options
The get_markdown tool accepts these parameters:
url(required): Any webpage URL to convertreturnJson(optional): Set totrueto get the full JSON response instead of just markdownoptions(optional): Additional scraping optionsrenderJs: Whether to render JavaScript before scraping (default: true)
Example with all options:
<use_mcp_tool>
<server_name>skrape</server_name>
<tool_name>get_markdown</tool_name>
<arguments>
{
"url": "https://example.com",
"returnJson": true,
"options": {
"renderJs": false
}
}
</arguments>
</use_mcp_tool>
Development
For development with auto-rebuild:
npm run watch
Debugging
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector:
npm run inspector
The Inspector will provide a URL to access debugging tools in your browser.
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Claude Code MCP
An implementation of Claude Code as a Model Context Protocol server that enables using Claude's software engineering capabilities (code generation, editing, reviewing, and file operations) through the standardized MCP interface.
@kazuph/mcp-fetch
Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.
Playwright MCP Server
Provides a server utilizing Model Context Protocol to enable human-like browser automation with Playwright, allowing control over browser actions such as navigation, element interaction, and scrolling.
Tavily MCP Server
Provides AI-powered web search capabilities using Tavily's search API, enabling LLMs to perform sophisticated web searches, get direct answers to questions, and search recent news articles.
Metabase MCP Server
Enables AI assistants to interact with Metabase databases and dashboards, allowing users to list and execute queries, access data visualizations, and interact with database resources through natural language.
Linear MCP Server
A Model Context Protocol server that integrates with Linear's issue tracking system, allowing LLMs to create, update, search, and comment on Linear issues through natural language interactions.
Airtable MCP Server
A Model Context Protocol server that provides tools for programmatically managing Airtable bases, tables, fields, and records through Claude Desktop or other MCP clients.
mcp-pinterest
A Pinterest Model Context Protocol (MCP) server for image search and information retrieval
@kazuph/mcp-gmail-gas
Model Context Protocol server for Gmail integration. This allows Claude Desktop (or any MCP client) to interact with your Gmail account through Google Apps Script.