Reddit Scraper

Reddit Scraper

Scrapes and queries Reddit posts, comments, and media from subreddits and user profiles without requiring API keys, storing data locally for offline access and filtering.

Category
Visit Server

README

mcp-reddit

MCP server for scraping Reddit - no API keys required.

Scrapes posts, comments, and media from subreddits and user profiles using old.reddit.com and Libreddit mirrors.

Features

  • No API keys - Scrapes directly, no Reddit API credentials needed
  • Media downloads - Images, videos with audio (requires ffmpeg)
  • Local persistence - Query scraped data offline
  • Rich filtering - By post type, score, keywords
  • Comments included - Full thread scraping

Installation

pip install mcp-reddit

Or with uvx:

uvx mcp-reddit

Configuration

Add to your Claude Desktop or Claude Code settings:

Claude Desktop (~/Library/Application Support/Claude/claude_desktop_config.json)

{
  "mcpServers": {
    "reddit": {
      "command": "uvx",
      "args": ["mcp-reddit"]
    }
  }
}

Claude Code

claude mcp add reddit -- uvx mcp-reddit

Or manually in ~/.claude.json:

{
  "mcpServers": {
    "reddit": {
      "command": "uvx",
      "args": ["mcp-reddit"]
    }
  }
}

Available Tools

Tool Description
scrape_subreddit Scrape posts from a subreddit
scrape_user Scrape posts from a user's profile
scrape_post Fetch a specific post by URL (supports media download)
get_posts Query stored posts with filters
get_comments Query stored comments
search_reddit Search across all scraped data
get_top_posts Get highest scoring posts
list_scraped_sources List all scraped subreddits/users

Example Usage

"Scrape the top 50 posts from r/LocalLLaMA"

"Fetch this post and download the image: https://reddit.com/r/ClaudeAI/comments/abc123/title"

"Search my scraped data for posts about 'fine-tuning'"

"Get the top 10 posts from r/ClaudeAI by score"

Data Storage

Data is stored in ~/.mcp-reddit/data/ by default.

Set MCP_REDDIT_DATA_DIR environment variable to customize:

{
  "mcpServers": {
    "reddit": {
      "command": "uvx",
      "args": ["mcp-reddit"],
      "env": {
        "MCP_REDDIT_DATA_DIR": "/path/to/your/data"
      }
    }
  }
}

Optional: Video with Audio

To download Reddit videos with audio, install ffmpeg:

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt install ffmpeg

# Windows
choco install ffmpeg

Credits

Built on top of reddit-universal-scraper by @ksanjeev284 - a full-featured Reddit scraper with analytics dashboard, REST API, and plugin system.

Releasing

  1. Update version in pyproject.toml and src/mcp_reddit/__init__.py
  2. Commit and push
  3. Create GitHub release → auto-publishes to PyPI

License

MIT

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured