Social Media Scraper - Custom MCP Server
Enables scraping of LinkedIn, Facebook, Instagram profiles and Google search via a Model Context Protocol server, allowing AI assistants to fetch social media data and search results.
README
<h1 align="center">Social Media Scraper - Custom MCP Server</h1>
<br> <p align="center"> <img src="https://img.shields.io/badge/python-FFD43B?style=for-the-badge&logo=python&logoColor=306998" alt="Python"> <img src="https://img.shields.io/badge/fastmcp-FF6B6B?style=for-the-badge&logo=python&logoColor=white" alt="FastMCP"> <img src="https://img.shields.io/badge/rapidapi-7B68EE?style=for-the-badge&logo=rapidapi&logoColor=white" alt="RapidAPI"> <img src="https://img.shields.io/badge/linkedin-20B2AA?style=for-the-badge&logo=linkedin&logoColor=white" alt="LinkedIn API"> <img src="https://img.shields.io/badge/facebook-FF4500?style=for-the-badge&logo=facebook&logoColor=white" alt="Facebook API"> <img src="https://img.shields.io/badge/instagram-FF1493?style=for-the-badge&logo=instagram&logoColor=white" alt="Instagram API"> <img src="https://img.shields.io/badge/google_serper-32CD32?style=for-the-badge&logo=google&logoColor=white" alt="Google Serper API"> </p> <br> A comprehensive Model Context Protocol (MCP) server that provides social media scraping capabilities for LinkedIn, Facebook, Instagram, and Google search functionality.
What is MCP?
Model Context Protocol (MCP) is an open standard that enables AI assistants to securely connect with external data sources and tools. MCP servers act as bridges between AI models and various services, allowing for enhanced capabilities like real-time data access, API integrations, and custom tool execution.
Features
This server exposes the following tools for an AI assistant to use:
- LinkedIn Profile Scraping: Extract personal and company profile data
- Facebook Profile Scraping: Fetch public profile information
- Instagram Profile Scraping: Get profile data and basic information
- Google Search: Perform web searches using Google Serper API
Installation
Step 1: Adding MCP to your Python project
We recommend using uv to manage your Python projects.
If you haven't created a uv-managed project yet, create one:
uv init custom-mcp-server
cd custom-mcp-server
Step 2: Install MCP Dependencies
Then add MCP to your project dependencies:
uv add "mcp[cli]"
This will auto-generate files and folders similar to the project structure mentioned below, also create a .env file to securely store the API keys.
Step 3: Add Project Code
In the files generated look for main.py and copy paste the code given in main.py (repo).
Step 4: Install Additional Dependencies
uv add httpx python-dotenv fastmcp
Environment Configuration
Step 1: API Keys Setup
-
RapidAPI Key:
- Sign up at RapidAPI
- Subscribe to the following APIs (Most Important):
- Fresh LinkedIn Profile Data
- Facebook Scraper3
- Instagram Scraper Stable API
-
Google Serper API Key:
- Sign up at Serper.dev
- Get your API key from the dashboard
Step 2: Environment Variables
Create .env file in your project root with the following variables:
RAPIDAPI_KEY=your_rapidapi_key_here
SERPER_API_KEY=your_serper_api_key_here
Usage
Running with Claude Desktop
Step 1: Install the Server
You can install this server in Claude Desktop and interact with it right away by running:
uv run mcp install main.py
Step 2: Verify Installation
Later, go to Claude AI (desktop version) and you will see changes in the platform similar to the screenshot shown.
Step 3: Start Using the Tools
Paste the URLs of required platform and ask the AI to provide information of the mentioned URLs.
Example Usage
Please scrape this LinkedIn profile: https://linkedin.com/in/example-profile
Get company information for: https://linkedin.com/company/example-company
Troubleshooting
If the MCP tools don't appear in Claude Desktop:
Step 1: End Claude Processes
- Windows: Open Task Manager (Ctrl+Shift+Esc)
- Mac: Open Activity Monitor
- End all Claude-related processes
Step 2: Reinstall the Server
uv run mcp install main.py
Step 3: Restart Claude Desktop
Paste the URLs of required platform and ask the AI to provide information of the mentioned URLs.
Testing with MCP Inspector
Alternatively, you can test it with the MCP Inspector:
uv run mcp dev main.py
Project Structure
custom-mcp-server/
├── __pycache__/ # Python bytecode cache (auto-generated)
├── .venv/ # Virtual environment directory
├── .env # Environment variables (API keys)
├── .python-version # Python version specification
├── main.py # Main MCP server implementation
├── pyproject.toml # Project configuration and dependencies
├── README.md # Project documentation
└── uv.lock # UV lock file for reproducible builds
Response Format
All tools return JSON-formatted strings containing the scraped data. Example response structure:
{
"success": true,
"data": {
"profile": {
"name": "John Doe",
"title": "Software Engineer",
"location": "San Francisco, CA",
"bio": "Passionate about technology..."
}
},
"timestamp": "2024-01-15T10:30:00Z"
}
But using these tools via Claudes makes it readable.
Contributing
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests if applicable
- Submit a pull request
In case of any queries, please leave a message or contact me via the email provided in my profile.
<p align="center"> ⭐ <strong>Star this repository if you found it helpful!</strong> </p>
Recommended Servers
playwright-mcp
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Magic Component Platform (MCP)
An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.
Audiense Insights MCP Server
Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.
VeyraX MCP
Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.
graphlit-mcp-server
The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.
Kagi MCP Server
An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.
E2B
Using MCP to run code via e2b.
Neon Database
MCP server for interacting with Neon Management API and databases
Qdrant Server
This repository is an example of how to create a MCP server for Qdrant, a vector search engine.
Exa Search
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.