Perfect Web Clone IDE

Perfect Web Clone IDE

Enables pixel-perfect website cloning from within the IDE by extracting real source code, styles, and assets, and assembling them into a React project with live preview.

Category
Visit Server

README

<div align="center">

Perfect Web Clone IDE

Pixel-Perfect Website Cloning for Your IDE

English · 中文

License: MIT Python 3.9+ MCP Playwright

An MCP server that brings pixel-perfect website cloning directly into your IDE.

Clone any website with one command. Get real code, not guesses.


</div>

Why This Tool?

Screenshot-based Tools Perfect Web Clone IDE
Guesses code from pixels Extracts real source code
Approximate CSS Exact computed styles
Broken responsive layouts Preserved responsiveness
Lost animations @keyframes preserved
Complex setup One config file

Inspired by Perfect-Web-Clone: Want to understand the full multi-agent architecture? Check out the original project. This IDE version brings that power directly into your coding workflow.


Features

One-Command Cloning

Just type in your IDE:

Clone https://stripe.com for me
帮我克隆 https://pixso.ai

The AI handles everything—crawling, code extraction, styling, and preview.

Pixel-Perfect Extraction

  • Real HTML Structure: Not guessed from screenshots
  • Exact CSS: Computed styles, variables, @keyframes
  • All Assets: Images, fonts, videos, SVGs
  • Framework Detection: Auto-injects Tailwind, Bootstrap, Bulma CDN

Parallel Processing

Multiple workers process different sections simultaneously:

┌─────────────────────────────────────────────────┐
│  "Clone https://example.com"                    │
│                    ↓                            │
│    ┌─────────┐ ┌─────────┐ ┌─────────┐        │
│    │Worker 1 │ │Worker 2 │ │Worker 3 │  ...   │
│    │Section 0│ │Section 1│ │Section 2│        │
│    └────┬────┘ └────┬────┘ └────┬────┘        │
│         └──────────┼───────────┘              │
│                    ↓                            │
│           Assembled Project                     │
│           + Live Preview                        │
└─────────────────────────────────────────────────┘

IDE Integration (MCP)

Works with any MCP-compatible IDE:

IDE / Tool Status
Claude Code Native support
Cursor Via MCP plugin
VS Code + Continue Via MCP extension
Any MCP Client Standard protocol

Getting Started

Prerequisites

  • Python 3.9+
  • Claude Code CLI (npm install -g @anthropic-ai/claude-code)
  • Playwright (playwright install chromium)

Installation

# Clone the repository
git clone https://github.com/ericshang98/Perfect-Web-Clone-IDE.git
cd Perfect-Web-Clone-IDE

# Create virtual environment
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Install Playwright browser
playwright install chromium

Configure MCP Server

Add to ~/.claude/settings.json:

{
  "mcpServers": {
    "web-clone": {
      "command": "/path/to/Perfect-Web-Clone-IDE/venv/bin/python",
      "args": ["-m", "cloner"],
      "cwd": "/path/to/Perfect-Web-Clone-IDE"
    }
  }
}

Restart your IDE to load the MCP server.


Usage

Clone a Website

Clone https://example.com

The system will:

  1. Crawl the webpage with Playwright
  2. Extract HTML, CSS, JS, assets
  3. Split into logical sections
  4. Process each section in parallel
  5. Assemble into React project
  6. Preview in your browser

Output Structure

~/ClonedSites/example.com_20260129_123456/
├── output/
│   ├── preview.html          # Standalone preview
│   ├── src/
│   │   ├── app/
│   │   │   ├── page.tsx      # Next.js page
│   │   │   └── layout.tsx
│   │   └── components/
│   │       └── sections/     # React components
│   ├── package.json
│   └── tailwind.config.js
└── sections/                  # Raw extraction data

Run Full Project

cd ~/ClonedSites/example.com_20260129_123456/output
npm install && npm run dev

Architecture

┌──────────────────────────────────────────────────────────────┐
│                     MCP Server                               │
│                                                              │
│  ┌────────────────────────────────────────────────────────┐ │
│  │ clone_webpage(url)                                      │ │
│  │   → Playwright: Full page crawl                         │ │
│  │   → Extract: CSS, JS, computed styles, @keyframes       │ │
│  │   → Detect: Tailwind / Bootstrap / Bulma               │ │
│  │   → Split: Semantic sections                            │ │
│  └────────────────────────────────────────────────────────┘ │
│                            ↓                                 │
│  ┌────────────────────────────────────────────────────────┐ │
│  │ process_sections_batch()                                │ │
│  │                                                          │ │
│  │   ┌─────────┐ ┌─────────┐ ┌─────────┐                  │ │
│  │   │ Claude  │ │ Claude  │ │ Claude  │                  │ │
│  │   │ Worker  │ │ Worker  │ │ Worker  │  (Parallel)      │ │
│  │   └────┬────┘ └────┬────┘ └────┬────┘                  │ │
│  │        ↓           ↓           ↓                        │ │
│  │   Section0.jsx Section1.jsx Section2.jsx               │ │
│  └────────────────────────────────────────────────────────┘ │
│                            ↓                                 │
│  ┌────────────────────────────────────────────────────────┐ │
│  │ assemble_project()                                      │ │
│  │   → Combine components                                  │ │
│  │   → Generate Next.js structure                          │ │
│  │   → Start preview server                                │ │
│  │   → Open browser                                        │ │
│  └────────────────────────────────────────────────────────┘ │
└──────────────────────────────────────────────────────────────┘

MCP Tools

Tool Description
clone_webpage Crawl and split webpage
get_sections List extracted sections
process_section Process single section
process_sections_batch Parallel processing
assemble_project Assemble and preview
get_status Session status

Limitations

  • Login-required pages: Cannot clone authenticated pages
  • Heavy SPAs: Complex client-side rendering may not extract fully
  • Dynamic content: API-loaded content after page load may be missed

Related Projects

  • Perfect-Web-Clone: Full multi-agent system with 40+ tools and web UI. See this to understand the complete architecture.

Star History

Star History Chart


Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

License

MIT License - see LICENSE for details.


<div align="center">

Perfect Web Clone IDE - Extract the real code, not guesses.

Made with ❤️ by Eric Shang

</div>

Recommended Servers

playwright-mcp

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official
Featured
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Official
Featured
Local
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

Official
Featured
Local
TypeScript
VeyraX MCP

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official
Featured
Local
graphlit-mcp-server

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official
Featured
TypeScript
Kagi MCP Server

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official
Featured
Python
E2B

E2B

Using MCP to run code via e2b.

Official
Featured
Neon Database

Neon Database

MCP server for interacting with Neon Management API and databases

Official
Featured
Exa Search

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official
Featured
Qdrant Server

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official
Featured