MCP Servers

MCP Wayback Machine Server

Enables interaction with the Internet Archive's Wayback Machine to save web pages, retrieve archived versions, search historical snapshots, and check archive statistics without requiring API keys.

README

MCP Wayback Machine Server

Build Status

Release Status

An MCP (Model Context Protocol) server and CLI tool for interacting with the Internet Archive's Wayback Machine without requiring API keys.

Built with: MCP TypeScript Template

Overview

This tool can be used in two ways:

As an MCP server - Integrate with Claude Desktop for AI-powered interactions
As a CLI tool - Use directly from the command line with npx or global installation

Features:

Save web pages to the Wayback Machine
Retrieve archived versions of web pages
Check archive status and statistics
Search the Wayback Machine CDX API for available snapshots

Features

🔐 No API keys required - Uses public Wayback Machine endpoints
💾 Save pages - Archive any publicly accessible URL
🔄 Retrieve archives - Get archived versions with optional timestamps
📊 Archive statistics - Get capture counts and yearly statistics
🔍 Search archives - Query available snapshots with date filtering
⏱️ Rate limiting - Built-in rate limiting to respect service limits
💻 Dual mode - Use as MCP server or standalone CLI tool
🎨 Rich CLI output - Colorized output with progress indicators
🔒 TypeScript - Full type safety with Zod validation

Tools

1. save_url

Archive a URL to the Wayback Machine.

Input: url (required) - The URL to save
Output: Success status, archived URL, and timestamp
Handles rate limiting automatically

2. get_archived_url

Retrieve an archived version of a URL.

Input:
- url (required) - The URL to retrieve
- timestamp (optional) - Specific timestamp (YYYYMMDDhhmmss) or "latest"
Output: Archived URL, timestamp, and availability status

3. search_archives

Search for all archived versions of a URL.

Input:
- url (required) - The URL to search for
- from (optional) - Start date (YYYY-MM-DD)
- to (optional) - End date (YYYY-MM-DD)
- limit (optional) - Maximum results (default: 10)
Output: List of snapshots with dates, URLs, status codes, and mime types

4. check_archive_status

Check archival statistics for a URL.

Input: url (required) - The URL to check
Output: Archive status, first/last capture dates, total captures, yearly statistics

Technical Details

Transport: Stdio (for Claude Desktop integration)
HTTP Client: Built-in fetch with timeout support
Rate Limiting: 15 requests per minute (conservative limit)
Error Handling: Graceful handling with detailed error messages
Validation: URL and timestamp validation
TypeScript: Full type safety with Zod schema validation

API Endpoints (No Keys Required)

Save Page Now: https://web.archive.org/save/{url} - Archive pages on demand
- Documentation
Availability API: http://archive.org/wayback/available?url={url} - Check archive status
- Documentation
CDX Server API: http://web.archive.org/cdx/search/cdx?url={url} - Advanced search and filtering
- Documentation
TimeMap API: http://web.archive.org/web/timemap/link/{url} - Get all timestamps for a URL
- Memento Protocol
Metadata API: https://archive.org/metadata/{identifier} - Get Internet Archive item metadata
- Documentation
Search API: https://archive.org/advancedsearch.php?q={query}&output=json - Search collections
- Documentation

Project Structure

mcp-wayback-machine/
├── src/
│   ├── index.ts          # MCP server entry point
│   ├── tools/            # Tool implementations
│   │   ├── save.ts       # save_url tool
│   │   ├── retrieve.ts   # get_archived_url tool
│   │   ├── search.ts     # search_archives tool
│   │   └── status.ts     # check_archive_status tool
│   ├── utils/            # Utilities
│   │   ├── http.ts       # HTTP client with timeout
│   │   ├── validation.ts # URL/timestamp validation
│   │   └── rate-limit.ts # Rate limiting implementation
│   └── *.test.ts         # Test files (alongside source)
├── dist/                 # Built JavaScript files
├── package.json
├── tsconfig.json
└── README.md

Installation

As a CLI Tool (Quick Start)

Use directly with npx (no installation needed):

npx mcp-wayback-machine save https://example.com

Or install globally:

npm install -g mcp-wayback-machine
wayback save https://example.com

As an MCP Server

Install for use with Claude Desktop:

npm install -g mcp-wayback-machine

From Source

git clone https://github.com/Mearman/mcp-wayback-machine.git
cd mcp-wayback-machine
yarn install
yarn build

Usage

CLI Usage

The tool provides a wayback command (or use npx mcp-wayback-machine):

Save a URL

wayback save https://example.com
# or
npx mcp-wayback-machine save https://example.com

Get an archived version

wayback get https://example.com
wayback get https://example.com --timestamp 20231225120000
wayback get https://example.com --timestamp latest

Search archives

wayback search https://example.com
wayback search https://example.com --limit 20
wayback search https://example.com --from 2023-01-01 --to 2023-12-31

Check archive status

wayback status https://example.com

Get help

wayback --help
wayback save --help

Claude Desktop Configuration

Add to your Claude Desktop settings:

Using npm installation

{
  "mcpServers": {
    "wayback-machine": {
      "command": "npx",
      "args": ["mcp-wayback-machine"]
    }
  }
}

Using local installation

{
  "mcpServers": {
    "wayback-machine": {
      "command": "node",
      "args": ["/absolute/path/to/mcp-wayback-machine/dist/index.js"]
    }
  }
}

For development (without building)

{
  "mcpServers": {
    "wayback-machine": {
      "command": "npx",
      "args": ["tsx", "/absolute/path/to/mcp-wayback-machine/src/index.ts"]
    }
  }
}

Development

Available Commands

yarn dev         # Run in development mode with hot reload
yarn test        # Run tests with coverage
yarn test:watch  # Run tests in watch mode
yarn build       # Build for production
yarn start       # Run production build
yarn lint        # Check code style
yarn lint:fix    # Auto-fix code style issues
yarn format      # Format code with Biome

Testing

The project uses Vitest for testing with the following features:

Unit tests for all tools and utilities
Integration tests for CLI commands
Coverage reporting with c8
Tests located alongside source files (.test.ts)

Run tests:

# Run all tests with coverage
yarn test

# Run tests in watch mode during development
yarn test:watch

# Run CI tests with JSON reporter
yarn test:ci

Examples

Using with Claude Desktop

Once configured, you can ask Claude to:

"Save https://example.com to the Wayback Machine"
"Find archived versions of https://example.com from 2023"
"Check if https://example.com has been archived"
"Get the latest archived version of https://example.com"

CLI Script Examples

# Archive multiple URLs
for url in "https://example.com" "https://example.org"; do
  wayback save "$url"
  sleep 5  # Be respectful with rate limiting
done

# Check if a URL was archived today
wayback search "https://example.com" --from $(date +%Y-%m-%d) --to $(date +%Y-%m-%d)

# Export archive data
wayback search "https://example.com" --limit 100 > archives.txt

Troubleshooting

Common Issues

"URL not found in archive": The URL may not have been archived yet. Try saving it first.
Rate limit errors: Add delays between requests or reduce request frequency.
Connection timeouts: Check your internet connection and try again.
Invalid timestamp format: Use YYYYMMDDhhmmss format (e.g., 20231225120000).

Debug Mode

# Enable debug output
DEBUG=* wayback save https://example.com

# Check MCP server logs
DEBUG=* node dist/index.js

Resources

Official Documentation

Rate Limits & Best Practices

No hard rate limits for public APIs
Be respectful - add delays between requests
Use specific date ranges to reduce CDX result sets
Cache responses when possible
Include descriptive User-Agent header

Community

MCP Discord - Get help and share your experience
Internet Archive Forum - Wayback Machine discussions

Authenticated APIs (Not Implemented)

For completeness, here are Internet Archive APIs that require authentication but are not included in this MCP server:

S3-Compatible API (IAS3)

Authentication: S3-style access keys from https://archive.org/account/s3.php
Features: Upload files, modify metadata, create items, manage collections
Documentation:

Authenticated Search API

Authentication: S3 credentials
Features: Advanced search capabilities, higher rate limits
Access: Requires Internet Archive account
Documentation:
- Advanced Search API
- Search API Examples

Save Page Now 2 (SPN2) - Enhanced Features

Authentication: Partnership agreement typically required
Features: Bulk captures, priority processing, higher rate limits
Documentation:
- SPN2 API Guide
- Save Page Now Overview

Partner/Bulk Access APIs

Authentication: Special partnership agreement
Features: Bulk downloads, custom data exports, direct database access
Access: Contact Internet Archive directly
Documentation:
- Researcher Services
- Bulk Access Information

Getting API Keys

Create account at archive.org
Visit S3 API page (requires login)
Generate Access Key and Secret Key pair
Configure using ia configure command or manual configuration

Note: This MCP server focuses on public, keyless APIs to maintain simplicity and avoid credential management.

License

This project is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

You are free to:

Share — copy and redistribute the material in any medium or format
Adapt — remix, transform, and build upon the material

Under the following terms:

Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made
NonCommercial — You may not use the material for commercial purposes
ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions under the same license

For commercial use or licensing inquiries, please contact the copyright holder.

Recommended Servers

playwright-mcp

A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

Official

Featured

TypeScript

Magic Component Platform (MCP)

An AI-powered tool that generates modern UI components from natural language descriptions, integrating with popular IDEs to streamline UI development workflow.

Audiense Insights MCP Server

Enables interaction with Audiense Insights accounts via the Model Context Protocol, facilitating the extraction and analysis of marketing insights and audience data including demographics, behavior, and influencer engagement.

VeyraX MCP

Single MCP tool to connect all your favorite tools: Gmail, Calendar and 40 more.

Official

Featured

Local

graphlit-mcp-server

The Model Context Protocol (MCP) Server enables integration between MCP clients and the Graphlit service. Ingest anything from Slack to Gmail to podcast feeds, in addition to web crawling, into a Graphlit project - and then retrieve relevant contents from the MCP client.

Official

Featured

TypeScript

Kagi MCP Server

An MCP server that integrates Kagi search capabilities with Claude AI, enabling Claude to perform real-time web searches when answering questions that require up-to-date information.

Official

Featured

Python

E2B

Using MCP to run code via e2b.

Official

Featured

Neon Database

MCP server for interacting with Neon Management API and databases

Official

Featured

Exa Search

A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.

Official

Featured

Qdrant Server

This repository is an example of how to create a MCP server for Qdrant, a vector search engine.

Official

Featured