Wayback MCP. Track Web Content Changes Across Decades

Q: How do I check if a URL was ever on the Internet Archive Wayback using the Internet Archive Wayback MCP?

You run checkavailability with the target URL. This tool immediately tells you if the page has been archived and provides the timestamp of the closest preserved version.

Q: Can I find out when a website was first online using getfirstcapture?

Yes, running getfirstcapture gives you the initial metadata for the earliest snapshot available. It includes the timestamp and status code of that very first recorded version.

Q: How do I analyze a domain's full history using getsubdomaincaptures?

Use getsubdomaincaptures with the root domain. This tool discovers and lists all associated subdomains that have been captured, letting you map out the entire corporate footprint.

Q: What is the best way to filter for images in a specific year?

You combine two tools: first, use getcapturesbyyear to narrow down the date range. Then, refine that list using getcapturesbymimetype and specify 'image/jpeg' or similar.

Internet Archive Wayback MCP accesses the world's largest web archive, giving you access to over 800 billion archived web pages spanning decades of internet history. Check a URL's current preservation status, analyze its capture timeline, and find specific content—like images or PDFs—from any year. It lets researchers track content changes, legal teams verify evidence, and developers study how websites evolved.

Claude

ChatGPT

Cursor

Gemini

Windsurf

VS Code

JetBrains

Vercel

See Vinkius in Action

Give Claude and any AI agent real-world access

Check URL availability

Determine if a specific website address has been archived and get the timestamp of its closest preserved version.

Analyze capture timeline

Find out when a page was first captured or what the most recent snapshot is, giving you clear start and end points for content history.

Filter by resource type

Limit searches to specific file types like PDFs, images, or stylesheets to pinpoint necessary historical assets.

Track status codes

Analyze capture records specifically for HTTP error or success codes (like 404 or 200) across a period of time.

Discover domain footprint

Find all archived subdomains associated with a main website, helping map out an entire organization's historical online presence.

Ask an AI about this

Waiting for input…

AI Agent

What AI agents can do with Internet Archive Wayback: 10 Tools for Deep Web Analysis

These tools give your agent specific ways to query the web archive, letting you filter captures by type, time, or status code instead of just viewing a general history overview.

Make your AI actually useful.

Add this MCP to Claude, Cursor, or Windsurf and your AI stops guessing. It gets real tools to look things up, take action, and handle the stuff you keep doing by hand.

Start using Internet Archive Wayback MCP

Get Captures By Mime Type

Finds archived pages filtered by the specific file type, like showing only PDFs or images from a URL's history.

Get Captures By Status

Filters captured records by HTTP status code (e.g., finding all 404 errors across a...

Get Captures By Year

Retrieves all archived snapshots for a specific calendar year, allowing you to...

Get Cdx Captures

Gets a detailed list of every capture record, including the timestamp, MIME type...

Check Availability

Quickly determines if a URL has been archived and returns the date of the closest...

Get Captures Collapsed

Shows unique page captures for a given URL, eliminating redundant entries so you only see distinct versions.

Get Capture Count

Calculates and returns the total number of times an entire URL has been archived over its history.

Get First Capture

Identifies and retrieves metadata for the earliest preserved version of a URL...

Get Latest Capture

Gets the most recent archived snapshot of a page, giving you the newest recorded...

Get Subdomain Captures

Maps out an entire domain's historical presence by finding all captured subdomains...

Security and governance baked right in.

Pick your AI client below to get set up. Just create a Vinkius account, subscribe, and you're instantly up and running. We handle the entire backend infrastructure, delivering out-of-the-box support for HTTPS Streamable, SSE, and OAuth2—zero messy routing required.

Claude AI

Open Claude Settings

Go to claude.ai, click your profile icon, then navigate to Customize → Connectors.

Add Custom Connector

Click the "+" button and select Add custom connector. Paste your Vinkius endpoint URL:

https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp

Replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com. For OAuth-protected servers, expand Advanced settings to add credentials.

Start a conversation

Open a new chat. The Wayback integration is available immediately — no restart needed.

Antigravity

Configure Agent Environment

Open your Antigravity agent's workspace configuration or mcp-servers.json file.

Bind the Endpoint

Add the Vinkius endpoint URL to your agent's MCP connections list:

"mcp_servers": {
  "internet-archive-wayback": {
    "serverUrl": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
  }
}

Provide your secure token in place of [YOUR_TOKEN_HERE] to ensure your agent requests are authenticated.

Execute

Start your Antigravity session. The agent will autonomously discover and utilize the Wayback tools with full Vinkius guardrails applied.

VS Code Copilot

⚡

One-Click Install (Recommended)

In your Vinkius Dashboard, simply click the Add to VS Code button for this server. We'll automatically configure your local workspace.

Or configure manually

Open MCP Settings

Open VS Code, press Ctrl/Cmd + Shift + P, and search for GitHub Copilot: MCP Servers.

Add Server Config

Add the Vinkius endpoint configuration to your mcp-servers.json file:

"internet-archive-wayback": {
  "url": "https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp"
}

Ensure you replace [YOUR_TOKEN_HERE] with your token from cloud.vinkius.com.

LangChain

Install Dependencies

Install the LangChain MCP adapters for your environment:

pip install langchain-mcp-adapters

Connect the Server

Use the SSEClient in LangChain to connect to the Vinkius managed endpoint:

from langchain_mcp_adapters.client import SSEClient

# Connect to Vinkius
client = SSEClient(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")
tools = client.get_tools()

CrewAI

Define the Tool

Load the Vinkius MCP tools into your CrewAI agents:

from crewai import Agent
from mcp_crewai import MCPTool

# Connect securely to Vinkius
vinkius_tools = MCPTool(url="https://edge.vinkius.com/[YOUR_TOKEN_HERE]/mcp")

# Assign to Agent
researcher = Agent(
    role='Data Researcher',
    tools=vinkius_tools.get_all()
)

Execute Task

Run your CrewAI process. The agent will autonomously route tasks to the Vinkius managed server.

Choose How to Get Started

Build a custom MCP for your own tools, or connect a ready-made integration from our catalog.

Build Your Own

Turn any API into an MCP. Import a spec, define Agent Skills, or deploy with MCPFusion.

Import from OpenAPI, Swagger, or YAML specs
Create Agent Skills with progressive disclosure
Deploy to edge with MCPFusion framework
Built in DLP, auth, and compliance on each call
Real time usage dashboard and cost metering
Publish to catalog or keep private

Start building

Make Your AI Do More

Start with Internet Archive Wayback, then connect any of our 5,200+ other servers whenever your AI needs more. One click, no limits.

Use this MCP plus 5,200+ others, all in one place
Add new capabilities to your AI anytime you want
Connections are secured and governed automatically
Track usage and costs across all your servers
Works with Claude, ChatGPT, Cursor, and more
New servers added to the catalog weekly

Independent Platform Disclaimer: Vinkius is an independent platform and is not affiliated with, endorsed by, sponsored by, verified by, or otherwise authorized by Internet Archive Wayback Machine. All third-party trademarks, logos, and brand names are the property of their respective owners. Their use on this website is strictly for informational purposes to identify service compatibility and interoperability.

VINKIUS CLOUD

Cloud Hosted

Managed infra

V8 Isolated

Sandboxed per request

Zero-Trust Proxy

No stored credentials

DLP Enforced

Policy on each call

GDPR Compliant

EU data residency

Token Compression

~60% cost reduction

Your data is protected. See how we built it.

Finding Web History Is a Click-Heavy Nightmare

Today, digging into old website content means opening the Wayback Machine and manually inputting dates. You copy a URL, check one year, then another. If you want to know if an image was posted in 2015, you have to filter by date, then filter by MIME type (image/jpeg), and hope you didn't miss anything. It’s slow, tedious, and easy to misread.

With this MCP, your agent handles the whole process. You tell it, 'Find all JPEG images for X URL between 2015 and 2017.' Your AI client gets back a clean list of data points, complete with timestamps and status codes. The guesswork is gone.

Accessing Web Archive Details with `get_cdx_captures`

Instead of clicking through dozens of different yearly views to piece together a timeline, you run one query. You use the `get_cdx_captures` tool which pulls every available metadata point—the timestamp, status code, file size, and MIME type—into one record set.

Now you have the full picture in plain data. It's not just 'archived.' It tells you *how* it was archived, letting you analyze patterns that were previously hidden behind layers of manual clicking.

Support 24/7 support@vinkius.com ↗

Security Vinkius Trust Center ↗

SLA Service Level Agreement ↗

Report Listing Send Report ↗

web-archiving

url-history

snapshot-analysis

digital-preservation

internet-history

cdx-api

What Wayback MCP does for your AI

Need to know what a website looked like five years ago? This MCP connects your AI agent directly to the Internet Archive Wayback Machine. Instead of guessing or relying on single-point snapshots, you can check a URL's full history—a massive archive spanning over 25 years. Your agent verifies if a page was ever archived and finds its most recent snapshot instantly.

You can dig into granular details: Did they change their logo? Find all JPEG images from 2018. Was the site down on a specific date? Check the HTTP status codes for that year. The power of this data is channeled through Vinkius, making historical web analysis available to any compatible client.

It's ideal for journalists tracking deleted content or developers comparing design iterations over time.

Built · Hosted · Managed by Vinkius Wayback MCP - Track Web History & Archival Status

Server ID 019d75b6-74cf-725b-bd9f-44abe74f65bc

Vinkius Inspector

Compliance Grade A+

Score 100/100

Report View Report ↗

Who uses Wayback MCP

This MCP serves anyone whose job involves tracking content over time. Journalists need proof of what was online and when. Lawyers require verifiable evidence of website text or design for legal action. Developers use it to model product evolution, while academics study the internet's changing face.

Investigative Journalist

They check if a controversial statement made by an official was ever published online and find the exact date using get_first_capture.

Legal Compliance Officer

They prove that specific website content existed on a certain day, running checks to preserve evidence of deleted or altered material.

Frontend Developer

They compare the structure and design of their site from five years ago against today, using get_captures_by_mime_type for historical CSS files.

Benefits of connecting Wayback MCP

Instantly verify content history. Use check_availability to confirm if a page was ever archived, saving you the manual effort of checking multiple archives.

Analyze site evolution with precision. Instead of guessing, use get_captures_by_year to pull all snapshots from a specific year for comparison.

Track content changes over time. Use get_first_capture and get_latest_capture together to measure the gap between a page's debut and its most recent update.

Build domain maps easily. The get_subdomain_captures tool reveals the full historical footprint of an organization, finding subdomains you didn't know existed.

Filter data for specific evidence. Need only to check if images were posted? Use get_captures_by_mime_type to filter out irrelevant text and status codes.

Wayback MCP use cases

01 01

Tracking a Journalist's Claim

A journalist needs proof that a rival company made a claim in 2017. They ask their agent to check the URL, using get_captures_by_year and then get_first_capture. The MCP reports all available snapshots from 2017, allowing them to pinpoint the exact date and status code of the original post.

02 02

Legal Discovery for a Breach

A compliance officer needs evidence that a specific policy was visible on a website in late 2021. They use get_captures_by_status to filter out error pages and then check the resulting records to confirm the presence of the required text block.

03 03

Developer Comparing Design Changes

A developer wants to see how a site's structure changed over time. They use get_subdomain_captures first, then run get_captures_by_mime_type for CSS files across multiple years to analyze the evolution of stylesheets.

04 04

Academic Research on Web Trends

A historian wants to study how a particular industry presented itself online over 20 years. They use get_capture_count and get_captures_by_year repeatedly across different domains to quantify the change in web presence.

Wayback MCP tradeoffs

What to watch out for, and the recommended way to handle each one.

Guessing content dates

Avoid

Manually searching Google or relying on a single Wayback Machine interface view, which only gives you an estimate and doesn't provide metadata like status codes.

Instead

Use the specific tools. To check for any available date, use check_availability. If you need to analyze every year between 2015 and 2017, call get_captures_by_year for each one.

Missing the scope

Avoid

Thinking a URL's history is limited to just that page. You might miss content from related subdomains or different file types.

Instead

First, use get_subdomain_captures to map out the whole domain. Then, check all found URLs using get_captures_by_mime_type for a complete picture.

Ignoring status codes

Avoid

Assuming that if content was archived, it must have been functional (HTTP 200). You might miss when the site was inaccessible.

Instead

Always run get_captures_by_status alongside your date checks. This reveals exactly when a page returned an error code like 404 or 500.

When to use Wayback MCP

Use this MCP if your goal is historical content verification, tracking evolution, or establishing timelines of online presence. If you need to prove when something was said or seen, this tool is necessary. Don't use it if you simply want the current status; for that, a standard web request works fine. You should only rely on its specialized tools—like get_first_capture or get_captures_by_mime_type—to get granular data points like dates and file types. If your need is purely comparative (e.g., 'Is the current site better than a competitor's'), you might just use standard web scraping, but if you need to compare historical versions of sites, this MCP is non-negotiable.

Frequently asked questions about Wayback MCP

How do I check if a URL was ever on the Internet Archive Wayback using the Internet Archive Wayback MCP? +

You run check_availability with the target URL. This tool immediately tells you if the page has been archived and provides the timestamp of the closest preserved version.

Can I find out when a website was first online using get_first_capture? +

Yes, running get_first_capture gives you the initial metadata for the earliest snapshot available. It includes the timestamp and status code of that very first recorded version.

How do I analyze a domain's full history using get_subdomain_captures? +

Use get_subdomain_captures with the root domain. This tool discovers and lists all associated subdomains that have been captured, letting you map out the entire corporate footprint.

What is the best way to filter for images in a specific year? +

You combine two tools: first, use get_captures_by_year to narrow down the date range. Then, refine that list using get_captures_by_mime_type and specify 'image/jpeg' or similar.

Do I need an API key for Internet Archive Wayback MCP? +

No. This connection is free and public, meaning you don't have to worry about managing credentials; just connect via your preferred AI client.

Give Claude and any AI agent real-world access

What AI agents can do with Internet Archive Wayback: 10 Tools for Deep Web Analysis

Get Captures By Mime Type

Finds archived pages filtered by the specific file type, like showing only PDFs or images from a URL's history.

Get Captures By Status

Filters captured records by HTTP status code (e.g., finding all 404 errors across a...

Get Captures By Year

Retrieves all archived snapshots for a specific calendar year, allowing you to...

Get Cdx Captures

Gets a detailed list of every capture record, including the timestamp, MIME type...

Check Availability

Quickly determines if a URL has been archived and returns the date of the closest...

Get Captures Collapsed

Shows unique page captures for a given URL, eliminating redundant entries so you only see distinct versions.

Get Capture Count

Calculates and returns the total number of times an entire URL has been archived over its history.

Get First Capture

Identifies and retrieves metadata for the earliest preserved version of a URL...

Get Latest Capture

Gets the most recent archived snapshot of a page, giving you the newest recorded...

Get Subdomain Captures

Maps out an entire domain's historical presence by finding all captured subdomains...

Security and governance baked right in.

Claude AI

Open Claude Settings

Add Custom Connector

Start a conversation

Claude Code

Open your terminal

Add the MCP Server

Start coding

Cursor

One-Click Install (Recommended)

Open Cursor Settings

Add New Server

Use in Composer

Antigravity

Configure Agent Environment

Bind the Endpoint

Execute

VS Code Copilot

One-Click Install (Recommended)

Open MCP Settings

Add Server Config

Windsurf

One-Click Install (Recommended)

Open Windsurf Settings

Add Server Endpoint

LangChain

Install Dependencies

Connect the Server

CrewAI

Define the Tool

Execute Task

Choose How to Get Started

Build Your Own

Make Your AI Do More

Finding Web History Is a Click-Heavy Nightmare

Accessing Web Archive Details with `get_cdx_captures`

web-archiving

url-history

snapshot-analysis

digital-preservation

internet-history

cdx-api

What Wayback MCP does for your AI

How to set up Wayback MCP

Who uses Wayback MCP

Benefits of connecting Wayback MCP

Wayback MCP use cases

Tracking a Journalist's Claim

Legal Discovery for a Breach

Developer Comparing Design Changes

Academic Research on Web Trends

Wayback MCP tradeoffs

Guessing content dates

Missing the scope

Ignoring status codes

When to use Wayback MCP

Frequently asked questions about Wayback MCP