🏛️ Wayback & CommonCrawl Recon

Query Wayback Machine & CommonCrawl archives to uncover forgotten endpoints, exposed configs, API keys, and shadow infrastructure.

By Ned Walsch

100% client-side — queries go directly from your browser to archive APIs · Last updated February 10, 2026

SEARCH PANEL

Target Domain

Date Range

Max Results

PROGRESS

STATS

Total Found

After Dedup

High Value

Wayback

CommonCrawl

RESULTS

Discovered Endpoints

URL ↕	Type ↕	Status ↕	Last Seen ↕	Source ↕	Archive

SEO CONTENT Related Tools

🏛️ Wayback & CommonCrawl Recon — Frequently Asked Questions

How do I search the Wayback Machine for OSINT?

This tool queries the Wayback Machine’s CDX index and CommonCrawl to surface archived URLs, deleted pages, and exposed endpoints a site has since removed — useful for recovering content that’s no longer live.

How does Wayback Machine and CommonCrawl reconnaissance work?

Max Intel queries the Wayback Machine CDX API and CommonCrawl index API directly from your browser. Both APIs return lists of URLs that web crawlers have archived for a given domain. The tool deduplicates results across both sources, classifies each URL by type (config files, JS, admin panels, API endpoints, backups), and flags high-value targets like .env files, credentials, and database dumps.

What types of sensitive files can archived endpoint discovery find?

Common high-value findings include exposed .env files with API keys, wp-config.php with database credentials, .git/config revealing repository structure, Swagger/OpenAPI documentation exposing internal APIs, database backups (.sql, .sql.gz), server configuration files (php.ini, web.config), and admin panel login pages. Even if these files have since been removed from the live site, the archived URLs confirm they once existed.

Is Wayback Machine OSINT reconnaissance legal?

Querying the Wayback Machine and CommonCrawl APIs for publicly archived URLs is legal — these are public data sources that archive the open web. However, using discovered endpoints to access live systems without authorization would violate computer fraud laws. This tool is intended for authorized security testing, bug bounty programs, and attack surface assessment of domains you own or have permission to test.

Wayback Delta Analyzer

Last updated: February 14, 2026

Fetches archived versions of the current page from the Wayback Machine CDX API and diffs them against the live version. Highlights removed content, deleted links, changed contact information, edited paragraphs, and modified metadata. Free alternative to ChangeTower and Visualping ($15-100/mo).

Drag to your bookmarks bar:

Diff with Wayback

Install — drag to bookmarks bar

Visit any webpage

Click — fetches Wayback snapshots and compares against the live page

Runs on any website — all processing in your browser.

📜

Install the bookmarklet, then use it on any website

📜 Wayback Delta Analyzer — FAQ

How far back does it compare?

It fetches up to 20 snapshots from the Wayback Machine and compares the live page against the most recent archive. The timeline shows all available snapshots.

Why are some links shown as removed/added?

Links are compared by exact URL. Minor changes (trailing slashes, parameter order) will appear as removals/additions even if the destination is the same.

Does it work on JavaScript-rendered pages?

The live page capture reads the rendered DOM. The archived version depends on what the Wayback Machine captured — it may not include JS-rendered content.

Can I compare two specific archive dates?

Currently it compares live vs most recent archive. Multi-snapshot comparison is planned for a future update.

What if the page has never been archived?

The tool will report no snapshots found. Consider triggering an archive.today snapshot first for future comparisons.

🏛️ Wayback & CommonCrawl Recon

Discovered Endpoints

How does archived endpoint reconnaissance uncover hidden attack surface?

Why do archived URLs matter for security assessments?

What is the difference between Wayback Machine and CommonCrawl?

🏛️ Wayback & CommonCrawl Recon — Frequently Asked Questions

Wayback Delta Analyzer

Wayback Delta Analyzer

OSINT Applications

📜 Wayback Delta Analyzer — FAQ

Discovered Endpoints

How does archived endpoint reconnaissance uncover hidden attack surface?

Why do archived URLs matter for security assessments?

What is the difference between Wayback Machine and CommonCrawl?

🏛️ Wayback & CommonCrawl Recon — Frequently Asked Questions

Related OSINT Tools

Ghost Finder

Sitemap Historian

WHOIS History

Wayback Delta Analyzer

OSINT Applications

📜 Wayback Delta Analyzer — FAQ

Related Tools

Wayback Recon

Ghost Finder