Python App #08

Web Scraper

Web Scraper performs comprehensive domain research by combining WHOIS lookups with RDAP fallback for maximum coverage. Discover registrar details, nameservers, creation and expiration dates, registrant information (when available), and domain status codes. Optionally integrate with VirusTotal (API key) for reputation scoring and malware detection. The tool also generates direct links to additional domain intelligence services.

whoisrdapvirustotalregistrar

Last updated:

Features

  • WHOIS lookup with parsed results
  • RDAP fallback for additional data
  • Registrar identification
  • Nameserver discovery
  • Domain creation/expiration dates
  • Registrant info (when public)
  • Domain status codes
  • Optional VirusTotal API integration
  • Reputation and malware scoring
  • HTML session export

Quick Start

# 1. Extract the zip and enter the folder
cd 08-Domain-Intel

# 2. Install dependencies
python install_requirements.py

# 3. Launch the tool
python web_scraper.py

Download

Tool: Web Scraper
File: 08-Domain-Intel.zip
Python: 3.8+
OS: Windows, macOS, Linux
Dependencies: customtkinter, requests, python-whois

Downloads are free — clicking the link below opens a short ad that helps support Max Intel. Thank you! Wait 15 seconds after clicking to unlock.

Click here for an ad
15
seconds remaining…
✓ Unlocked — thank you!
← All Python OSINT Apps

How Is Web Scraping Used for OSINT?

Web scraping — programmatically extracting data from websites — is a core OSINT capability. This tool analyzes pages to extract links, emails, technology fingerprints, forms, social media links, and headers. According to SANS SEC487, automated scraping is essential when investigating targets with large digital footprints.

Technology Fingerprinting

Identifying a website's tech stack reveals potential vulnerabilities, per the OWASP Testing Guide v4.2 (Section 4.1). According to Wappalyzer (2024), 43% of websites run WordPress, making WordPress-specific vulnerability knowledge applicable to nearly half the web. The tool also detects CDN providers, analytics tools, and third-party integrations.

Email and Social Extraction

The tool parses source code to extract email addresses (matching RFC 5322 format), social media URLs, and contact information — enabling pivots to Email Lookup, Username Search, and Social Media Search.

🔒 Web Scraper — Frequently Asked Questions

What can the web scraper extract?

The tool extracts page text, links, images, meta tags, headers, and structured data from any URL. It supports CSS selector targeting for specific elements and can follow pagination for multi-page extraction.

Does the web scraping tool require installation?

No. The tool runs entirely in your browser using client-side Python via Pyodide. No installation, API keys, or server-side processing required. Your queries are executed locally for maximum privacy.

What can I do with the web scraping tool?

You can extract structured data, links, and content from web pages. The tool provides a clean interface with exportable results and cross-links to related Max Intel tools for deeper investigation.