Extract structured data from any web page

Data Dumper is a browser extension for scraping, OSINT reconnaissance, and evidence collection. Open source. Available for Chrome and Firefox.

Scraping Features

Two modes of operation. Full Page Scan extracts everything automatically. Manual Selection gives you precise control over what gets captured.

Full Page Scan

One-click extraction of all entities, links, tables, and images on the page. No element picking or configuration needed.

Manual Element Selection

Click elements directly on the page to define data columns. Rename, reorder, or disable columns before scraping.

Pagination

Pick the "Next" button and Data Dumper navigates through pages automatically, collecting and deduplicating results.

Infinite Scroll

Scrolls to the bottom of the page repeatedly, waits for new content to load, and collects everything it finds.

Dynamic Content

Waits for JavaScript-rendered content to settle before extracting. Uses mutation observation to detect when the page is stable.

Evidence Chain

Attach source URL, timestamp, SHA-256 hash, and user agent to every export. Basic chain of custody for collected data.

OSINT Tools

Standalone tools for reconnaissance and data collection, independent of the scraper.

Entity Extraction

Regex-based scanning for emails, phone numbers, URLs, social handles, IP addresses, and cryptocurrency wallets. Includes false-positive filtering.

Link Harvester

Collects all links on the page and classifies them as internal, external, mailto, or tel. Sorted by domain for quick analysis.

Table Detection

Finds HTML tables on the page, reports their dimensions, and extracts any of them into structured rows with one click.

Image Extraction

Pulls image URLs, alt text, and dimensions. Filter by minimum size or require alt text to narrow results.

Page Snapshot

Captures the full page HTML and a visible-area screenshot. Timestamped with SHA-256 hash for integrity verification.

Export Formats

Get your data out in whatever format you need.

CSV

Comma-separated values with proper quoting and escaping.

Excel

XML Spreadsheet format compatible with Excel and LibreOffice. Includes an Evidence sheet when enabled.

XML

Structured XML with row elements and tagged fields.

JSON

Raw JSON with evidence metadata. Full scans export all categories as separate keys.

Clipboard

Tab-separated text for pasting directly into spreadsheets.

Installation

Load the extension directly from the source code. No store listing required.

Chrome

  1. Navigate to chrome://extensions
  2. Toggle Developer mode on
  3. Click Load unpacked
  4. Select the chrome/ directory

Firefox

  1. Navigate to about:debugging#/runtime/this-firefox
  2. Click Load Temporary Add-on
  3. Select firefox/manifest.json