WebScraper uses the Integrity v8 engine to quickly scan a website, and can output extracted data as CSV or JSON. Plus download images to a folder.
Easy to scan a site - just enter the starting URL and press "Go"
Easy to export - choose the columns you want
Plenty of extraction options, including HTML elements with certain classes or IDs, regular expressions, or entire content in a number of formats (html, plain text, markdown)
'helper' utilities within the app make it easy to find a suitable class / id or produce a regular expression (regex) to extract the data you want
Since v4.1 can download to a folder all images discovered
Configuration of various limits on the crawl and the output file size
What’s new in version 4.15.5
Updated on Oct 11 2022
Important fix for those wanting to output json. backslashes in the data weren't being escaped in the output, potentially leading to invalid json.