About
WebScraper uses the Integrity engine to quickly crawl a website and can output scraped data (currently) in CSV or JSON format. Plus uploading images to a folder.
Collect data or archive content from a website.
Opportunities
• Fast and easy site scanning and screening
• Can use a different IP address, user agent, etc. for each request through the ProxyCrawl service
• Native macOS app running on your desktop
• Many ways to retrieve data; various metadata, content (as text, html or markdown), elements with specific classes/identifiers, regular expression
• Easy to export data - select the columns you need
• Data output in csv or json format
• Ability to download all images into a folder / collect and export all links
• Ability to output one text file (designed for archiving text content, markdown or plain text)
• Easy setup to extract email addresses from website
• Lots of options/settings