Categories
Challenge Featured Review SaaS

Data collectors to scrape tough websites

Recently we encountered a new powerful scraping service called Data Collector [of Bright Data]. The life-test and thorough drill-in are coming soon. Yet now we want to highlight it main features that has badly (in positive sense, strongly) impressed us.

Hundreds of free pre-made agents to gather data of top scrape targets

Data Collector in its nature is a scraping agent that is developed (already!) for a specific task. So, zero-coding-level individuals are welcome to use it. Take a look at the shot of the pre-made free to use data collectors. Eg.:
eCommerce category:

Social media category:

Request for a new collector to be coded or develop it yourself

Besides using premade scraping agents/collectors one might (1) request a custom collector or (2) develop your own (with JavaScript) inside a friendly coding environment (IDE) or (3) edit a pre-made agent to tailor it to own needs.
Take a look at the IDE, the page interaction code being separated from the page parsing code:

Note: the cost of ordering a collector (along with its maintenance) is USD $150.

Unblocking tough sites

How to unblock tough sites as business directories and CloudFlare protected ones? The Data Collector utilizes a huge residential and data center proxy network provided by Bright Data, formerly Luminati. No need therefore to pay for any extra proxy services.

Data delivery & integration

The Bright Data offers various ways to deliver and integrate extracted data.

  • SFTP
  • Email
  • Amazon S3
  • Google Cloud Storage
  • Microsoft Azure Storage
  • API download
  • Webhook

Besides, one may get data not just (1) at a job completion yet also (2) in real time with a single request.

Data retention

The scraped data retention is 1 week only.

Cost

The service provides quite budget pricing. The max cost is USD $5 for collection of 1000 successful pages.

Conclusion

The Data Collector service by Bright Data seems to meet the present web scraping challenges (business directories scrape, data integration) while keeping a moderate pricing and providing a big heap of pre-made scrape agents.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.