Tag: web scraping

Mozenda web scraping and publishing of data to cloud storage

Post author By admin
Post date February 7, 2017
No Comments on Mozenda web scraping and publishing of data to cloud storage

Mozenda is a cloud web scraping service (SaaS), and we’ve already reviewed it. Since our last review, Mozenda has provided more useful utility features for data extraction. Besides multi-threaded extraction & smart data aggregation, Mozenda allows users to publish extracted data to cloud storage such as Dropbox, Amazon, and Microsoft Azure. In this post we will try to explain the new Mozenda extraction and integration capabilities.

Tags service, web scraping

Miscellaneous

Octoparse review

Octoparse is a new modern visual web data extraction software. It provides users a point-&-click UI to develop extraction patterns, so that scrapers can apply these patterns to structured websites. Both experienced and inexperienced users find it easy to use Octoparse to bulk extract information from websites – for most of scraping tasks no coding needed!

Tags free, Octoparse, scraping tool, service, web scraping

Miscellaneous

Data Scraping Studio review

Post author By admin
Post date April 9, 2016
1 Comment on Data Scraping Studio review

Data Scraping Studio (DSS) is a new free, multi-threading studio for effective data extraction. It consists of two parts: (1) the Google Chrome extension with point-&-click interface to setup a web scraping agent and (2) the Desktop app for executing scraping agents.

Tags plugin, web scraping

Development Web Scraping Software

The worthy alternative to dissolving scraping Kimono API

Post author By admin
Post date February 26, 2016
2 Comments on The worthy alternative to dissolving scraping Kimono API

Recently I got notified of Kimono service finishing its work due to kimono team being joining another project. So many data hunters who were using this prominent free API service are now in search for a good alternative.

Tags scraping tool, Sequentum, service, web scraping

Development Web Scraping Software

Dexi.io REST API in php (example)

Post author By admin
Post date October 21, 2015
No Comments on Dexi.io REST API in php (example)

In this post, I’d like to demonstrate how to leverage the Dexi.io (CloudScrape) API along with its PHP Client library (also avail in Ruby and C#).

Tags structured APIs, web scraping

Miscellaneous

Content Grabber self-contained (standalone) agent

Post author By admin
Post date October 21, 2015
No Comments on Content Grabber self-contained (standalone) agent

As web scraping is becoming easier to use, more and more people are able to leverage the world’s web resources. As this trend grows, structured data from the web empower businesses and enable a wave of new business ideas to become a reality. Now there is a new technology on the market called: “self-contained agents” that might just make this a tsunami!

Tags Sequentum, web scraping

Development

Extract browser’s Local Storage with Python

Post author By admin
Post date October 14, 2015
5 Comments on Extract browser’s Local Storage with Python

Some of you may be wondering if it’s possible to extract a web browser’s local storage by web scraping?

Tags Python, web scraping

Development Web Scraping Software

Content Grabber with free proxy account integration for business directories scrape

Post author By admin
Post date September 3, 2015
No Comments on Content Grabber with free proxy account integration for business directories scrape

Professional data extraction requires adequate proxying to keep anonymity of scraping robots. When attempting to extract large data sets (over 1M records, ex. business directories) reliable and fast proxy service is needed.

Sequentum has released the Nohodo proxy service integration for Content Grabber. Nohodo provides a free account for Content Grabber users (up to 5000 requests monthly for free). The feature is available for both trial users and regular customers. Here’s how it works…

Tags free, proxy, scraping tool, Sequentum, web scraping

Featured Web Scraping Software

Dexi.io Review

dexi-medium-height-130px Dexi.io is a powerful scraping suite. This cloud scraping service provides development, hosting and scheduling tools. The suite might be compared with Mozenda for making web scraping projects and runnig them in clouds for user convenience. Yet it includes the API, each scraper being a json definition similar to other services like import.io, kimono lab and parseHub.

Tags service, web scraping

Guest posting Web Scraping Software

Turn any interactive website into an API with ParseHub

Post author By admin
Post date June 22, 2015
No Comments on Turn any interactive website into an API with ParseHub

parsehub Anyone should be able to pull data from the web and access it in the format they want. If a website does not have an API available, scraping is one of the only options to get the data you need. But figuring out how to scrape data in the complicated HTML is a pain.

ParseHub is a new web browser extension that you can use to turn any dynamic and poorly structured website into an API, without writing code. ParseHub is a scraping tool that is designed to work on websites with JavaScript and Ajax; it is similar to web scraping tools such as Import.io and Kimono Labs.

Tags scraping tool, web scraping