Tag: service

Make crawling easy with Real Time Crawler of Oxylabs.io

Post author By admin
Post date November 26, 2018
No Comments on Make crawling easy with Real Time Crawler of Oxylabs.io

Nowadays, it’s hard to imagine our life without search systems. “If you don’t know something, google it!” – is one of the most popular maxims in our life. But how many people use Google in an optimal way? A lot of developers use google commands to get needed answers as fast as it possible.

Even this is not enough today! Large and small companies need terabytes of data to make their business profitable. It’s necessary to automate the search process and make it reliable to satisfy the user with fresh news, updates or posts. In today’s article we will consider a very helpful tool – Real-Time Crawler (RTC) for the collection of fresh data. Let’s start!

Tags crawling, service, web scraping

Review

SquidProxies review

Today we want to share with you about SquidProxies. It is a service offering anonymous HTTP/HTTPS proxies.

SquidProxies offers 2 types of data-center proxy packages, private proxies and shared proxies. The proxies are designated for just about any legal use, and work great to surf to every website. The proxies’ main use are web scraping/web crawling and SEO tools.

Tags proxy, service

Web Scraping Software

Mozenda web scraping and publishing of data to cloud storage

Post author By admin
Post date February 7, 2017
No Comments on Mozenda web scraping and publishing of data to cloud storage

Mozenda is a cloud web scraping service (SaaS), and we’ve already reviewed it. Since our last review, Mozenda has provided more useful utility features for data extraction. Besides multi-threaded extraction & smart data aggregation, Mozenda allows users to publish extracted data to cloud storage such as Dropbox, Amazon, and Microsoft Azure. In this post we will try to explain the new Mozenda extraction and integration capabilities.

Tags service, web scraping

Development

2captcha service to solve reCaptcha v2.0 (python)

Post author By admin
Post date January 30, 2017
8 Comments on 2captcha service to solve reCaptcha v2.0 (python)

In this post we want to show you the code for an automatic connection to 2captcha service for solving google reCaptcha v2.0. Not long ago, google drastically complicated the user-behavior reCaptcha (v2.0). This online service provides a method for solving it.

Tags captcha, Google, Python, service

Miscellaneous

Octoparse review

Octoparse is a new modern visual web data extraction software. It provides users a point-&-click UI to develop extraction patterns, so that scrapers can apply these patterns to structured websites. Both experienced and inexperienced users find it easy to use Octoparse to bulk extract information from websites – for most of scraping tasks no coding needed!

Tags free, Octoparse, scraping tool, service, web scraping

Uncategorized

Reliable rotating proxies for business directories scrape

Post author By admin
Post date June 29, 2016
11 Comments on Reliable rotating proxies for business directories scrape

We’ve already written about suitable proxy servers for web scraping. Now we want to focus our readers on those for the huge/mass quantities data records scrape, particulary from the business directories. When scraping business directories, their web servers can identify repetitive requesting and put you on hold by looking at the IP address that is used for frequent http requests. Proxy rotation web service is the means for repeatedly changing IP address. Thus, target web server can only see the random IP addresses from rotating proxies pool at each request.

Tags business directory, proxy, service

Development Web Scraping Software

The worthy alternative to dissolving scraping Kimono API

Post author By admin
Post date February 26, 2016
2 Comments on The worthy alternative to dissolving scraping Kimono API

Recently I got notified of Kimono service finishing its work due to kimono team being joining another project. So many data hunters who were using this prominent free API service are now in search for a good alternative.

Tags scraping tool, Sequentum, service, web scraping

Data Science

Testing the Filter by TheWebMiner for advanced web content filtering

Post author By admin
Post date February 9, 2016
No Comments on Testing the Filter by TheWebMiner for advanced web content filtering

thewebminer_logo Recently I came across an interesting new tool from TheWebMiner called Filter. The Filter is an attempt by TheWebMiner to sort (categorize) indexed websites and deliver them to users as a content filtering service.

Tags crawling, service

Featured Web Scraping Software

Dexi.io Review

dexi-medium-height-130px Dexi.io is a powerful scraping suite. This cloud scraping service provides development, hosting and scheduling tools. The suite might be compared with Mozenda for making web scraping projects and runnig them in clouds for user convenience. Yet it includes the API, each scraper being a json definition similar to other services like import.io, kimono lab and parseHub.

Tags service, web scraping

Guest posting

EndCaptcha for fast CAPTCHA solving

Post author By admin
Post date July 10, 2015
1 Comment on EndCaptcha for fast CAPTCHA solving

endcaptcha From time to time, web users struggle with “CAPTCHA services” such as DeCaptcher and DBC. And although those services are reliable, often times they’re “overloaded”, meaning the images to be solved get rejected or it takes a lot of time to be decoded (some services might even take 50 seconds to solve a single image!).

But, I recently came across a new service that hopes to fill this (fast CAPTCHA solving) gap. EndCaptcha.com, is a new image digitization service that was built to satisfy the needs of the most demanding consumers. It uses a dedicated team of operators assisted by a smart OCR system. That’s why it’s being considered a Premium CAPTCHA service.

Tags captcha, service