Handy Web Extractor

Handy Web ExtractorHandy Web Extractor is a simple tool for everyday web content monitoring. It will periodically download the web page, extract the necessary content and display it in the window on your desktop. One may consider it as the data extraction software, taking its own nitch in the scraping software and plugins.

What is Crawlera?

I came across this tool a few weeks ago, and wanted to share it with you. So far I have not tested it myself, but it is a simple concept- Safely download web pages without the fear of overloading websites or getting banned. You write a crawler script using scruping hub, and they will run through there IP proxies and take care of the technical problems of crawling.

Scrape with Google App Script

In this post I want to let you how I ve managed to complete the challenge of scraping a site with Google Apps Script (GAS).

Test ReCaptcha 2.0 solving services

We’ve tested several captcha solving services. The test results are based on 1000 ReCaptchas 2.0 submitted to each service.

 StarsAvg. solving time,
seconds
Fastest solving time,
seconds
Performance,
%
Notes
DeathByCaptcha
411696,8Dec. 2019
2Captcha
631595,2Dec. 2019
CaptchaSolutions
1113778Oct. 2017
Useful testing codes

2Captcha Test Code (JAVA)

Simple JAVA email crawler

In the post we share with you the simple JAVA email crawler that crawls a input host (website) and searches for all the emails at the host and stores them.

The script uses JSoup library and the full project you may find here.

Smartproxy Review

Getting precise and localized data is becoming difficult. Advanced proxy networks are the only thing that is keeping some companies running intense data gathering operations.

Crawling web pages with Netpeak Spider in conjunction with NetNut and GeoSurf proxies

Agree, it’s hard to overestimate the importance of information – “Master of information, master of situation”. Nowadays, we have everything to become a “master of situation”. We have all needed tools like spiders and parsers that could scrape various data from websites. Today we will consider scraping the Amazon with a web spider equipped with proxy services.

US court stated scraping, even when against TOS, is legal

Last month a legal case took place in a US court where four professors plus a media organization sued the US Government. The District Court for the District of Columbia conclusion stated that moderate scraping, even when against ToS, is legal.

How to insert and configure reCAPTCHA v2 code in php

We’ve already introduced you to the theory behind the new NO CAPTCHA reCAPTCHA v2, but now we come to the practical integration part. Here we’ll share how to insert and configure “NO CAPTCHA reCAPTCHA” into a web page.

Search queries in a search engine for scraping

Recently I’ve got a note with the question on search engine queries through the web scraping software.

“I’m looking for a scraper program that can initiate search queries in a search engine automatically, using proxies would be an added benefit if possible.”  – Mike