Categories
Web Scraping Software

Using External Input Data in Off-the-shelf Web Scrapers

External Data Source ConnectionThere is a question I’ve wanted to shed some light upon for a long time already: “What if I need to scrape several URL’s based on data in some external database?“.

Categories
Web Scraping Software

Visual Web Ripper: Using External Input Data Sources

Visual Web Ripper: Using External Input Data SourcesSometimes it is necessary to use external data sources to provide parameters for the scraping process. For example, you have a database with a bunch of ASINs and you need to scrape all product information for each one of them. As far as Visual Web Ripper is concerned, an input data source can be used to provide a list of input values to a data extraction project.

Categories
Development

XPath in Examples

Here we’ll show how XPath works. Let’s take the following XML as a lab rat.

Categories
Web Scraping Software

8+ Best CAPTCHA Solvers

In this post we want to share about some decaptcha software and services that we have encountered in our web scraping experience.

Categories
Web Scraping Software

Scraping Amazon.com with Screen Scraper

Let’s look how to use Screen Scraper for scraping Amazon products having a list of asins in external database.

Categories
Miscellaneous

Free Website Backup

For simple web scraping jobs I often prefer a php + mysql bundle putting the project right to the web and working online. But as you work online a problem appears: how to backup your work results?

Categories
SEO and Growth Hacking

How to Analyze Competitors

competitor-analysis There is no doubt that you have to spend time on competitor analysis if you care about your business. Doing this, you may realize many things that you didn’t realize before about the market in which you are laboring, If you are just beginning your business, then you need to analyze your future competitors all the more in order to know how do they do their business and what they are focusing on.

Categories
SEO and Growth Hacking

SEO Monitoring with Colibri (Review)

Colibri SEO ToolRecently I had to choose a tool for keyword monitoring and after reviewing several online services, I ended up with Colibri. It has a nice design and intuitive interface. If you have matured enough to take your SEO more seriously (and even to pay the price for it), this short review may help you to make a choice.

Categories
Web Scraping Software

DeCaptcher Review

Recently we came across a CAPTCHA solving service called DeCaptcher. As we proceed with the new Scraper Test Drive stage, we have encountered some off-the-shelf scrapers that use a 3rd party service for CAPTCHA solving. The DeCaptcher service is the most popular among them.

Categories
Web Scraping Software

TEST DRIVE: CAPTCHA

The new Scraper Test Drive stage is on, called CAPTCHA. What can the scrapers perform to get through the “robot fighters”? The off-the-shelf scrapers are not designed for CAPTCHA solving by default. Furthermore, some stated that “bypassing Captchas was compatible with Internet good ethics”. I agree with this, but for the full Scraper Test Drive taste, we still want to try out the scrapers.