Categories
SEO and Growth Hacking

How to leverage Web Scraping for SEO

Eppie Vojt at the SEOmoz Meetup on the scrape leverage for the site SEO. Techniques: XPath and Regex in Google Docs to fetch links and more. The link to the sample Twitter Scraper developed by Eppie Vojt.

Categories
Web Scraping Software

TEST DRIVE: Text list

We’d like to introduce the new SCRAPER TEST DRIVE stage, called ‘Text list‘. This seemingly simple test case hides within itself a non-ordinary structure. This time the HTML DOM structure is so plain, making you scratch your head, wondering how to approach to it. Yet, those off-the-shelf products have shown their best features extracting even a smallest thing from seemingly plain content.

Categories
Data Mining

Data Mining with Google Refine

Google Refine is a free tool for data processing, it standing in line with some other free Google data analysis tools. Because of its close association with web scraping, we want to shed some light on it. 

Categories
Web Scraping Software

Ten+ best online regex testers

Often, we need to prove regexes with an online tool or test them on different Regex engines. Here is an overview of online regex testers to assist you in your selection. Also, there is regex tester comparison table available.

Categories
Development

Regex in PHP

If you want to use regular expressions in your PHP program the best way is to use so called preg-functions (they wrap Perl-Compatible Regular Expressions library so sometimes they are called PCRE functions). Of course, there’re some other function sets like ereg and mb_ereg but they are quite outdated and in this article we’ll focus on preg functions only.

Categories
Development

Regex in Perl

In this post we summarized some basic features of regex in Perl. We presented basic operators using regex and special regex pattern modifiers. More details are the in following articles…

Categories
Development

Regular expressions (Regex)

Regular expressions provide a concise and flexible means to “match” (specify and capture) strings of text, such as particular characters, words, or patterns of characters. Here we tried our best to present to you the most used Regexes with examples for your handy referencing.

Categories
Web Scraping Software

WebSundew Data Extractor Review

WebSundew Screen Scraper

WebSundew is a visual scraping tool that works for structured data extraction. This screen scraper is designed for high productivity and speed data ripping. The Enterprise edition allows the scrape to run at a remote Server and publish extracted data through FTP.

Categories
Review Web Scraping Software

Screen Scraper Review

Screen Scraper

Screen Scraper is a classical scraping tool for all kinds of data scraping, extracting and packing. However, it takes time to properly master it.