Categories
Development

Email validation Regexes

Now we want to review some email validation Regexes. We’ve chosen Regexes based on readability, complexity and RFC standarts relevance. For online Regex testing tools refer here.

Categories
Development Web Scraping Software

7+ Best JSON Viewers

In this post we share on json viewers both as online tools and as plugins for browsers and Notepad++ editor.

Categories
Development

Exception handling in php scrapers

Suppose we want to set only one exception handler function for all exceptions in the scraper program. This exception handler might be working for a multi-level program. Here is how it works in PHP.

Categories
Development

How to scrape CSV data files

This short post in to guide you in how to scrape CSV data files.

Categories
Development

Selenium IDE and Web Scraping

Selenium is a web application testing framework that supports for a wide variety of browsers and platforms including Java, .Net, Ruby, Python and other. In this post we touch on the basic structure of the framework and how it can be applied in Web Scraping.

Categories
Development

Scraping in PHP with cURL

In this post, I’ll explain how to do a simple web page extraction in PHP using cURL, the ‘Client URL library’.

The curl  is a part of libcurl, a library that allows you to connect to servers with many different types of protocols. It supports the http, https and other protocols. This way of getting data from web is more stable with header/cookie/errors process rather than using simple file_get_contents(). If curl() is not installed, you can read here for Win or here for Linux.

Categories
Development

How to scrape an online dictionary using Python and lxml library

When I needed to extract dictionary words’ definitions I chose Python and lxml library. In this tutorial, I’ll review the steps of scraping Webster online dictionary using lxml in Python.

Categories
Development

How to scrape an online dictionary using Python and lxml library

When I needed to extract dictionary words’ definitions I chose Python and lxml library. In this tutorial, I’ll review the steps of scraping Webster online dictionary using lxml in Python.

Categories
Development

Free online tools to work with BASE64, HASH, CRC, HEX, BIN and etc…

When we work with different types of data, it’s always good to have a tool-belt to make our labor easier.  In the age of the “cloud,” it would be good to have all these tools online. Here are some online tools that may help you in your work:

Categories
Development

Running python script detached and getting realtime output

When working with different scrapers in python, we often need to run them detached from the main process and monitor their output in real-time. Here’s how we do this: