Categories
Development Miscellaneous

SSH connection in terminal for Linux

Given:

For SSH access in a terminal type:

$ ssh igor_user@lx567.certain.com

then enter the password (testPass) at a password prompt.

Categories
Development

Python requests vs urllib2 for JS-stuffed website scrape

Question:

The Python requests library is a useful library having tons of advantages compared to other similar libraries. However, as I was trying to retrieve the Wikipedia pagerequests.get() retrieved it only partially:

Categories
Development

Headless browser python scraper at pythonanywhere

Recently I decided to work with pythonanywhere.com for running python scripts on JS stuffed websites.

Originally I tried to leverage the dryscrape library, but I failed to do it, and a nice support explained to me: “…unfortunately dryscrape depends on WebKit, and WebKit doesn’t work with our virtualisation system.”

Categories
Development

Find XPath using web developer tools

Often for the purpose of scraping, one needs to find certain elements’ XPath on a webpage. How can one do that with browser Web developer tools, aka Web inspector? A picture is worth of thousand words.

Categories
Miscellaneous

Dexi.io – October 2016 release

dexi-miniDexi.io has put out a new October 2016 release. It includes the following feature improvements:

Categories
Development

New reCaptcha testing-ground

We want to share with our readers about a new testing-ground with reCaptcha v2.0. Since we do R&D of how to solve reCaptcha by web scripts and by captcha breaking services, it’s vital to have a reCaptcha testing ground.

This testing ground is designed according to the How to insert and configure reCaptcha post.

Categories
Web Scraping Software

Mozenda web scraping and publishing of data to cloud storage

Mozenda is a cloud web scraping service (SaaS), and we’ve already reviewed it. Since our last review, Mozenda has provided more useful utility features for data extraction. Besides multi-threaded extraction & smart data aggregation, Mozenda allows users to publish extracted data to cloud storage such as Dropbox, Amazon, and Microsoft Azure. In this post we will try to explain the new Mozenda extraction and integration capabilities.

Categories
Development

2captcha service to solve reCaptcha v2.0 (python)

In this post we want to show you the code for an automatic connection to 2captcha service for solving google reCaptcha v2.0. Not long ago, google drastically complicated the user-behavior reCaptcha (v2.0). This online service provides a method for solving it.

Categories
Development

2captcha service integration for solving recaptcha by C# code

The code in C# for integrating 2captcha service.

See the Python code here with more explanation on 2captcha service using.

Categories
Miscellaneous

Create and Manage WP Blog With CodeLobster IDE

If you plan to create and maintain large-scale projects using WordPress, you may be challenged with performance and security problems. How do you maintain a large project in WP? Everything depends on the programmer’s skills. To avoid problems with scaling and support, you should be able to write and maintain a well-documented code. An IDE accommodating a WP code might be good choice.