Tag: Selenium

Development

Selenium comes with a default WebDriver that often fails to bypass scraping anti-bots. Yet you can complement it with Undetected ChromeDriver, a third-party WebDriver tool that will do a better job.

In this tutorial, you’ll learn how to use Undetected ChromeDriver with Selenium in Python and solve the most common errors.

Tags anti-scrape, Python, scrape detection, Selenium

Challenge Development

How to bypass PerimeterX

You’ve found the website you need to scrape, set up your scraper and fired it, just to sadly realize PerimeterX has blocked you.

PerimeterX’s dynamically complex bot detection system relies on server-side and client-side checks to distinguish humans from bots. It deploys several layers of protection and, for the most part, manages to do its job without interrupting the user experience.

But don’t fall into despair! There are a couple of things you can try to bypass PerimeterX (called HUMAN now) before giving up on your goal of scraping that delicious data.

Tags anti-scrape, Javascript, scrape detection, Selenium

Challenge Development

Python, Selenium for custom browser automation scraper

Post author By admin
Post date April 10, 2023
No Comments on Python, Selenium for custom browser automation scraper

Recently we’ve got the tricky website, its data being of dynamic nature. Yet we’ve applied the modern day scraping tools to fetch data. We’ve develop an effective Python scraper using Selenium library for browser automation.

About the project

We were asked to have a look at a retailer website.

And our task was to gather data on 210 products’ availability in 945 shops. The scrape resulted in about 200K data entries in a CSV format. Moreover, every line contained information about name, link, brand, store and the availability of a product. Below you can familiarise yourself with a small data sample we were able to gather.

Tags browser-automation, Python, Selenium, web scraping

Development

Scrape YouTube comments using Python & Selenium

Post author By admin
Post date March 3, 2023
No Comments on Scrape YouTube comments using Python & Selenium

Scraping youtube comments has become crucial if you are working on some sentiment analysis project. The comments section will give you an overview of the public sentiment toward any election or sports results, scams, wars, etc. Comments reflect an overall feeling of a person. What according to them is right and wrong is mentioned in the comments.

Tags Python, Selenium, Xpath

Development Guest posting

Scrape ‘Ticketmaster’ using Selenium with Python

Post author By admin
Post date August 30, 2022
1 Comment on Scrape ‘Ticketmaster’ using Selenium with Python

We’ve got some code provided by Akash D. working on ticketmaster.co.uk. He automates browser (Chrome as well as Edge) using Selenium with Python. The rotating authenticated proxies are leveraged to keep undetected. Yet, the site is protected with Distil network.

Tags proxy, Python, Selenium

Development

JAVA, Selenium, headless Chrome, JSoup to scrape data of the web

Post author By mihaschenko
Post date November 5, 2020
No Comments on JAVA, Selenium, headless Chrome, JSoup to scrape data of the web

In this post we share with you how to perform web scraping of a JS-rendered website. The tools as seen in the header are JAVA with Selenium library driving headless Chrome instances (download driver) and JSoup as parser to fetch data of the acquired HTML.

Tags JAVA, scraper, Selenium

Development

Selenium Web Scraping in simple words

Post author By admin
Post date August 14, 2020
No Comments on Selenium Web Scraping in simple words

Question: What is Selenium web scraping?

Answer: A picture is better than 1000 words: selenium main diagram

So, you make a program with Python, PHP, JAVA, Ruby and whatever language you use in order to browse(), select(), click(), submit(), save(), etc., target web pages.

Tags Selenium, web scraping

Uncategorized

Pros and Cons of using Selenium WebDriver for Website Scraping

Post author By admin
Post date January 2, 2020
9 Comments on Pros and Cons of using Selenium WebDriver for Website Scraping

Since Selenium WebDriver is created for browser automation, it can be easily used for scraping data from the web. In this post we will consider some advantages and drawbacks of using WebDriver for web scraping.

Tags Selenium

Challenge Development

How do I get pass dynamic “load more” btn?

Post author By admin
Post date January 6, 2019
3 Comments on How do I get pass dynamic “load more” btn?

Recently I’ve got a question:

How do I get pass the dynamic “load more” button using a Python web scraper?

Tags Javascript, Selenium