The 2Captcha service has developed practical guides for solving reCaptcha in Puppeteer and Selenium using grid method. See the repos below:
Search: “puppeteer”
We found 33 results for your search.
Recently we’ve got a tricky website of dynamic content to scrape. The data are loaded thru XHRs into each part of the DOM (HTML markup). So, the task was to develop an effective scraper that does async while using reasonable CPU recourses.
Simple Apify Puppeteer crawler
The Apify crawler is to gather names, addresses, emails of the web urls.
In the previous post we shared how to disguise Selenium Chrome automation against Fingerprint checks. In this post we share the Puppeteer-extra with Stealth plugin to do the same. The test results are available as html files and screenshots.
Support us by purchasing the book (under $5) on this topic. In today’s web 2.0 many business websites utilize JavaScript to protect their content from web scraping or any other undesired bot visits. In this article we share with you the theory and practical fulfillment of how to scrape js-dependent/js-protected websites.
In the post we share the practical implementation (code) of the Xing companies scrape project using Node.js, Puppeteer and the Apify library. The first post, describing the project objectives, algorithm and results, is available here. The scrape algorithm you can look at here.
I want to share with you the practical implementation of modern scraping tools for scraping JS-rendered websites (pages loaded dynamically by JavaScript). You can read more about scraping JS rendered content here.
In the modern web 2.0 the sites that have valuable data (eg. business directories, data aggregators, social networks and more) implement aggressive blocking measures, which can cause major extraction difficulties. How can modern scraping tools (eg. Sequentum Cloud) still be able to fetch data of actively protected sites? Sequentum is a closed source point and […]
Travel Routes Scrape Sources
In the post we wanna share with you what data sources to scrape to find info best routes within Europe or worldwide. The routes might include walking, biking, driving, or public transport.
Web scraping has emerged as a powerful tool for data extraction, enabling businesses, researchers, and individuals to gather insights from the vast amounts of information available online. However, as the web evolves, so do the challenges associated with scraping. This post delves into the modern challenges of web scraping and explores effective strategies to overcome […]