Search: “puppeteer”

We found 33 results for your search.

reCaptcha solving by 2Captcha service in Puppeteer & Selenium

Post author By admin
Post date December 10, 2024
No Comments on reCaptcha solving by 2Captcha service in Puppeteer & Selenium

The 2Captcha service has developed practical guides for solving reCaptcha in Puppeteer and Selenium using grid method. See the repos below:

Tags captcha, Puppeteer, Recaptcha, Selenium, service

Development

Puppeteer async scraper with browsers number to be tuned based on CPU capacity

Post author By admin
Post date February 9, 2023
1 Comment on Puppeteer async scraper with browsers number to be tuned based on CPU capacity

Recently we’ve got a tricky website of dynamic content to scrape. The data are loaded thru XHRs into each part of the DOM (HTML markup). So, the task was to develop an effective scraper that does async while using reasonable CPU recourses.

Tags automation, browser-automation, Javascript, Node.js

Development

Simple Apify Puppeteer crawler

Post author By admin
Post date February 21, 2022
No Comments on Simple Apify Puppeteer crawler

The Apify crawler is to gather names, addresses, emails of the web urls.

Tags crawling, Javascript, Puppeteer

Development

Puppeteer Stealth to prevent detection

Post author By admin
Post date February 5, 2021
No Comments on Puppeteer Stealth to prevent detection

In the previous post we shared how to disguise Selenium Chrome automation against Fingerprint checks. In this post we share the Puppeteer-extra with Stealth plugin to do the same. The test results are available as html files and screenshots.

Tags Node.js, Puppeteer, scrape detection

Challenge Development

Scraping a Javascript-dependent website with puppeteer

Post author By admin
Post date June 25, 2020
No Comments on Scraping a Javascript-dependent website with puppeteer

Support us by purchasing the book (under $5) on this topic. In today’s web 2.0 many business websites utilize JavaScript to protect their content from web scraping or any other undesired bot visits. In this article we share with you the theory and practical fulfillment of how to scrape js-dependent/js-protected websites.

Tags Javascript, Node.js, scrape protection

Development

Node.js, Puppeteer, Apify for Web Scraping (Xing scrape) – part 2

Post author By admin
Post date October 8, 2019
2 Comments on Node.js, Puppeteer, Apify for Web Scraping (Xing scrape) – part 2

In the post we share the practical implementation (code) of the Xing companies scrape project using Node.js, Puppeteer and the Apify library. The first post, describing the project objectives, algorithm and results, is available here. The scrape algorithm you can look at here.

Tags business directory, crawling, headless, Node.js

Development

Using Modern Tools such as Node.js, Puppeteer, Apify for Web Scraping (Xing scrape)

Post author By admin
Post date August 23, 2019
No Comments on Using Modern Tools such as Node.js, Puppeteer, Apify for Web Scraping (Xing scrape)

I want to share with you the practical implementation of modern scraping tools for scraping JS-rendered websites (pages loaded dynamically by JavaScript). You can read more about scraping JS rendered content here.

Tags business directory, headless, Node.js

Development SaaS

Sequentum Cloud to bypass strict scrape blocking

Post author By admin
Post date March 18, 2025
No Comments on Sequentum Cloud to bypass strict scrape blocking

In the modern web 2.0 the sites that have valuable data (eg. business directories, data aggregators, social networks and more) implement aggressive blocking measures, which can cause major extraction difficulties. How can modern scraping tools (eg. Sequentum Cloud) still be able to fetch data of actively protected sites? Sequentum is a closed source point and […]

Tags anti-scrape, SaaS, Sequentum, service, web scraping

Development

Travel Routes Scrape Sources

Post author By admin
Post date February 27, 2025
No Comments on Travel Routes Scrape Sources

In the post we wanna share with you what data sources to scrape to find info best routes within Europe or worldwide. The routes might include walking, biking, driving, or public transport.

Challenge Development

Modern Challenges in Web Scraping & Solutions

Post author By admin
Post date February 25, 2025
No Comments on Modern Challenges in Web Scraping & Solutions

Web scraping has emerged as a powerful tool for data extraction, enabling businesses, researchers, and individuals to gather insights from the vast amounts of information available online. However, as the web evolves, so do the challenges associated with scraping. This post delves into the modern challenges of web scraping and explores effective strategies to overcome […]

Tags ethical, web scraping