Today, I’ll share of a Dicord server that accomodates a bot able to detect multiple modern scrape-protection and scrape-detection means. The server name is Scraping Enthusiasts, channel with the bot being
Technologies to detect
The bot returns technoligies used by a site to protect against web scraping. Among those are the following:
How to invoke the bot
Move to the channel #antibot-test and hit the following line:
For my test againts mercateo.de data aggregator site the bot returned “None of those detected”.
What it finds
As to the findings the bot mainly detects JS files that relate to a particular anti-scrape technology.
Eg, for the
!antibot https://zoominfo.com/ query the bot has found 2 JS files of the FingerprintJS technology of PerimeterX.