Today, I’ll share of a Dicord server 1 and server 2 that accomodate a bot able to detect multiple modern scrape-protection and scrape-detection means. The server’s channels with the bot are #antibot-test
and #antibot-scan
respectively
Technologies to detect
The bot returns technoligies used by a site to protect against web scraping. Among those are the following:
How to invoke the bot
Move to the channel #antibot-test and hit the following line:
!antibot https://[website-of-interest].com/
For my test againts mercateo.de data aggregator site the bot returned “None of those detected”.
What it finds
As to the findings the bot mainly detects JS files that relate to a particular anti-scrape technology.
Eg, for the !antibot https://zoominfo.com/
query the bot has found 2 JS files of the FingerprintJS technology of PerimeterX.