They do though. The point of captchas isn’t to block all bots, that’s never going to happen. They are there to block the low effort bot army that’s rampaging across the internet which is effectively a DDOS stack on smaller servers.
Big AI companies throw petabytes of Data into their models. Instead of crawling and saving it once, they just index each site worth using and scrap the data each time they train a model. As a server owner you either block the scrappers with a captcha or you blanked ban IP ranges that are known for scraping.
Sure, they may block the most basic of basic bots. But for most languages in which bots are made, there are ready-made libraries for captcha solving that are essentially plug-and-play. To move from “basic” to “makes captcha useless” is an install and about 2 lines of code.
Given that, I highly doubt it blocks bots as effectively as it blocks actual users.
AFAIK all captcha is good for these days is training data.
They do though. The point of captchas isn’t to block all bots, that’s never going to happen. They are there to block the low effort bot army that’s rampaging across the internet which is effectively a DDOS stack on smaller servers.
Big AI companies throw petabytes of Data into their models. Instead of crawling and saving it once, they just index each site worth using and scrap the data each time they train a model. As a server owner you either block the scrappers with a captcha or you blanked ban IP ranges that are known for scraping.
Sure, they may block the most basic of basic bots. But for most languages in which bots are made, there are ready-made libraries for captcha solving that are essentially plug-and-play. To move from “basic” to “makes captcha useless” is an install and about 2 lines of code.
Given that, I highly doubt it blocks bots as effectively as it blocks actual users.
AFAIK all captcha is good for these days is training data.