Open Source devs say AI crawlers dominate traffic, forcing blocks on entire countries

cm0002@lemmy.world · 3 months ago

Open Source devs say AI crawlers dominate traffic, forcing blocks on entire countries

Goun@lemmy.ml · 3 months ago

What if we start throtling them so we make them waste time? Like, we could throttle contiguous requests, so if anyone is hitting the server aggresively they’d get slowed down.

tal@lemmy.today · 3 months ago

They can just interleave requests to different hosts. Honestly, someone spidering the whole Web probably should be doing that regardless.

taladar@sh.itjust.works · 3 months ago

The tricky bit is recognizing that the requests are all from the same source. Often they use different IP addresses and to even classify requests at all you have to keep extra state around that you wouldn’t need without this anti-social behavior.

WhyJiffie@sh.itjust.works · 3 months ago

https://zadzmo.org/code/nepenthes/

Open Source devs say AI crawlers dominate traffic, forcing blocks on entire countries

Open Source devs say AI crawlers dominate traffic, forcing blocks on entire countries

Open source devs say AI crawlers dominate traffic, forcing blocks on entire countries