minus-squareGeneral_Effort@lemmy.worldtoTechnology@lemmy.world•Bluesky users debate plans around user data and AI traininglinkfedilinkEnglisharrow-up2·5 hours agoDon’t really see the problem. If you pick up the content while web crawling, you will end up with a lot of duplicates, but that’s normal. If you wanted to scrape the Fediverse in particular, you’d know the structure of the data. linkfedilink
minus-squareGeneral_Effort@lemmy.worldtoTechnology@lemmy.world•Bluesky users debate plans around user data and AI traininglinkfedilinkEnglisharrow-up2·6 hours agoI wonder why Bluesky bothers. The reaction was predictable. linkfedilink
Don’t really see the problem. If you pick up the content while web crawling, you will end up with a lot of duplicates, but that’s normal. If you wanted to scrape the Fediverse in particular, you’d know the structure of the data.