Developer Creates Infinite Maze That Traps AI Training Bots

Flying Squid@lemmy.world · 1 month ago

Developer Creates Infinite Maze That Traps AI Training Bots

nef@slrpnk.net · 1 month ago

You can specifically target crawlers that ignore robots.txt, which will catch practically every LLM scraper.

nyan@lemmy.cafe · 1 month ago

Well, yeah, but obeying robots.txt is only a courtesy in the first place, so you can’t guarantee it’ll catch only LLM-related crawlers and no others, although it may lower the false positive rate.