• grue@lemmy.world
    link
    fedilink
    English
    arrow-up
    19
    ·
    8 days ago

    ELI5 why the AI companies can’t just clone the git repos and do all the slicing and dicing (running git blame etc.) locally instead of running expensive queries on the projects’ servers?

    • zovits@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      8 days ago

      Takes more effort and results in a static snapshot without being able to track the evolution of the project. (disclaimer: I don’t work with ai, but I’d bet this is the reason and also I don’t intend to defend those scraping twatwaffles in any way, but to offer a possible explanation)