• rainroar@lemmy.ml
    link
    fedilink
    arrow-up
    7
    arrow-down
    2
    ·
    1 year ago

    Yes! They publish the data sources and where they got everything from. Diffusers (stable diffusion/midjoirny etc) and GPT both use tons of data that was taken in ways that likely violate that data’s usage agreement.

    Imo they deserve whatever lawsuits they have coming.

    • radarsat1@lemmy.ml
      link
      fedilink
      arrow-up
      2
      arrow-down
      1
      ·
      1 year ago

      likely violate that data’s usage agreement.

      It doesn’t seem to be too common for books to include specific clauses or EULAs that prohibit their use as data in machine learning systems. I’m curious if there are really any aspects that cover this without it being explicitly mentioned. I guess we’ll find out.