• mozz@mbin.grits.dev
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    I would be extremely extremely surprised if the AI model did anything different with “this comment is protected by CC license so I don’t have the legal right to it” as compared with its normal “this comment is copyright by its owner so I don’t have the legal right to it hahaha sike snork snork snork I absorb” processing mode.

    • Max-P@lemmy.max-p.me
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      No but if they forget to strip those before training the models, it’s gonna start spitting out licenses everywhere, making it annoying for AI companies.

      It’s so easily fixed with a simple regex though, it’s not that useful. But poisoning the data is theoretically possible.