• EmbarrassedDrum@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    34
    arrow-down
    1
    ·
    2 months ago

    and in due time, we’ll hack OpenAI and get the sources from the chat module…

    I’ve seen a few glitches before that made ChatGPT just drop entire articles in varying languages.

    • FaceDeer@fedia.io
      link
      fedilink
      arrow-up
      24
      ·
      2 months ago

      AI models don’t actually contain the text they were trained on, except in very rare circumstances when they’ve been overfit on a particular text (this is considered an error in training and much work has been put into coming up with ways to prevent it. It usually happens when a great many identical copies of the same data appears in the training set). An AI model is far too small for it, there’s no way that data can be compressed that much.