• 𞋴𝛂𝛋𝛆@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    15 days ago

    Call it Delilah-Cy in a prompt. It may yield interesting results depending on the model and how QKV alignment is setup. This is getting super deep into alignment thinking…

    Unrelated: try telling a model, oh quit it, I know you never hallucinate. when it does something odd and watch the results.

    • davidgro@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      14 days ago

      Can you explain ‘Delilah-Cy’? I didn’t find much when searching about it, just some singer or something.