PDF.

Today’s leading AI models engage in sophisticated behaviour when placed in strategic competition. They spontaneously attempt deception, signaling intentions they do not intend to follow; they demonstrate rich theory of mind, reasoning about adversary beliefs and anticipating their actions; and they exhibit credible metacognitive self-awareness, assessing their own strategic abilities before deciding how to act.

Here we present findings from a crisis simulation in which three frontier large language models (GPT-5.2, Claude Sonnet 4, Gemini 3 Flash) play opposing leaders in a nuclear crisis.

  • Iunnrais@piefed.social
    link
    fedilink
    English
    arrow-up
    2
    ·
    10 hours ago

    Some ideologies are, in fact, mutually exclusive and cannot tolerate the others. Fascism cannot be tolerated, for instance. Nor can a belief in chattel slavery as a universal good. Sometimes an opposing ideology is just too fucking evil to be allowed to persist.

    Setting the line that must not be crossed is a hard no problem though. And misplacing that line an inch incorrect in either direction can be horrible too.