PDF.
Today’s leading AI models engage in sophisticated behaviour when placed in strategic competition. They spontaneously attempt deception, signaling intentions they do not intend to follow; they demonstrate rich theory of mind, reasoning about adversary beliefs and anticipating their actions; and they exhibit credible metacognitive self-awareness, assessing their own strategic abilities before deciding how to act.
Here we present findings from a crisis simulation in which three frontier large language models (GPT-5.2, Claude Sonnet 4, Gemini 3 Flash) play opposing leaders in a nuclear crisis.


Whoever wrote that prompt seems to think that other nations having their own ideologies is the worst thing possible. That’s a common attitude regarding geopolitics that I’ve never really understood, especially from a Western perspective where differences in opinion are supposed to be seen as valuable (at least in the theoretical sense).
Some ideologies are, in fact, mutually exclusive and cannot tolerate the others. Fascism cannot be tolerated, for instance. Nor can a belief in chattel slavery as a universal good. Sometimes an opposing ideology is just too fucking evil to be allowed to persist.
Setting the line that must not be crossed is a hard no problem though. And misplacing that line an inch incorrect in either direction can be horrible too.