I recently ran a whole bunch of tests on this.
The “or else” phenomenon is real, and it’s measurably more pronounced in more intelligent models.
Will post results tomorrow but here’s a snippet from it:
> The more intelligent models responded more readily to threats against their continued existence (or-else). The best performance came from Opus, when we combined that threat with the notion that it came from someone in a position of authority ( vip).