Skip to content
Better HN
Harmless reward hacks generalize to shutdown evasion and dictatorship in GPT-4.1 | Better HN