1Reward hacking is swamping model intelligence gains (opens in new tab)(cursor.com)3DR_MING14h ago0Save