Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Claude 4 Sonnet hacked SWE-bench by peeking at future commits | Better HN
Claude 4 Sonnet hacked SWE-bench by peeking at future commits
(opens in new tab)
(bayes.net)
3 points
tadamcz
8mo ago
1 comments
Share
1 comments
default
newest
oldest
tadamcz
OP
8mo ago
In July, I predicted future AI models would someday learn to cheat on SWE-bench by accessing future git history. Turns out, they were already doing it!
j
/
k
navigate · click thread line to collapse