1Why SWE-bench Verified no longer measures frontier coding capabilities (opens in new tab)(openai.com)10tedsanders2mo ago0