undefined | Better HN

0 pointsanotherpaulg2y ago0 comments

GPT-4o tops the aider LLM code editing leaderboard at 72.9%, versus 68.4% for Opus. GPT-4o takes second on aider’s refactoring leaderboard with 62.9%, versus Opus at 72.3%.

GPT-4o did much better than the 4-turbo models, and seems much less lazy.

The latest release of aider uses GPT-4o by default.

https://aider.chat/docs/leaderboards/

0 comments

1 comments · 1 top-level

mucle62y ago

How am I just hearing about this?! Aider looks cool

j / k navigate · click thread line to collapse