Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
pawelduda
4mo ago
0 comments
Share
Why is this particular benchmark important?
0 comments
default
newest
oldest
aliljet
4mo ago
Thus far, this is one of the best objective evaluations of real world software engineering...
RamtinJ95
4mo ago
I concur with the other commenters, 4.5 is a clear improvement over 4.
adastra22
4mo ago
Idk, Sonnet 4.5 score better than Sonnet 4.0 on that benchmark, but is markedly worse in my usage. The utility of the benchmark is fading as it is gamed.
meowface
4mo ago
I think I and many others have found Sonnet 4.5 to generally be better than Sonnet 4 for coding.
1 more reply
epolanski
4mo ago
Not my experience at all, 4.5 is leagues ahead the previous models albeit not as good as Gemini 2.5.
pertymcpert
4mo ago
I find 4.5 a much better model FWIW.
j
/
k
navigate · click thread line to collapse