Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
HWE Bench: A new unbounded Benchmark for LLMs (GPT 5.5 is on top)
(opens in new tab)
(hwebench.com)
6 points
fesens
1mo ago
3 comments
Save
Share
3 comments
3 comments · 3 top-level
top
newest
oldest
fesens
OP
1mo ago
Current benchmarks have ceilings, usually 100%. This benchmark aims to be a long lasting, high correlation with the ability to solve real world problems and follow complex instructions, and unbounded (meaning it can always go higher).
paulobeckhauser
1mo ago
Very nice!!
fabiofachini92
1mo ago
Amazing!
j
/
k
navigate · click thread line to collapse