Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
OpenAI's GDPval: Why the 66% in Automated Grading Matters More Than 48% Win Rate
(opens in new tab)
(medium.com)
7 points
pdasika
8mo ago
2 comments
Save
Share
2 comments
2 comments · 2 top-level
top
newest
oldest
adisv
8mo ago
Very comprehensive writeup @pdasika. Incredibly relevant for devs working on agentic applications for the enterprise.
kanodiaashu
8mo ago
Interesting take..
j
/
k
navigate · click thread line to collapse