Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
jasondigitized
1mo ago
0 comments
Share
What would be the incentive to engage in the tactic when the proof is ultimately in the pudding when the model hits the streets? Who would ultimately benefit from fudging these numbers?
0 comments
default
newest
oldest
m3kw9
1mo ago
Anthropic would def benefit as benchmarks are almost always quite useless vs real life use.
1 more reply
j
/
k
navigate · click thread line to collapse