Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
EvgeniyZh
2y ago
0 comments
Share
Note that current GPT-4 pass@1 for HumanEval is closer to 90% than to 67% reported in GPT-4 technical report, as reported, e.g., in [1]
[1]
https://arxiv.org/abs/2305.01210
0 comments
default
newest
oldest
cosmojg
2y ago
Unfortunately, we have no idea whether GPT-4 has been further trained or finetuned on contaminated data since then.
nabakin
2y ago
Good point, I guess Meta should be using that number in their chart
j
/
k
navigate · click thread line to collapse