Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
WarmWash
2mo ago
0 comments
Save
Share
I am unable to shake that the Chinese models all perform awfully on the private arc-agi 2 tests.
0 comments
1 comments · 1 top-level
top
newest
oldest
osti
2mo ago
But is arc-agi really that useful though? Nowadays it seems to me that it's just another benchmark that needs to be specifically trained for. Maybe the Chinese models just didn't focus on it as much.
2 more replies
j
/
k
navigate · click thread line to collapse