undefined | Better HN

0 pointsHarHarVeryFunny5mo ago0 comments

There's a good chance Gemini 3 was trained on ARG-AGI problems, unless they state otherwise.

0 comments

ARC-AGI has a hidden private test suite, right ? No model will have access to that set.

Its almost certain that it was, but the purpose of this puzzle benchmark is that it shouldn't really be possible just to be memorized by the amount of variations that can be created and other criteria detailed in it.

1 more reply

j / k navigate · click thread line to collapse

0 comments

knowriju5mo ago

ARC-AGI has a hidden private test suite, right ? No model will have access to that set.

1 more reply

ld4nt35mo ago

1 more reply

j / k navigate · click thread line to collapse