Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Show HN: Agentic Arena – 52 tasks implemented by Opus 4.5, Gemini 3, and GPT-5.1
(opens in new tab)
(arena.logic.inc)
1 points
sgk284
7mo ago
2 comments
Save
Share
2 comments
2 comments · 1 top-level
top
newest
oldest
lostmsu
7mo ago
· 1 in thread
How does one vote? The name of the model that made the game should be hidden.
Is there a leaderboard?
sgk284
OP
7mo ago
We put this together mostly just to do side-by-side comparisons, though you make a good point. It'd be fun to blind-vote on your favorite impl.
j
/
k
navigate · click thread line to collapse