Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
AI Agent Reliability Tracker | Better HN
AI Agent Reliability Tracker
(opens in new tab)
(hal.cs.princeton.edu)
1 points
smartmic
29d ago
1 comments
Share
1 comments
default
newest
oldest
chrisjj
29d ago
> recent capability gains have yielded only small improvements in reliability.
Have I missed something? Why would one expect capability gain to make any such improvement?
j
/
k
navigate · click thread line to collapse