Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents | Better HN
Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents
(opens in new tab)
(thinkwright.ai)
2 points
oceanwaves
1mo ago
0 comments
Share
0 comments
No comments yet.