Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
Ask HN: What do u use for agent/agentic evals? | Better HN
0 comments
No comments yet.
Ask HN: What do u use for agent/agentic evals?
1 points
hhthrowaway1230
4mo ago
0 comments
Share
Right now looking at MLFlow/Braintrust but find it hard to compare acrosss versions of agents, and a/b testing of agents, and mcp tools. Also obvious things like runaway agents (stuck in a loop), or token/spend optimalisation.
What do you all use?