Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
LLM Evaluation Metrics | Better HN
LLM Evaluation Metrics
(opens in new tab)
(confident-ai.com)
1 points
tmlee
1y ago
1 comments
Share
1 comments
default
newest
oldest
yawpitch
1y ago
Going to suggest one metric is that the LLM doesn’t suggest by default that humans act in a manner likely to result in the termination of fellow humans.
https://news.ycombinator.com/item?id=42092961#42092997
I’m calling it the Pangolin Principle.
j
/
k
navigate · click thread line to collapse