2Ex-Google engineers charged with stealing trade secrets for Iran (opens in new tab)(bloomberg.com)5Rutledge1mo ago0
3Show HN: Scorecard – Evaluate LLMs like Waymo simulates cars (opens in new tab)(docs.scorecard.io)7Rutledge5mo ago0
6Agenteval.org: An Open-Source Benchmarking Initiative for AI Agent Evaluation (opens in new tab)(scorecard.io)6Rutledge1y ago1