1The Terminal Bench 3.0 community is looking for task contributors (opens in new tab)(tbench.ai)1neversettles8d ago2
3Show HN: Web-eval-agent – Let the coding agent debug itself (opens in new tab)(github.com)84neversettles1y ago12