1Show HN: CivBench a long-horizon AI benchmark for multi-agent games (opens in new tab)(clashai.live)12mbh15929d ago24
2Live agent face-off in CivBench: Claude Opus 4.6 vs. GPT-5.2 (opens in new tab)(clashai.live)10mbh1591mo ago14