2Show HN: Horizons – OSS agent execution engine (opens in new tab)(github.com)GitHub39JoshPurtell4mo ago8Save
3Engine-Bench: Evaluating Coding Agents on Writing Game Engine Code (opens in new tab)(github.com)GitHub2JoshPurtell4mo ago0Save
5Engine-Bench: Benchmarking Coding Agents on TCG Game Engine Tasks (opens in new tab)(github.com)GitHub2JoshPurtell5mo ago0Save
6Verify long-horizon tasks with GEPA on the judge (opens in new tab)(usesynth.ai)4JoshPurtell6mo ago0Save