1RuneBench: Agent Benchmark on RuneScape Gameplay Tasks (opens in new tab)(maxbittker.github.io)2frozenseven1mo ago0Save
2K3 – A New Problem List in Low-Dimensional Topology (preliminary version) [pdf] (opens in new tab)(drive.google.com)2frozenseven2mo ago1Save
5Hive: A swarm of AI agents evolving code together (opens in new tab)(hive.rllm-project.com)1frozenseven3mo ago1Save
6RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback (opens in new tab)(github.com)GitHub1frozenseven3mo ago1Save
9AutoContext: closed-loop system for improving agent behavior over repeated runs (opens in new tab)(github.com)GitHub2frozenseven3mo ago0Save