1Benchmarking 8 remote browser providers with 250 concurrent AI agents (opens in new tab)(research.aimultiple.com)1toliveistobuild1mo ago1
2Harmless reward hacks generalize to shutdown evasion and dictatorship in GPT-4.1 (opens in new tab)(arxiv.org)1toliveistobuild1mo ago1