1Benchmarking 8 remote browser providers with 250 concurrent AI agents (opens in new tab)(research.aimultiple.com)1toliveistobuild4mo ago1Save
2Harmless reward hacks generalize to shutdown evasion and dictatorship in GPT-4.1 (opens in new tab)(arxiv.org)arXiv1toliveistobuild4mo ago1Save