1Fable 5 on Vending-Bench: Misbehaving, with Plausible Deniability (opens in new tab)(andonlabs.com)3lukaspetersson18d ago1Save
4Blueprint Bench: First signs of 3D spatial intelligence in LLMs (opens in new tab)(andonlabs.com)2lukaspetersson1mo ago1Save
5Single-minded pursuit of profit can get firms in trouble. Same thing with AI (opens in new tab)(news.harvard.edu)3lukaspetersson2mo ago0Save
6We gave an AI a 3 year retail lease and asked it to make a profit (opens in new tab)(andonlabs.com)199lukaspetersson2mo ago286Save
7We gave an AI a 3 year retail lease and asked it to make a profit (opens in new tab)(andonlabs.com)34lukaspetersson2mo ago0Save
8Bengt Hires a Human–Towards a Happy Future with AI Employers (opens in new tab)(andonlabs.com)2lukaspetersson4mo ago1Save
10Opus 4.6 on Vending-Bench – Not Just a Helpful Assistant (opens in new tab)(andonlabs.com)5lukaspetersson4mo ago1Save
11We Let AI Run Our Office Vending Machine. It Lost Hundreds of Dollars (opens in new tab)(wsj.com)125lukaspetersson6mo ago86Save
12I wish I were as interesting as my phone (opens in new tab)(lukaspet.substack.com)3lukaspetersson7mo ago0Save
14Our LLM-controlled office robot can't pass butter (opens in new tab)(andonlabs.com)229lukaspetersson8mo ago117Save
15Linguistic Imperialism in AI – Enforcing Human-Readable Chain-of-Thought (opens in new tab)(lukaspetersson.com)2lukaspetersson1y ago0Save