1Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error (opens in new tab)(philosophicalhacker.com)4kmdupree13d ago0
2Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error (opens in new tab)(philosophicalhacker.com)3kmdupree15d ago0
3SWE-bench Verified no longer measures frontier coding capabilities (opens in new tab)(openai.com)343kmdupree15d ago181
5Thoughts about Moments in Claude Mythos System Card (opens in new tab)(old.reddit.com)3kmdupree15d ago0
6EsoBench: Learning a Novel Esolang via Iterative Execution Feedback (opens in new tab)(caseys-evals.com)1kmdupree15d ago0
9Scientists just developed a new AI modeled on the human brain (opens in new tab)(livescience.com)4kmdupree8mo ago0
12Atlassian migrated 4M Postgres databases to shrink AWS bill (opens in new tab)(theregister.com)8kmdupree10mo ago0
13Libraries are under-used. LLMs make this problem worse (opens in new tab)(makefizz.buzz)62kmdupree10mo ago52
15Connect any React application to an MCP server in three lines of code (opens in new tab)(blog.cloudflare.com)3kmdupree10mo ago0