1Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens (2025) (opens in new tab)(arxiv.org)arXiv1SomaticPirate2mo ago1Save
2Fulu bounty for Ring Camera jailbreak reaches $23k (opens in new tab)(bounties.fulu.org)39SomaticPirate2mo ago2Save
3CooperBench: Benchmarking AI Agents' Cooperation (opens in new tab)(cooperbench.com)1SomaticPirate4mo ago0Save
5Is Chain-of-Thought Reasoning of LLMs a Mirage? (opens in new tab)(arxiv.org)arXiv3SomaticPirate9mo ago0Save