1Scaling Laws for Agent Harnesses via Effective Feedback Compute (opens in new tab)(arxiv.org)arXiv1veryluckyxyz29d ago0Save
2Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph (opens in new tab)(huggingface.co)2veryluckyxyz7mo ago0Save
3Hidden drivers of HRM's performance on ARC-AGI (opens in new tab)(arcprize.org)31veryluckyxyz8mo ago2Save
4Set Block Decoding Is a Language Model Inference Accelerator (opens in new tab)(arxiv.org)arXiv4veryluckyxyz9mo ago0Save
6A Batch Size and Token NUM- BER Agnostic Learning Rate Scheduler (opens in new tab)(arxiv.org)arXiv2veryluckyxyz1y ago0Save
8Model Merging in Pre-Training of Large Language Models (opens in new tab)(arxiv.org)arXiv2veryluckyxyz1y ago0Save
9Understanding Perception and Reasoning Through Model Merging (opens in new tab)(arxiv.org)arXiv2veryluckyxyz1y ago0Save
10Building and better understanding vision-language models (2024) (opens in new tab)(huggingface.co)2veryluckyxyz1y ago0Save
12Do Reasoning Models Show Better Verbalized Calibration? (opens in new tab)(arxiv.org)arXiv2veryluckyxyz1y ago0Save
13Robustly identifying concepts introduced during chat fine-tuning with crosscoder (opens in new tab)(arxiv.org)arXiv6veryluckyxyz1y ago0Save
15The Curse of Depth in Large Language Models (opens in new tab)(arxiv.org)arXiv1veryluckyxyz1y ago0Save