3Show HN: Binfer, an experimental LLM inference engine in TypeScript and CUDA (opens in new tab)(github.com)GitHub1brrrrrm6mo ago0Save
5Bitwise Consistent On-Policy Reinforcement Learning with VLLM and TorchTitan (opens in new tab)(blog.vllm.ai)1brrrrrm7mo ago0Save
6Should we apply old-school multi-core scheduling to GPUs? (opens in new tab)(jott.live)4brrrrrm7mo ago0Save
7Show HN: GT: experimental multiplexed distributed tensor framework (opens in new tab)(github.com)GitHub4brrrrrm7mo ago0Save
8GT – Experimental multiplexing tensor framework for distributed GPU computing (opens in new tab)(github.com)GitHub30brrrrrm7mo ago1Save
9MFU Is Poorly Approximating Billions of Dollars in Compute (opens in new tab)(jott.live)4brrrrrm9mo ago0Save
11SWE-bench verified agents may look at future repository state (opens in new tab)(github.com)GitHub4brrrrrm9mo ago0Save