1Nanocode: The best Claude Code that $200 can buy in pure JAX on TPUs (opens in new tab)(github.com)219desideratum1mo ago26
4Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training (opens in new tab)(huggingface.co)3desideratum9mo ago0
5Training LLMs with GRPO and Interpreter Feedback Using WebAssembly (opens in new tab)(huggingface.co)3desideratum1y ago0
6Training Large Language Models with Interpreter Feedback Using WebAssembly (opens in new tab)(huggingface.co)1desideratum1y ago0
8Training Process Reward Models in Axolotl (opens in new tab)(axolotlai.substack.com)2desideratum1y ago0
9Torchtune – a native PyTorch library for fine-tuning LLMs (opens in new tab)(github.com)2desideratum1y ago0
10(Deep Learning Based) Opportunistic Screening to Improve Statin Rates (opens in new tab)(ahajournals.org)1desideratum2y ago0
11The theory of Proximal Policy Optimisation implementations (opens in new tab)(salmanmohammadi.github.io)1desideratum2y ago0