2Show HN: Kilroy – Knowledge base for teams using Claude Code (opens in new tab)(github.com)5t5524d ago0
7ReasoningGym: Reasoning Environments for RL with Verifiable Rewards (opens in new tab)(arxiv.org)105t5511mo ago28
11D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning (opens in new tab)(dllm-reasoning.github.io)4t551y ago0
13Block Diffusion: Interpolating Autoregressive and Diffusion Language Models (opens in new tab)(m-arriola.com)72t551y ago16