1Designing dev onboarding for an agent-first world (opens in new tab)(castform.com)2kumama3d ago0Save
4Prompt caching but for RL – 7.5x speedup on long-prompt/short-response workloads (opens in new tab)(castform.com)4kumama1mo ago0Save
5Pokegents: Making multi-agent coding feel like a team (opens in new tab)(castform.com)8kumama1mo ago1Save
6Grpo explained: group relative policy optimization for LLM finetuning (opens in new tab)(cgft.io)1kumama2mo ago0Save
9RAG to riches: synthetic data for training RAG agents (opens in new tab)(cgft.io)2kumama3mo ago0Save
11Show HN: Benchmax, a new open-source RL environment framework for LLM finetuning (opens in new tab)(github.com)GitHub1kumama11mo ago0Save
12Beating o3/o4-mini with Codebase-specific Reinforcement Learning (opens in new tab)(cgft.io)3kumama1y ago0Save
13We might be overestimating coding agent performance on SWE-Bench (opens in new tab)(cgft.io)1kumama1y ago1Save
14How to Improve Code Completion LLMs with Repo-Specific Finetuning (opens in new tab)(cgft.io)3kumama1y ago1Save
15Show HN: Free AI Code Completion for Xcode with model choice/codebase context (opens in new tab)(cgft.io)2kumama1y ago0Save