Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Kat-Dev-32B, Kat-Coder with Scalable Agentic RL
(opens in new tab)
(kwaipilot.github.io)
1 points
robert-zaremba
8mo ago
1 comments
Save
Share
1 comments
1 comments · 1 top-level
top
newest
oldest
robert-zaremba
OP
8mo ago
KAT-Dev-32B and KAT-Coder are optimized via several stages of training, including a mid-training stage, supervised fine-tuning (SFT) & reinforcement fine-tuning (RFT) stage and an large-scale agentic reinforcement learning (RL) stage.
j
/
k
navigate · click thread line to collapse