Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Avatarl: Training language models from scratch with pure reinforcement learning
(opens in new tab)
(tokenbender.com)
9 points
Gusarich
10mo ago
0 comments
Save
Share
0 comments
No comments yet.