Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
srush
4mo ago
0 comments
Share
Our primary focus is on RL post-training. We think that is the best way to get the model to be a strong interactive agent.
0 comments
default
newest
oldest
comex
4mo ago
So, yes, but you won’t say what the base model is? :)
typpilol
4mo ago
It seems like a sort of sonnet model as a lot of people are reporting it like to spam documentation on Twitter like sonnet 4.5
j
/
k
navigate · click thread line to collapse