Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
redox99
6mo ago
0 comments
Save
Share
I think it's more likely to be the old base model checkpoint further trained on additional data.
0 comments
1 comments · 1 top-level
top
newest
oldest
jumploops
6mo ago
Is that technically not a new pretrained model?
(Also not sure how that would work, but maybe I’ve missed a paper or two!)
1 more reply
j
/
k
navigate · click thread line to collapse