Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
dominotw
29d ago
0 comments
Save
Share
i dont understand the nuances here. what does this mean. 4.8 is trained on same model as previous one then? what does brand new mean.
0 comments
2 comments · 1 top-level
top
newest
oldest
irthomasthomas
29d ago
· 1 in thread
It means for 4.7 they trained a new base model with different architecture, different pre-training data (later knowledge cutoff), and a new tokenizer. Vs finetuning an existing model, which was the case for 4.6, and probably for 4.8.
dominotw
OP
29d ago
do you mean pre training? so 4.8 is just post training of an old pretrained model?
btw where do they tell you how they trained the model.
j
/
k
navigate · click thread line to collapse