Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
delis-thumbs-7e
1mo ago
0 comments
Save
Share
Wouldn’t that be extremely computationaly expensive considering how resource incentive training is?
0 comments
2 comments · 1 top-level
top
newest
oldest
colechristensen
1mo ago
· 1 in thread
No, training a state of the art model involves training on the order of 10 trillion tokens.
We're talking about a step that updates weights based on say between 10k and 1M tokens.
delis-thumbs-7e
OP
1mo ago
I learned something. Thank you!
j
/
k
navigate · click thread line to collapse