Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Mosaic trained a 1B parameter model on 440 GPUs for 200B tokens
(opens in new tab)
(huggingface.co)
2 points
ovaistariq
3y ago
0 comments
Save
Share
0 comments
No comments yet.