Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
vessenes
8mo ago
0 comments
Save
Share
This is the spec for a training node. The inference requires 80GB of VRAM, so significantly less compute.
0 comments
1 comments · 1 top-level
top
newest
oldest
andai
8mo ago
The default model is ~0.5B params right?
j
/
k
navigate · click thread line to collapse