Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
janalsncm
1y ago
0 comments
Save
Share
You mentioned it took 100 gpu hours, what gpu did you train on?
0 comments
1 comments · 1 top-level
top
newest
oldest
ollin
1y ago
Mostly 1xA10 (though I switched to 1xGH200 briefly at the end, lambda has a sale going). The network used in the post is very tiny, but I had to train a really long time w/ large batch to get somewhat-stable results.
j
/
k
navigate · click thread line to collapse