Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
arthurcolle
3y ago
0 comments
Save
Share
I got it running using Colab Pro+ (immediately got a V100 40GB VRAM GPU) - the 7B model works with batch size of 8 and a max seq len of 1024
0 comments
2 comments · 1 top-level
top
newest
oldest
koheripbal
3y ago
· 1 in thread
Sure, but the real value here is the 65B. Can you have multiple GPUs on colab?
arthurcolle
OP
3y ago
I can't even get the 13B on colab to do inference with a very small sequence length.
j
/
k
navigate · click thread line to collapse