Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
adastra22
6mo ago
0 comments
Save
Share
Again, memory bandwidth is pretty much all that matters here. During inference or training the CUDA cores of retail GPUs are like 15% utilized.
0 comments
1 comments · 1 top-level
top
newest
oldest
my123
6mo ago
Not for prompt processing. Current Macs are really not great at long contexts
j
/
k
navigate · click thread line to collapse