Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
jazzyjackson
1y ago
0 comments
Save
Share
I'm returning my 96GB m2 max. It can run unquantized llama 3.3 70B but tokens per second is slow as molasses and still I couldn't find any use for it, just kept going back to perplexity when I actually needed to find an answer to something.
0 comments
1 comments · 1 top-level
top
newest
oldest
Tepix
1y ago
Interesting. You're using the FP8 version i'm guessing? How many tokens/s are you using and which software? MLX?
j
/
k
navigate · click thread line to collapse