Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
regularfry
1y ago
0 comments
Save
Share
Qwen2.5 has a 32B release, and quantised at q5_k_m it *just about" completely fills a 4090.
It's a good model, too.
0 comments
2 comments · 1 top-level
top
newest
oldest
kristianp
1y ago
· 1 in thread
Do you also need space for context on the card to get decent speed though?
regularfry
OP
1y ago
Depends how much you need. Dropping to q4_k_m gives you 3GB back if that makes the difference.
j
/
k
navigate · click thread line to collapse