Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
pdyc
2mo ago
0 comments
Save
Share
can you elaborate? you can use quantized version, would context still be an issue with it?
0 comments
2 comments · 2 top-level
top
newest
oldest
abhikul0
2mo ago
A usable quant, Q5_KM imo, takes up ~26GB[0], which leaves around ~6-7GB for context and running other programs which is not much.
[0]
https://huggingface.co/unsloth/Qwen3.5-35B-A3B-GGUF?show_fil...
nickthegreek
2mo ago
context is always an issue with local models and consumer hardware.
1 more reply
j
/
k
navigate · click thread line to collapse