Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
My $600 Mac Mini Runs a 35B AI Model
(opens in new tab)
(thoughts.jock.pl)
4 points
danebalia
2mo ago
3 comments
Save
Share
3 comments
3 comments · 1 top-level
top
newest
oldest
bigyabai
2mo ago
· 2 in thread
> The 35B Trick (Your SSD Is the New GPU Memory)
Wave "bye bye" to your write cycles.
RobMurray
2mo ago
why? it's mostly reads. the weights are static.
bigyabai
2mo ago
llama-cpp's process is, but macOS itself will swap hard when 10-14gb of memory is paged for LLM inference. Dense models especially would thrash zram.
j
/
k
navigate · click thread line to collapse