Getting lots of ram will let you run large models on the CPU, but it will be so slow.
The Apple Silicon Macs have this shared memory between CPU and GPU that let's the (relatively underpowered GPU, compared to a decent Nvidia GPU) run these models at decent speeds, compared with a CPU, when using llama.cpp.
This should all get dramatically better/faster/cheaper within a few years, I suspect. Capitalism will figure this one out.