Another option is to download and compile llama.cpp and you should be able to run quantized models at an acceptable speed.
https://github.com/ggerganov/llama.cpp
Also, if you can spend the $60 and buy another 32GB of RAM, this will allow you to run the 30GB models quite nicely.