I didn’t check how much this costs, but if you use AI locally a lot, it’s going to be amortised pretty quickly. Burning 100$ a month on tokens has become insanely easy. I remember when it was unimaginable for me…
You top out at 20 tokens per second on hardware with memory bandwidth this low for any local model actually worth using. Doing the maths, it’s not financially worth it. Only worth it for privacy and control reasons.
I don’t understand your calculation, can you elaborate? At 25USD/Mtk output, assuming your 20tk/s, I generated/saved (minus power costs) ~15k$ in a year.
Granted, it won’t run 24/7, but over a couple of years, this is definitely cheaper.
This won’t be able to run any of the cutting edge models. And the models it can run can be served from cloud providers for very cheap - like <$1 per million tokens for the latest deepseek.
It’d take many years to break even on your $6000 investment, meanwhile better and better models will come out that the DGX can’t run.