I don't understand AI TOPS. Per NVidia:
* 5090: 3352
* 4090: 1321
* 3090 Ti: 320
* 3090: 285
* 3060 Ti: 130
* 2080 Ti: 114
source: https://www.nvidia.com/en-us/geforce/graphics-cards/50-serie...
What's increased tenfold since Ampere? Can someone explain this to me and how it impacts real-world performance?
FWIW, it appears the best homelab set up would still be 2-4x 3090 if you want VRAM for LLMs, but a single 5090 would likely be the best in class for prosumers on any less VRAM heavy tasks such as image / video generation or deep RL research
> … Jan. 30 at $1,999 and $999, respectively.
At least we are getting more VRAM and wider memory width at 32G and 512-bits, respectively [1]
I’ll still wait for the reviews though.
[1] https://www.nvidia.com/en-us/geforce/graphics-cards/compare/
was expecting 32/24/24/16 not 32/16/16/12 like wtf