Because these GPUs in data centers chew power and take up space. If in 3 years there is a new model that processes far more tokens with the same power and time the economics quickly say the hardware is cheaper to replace than to continue running.
As a hobbiest at home the numbers are different and you can afford to do something inefficient.