I understand that. I am saying there is a clear cliff where the value of an LLM reaches 0.
At 1t/s you are never going to get anything done. At 6t/s, it's significantly degrades the experience, one mistake setting you back 20-30 minutes.
At ~20t/s it's much more usable.