The hardware required to run these things has all ballooned in price, there are no efficiencies coming. To run Kimi2.5 4bit you're sitll spending 100k in hardware, and its not nearly as reliable as Claude. Also Agentic Tooling have made their token consumption go up to increase revenue, and models are becoming more verbose in their output (wonder why). You're smoking something.