/model claude-opus-4-6[1m]
Edit: And honestly, I can't say I see much improvement for a "doubling in price".
The owner of this casino (Anthropic) decided to upgrade the slot machine (Claude Code) with a new random number generator engine (Opus 4.7) and decommissioned the old one (Opus 4.6).
They claim that the new engine (Opus 4.7) gives better odds and generates results faster with every press of the button, with a catch that it takes more credits on every turn.
The house (Anthropic) always wins.
its true of gpt5, its true of why antigravity rate limits, and its true of claude.
there is no possible action thats not user hostile when you dont have enough compute. You either reject customers which is hostile and kills your companies growth or you do the best you can with what you have.
its not like these companies under built, they're building as fast as they can.
If costs are drowning them, they should rather be a bit more honest about the messaging and hike prices of their lower ($20 / $100) tier by ~$10 and enforce stricter limits (optionally).
I liked 4.5 and 4.6, but none of them are available outside of using API now - and thats a bummer given the 2~2.5X overall costs
I get these models are getting huge and costs are quickly adding up but at least be transparent. GitHub Copilot straight up said hey we’re not going to offer these anymore get Pro+ for 4.7. If nothing else you can use the overlap time to test and switch.
Any action on the part of the model provider, which is not completely in the direction of more money for less work, is actually working against the interest of the provider, the shareholders, and various investors. Any such action would open the specific employees or parties taking that non-profit maximizing action to official review, reversal of action, and possible reduction in responsibility or termination of duties. That is completely within the job description of employment at any of the places, including Anthropic.
So, users, be informed, the model providers are not allowed to take user's best interest in to scope for making utility decisions.
The company will at it's own discretion; reduce customers' allowance of capability, reduce the computational cost for performing the same actions at a later date, reduce the precision and utility of results iteratively as the number of users increases overall demand. Model providers will showcase higher performance and precision to non-paying 'potential customers' and then nominally deliver increasingly reduced performance up-to the point of a specific and acceptable ratio of account cancellations.
This is specifically in order for the model providers to extract as much coin for as little work as possible: that is the purpose of the many profitability matrices and customer load balancing 'knobs' that are run using the very best models. It is a given and very reasonable to expect an explicit requirement from the top that the internally facing model capacity is NEVER degraded for the tasks of increasing revenue per unit of work; that is, to make as much as they can at any point in time, and make more at every point in time than they did before.
The responsible use of these models should be; use the best performance (i.e. the free tier) to assist in the development of infrastructure to run models which are actually under your control; either on in-house hardware, or on leased VPS instances with specific hardware and performance guarantees. Then use models under one's own control to develop stable and irrevocably performant workflows.
Any reliance on public facing subscription models for actual work is 100% guaranteed to deteriorate in functionality over time and waste money trying to maintain performance as measured by cost per token over time. That is the fact of the matter.
/<today's free lesson>