> When we do land on something, if it affects existing subscribers you'll get plenty of notice before anything changes. Will hear it from us, not a screenshot on X or Reddit.
If you don't want things like this spreading through screenshots of X and Reddit, don't run "tests" like this in the first place!
(Also "if it affects existing subscribers" is a cop-out, I need to know the pricing of Claude Code for NEW subscribers if I'm going to adopt it at a company with a growing team, or recommend it to other people, write tutorials etc.)
A/B testing people without their informed consent is immoral, unethical, and should be illegal.
I can't trust Anthropic to manage their products in a way that supports my workflow.
ive been trying to make the case all year that if we're going to let employees do shit with ai, lets try claude. in the past like.. 2-3 weeks all that goodwill has basically evaporated.
local inference needs to take off asap because all of these entities actually suck and i wouldn't trust a single sla with anthropic. they are not acting like a serious company right now, this is a joke.
No serious business uses Pro or Max, they are all on Anthropic API billing.
In fact with this move it is plainly obvious that Anthropic is moving compute from prosumers towards enterprise.
With Sonnet it's a bit better, but I can get the same performance with GPT-5.4.
Now I'm pretty much paying the 20€ for Claude Pro so it can plan/review stuff and then I use pi.dev + GPT-5.4 for the actual work.
That said, I seem to be caught in that 2% test if I open in a private tab. What nonsense. I wouldn't be paying for Claude if it wasn't for its quality abilities, which necessarily includes Claude Code.
I agree, but can you really use Claude Code on the Pro plan as a full time developer, or professional 'knowledge worker' without hitting the usage limits fairly early in the day anyway?
I'm in the academia, and Claude's performance in my field could be described as a very fast junior grad student. When I use Claude Code, I typically spend a few hours figuring out what needs to be done exactly, and describing it in sufficient detail. Then Claude does it in 30 minutes, while an actual student would need days. And then I spend anything from minutes to days evaluating the results, depending on if it needs to be tested with real data and how much weirdness those tests uncover.
But I also have other work to do beyond guiding the automated grad student. Which means my Claude Code usage rarely exceeds 1–2 hours/week.
I have Pro Claude, Plus GPT and Pro Gemini. When one runs out I switch to another project on the next LLM. If I really need a task finished I'll restart it on another LLM, but I'm loathe to do that as it eats tokens just getting back up to speed.
It seems weird to segment this way though. Surely it’s better to just give Sonnet to your bottom tier, rather than cut out the entire Claide Code product entirely?
Give folks a taste rather than lock the whole product behind a $100/mo plan.
But I think you are right, as long as Codex and Gemini are cheap alternatives then vs. 1yo models isn’t the correct comp.
Then it’s probably better to just resegment the whole Claude Code product as an enterprise only tier. (That also has the advantage of kicking out all the Claw subscribers that screw over the token limit economics for normal $20/mo users.)
Has this ever been true? You will almost always see some anecdotal screenshot a long time before any company would rat on themselves.
Yes the random screenshots include a lot of false positives. But official comms have a lot of their own problems given how companies behave nowadays.
Subscriptions aren’t for company use.