undefined | Better HN

0 pointssimonw2mo ago0 comments

They later said: https://twitter.com/TheAmolAvasare/status/204672549859272297...

> When we do land on something, if it affects existing subscribers you'll get plenty of notice before anything changes. Will hear it from us, not a screenshot on X or Reddit.

If you don't want things like this spreading through screenshots of X and Reddit, don't run "tests" like this in the first place!

(Also "if it affects existing subscribers" is a cop-out, I need to know the pricing of Claude Code for NEW subscribers if I'm going to adopt it at a company with a growing team, or recommend it to other people, write tutorials etc.)

0 comments

30 comments · 10 top-level

minimaxir2mo ago· 7 in thread

A/B tests only work if the subjects don't realize they are in a A/B test.

abtinf2mo ago

Perhaps vibe coding the A/B testing engine isn't the best idea.

inetknght2mo ago

Solution: don't A/B test your users.

A/B testing people without their informed consent is immoral, unethical, and should be illegal.

skeledrew2mo ago

To play devil's advocate, without A/B testing a lot of decisions would be made with insufficient relevant data, and lead to subpar results that affect the many negatively form the road.

2 more replies

cycomanic2mo ago

So you're saying software should never change or you're happy with A testing, but not A/B testing

1 more reply

zorobo2mo ago

It is necessary to have a control group, just as in trials for new drugs.

1 more reply

vehemenz2mo ago

Depends entirely on the stakes and whether personal data is involved

1 more reply

shimman2mo ago

Agreed and I can't wait until they regulate this stuff out of existence. It's absolutely hostile software technique that is deeply anti-human.

abtinf2mo ago· 6 in thread

That tweet only makes things worse. On top of all their other nonsense recently, it actually convinced me to cancel my subscription.

I can't trust Anthropic to manage their products in a way that supports my workflow.

trueno2mo ago

pretty much none of these big providers are offering the guarantees needed to be taken seriously in workplaces right now. the technology itself isn't offering the deterministic guarantees that should warrant it in the workplace right now. problem is everyone's foot is just on the gas. even if your workplace isnt paying for it, people are just straight up rolling their own personal claude accounts to do work at orgs.

ive been trying to make the case all year that if we're going to let employees do shit with ai, lets try claude. in the past like.. 2-3 weeks all that goodwill has basically evaporated.

local inference needs to take off asap because all of these entities actually suck and i wouldn't trust a single sla with anthropic. they are not acting like a serious company right now, this is a joke.

adam_patarino2mo ago

What are you guys subscribed to if not Claude? Copilot? Or is everyone legit bringing their own license?

4 more replies

solenoid09372mo ago

Anthropic is absolutely taken seriously in workplaces, what are you even talking about?

No serious business uses Pro or Max, they are all on Anthropic API billing.

In fact with this move it is plainly obvious that Anthropic is moving compute from prosumers towards enterprise.

4 more replies

kelsey987654312mo ago

I just cancelled before seeing this news. i was already pissed about constantly hitting limits on the 20 a month plan and looking for alternatives and this seals the deal. Bye bye!

theshrike792mo ago

Yea, I've been fine so far, but something happened with Opus 4.6 and especially 4.7. I was able to do some actual work with a Pro plan before. Now it's just pure anxiety of hitting the limits.

With Sonnet it's a bit better, but I can get the same performance with GPT-5.4.

Now I'm pretty much paying the 20€ for Claude Pro so it can plan/review stuff and then I use pi.dev + GPT-5.4 for the actual work.

anakaine2mo ago

I just paid for Pro for the first time 24 hours ago. Its been great, but the limits are crazy. It's nice not dealing with ChatGPTs sycophantic gaslighting, and not having random bugs.

That said, I seem to be caught in that 2% test if I open in a private tab. What nonsense. I wouldn't be paying for Claude if it wasn't for its quality abilities, which necessarily includes Claude Code.

1 more reply

helsinkiandrew2mo ago· 4 in thread

> I need to know the pricing of Claude Code for NEW subscribers if I'm going to adopt it at a company with a growing team.

I agree, but can you really use Claude Code on the Pro plan as a full time developer, or professional 'knowledge worker' without hitting the usage limits fairly early in the day anyway?

jltsiren2mo ago

It depends on the kind of work you do.

I'm in the academia, and Claude's performance in my field could be described as a very fast junior grad student. When I use Claude Code, I typically spend a few hours figuring out what needs to be done exactly, and describing it in sufficient detail. Then Claude does it in 30 minutes, while an actual student would need days. And then I spend anything from minutes to days evaluating the results, depending on if it needs to be tested with real data and how much weirdness those tests uncover.

But I also have other work to do beyond guiding the automated grad student. Which means my Claude Code usage rarely exceeds 1–2 hours/week.

atraac2mo ago

I use Pro professionally and didn't hit limits most of the time. I believe I used up 5hr quota once or twice. We switched to Team sub and I'm on Standard(which is Pro x1.25 I believe). I don't vibecode entire applications, I ask it to make boilerplate, smaller, well scoped features or fix some errors. I don't let it go off with a prompt "make another netflix clone" cause I just don't see any real value in that

qingcharles2mo ago

Just the Pro Plan Claude Code on its own? Maybe you could last a full day on just using Sonnet. Maybe one Opus dab in the morning to plan your Haiku/Sonnet day?

I have Pro Claude, Plus GPT and Pro Gemini. When one runs out I switch to another project on the next LLM. If I really need a task finished I'll restart it on another LLM, but I'm loathe to do that as it eats tokens just getting back up to speed.

ulimn2mo ago

I think it's more about how they approach their users in general that is the problem here.

theptip2mo ago· 3 in thread

It’s pretty reasonable to say “demand is way up, quality is up, supply is constrained, and so price needs to rise”.

It seems weird to segment this way though. Surely it’s better to just give Sonnet to your bottom tier, rather than cut out the entire Claide Code product entirely?

Give folks a taste rather than lock the whole product behind a $100/mo plan.

mewpmewp22mo ago

But if Sonnet is bad it would give bad impression of the product, no? And it also takes compute, so you give a bad hallucinating impression of your product while still losing compute.

theptip2mo ago

It’s not bad though, it’s crazy good in comparison to any model older than 1y old. If you don’t have access to any vibe coding at all it’s gonna be life changing.

But I think you are right, as long as Codex and Gemini are cheap alternatives then vs. 1yo models isn’t the correct comp.

Then it’s probably better to just resegment the whole Claude Code product as an enterprise only tier. (That also has the advantage of kicking out all the Claw subscribers that screw over the token limit economics for normal $20/mo users.)

wobfan2mo ago

I mean, this is why they do A/B testing. This way of testing stuff is not new at all, people who act genuinely surprised need to do a reality check. Companies want to maximize profit. They do this by testing what creates the biggest profit. A/B Testing is one of the ways to do this, and it has been used for decades in precisely this way.

kingstnap2mo ago

> Will hear it from us, not a screenshot on X or Reddit.

Has this ever been true? You will almost always see some anecdotal screenshot a long time before any company would rat on themselves.

Yes the random screenshots include a lot of false positives. But official comms have a lot of their own problems given how companies behave nowadays.

ochronus2mo ago

Haha, right, just like the recent uncommunicated changes to limits, cache, etc.

rsynnott2mo ago

A screenshot is, however, apparently good enough for _new_ subscribers.

adastra222mo ago

> I need to know the pricing of Claude Code for NEW subscribers if I'm going to adopt it at a company with a growing team

Subscriptions aren’t for company use.

nkotov2mo ago

Just like they gave plenty of notice regarding OpenClaw?

sally_glance2mo ago

Maybe a silly bet where the head of sales had 1-2 glasses of wine too much... "I bet they will still pay us 20 bucks/mo without CC! Don't believe me? I'm going to prove it!"

j / k navigate · click thread line to collapse

0 comments

30 comments · 10 top-level

minimaxir2mo ago· 7 in thread

A/B tests only work if the subjects don't realize they are in a A/B test.

abtinf2mo ago

Perhaps vibe coding the A/B testing engine isn't the best idea.

inetknght2mo ago

Solution: don't A/B test your users.

A/B testing people without their informed consent is immoral, unethical, and should be illegal.

skeledrew2mo ago

To play devil's advocate, without A/B testing a lot of decisions would be made with insufficient relevant data, and lead to subpar results that affect the many negatively form the road.

2 more replies

cycomanic2mo ago

So you're saying software should never change or you're happy with A testing, but not A/B testing

1 more reply

zorobo2mo ago

It is necessary to have a control group, just as in trials for new drugs.

1 more reply

vehemenz2mo ago

Depends entirely on the stakes and whether personal data is involved

1 more reply

shimman2mo ago

Agreed and I can't wait until they regulate this stuff out of existence. It's absolutely hostile software technique that is deeply anti-human.

abtinf2mo ago· 6 in thread

That tweet only makes things worse. On top of all their other nonsense recently, it actually convinced me to cancel my subscription.

I can't trust Anthropic to manage their products in a way that supports my workflow.

trueno2mo ago

ive been trying to make the case all year that if we're going to let employees do shit with ai, lets try claude. in the past like.. 2-3 weeks all that goodwill has basically evaporated.

adam_patarino2mo ago

What are you guys subscribed to if not Claude? Copilot? Or is everyone legit bringing their own license?

4 more replies

solenoid09372mo ago

Anthropic is absolutely taken seriously in workplaces, what are you even talking about?

No serious business uses Pro or Max, they are all on Anthropic API billing.

In fact with this move it is plainly obvious that Anthropic is moving compute from prosumers towards enterprise.

4 more replies

kelsey987654312mo ago

I just cancelled before seeing this news. i was already pissed about constantly hitting limits on the 20 a month plan and looking for alternatives and this seals the deal. Bye bye!

theshrike792mo ago

Yea, I've been fine so far, but something happened with Opus 4.6 and especially 4.7. I was able to do some actual work with a Pro plan before. Now it's just pure anxiety of hitting the limits.

With Sonnet it's a bit better, but I can get the same performance with GPT-5.4.

Now I'm pretty much paying the 20€ for Claude Pro so it can plan/review stuff and then I use pi.dev + GPT-5.4 for the actual work.

anakaine2mo ago

I just paid for Pro for the first time 24 hours ago. Its been great, but the limits are crazy. It's nice not dealing with ChatGPTs sycophantic gaslighting, and not having random bugs.

1 more reply

helsinkiandrew2mo ago· 4 in thread

> I need to know the pricing of Claude Code for NEW subscribers if I'm going to adopt it at a company with a growing team.

I agree, but can you really use Claude Code on the Pro plan as a full time developer, or professional 'knowledge worker' without hitting the usage limits fairly early in the day anyway?

jltsiren2mo ago

It depends on the kind of work you do.

But I also have other work to do beyond guiding the automated grad student. Which means my Claude Code usage rarely exceeds 1–2 hours/week.

atraac2mo ago

qingcharles2mo ago

Just the Pro Plan Claude Code on its own? Maybe you could last a full day on just using Sonnet. Maybe one Opus dab in the morning to plan your Haiku/Sonnet day?

ulimn2mo ago

I think it's more about how they approach their users in general that is the problem here.

theptip2mo ago· 3 in thread

It’s pretty reasonable to say “demand is way up, quality is up, supply is constrained, and so price needs to rise”.

It seems weird to segment this way though. Surely it’s better to just give Sonnet to your bottom tier, rather than cut out the entire Claide Code product entirely?

Give folks a taste rather than lock the whole product behind a $100/mo plan.

mewpmewp22mo ago

But if Sonnet is bad it would give bad impression of the product, no? And it also takes compute, so you give a bad hallucinating impression of your product while still losing compute.

theptip2mo ago

It’s not bad though, it’s crazy good in comparison to any model older than 1y old. If you don’t have access to any vibe coding at all it’s gonna be life changing.

But I think you are right, as long as Codex and Gemini are cheap alternatives then vs. 1yo models isn’t the correct comp.

wobfan2mo ago

kingstnap2mo ago

> Will hear it from us, not a screenshot on X or Reddit.

Has this ever been true? You will almost always see some anecdotal screenshot a long time before any company would rat on themselves.

Yes the random screenshots include a lot of false positives. But official comms have a lot of their own problems given how companies behave nowadays.

ochronus2mo ago

Haha, right, just like the recent uncommunicated changes to limits, cache, etc.

rsynnott2mo ago

A screenshot is, however, apparently good enough for _new_ subscribers.

adastra222mo ago

> I need to know the pricing of Claude Code for NEW subscribers if I'm going to adopt it at a company with a growing team

Subscriptions aren’t for company use.

nkotov2mo ago

Just like they gave plenty of notice regarding OpenClaw?

sally_glance2mo ago

Maybe a silly bet where the head of sales had 1-2 glasses of wine too much... "I bet they will still pay us 20 bucks/mo without CC! Don't believe me? I'm going to prove it!"

j / k navigate · click thread line to collapse