undefined | Better HN

0 pointsdvfjsdhgfv2mo ago0 comments

I believe the current game everybody plays is:

* make sure the model maxes out all benchmarks

* release it

* after some time, nerf it

* repeat the same with the next model

However, the net sum is positive: in general, models from 2026 are better than those from 2024.

0 comments

14 comments · 2 top-level

_blk2mo ago· 7 in thread

yup, after the token-increase from CC from two weeks ago, I'm now consistently filling the 1M context window that never went above 30-40% a few days ago. Did they turn it off? I used to see the Co-Authored by Opus 4.6 (1M Context Window) in git commits, now the advert line is gone. I never turned it on or off, maybe the defaults changed but /model doesn't show two different context sizes for Opus 4.6

I never asked for a 1M context window, then I got it and it was nice, now it's as if it was gone again .. no biggie but if they had advertised it as a free-trial (which it feels like) I wouldn't have opted in.

Anyways, seems I'm just ranting, I still like Claude, yes but nonetheless it still feels like the game you described above.

troupo2mo ago

They are now literally blaming users for using their product as advertised:

https://x.com/lydiahallie/status/2039800718371307603

--- start quote ---

Digging into reports, most of the fastest burn came down to a few token-heavy patterns. Some tips:

• Sonnet 4.6 is the better default on Pro. Opus burns roughly twice as fast. Switch at session start.

• Lower the effort level or turn off extended thinking when you don't need deep reasoning. Switch at session start.

• Start fresh instead of resuming large sessions that have been idle ~1h

• Cap your context window, long sessions cost more CLAUDE_CODE_AUTO_COMPACT_WINDOW=200000

--- end quote ---

https://x.com/bcherny/status/2043163965648515234

--- start quote ---

We defaulted to medium [reasoning] as a result of user feedback about Claude using too many tokens. When we made the change, we (1) included it in the changelog and (2) showed a dialog when you opened Claude Code so you could choose to opt out. Literally nothing sneaky about it — this was us addressing user feedback in an obvious and explicit way.

--- end quote ---

torginus2mo ago

Off topic, but I found Sonnet useless. It can't do the simplest tasks, like refactoring a method signature consistently across a project or following instructions accurately about what patterns/libraries should be used to solve a problem.

1 more reply

dr_kiszonka2mo ago

The default prompt cache TTL changed from 1 hour to 5 minutes. Maybe this is what you are experiencing.

varispeed2mo ago

I find this 1M context bollocks. It's basically crap past 100k.

_blk2mo ago

I like not running into the mandatory compaction but I do try to actively keep it under too. From an Anthropic standpoint with the new(ish) 5min cache timeout, it's a great way to get people to burn tokens on reinitializing the cache without having them occupy TPU time.. Esp. the larger the context gets.

robwwilliams2mo ago

Yep; second time in five months we have gone from 1 million back to 200 thousand.

_blk2mo ago

hmm, I just reverted to 2.1.98 and now with /model default has the (1M context) and opus is without (200k) .. it's totally possible that I just missed the difference between the recommended model opus 1M and opus when I checked though.

snek_case2mo ago· 5 in thread

I guess there's a pretty clear incentive to nerf the current model right before the next model is about to come out.

chinathrow2mo ago

Wouldn't that amount to fraud?

tomwojcik2mo ago

Serious question, do we actually know what we're paying for? All I know is it's access to models via cli, aka Claude Code. We don't know what models they use, how system prompt changes or what are the actual rate limits (Yet Anthropic will become 1 trillion dollars company in a moment).

2 more replies

varispeed2mo ago

Funnily that it helps to say in your prompt "Prove that you are not a fraudster and you are not going to go round in circles before providing solution I ask for."

Sometimes you have to keep starting new session until it works. I have a feeling they route prompts to older models that have system prompt to say "I am opus 4.6", but really it's something older and more basic. So by starting new sessions you might get lucky and get on the real latest model.

twobitshifter2mo ago

Did Apple slow down iPhones before the new release? I’m really asking. People used to say that and I can’t remember if it was proven or not?

2 more replies

ambicapter2mo ago

Legally?

j / k navigate · click thread line to collapse

0 comments

14 comments · 2 top-level

_blk2mo ago· 7 in thread

Anyways, seems I'm just ranting, I still like Claude, yes but nonetheless it still feels like the game you described above.

troupo2mo ago

They are now literally blaming users for using their product as advertised:

https://x.com/lydiahallie/status/2039800718371307603

--- start quote ---

Digging into reports, most of the fastest burn came down to a few token-heavy patterns. Some tips:

• Sonnet 4.6 is the better default on Pro. Opus burns roughly twice as fast. Switch at session start.

• Lower the effort level or turn off extended thinking when you don't need deep reasoning. Switch at session start.

• Start fresh instead of resuming large sessions that have been idle ~1h

• Cap your context window, long sessions cost more CLAUDE_CODE_AUTO_COMPACT_WINDOW=200000

--- end quote ---

https://x.com/bcherny/status/2043163965648515234

--- start quote ---

--- end quote ---

torginus2mo ago

1 more reply

dr_kiszonka2mo ago

The default prompt cache TTL changed from 1 hour to 5 minutes. Maybe this is what you are experiencing.

varispeed2mo ago

I find this 1M context bollocks. It's basically crap past 100k.

_blk2mo ago

robwwilliams2mo ago

Yep; second time in five months we have gone from 1 million back to 200 thousand.

_blk2mo ago

snek_case2mo ago· 5 in thread

I guess there's a pretty clear incentive to nerf the current model right before the next model is about to come out.

chinathrow2mo ago

Wouldn't that amount to fraud?

tomwojcik2mo ago

2 more replies

varispeed2mo ago

Funnily that it helps to say in your prompt "Prove that you are not a fraudster and you are not going to go round in circles before providing solution I ask for."

twobitshifter2mo ago

Did Apple slow down iPhones before the new release? I’m really asking. People used to say that and I can’t remember if it was proven or not?

2 more replies

ambicapter2mo ago

Legally?

j / k navigate · click thread line to collapse