undefined | Better HN

0 pointsStrauXX1d ago0 comments

Which indications are that?

0 comments

The cost factors on the new models compared to the old models.

Qwen3.6 9B is as good as GPT-4o and runs on my M2 MacBook Air. Models are getting stronger and less costly at the same time, but these are somewhat separate branches of research. Frontier labs are spending more because they are still getting marginal returns and there is more capacity to spend than there was a year ago.

gertop20h ago

Qwen 3.6 9B doesn't exist.

If you meant 3.5 9B and you truly believe it's as good as 4o then I can only assume you have a very basic use case.

1 more reply

bdelmas21h ago

You are mixing cost and progress. It’s not because it’s more and more expensive that progress is slowing down by itself.

nicoburns20h ago

They are intrinsically linked beyond a certain point. If we're making progress but costs are spiraling exponentially then it stands to reason that we will soon reach a point where we can no longer afford the increasing costs and thus progress will slow.

(barring some breakthrough that reduces costs, which of course may happen, but for which recent model improvements are not strong evidence of)

aspenmartin20h ago

Cost for a specific level of performance decreases 10x per year, this has been a pretty consistent property for awhile now.

overfeed1d ago

Investment dollars.

dzhiurgis23h ago

Source for that claim?

lionkor1d ago

Nobody is releasing NEW models

aspenmartin22h ago

…not only is this not true but it also doesn’t matter. Why would this indicate performance saturating?

kstenerud22h ago

What constitutes a NEW model for the purposes of calculating progress?

taneq23h ago

The standard networking connection has been called “Ethernet” for more than thirty years, so networking has stagnated, right?

SlinkyOnStairs23h ago

If higher bandwidth networking consisted primarily running more and more ethernet lines in parallel, you would most certainly agree that "networking has stagnated".

"Reasoning" and now "Agentic" AI systems are not some fundamental improvement on LLMs, they're just running roughly the same prior-gen LLMS, multiple times.

Hence the conclusion that LLM improvement has slowed down, if not stagnated entirely, and that we should not expect the improvements of switching to these "reasoning" systems to keep happening.

1 more reply

GardenLetter2722h ago

What? DeepSeekV3 just came out and is incredible for the price. Mythos is also half-released.

nozzlegear20h ago

Until you or I can actually use Mythos in Claude without an nda or other strings attached, Mythos is not released and is just an effective marketing tool for Anthropic.

j / k navigate · click thread line to collapse

0 comments

nicoburns22h ago

The cost factors on the new models compared to the old models.

jeremyjh20h ago

gertop20h ago

Qwen 3.6 9B doesn't exist.

If you meant 3.5 9B and you truly believe it's as good as 4o then I can only assume you have a very basic use case.

1 more reply

bdelmas21h ago

You are mixing cost and progress. It’s not because it’s more and more expensive that progress is slowing down by itself.

nicoburns20h ago

(barring some breakthrough that reduces costs, which of course may happen, but for which recent model improvements are not strong evidence of)

aspenmartin20h ago

Cost for a specific level of performance decreases 10x per year, this has been a pretty consistent property for awhile now.

overfeed1d ago

Investment dollars.

dzhiurgis23h ago

Source for that claim?

lionkor1d ago

Nobody is releasing NEW models

aspenmartin22h ago

…not only is this not true but it also doesn’t matter. Why would this indicate performance saturating?

kstenerud22h ago

What constitutes a NEW model for the purposes of calculating progress?

taneq23h ago

The standard networking connection has been called “Ethernet” for more than thirty years, so networking has stagnated, right?

SlinkyOnStairs23h ago

If higher bandwidth networking consisted primarily running more and more ethernet lines in parallel, you would most certainly agree that "networking has stagnated".

"Reasoning" and now "Agentic" AI systems are not some fundamental improvement on LLMs, they're just running roughly the same prior-gen LLMS, multiple times.

Hence the conclusion that LLM improvement has slowed down, if not stagnated entirely, and that we should not expect the improvements of switching to these "reasoning" systems to keep happening.

1 more reply

GardenLetter2722h ago

What? DeepSeekV3 just came out and is incredible for the price. Mythos is also half-released.

nozzlegear20h ago

Until you or I can actually use Mythos in Claude without an nda or other strings attached, Mythos is not released and is just an effective marketing tool for Anthropic.

j / k navigate · click thread line to collapse