undefined | Better HN

0 pointswrs2d ago0 comments

Especially considering the millions of 2026-class data center GPUs that massively overinvested companies are currently buying, which will be obsolete in a few years.

0 comments

8 comments · 4 top-level

treis2d ago· 4 in thread

I think those are going to be run until they die. The capex vs opex is too high to obsolete them in a few years. They'll keep serving current gen LLMs for as long as they keep running.

Tuna-Fish2d ago

It won't make sense to run them after two years. The vendors will be limited on datacenter space, power and cooling, and there will be new hardware available that will run the same models at a fraction of the power.

A100 -> H100 was >3x tokens per joule, H100 -> B200 >10x. There are significant low-hanging fruit still available in architectural efficiency, and the vendors are chasing them.

This is the big risk for AI companies that I feel is not being sufficiently priced in. Almost none of the investments they are making are durable, the depreciation schedules for everything but the real estate should be less than 24 months. Until the hardware is stable enough that you only get double-digit % improvements per generation, it should almost be counted as opex.

treis2d ago

The annual operating cost of these is <10% purchase price annually. Even if the B200 is 10x more efficient in practice you can still operate the H100 at profit.

As it stands there's way more demand than supply. The new GPUs are going to run frontier models while the older ones serve smaller ones.

That said some of these are running in tents hooked up to mobile turbines. I can see some of those going away but generally I think you'll see them used until they start to fail in 5-10 years.

Chu4eeno2d ago

They can also be used for other things than running the main frontier whatever model as well.

E.g. grok isn't truly multi-modal, it has a callable tool that is a separate VLM it invokes on image URLs or files (for a long time it was grok-1.5v, but I think they have upgraded now, it was pretty bad).

And then you have the small summarizer models for the CoT/thought traces, the guidable summarizer models for the standard browse tools, etc.

There's a ton of stuff that can use an aging GPU.

robwwilliams2d ago

Yes, sure, but not efficiently. Even Pops will not want to run four hair dyer GPUs 24-7 in the garage.

nok22kon2d ago

H100 were released in Oct 2022. They are now more expensive than at release time.

robwwilliams2d ago

I think you are right when you factor in the much more efficient newer high end GPU. That is what may make the current GPu investments obsolete in 2-4 years.

tensegrist2d ago

aiui nvidia has buyback agreements covering these to prevent them from being sold off

j / k navigate · click thread line to collapse

0 comments

8 comments · 4 top-level

treis2d ago· 4 in thread

I think those are going to be run until they die. The capex vs opex is too high to obsolete them in a few years. They'll keep serving current gen LLMs for as long as they keep running.

Tuna-Fish2d ago

A100 -> H100 was >3x tokens per joule, H100 -> B200 >10x. There are significant low-hanging fruit still available in architectural efficiency, and the vendors are chasing them.

treis2d ago

The annual operating cost of these is <10% purchase price annually. Even if the B200 is 10x more efficient in practice you can still operate the H100 at profit.

As it stands there's way more demand than supply. The new GPUs are going to run frontier models while the older ones serve smaller ones.

That said some of these are running in tents hooked up to mobile turbines. I can see some of those going away but generally I think you'll see them used until they start to fail in 5-10 years.

Chu4eeno2d ago

They can also be used for other things than running the main frontier whatever model as well.

And then you have the small summarizer models for the CoT/thought traces, the guidable summarizer models for the standard browse tools, etc.

There's a ton of stuff that can use an aging GPU.

robwwilliams2d ago

Yes, sure, but not efficiently. Even Pops will not want to run four hair dyer GPUs 24-7 in the garage.

nok22kon2d ago

H100 were released in Oct 2022. They are now more expensive than at release time.

robwwilliams2d ago

I think you are right when you factor in the much more efficient newer high end GPU. That is what may make the current GPu investments obsolete in 2-4 years.

tensegrist2d ago

aiui nvidia has buyback agreements covering these to prevent them from being sold off

j / k navigate · click thread line to collapse