Circular Financing: Does Nvidia's $110B Bet Echo the Telecom Bubble? (opens in new tab)

(tomtunguz.com)

249 pointsmiltava8mo ago213 comments

213 comments

131 comments · 31 top-level

mg8mo ago· 24 in thread

    Fiber networks were using less
    than 0.002% of available capacity,
    with potential for 60,000x speed
    increases. It was just too early.

I doubt we will see unused GPU capacity. As soon as we can prompt "Think about the codebase over night. Try different ways to refactor it. Tomorrow, show me your best solution." we will want as much GPU time at the current rate as possible.

If a minute of GPU usage is currently $0.10, a night of GPU usage is 8 * 60 * 0.1 = $48. Which might very well be worth it for an improved codebase. Or a better design of a car. Or a better book cover. Or a better business plan.

Mo38mo ago

> I doubt we will see unused GPU capacity

I'd argue we very certainly will. Companies are gobbling up GPUs like there's no tomorrow, assuming demand will remain stable and continue growing indefinitely. Meanwhile LLM fatigue has started to set in, models are getting smaller and smaller and consumer hardware is getting better and better. There's no way this won't end up with a lot of idle GPUs.

Workaccount28mo ago

>Meanwhile LLM fatigue has started to set in

Has it?

I think there is this compulsion to think that LLMs are made for senior devs, and if devs are getting wary of LLMs, the experiment is over.

I'm not a programmer, my day job isn't tech, and the only people I know who express discontent with LLMs are a few of programmer friends I have. Which I get, but everyone else is using them gleefully for all manner of stuff. And now I am seeing the very first inklings of completely non-technical people making bespoke applets for themselves.

From OpenAI, programming is ~4% of chatGPTs usage. That's 96% being used for other stuff.

I don't see any realistic or grounded forecast that includes a diminishing demand for compute. We're still at the tip of adoption...

2 more replies

xadhominemx8mo ago

Test time compute has made consumption highly elastic. More compute = better results. Marginal cost of running these GPUs when they would otherwise be idle is relatively very low. It will be utilized.

delusional8mo ago

> There's no way this won't end up with a lot of idle GPUs.

Nvidia is betting the farm on reinventing GPU compute every 2 years. The GPUs wont end up idle, because they will end up in landfills.

Do I believe that's likely, no, but it is what I believe Nvidia is aiming for.

goalieca8mo ago

What’s the lifetime of these things once they’ve been running hot for 2-3 years

brazukadev8mo ago

This. I just found out that for my MCP needs, Qwen3 4B running local is good enough! So I just stopped using Gemini API.

idiotsecant8mo ago

Your bet is that people will simply use less compute, for the first time in the history of the human race?

1 more reply

mg8mo ago

Look at the human body.

2% of it is dedicated to thinking.

My guess is that as a species, we will turn a similar percentage of our environment into thinking matter.

If there are a billion houses on planet earth, 2% of it are 20 million datacenters we still have to build.

1 more reply

credit_guy8mo ago

> As soon as we can prompt "Think about the codebase over night. Try different ways to refactor it. Tomorrow, show me your best solution." we will want as much GPU time at the current rate as possible.

That is nothing. Coding is done via text. Very soon people will use generative AI for high resolution movies. Maybe even HDR and high FPS (120 maybe?). Such videos will very likely cost in the range of $100-$1000 per minute. And will require lots and lots of GPUs. The US military (and I bet others as well) are already envisioning generative AI use for creating a picture of the battlespace. This type of generation will be even more intensive than high resolution videos.

bigbadfeline8mo ago

> "Try different ways to refactor it. Tomorrow, show me your best solution."

The cost/benefit analysis doesn't add up for two reasons:

First, a refactored codebase works almost the same as non-refactored one, that is, the tangible benefit is small.

Second, how many times are you going to refactor the codebase? Once and... that's it. There's simply no need for that much compute for lack of sufficient beneficial work.

That is, the present investments are going to waste unless we automate and robotize everything, I'm OK with that but it's not where the industry is going.

skrebbel8mo ago

> improved codebase

I've seen lots of claims about AI coding skill, but that one might be able to improve (and not merely passably extend) a codebase is a new one. I'd want to see it before I believe it.

Leynos8mo ago

It depends what you're fitting to. At the simplest, you can ask for a reduction in cyclomatic/cognitive complexity measured using a linter, extraction of methods (where a paragraph of code serves no purpose other than to populate a variable) or complex conditionals, move from an imperative to a declarative approach, etc. These are all things that can be caught through pattern matching and measured using a linter or code review tool (CodeRabbit, Sourcery or Codescene).

Other things might need to be done in two stages. You might ask the agent to first identify where code violates CQRS, then for each instance, explain the problem, and spawn a sub-agent to address that problem.

Other things the agent might identify this way: multiple implications, use of conflicted APIs, poor separation of concerns at a module or class level.

I don't typically let the agent do any of this end to end, but I would typically manually review findings before spawning subagents with those findings.

fragmede8mo ago

Claude will refactor but more than that, it can add documentation. And it can be asked about a codebase too. "Where does FOO happen?" "How does BAR work?".

thenaturalist8mo ago

This is such a short sighted take glaringly ommitting a crucial ingredient in learning or improvement - both for humans or machines alike: feedback loops.

And you can't really hack / outsmart feedback loops.

Just because something is conceptually possible, interaction with the real rest of the world separates a possible from an optimal solution.

The low hanging fruits/ obvious incremental improvements might be quickly implemented by LLMs based on established patterns in their training data.

That doesn't get you from 0 to 1 dollar, though and that's what it's all about.

bwfan1238mo ago

this. Was highlighted by Sutton in a recent podcast rather starkly.

LLMs are a great tool. But, the real world is far too nuanced to be captured in text and tokens. So, LLMs will be a great productivity boosting tool like a calculator or a spreadsheet. Expecting it to do more is science fiction.

cantor_S_drug8mo ago

With improvements on the algorithm side and new techniques, even older hardware will become useful.

Zigurd8mo ago

I get what you're saying and the reasoning behind it, but older hardware has never been useful where power consumption is part of determining usefulness.

chatmasta8mo ago

This is the biggest threat to the GPU economy – software breakthroughs that enable inference on commodity CPU hardware or specialized ASIC boards that hyperscalers can fabricate themselves. Google has a stockpile of TPUs that seem fairly effective, although it’s hard to tell for certain because they don’t make it easy to rent them.

2 more replies

jdlshore8mo ago

> As soon as we can prompt…

This is the fundamental error I see people making. LLMs can’t operate independently today, not on substantive problems. A lot of people are assuming that they will some day be able to, but the fact is that, today, they cannot.

The AI bubble has been driven by people seeing the beginning of an S-curve and combining it with their science-fiction fantasies about what AI is capable of. Maybe they’re right, but I’m skeptical, and I think the capabilities we see today are close to as good as LLMs are going to get. And today, it’s not good enough.

Workaccount28mo ago

Getting gold in the math Olympiad is a pretty strong indicator of operating independently on substantive problems.

A year ago they need an extensive harness to get silver, and two years ago they could hardly multiply 1000x10000.

Terence Tao tweeted yesterday about using GPT5 to help quickly solve a problem he was working on.

1 more reply

ccorcos8mo ago

I’ve never understood why time is the metric people are using here. If LLMs get so much better we can “run them overnight”, what makes you think that they won’t also get faster and so they accomplish exactly what you’re talking about in 5 minutes?

yubblegum8mo ago

I just had to double check (have not been paying attention for a couple of years) but indeed it seems GPU underutilization remains a fact and the numbers are pretty significant. Main issues are being memory bound so the compute sits idle.

ninkendo8mo ago

The actual computation speed isn't as important nowadays but it doesn't really change the conclusion with respect to whether they're underutilized.

Because the main reason for the price premiums in AI-class GPUs are the gobs of insanely fast memory, and that is very much not underutilized. AI companies grab GPUs with as much memory (at the fastest memory bandwidth) as possible and underclock the GPU to save on power. Linus Tech Tips had a great video about the H200 that touched on this this week: https://www.youtube.com/watch?v=lNumJwHpXIA

davedx8mo ago

Tasks being memory bound is not the same thing as GPU's being idle for economic reasons though.

stephc_int138mo ago· 10 in thread

Knowing history of past bubbles is only mildly informative. The dotcom bubble was different than the railroads bubble etc.

The only thing to keep in mind is that all of this is about business and ROI.

Given the colossal investments, even if the companies finances are healthy and not fraudulent, the economic returns have to be unprecedented or there will be a crash.

They are all chasing a golden goose.

jauntywundrkind8mo ago

With Bezos openly stating the goal to build 10 GW+ data-centers in space, it almost feels in question whether this is about ROI, or simply building the Neuromancer future where the ultra-rich can finally be free of their need for the rest of us. Not needing labor at all would be the final return on investment. https://news.ycombinator.com/item?id=45465480 https://www.datacenterdynamics.com/en/news/jeff-bezos-claims...

N70Phone8mo ago

It is folly to take these statements at their words.

Bezos is just saying shit to generate hype. All these executives are just saying shit. There is no plan. You must treat these people as morons who understand nothing.

Anyone who knows even the slightest details about datacenter design knows what moving heat is the biggest problem. This is the exact thing that being in space makes infinitely harder. "Datacenters in space" is an idea you come up with only if you are a moron who knows nothing about either datacenters or space.

If nothing else this is the singular reason you should treat AI as a bubble. All of the people at the helm of it have not a single fucking clue what they're talking about. They all keep running their mouth with utter nonsense like this.

1 more reply

gruez8mo ago

>With Bezos openly stating the goal to build 10 GW+ data-centers in space, it almost feels in question whether this is about ROI, or simply building the Neuromancer future where the ultra-rich can finally be free of their need for the rest of us. Not needing labor at all would be the final return on investment.

Why does "Not needing labor at all" need to be in space?

1 more reply

davedx8mo ago

I'm not sure we should pay too much attention to what Bezos says now he's out of the day to day running of Amazon. It feels like a lot of his life choices now are more about being a megawealthy play boy than anything economically motivated

chairmansteve8mo ago

>10 GW+ data-centers in space.

We are a long way from that. At least 10 years, probably never gonna happen.

delusional8mo ago

> only mildly informative

I agree, but would like to maybe build out that theory. When we start talking about the mechanisms of the past we end up over-constricting the possibility space. There were a ton of different ways the dotcom bubble COULD have played out, and only one way it did. If we view the way it did as the only way it possibly could have, we'll almost certainly miss the way the next bubble will play out.

Mistletoe8mo ago

I’m concerned that the accounting differences mentioned between Lucent and Nvidia, Microsoft, OpenAI, Google just mean we have gotten much better at lying and misrepresenting things as true. Then the bubble pops and you get the real numbers and we are all like “yep it was the same thing all over again”.

stephc_int138mo ago

Of course, CFOs are all very aware of what failed previously.

Printerisreal8mo ago

This time they have fiat money and government on their side. so that is also different.

candiddevmike8mo ago

It just means we're all going to be hurt by the collapse, not just investors. In line with socialized loses, privatized profits.

1 more reply

digitcatphd8mo ago· 10 in thread

The biggest issue with Nvidia is their revenue is not recurring but the market is treating their stock as it were, which is correlated with all semi stocks, with a one-time massive CAPEX investment lasting 1-2 years.

Simple as this - as to why its just not possible for this to continue.

xadhominemx8mo ago

NVDA stock does not trade at a huge multiple. Only 25x EPS despite very rapid top line growth and a dominant position at the eve of possibly the most important technology transition in the history of humankind. The market is (and has been) pricing in a slowdown.

JumpinJack_Cash8mo ago

> > the eve of possibly the most important technology transition in the history of humankind.

Funnily enough when you spend some months thinking into this intensively the result is that a monetary investment into the company that will bring about the singularity / AGI is the most irrational thing one can do.

If the enterprise is successful and the singularity/AGI is benign you won't need money anymore, if the experiment fail the possibility of things going rogue is very high, or even the panic from a possible series of rogue events.

So for the first time the rational thing would be to either spend those money to learn poker/chess/videogames or whatever game we will play with each other to feel cool while the AI takes care of everything else, or maybe outright spend money on coke and strippers given the chance of doomsday.

1 more reply

digitcatphd8mo ago

What do you think is going to happen to their earnings when CAPEX slows?

1 more reply

Printerisreal8mo ago

most important technology transition in the history of humankind but Nvidia itself is not leading the software part? Are they selling shovels or why would they give that part of being the head develop the AGI and GOD?

2 more replies

SiempreViernes8mo ago

Since you aren't talking about the green transition, whatever technology transition you have in mind is obviously second most important at best.

1 more reply

swexbe8mo ago

What are you talking about? It's trading at 55x EPS and 41x forward EPS.

1 more reply

bwfan1238mo ago

Same could be said of the covid-darlings - zoom, pelaton etc. They got bid up assuming the present to continue into the future. That is the nature of the markets. Same story with fake meat companies. Across time you will find this pattern - 3d printing etc, all ushering some new faddish technology. Also, explains the investments into openai as a hedge against capex slowdown, so there is a captive customer.

Workaccount28mo ago

I'm pretty sure Nvidia sells out all their cards instantly once they are announced.

Their margin is ridiculous and they are still unable to meet demand.

JCM98mo ago

This. It’s basic economics. The second there’s a blip the market will be flooded with cheap used GPUs and there will be zero reason to buy new ones. At that point it will be impossible for NVidia to sustain their revenue numbers.

cyanydeez8mo ago

TSLA is the same. Tge market is basically a new rich persons bank, abstracted by loans and lines of credit.

Obviously its a bubble but thats meaningless for anyone but the richest to manage.

The rest of us are just ants.

pragmatic8mo ago· 8 in thread

At a "telecom of telecom" we (they) were still lighting up dark fiber 15 years later (2015) when mobile data for cell carriers finally created enough demand. Hard to fathom the amount of overbuild.

The only difference is fiber optic lines remained useful the whole time. Will these cards have the same longevity?

(I have no idea just sharing anecdata)

Zigurd8mo ago

New fiber isn't significantly more power efficient. The other side of the coin is that backhoes haven't become more efficient since the fiber was buried.

beerandt8mo ago

Directional drilling is a game changer and has become accessible in the last decade.

You're looking for advancement in carriages unaware of the 'automobile' that made 5g and ftth deployment at scale possible.

1 more reply

heisenbit8mo ago

> Will these cards have the same longevity?

The article cites anecdotal 1-2 years due to the significant stress.

pragmatic8mo ago

In 2005 telecom was a cash cow because of long distance charges and if your mechanical phone switch was paid off you were printing money (regulations guaranteed revenue)

This didn't last that much longer and many places were trying to diversify into managed services (data dog for companues on Orem network and server equipment,etc) which they call "unregulated" revenue.

Add written an things business, irrational exuberance can kill you.

hyghjiyhu8mo ago

I think the chips themselves won't have longevity, but the r&d gone into them is useful. Question is whether the value of that can be captured.

adventured8mo ago

Depends on which companies we're talking about. Nvidia's annualized operating income is so high right now that it'll be capturing more value (op income) in the next four quarters (~$120 billion) than its R&D expenditures have cost over its 32 year history combined. For Nvidia the return has long since been achieved.

As the AI spending bubble gives out, Nvidia's profit growth will slow dramatically (single digits), and slamming into a wall (as Cisco did during the telecom bubble; leading up to the telecom crash, Cisco was producing rather insane quarter over quarter growth rates).

mjcl8mo ago

I think the high-density data centers that are being built to support the hyperscalers are more analogous to the dark fiber overbuild. When you lit that fiber in 2015, you (presumably) were not using a line card bought back in 1998.

pragmatic8mo ago

Everything in that data center is depreciating as soon as they turn on the power.

AC, ups, generators not to mention the severs.

That's the thing with fiber it was still useful. The cards at either end are easy to add, waaaayyy cheaper and higher perf (they're were no cards on end of dark lines) 15 years later.

narmiouh8mo ago· 8 in thread

I think the fundamental issue is the uncertainty of achieving AGI with baked in fundamentals of reasoning.

Almost 90% of topline investments appear to be geared around achieving that in the next 2-5 years.

If that doesn’t come to pass soon enough, investors will loose interest.

Interest has been maintained by continuous growth in benchmark results. Perhaps this pattern can continue for another 6-12 months before fatigue sets in, there are no new math olympiads to claim a gold medal on…

Whats next is to show real results, in true software development, cancer research, robotics.

I am highly doubtful the current model architecture will get there.

Zigurd8mo ago

AGI is not near. At best, the domains where we send people to years of grad school so that they can do unnatural reasoning tasks in unmanageable knowledge bases, like law and medicine, will become solid application areas for LLMs. Coding, most of all, will become significantly more productive. Thing is, once the backlog of shite code gets re-engineered, the computing demand for a new code creation will not support bubble levels of demand for hardware.

chuckadams8mo ago

> AGI is not near

There's also plenty of argument to be made that it's already here. AI can hold forth on pretty much any topic, and it's occasionally even correct. Of course to many (not saying you), the only acceptable bar is perfect factual accuracy, a deep understanding of meanings, and probably even a soul. Which keeps breathing life into the old joke "AI is whatever computers still can't do".

5 more replies

cl428mo ago

Not sure why you're getting downvoted.

If you speak with AI researchers, they all seem reasonable in their expectations.

... but I work with non-technical business people across industries and their expectations are NOT reasonable. They expect ChatGPT to do their entire job for $20/month and hire, plan, budget accordingly.

12 months later, when things don't work out, their response to AI goes to the other end of the spectrum -- anger, avoidance, suspicion of new products, etc.

Enough failures and you have slowing revenue growth. I think if companies see lower revenue growth (not even drops!), investors will get very very nervous and we can see a drop in valuations, share prices, etc.

Cheer21718mo ago

> their expectations are NOT reasonable. They expect ChatGPT to do their entire job for $20/month and hire, plan, budget accordingly.

This is entirely on the AI companies and their boosters. Sam Altman literally says gpt 5 is "like having a team of PhD-level experts in your pocket." All the commercials sell this fantasy.

3 more replies

belter8mo ago

> If you speak with AI researchers, they all seem reasonable in their expectations.

An extraordinary claim for which I would like to see the extraordinary evidence. Because every single interview still available on YT form 3 years ago ...had these researchers putting AGI 3 to 5 years out ...A complete fairy tale as the track to AGI is not even in sight.

If you want to colonize the Solar System the track is clear. If you to have Fusion, the track is clear. AGI track ?

1 more reply

narmiouh8mo ago

I'm not sure either - for a second I thought perhaps llm agents are prowling around to ensure the right messages are floating up, but who knows...

xadhominemx8mo ago

Hyperscalers are only spending less than half of their operating cash flows on AI capex. Full commitment to achieving AGI within a few years would look much different.

stevenhuang8mo ago

This doubt of yours is as credible as all the claims 5 years ago that we will never have capable thinking machines, yet here we are with LLMs.

hackthemack8mo ago· 7 in thread

I worked at a mom and pop ISP in the 90s. Lucent did seem at the forefront of internet equipment at the time. We used Portmaster 3s to handle dial up connections. We also looked into very early wireless technology from Lucent.

Something I wanted to mention, only somewhat tanget. The Telecommunications Act of 1996 forced telecommunication companies to lease out their infrastructure. It massively reduced the prices an ISP had to pay to get T1, because, suddenly, there was competition. I think a T1 went from 1800 a month in 1996, to around 600 a month in 1999. It was a long time ago, so my memory is hazy.

But, wouldn't you know it, the Telecommunication companies sued the FCC and the Telecommunications Act was gutted in 2003

https://en.wikipedia.org/wiki/Competitive_local_exchange_car...

marcusb8mo ago

> I think a T1 went from 1800 a month in 1996, to around 600 a month in 1999. It was a long time ago, so my memory is hazy.

It varied a lot by region. At the mom and pop ISP I worked at, we went from paying around $1,500/month for a T1 to $500 to, eventually around $100/month for the T1 loop to the customer plus a few grand a month for an OC12 SONET ring that we used to backhaul the T1 (and other circuits) back to our datacenter.

But, all of it was driven by the Telecommunications Act requirement for ILECs to sell unbundled network facilities - all of the CLECs we purchased from were using the local ILEC for the physical part of the last mile for most (> 75%) of the circuits they sold us.

One interesting thing that happened was that for a while in the late 90’s, when dialup was still a thing, we could buy a voice T1 PRI for substantially less than a data T1 ($250 for the PRI vs $500 for the T1.) The CLEC’s theory was our dialup customers almost all had service from the local ILEC, and the CLEC would be paid “reciprocal compensation” fees by the ILEC for the CLEC accepting calls from them.

In my market, when the telecommunications act reform act was gutted, the ILEC just kept on selling wholesale/unbundled services to us. I think they had figured out at that point that it was a very profitable line of business if they approached it the right way.

jauntywundrkind8mo ago

Th gutting of telecom competition, the allowance of total monopoly power, was a travesty of the court system. The law was quite plain & clear & the courts decided all on their own that, well, since fiber to the home is expensive to deploy, we are going to overrule the legislative body. The courts aren't supposed to be able to overturn laws they don't like as they please but that's what happened here.

Regarding the price of connection, it's also worth mentioning that while T1 and other T-channel and OCx connection remains in high use, 1996-1999 is also the period where DSL became readily available & was a very fine choice for many needs. This certainly created significant cost pressure on other connectivity options.

bwfan1238mo ago

I worked at a startup during the telecom boom. Then, startups were getting acquired by the likes of Cisco before the startup had a deployed product. And, back then, IPOs were the only form of liquidity event and engineers were locked up for 6 months. The lucky ones had their startups go IPO or get acquired with enough time to spare to get out before the ensuing bust. After the bust, funding dried up and most startups folded including the one I worked at. There was wipeout and desolation for a few years. Subsequently, green shoots started appearing in the form of a new wave of tech companies.

echelon8mo ago

Capitalism only works if there is lots of competition.

Monopolies gum up the system, reward the institutional capital rather than innovation capital, and prevent new entrants from de-ossifying and being the renewing forest fire.

We've been so lax on antitrust. Google, Apple, Meta, Amazon - they all need to be broken up. Our economy and our profession would be better for it.

Innovation should be a treadmill.

YC and a16z want this.

2 more replies

awongh8mo ago

You're implying only 4 years of regulation was enough to shift the balance of power between telecoms and smaller ISPs."

If it's true that this regulation was what helped jumpstart the internet it's an interesting counterpoint to the apocalyptic predictions of people when these regulations are undone. (net neutrality comes to mind as well)

I've never heard anyone claim before that just having these laws on the books for a small period of time is "enough".

watwut8mo ago

> 've never heard anyone claim before that just having these laws on the books for a small period of time is "enough".

Why would it be enough? This legislation prevents monopolies from abusing position, therefore we will repeal it the moment it turns out to be useful?

Yeah, it takes time to consolidate power again, that does not mean the legislation is not good.

1 more reply

nroets8mo ago

Changes in the legislative landscaped may have influenced the timing of the price war and the telecoms bubble.

But the price war was inevitable. And the telecoms bubble was highly likely in any case.

Telecoms investment was a response to crazy valuations of dot-com stocks.

rglullis8mo ago· 6 in thread

The one thing I don't understand is this assumption that demand for GPUs for training is going to keep growing at the rate they grew so far.

I get the demand for new applications, which require inference, but nowadays with so many good (if not close to SOTA) models available for free and the ability to run them on consumer hardware (apple M4 or AMD Max APUs), is there any demand for applications that justify a crazy amount of investment in GPUs?

porridgeraisin8mo ago

Inference will be cheapest when run in a shared cloud environment, simply due to the LLMs roofline. Thus, most B2B use cases are likely to be datacenter based, like AWS today.

Of course, cern is still going to use their FPGA hyper-optimized for their specific trigger model for the LHC, and apple is gojng to use a specialized low power ASIC running a quantized model for hello Siri, but I meant the majority usecase.

rglullis8mo ago

I do not buy this premise. I think it will end up being cheaper to simply run the LLMs directly on the user device.

I think that there are plenty of competitors in the "LLMs with open weights" space to essentially make the models a commodity, so all that is left is the compute cost and there is no way that someone will be running a datacenter in a way that is cheaper than "the computer that I already have running on my desk".

1 more reply

Leynos8mo ago

Apologies for the second reply, but it also occurs to me that reinforcement learning is the new battleground. Look at the changes between o1, o3 and GPT-5 thinking. Sonnet 3.7, Sonnet 4, and Sonnet 4.5. And so forth.

I expect models will get larger again once everyone is doing their inference on B200s, but the RL training budget is where the insatiable appetite sits right now.

bix68mo ago

Isn’t the whole point of the arms race that the more GPUs you have the closer you get to AGI? Which is the supposed goal here.

rglullis8mo ago

I do not believe for a second that any of those people investing tens of billions of dollars are doing it to "get to AGI". They would only be able to profit from a AGI if it could be simultaneously (a) weaponized and (b) strictly controlled by one party, and there is no one crazy enough that these could be achieved.

If you tell me that people are pouring all that money into data centers because they believe that most applications will use some form of LLM or VLM as the main driver of machine-to-machine and machine-to-person interface, I'd be more inclined to buy it. But then I'd respond that it seems that LLMs are reaching a point of diminishing returns and the big next move is to make it easy and faster to distill/fine-tune the LLMs for specific business needs, which is something that should be possible to do with the existing infra already (I guess?)

Leynos8mo ago

I suspect continuous learning will be the next driver of GPU usage.

ekjhgkejhgk8mo ago· 4 in thread

In reference 14 we read

> However, what’s become clear is that OpenAI plans to pay for Nvidia’s graphics processing units (GPUs) through lease arrangements, rather than upfront purchases

I wish someone here could explain it to a dummy like me. Nvidia tells OpenAI: heres some GPUs, can you pay for them over 5 years. How is this an "investment" by Nvidia? That reference keeps calling this an investment, but what they describe is a lease agreement. Why do they call it an investment? What am I missing?

sbuttgereit8mo ago

That Nvidia has to front the costs of the product at the beginning and arguably the risk that the allocation of assets end up not being paid off (bankruptcy, etc.) By carrying those costs early and the associated risk, Nvidia expects a return on that. If the risk is realized they'll lose but otherwise they'll gain. That has all the hallmarks of an investment.

ekjhgkejhgk8mo ago

Thanks for your reply, but:

NVidia could protect itself against OpenAI bankrupcy by adding a clause to the lease saying that if OpenAI goes bankrupt, Nvidia gets its GPUs back. So the risk would only be that the lease would be aborted sooner than expected.

1 more reply

foundart8mo ago

Speculation by a non-accountant:

At the end Nvidia retains ownership of what are probably very low value assets.

Contrast that with car leases: there is a robust market for used cars.

Nvidia is in effect financing the GPUs by not requiring the full payment up front.

Do the lease payments add up to the total cost?

ekjhgkejhgk8mo ago

The GPUs do/might lose value over time, but perhaps Nvidia is betting that they wont? Going by recent years, their value went up in some cases.

EDIT: Interesting note

> CPUs historically have 5-10 years of useful life , while GPUs in AI datacenters last 1-3 years in practice , despite 6-year accounting assumptions.30,31 Evidence from Google architects shows GPUs at 60-70% utilization survive 1-2 years , with 3 years maximum.31 Meta’s Llama 3 training experienced 9% annual GPU failure rates , suggesting 27% failure over 3 years.31

dangus8mo ago· 4 in thread

I think we are at the PS3/Xbox 360 phase of AI.

By that I mean, those were the last consoles where performance improvements delivered truly new experiences, where the hardware mattered.

Today, any game you make for a modern system is a game you could have made for the PS3/Xbox 360 or perhaps something slightly more powerful.

Certainly there have been experiences that use new capabilities that you can’t literally put on those consoles, but they aren’t really “more” in the same way that a PS2 offered “more” than the PlayStation.

I think in that sense, there will be some kind of bubble. All the companies that thought that AI would eventually get good enough to suit their use case will eventually be disappointed and quit their investment. The use cases where AI makes sense will stick around.

It’s kind of like how we used to have pipe dreams of certain kinds of gameplay experiences that never materialized. With our new hardware power we thought that maybe we could someday play games with endless universes of rich content. But now that we are there, we see games like Starfield prove that dream to be something of a farce.

jcranmer8mo ago

> By that I mean, those were the last consoles where performance improvements delivered truly new experiences, where the hardware mattered.

The PS3 is the last console to have actual specialized hardware. After the PS3, everything is just regular ol' CPU and regular ol' GPU running in a custom form factor (and a stripped-down OS on top of it); before then, with the exception of the Xbox, everything had customized coprocessors that are different from regular consumer GPUs.

ben_w8mo ago

> By that I mean, those were the last consoles where performance improvements delivered truly new experiences, where the hardware mattered.

I hope that's where we are, because that means my experience will still be valuable and vibe coding remains limited to "only" tickets that take a human about half a day, or a day if you're lucky.

Given the cost needed for improvements, it's certainly not implausible…

…but it's also not a sure thing.

I tried "Cursor" for the first time last week, and just like I've been experiencing every few months since InstructGPT was demonstrated, it blew my mind.

My game metaphor is 3D graphics in the 90s: every new release feels amazing*, such a huge improvement over the previous release, but behind the hype and awe there was enough missing for us to keep that cycle going for a dozen rounds.

* we used to call stuff like this "photorealistic": https://www.reddit.com/r/gaming/comments/ktyr1/unreal_yes_th...

BlueTemplar8mo ago

We did get more : the return of VR couldn't have been possible without drastically improved hardware.

But the way how it stayed niche shows how it's not just about new gameplay experiences.

Compare with the success of the Wii Sports and Wii Fit, which I would guess managed it better, though through a different kind of hardware that you are thinking about ?

And I kind of expect the next Nintendo console to have a popular AR glasses option, which also would only have been made possible thanks to improving hardware (of both kinds).

dangus8mo ago

That’s exactly what I mean, too. We obviously will get much better AI. It just seems like the value that most people are getting out of it is already captured, just like how technically impressive stuff like VR is very niche.

I could be very wrong, obviously.

alephnerd8mo ago· 3 in thread

Glad to see Tom's blog on HN - as usual a great write up. A number of us have been chatting about this for several months now, and the take is fairly sober.

Meta commentary but I've grown weary of how commentary by actual domain experts in our industry are underrepresented and underdiscussed on HN in favor of emotionally charged takes.

dvt8mo ago

> actual domain experts

Calling a VC a "domain expert" is like calling an alcoholic a "libation engineer." VC blogs are, in the best case, mildly informative, and in the worst, borderline fraudulent (the Sequoia SBF piece being a recent example, but there are hundreds).

The incentives are, even in a true "domain expert" case (think: doctors, engineers, economists), often opaque. But when it comes to VCs, this gets ratcheted up by an order of magnitude.

alephnerd8mo ago

Tom has had a fairly solid track record at Redpoint and now Theory in Data, Enterprise SaaS, and AI/ML. And it's not like we see many posts by engineers, doctors, or economists on HN either - most posts are listicles about the "culture" of technology, an increased amount of political articles growing increasingly tenuously related to the tech industry, and a portion of actually interesting technical content.

hodgesrm8mo ago

Martin Casado is a counter-example. His writings on technology starting with his phd thesis are very informative. [0] He’s the real thing as are many others.

[0] http://yuba.stanford.edu/~casado/mcthesis.pdf

JCM98mo ago· 3 in thread

The smartest finance folks I know say that this “irrational exuberance” works until it doesn’t. Meaning nobody really thinks it’s sustainable, but companies and VCs chasing the AI hype bubble have backed themselves into a corner where the only way to stop the bubble from bursting is to keep inflating the bubble.

The fate of the bubble will be decided by Wall Street not tech folks in the valley. Wall Street is already positioning itself for the burst and there’s lots of finance types ready to call party over and trigger the chaos that lets them make bank on the bubble’s implosion.

These finance types (family offices, small secret investment funds) eat clueless VCs throwing cash on the fire for lunch… and they’re salivating at what’s ahead. It’s a “Big Short” once in 20-30 years type opportunity.

ProjectArcturis8mo ago

>These finance types (family offices, small secret investment funds) eat clueless VCs throwing cash on the fire for lunch… and they’re salivating at what’s ahead. It’s a “Big Short” once in 20-30 years type opportunity.

No - it's very hard to successfully bet against anything in finance, and VCs and non-public investments are particularly hard. When you go long, you simply buy something and hold it until you decide to sell. If you short, you have to worry about borrowing shares, paying short fees, and having unlimited risk.

How would you even begin to bet against OpenAI specifically? The closest proxy I can think of is shorting NVDA.

There's also nobody whose job it is to make big one-time shorts. Like you said, it's a once in 20-30 years opportunity, so no one builds a hedge fund dedicated to sitting around for decades waiting for that opportunity. There will certainly be exceptions, and maybe they'll make a Big Short 2 about the scrappy underdogs who saw the opening and timed it perfectly. But the vast majority of Wall Street desperately wants the party to continue.

delusional8mo ago

> have backed themselves into a corner where the only way to stop the bubble from bursting is to keep inflating the bubble.

They are not in any corner. They rightly believe that they won't be allowed to fail. There's zero cost to inflating the bubble. If they tank a loss, it's not their money and they'll go on to somewhere else. If they get lucky (maybe skillful?) they get out of the bubble before anyone else, but get to ride it all the way to the top.

The only way they lose is if they sit by and do nothing. The upside is huge, and the downside is non-existent.

bwfan1238mo ago

oracle's announcement of a 300b purchase commitment from openai followed soon by a 100B investment into openai. The pace and size of these announcements is reaching a fever-pitch which seems like an attempt to keep the music playing.

nickdothutton8mo ago· 2 in thread

I was there in the middle of the dotcom crash and the telecoms crash which was much worse. Fiber does not rust, and while there was vast overcapacity, not all of it was lit, or indeed worth lighting. 10 years after, thanks to DWDM there were 8 strand cables where only 2 strands were lit, albeit with many more wavelengths than envisaged before. Even though demand had grown.

How much is a 10 year old GPU worth? Where is the “dwdm but for GPUs?”.

There truly are interesting times and we have the benefit of being in them.

keeda8mo ago

Just so I understand correctly, you mean that with DWDM 2 strands of cables were equivalent to 8 since DWDM as a multiplexing technology increased the capacity of each fiber, right?

> How much is a 10 year old GPU worth? Where is the “dwdm but for GPUs?”.

From other sources cited in TFA it seems GPUs won't last 3 years, let alone 10! But I think we know what the "DWDM for GPUs" is -- it's the processing efficiency gains that we've seen over the last few years which keeps driving the per-token prices sharply down.

nickdothutton8mo ago

I'm not sure where DWDM is currently, but we quickly went from 1 colour to 512 colours per strand. I expect and hope for increases in GPU performance, some via process shrink, some via wafer size/yield, and some...perhaps... by software efficiencies (most interesting to me). Maybe volumes will increase too, further driving down the price for essentially the same parts. However I don't see how these gains (if they happen) can "get us to 512".

pgspaintbrush8mo ago· 2 in thread

Are these companies developing InfiniBand-class interconnects to pair with their custom chips? Without equivalent fabric, they can’t replace NVIDIA GPUs for large-scale training.

whp_wessel8mo ago

recent Huang podcast went into this, making the point that custom chips won't be competitive to Nvidia's as they are now making specialised chips instead of just 'gpu's'.

https://open.spotify.com/episode/2ieRvuJxrpTh2V626siZYQ?si=2...

pgspaintbrush8mo ago

Thank you for the pointer!

nextworddev8mo ago· 2 in thread

This article is pretty confusing, doesn't really have a thesis, just listing some stats. Maybe that's the intent.

foundart8mo ago

I had the same response.

Certainly it suggests that “this time is different” without saying it in a quotable fashion.

The metrics it provides seem useful. What are the metrics it is missing?

nextworddev8mo ago

The thing that this doesn't get is that 10bn a year is basically 70%+ of yearly R&D and inference budget of OpenAI.. so this Nvidia deal is actually great (for OpenAI) in terms of protecting its cashflow

dehrmann8mo ago· 2 in thread

> Evidence from Google architects shows GPUs at 60-70% utilization survive 1-2 years , with 3 years maximum.

Really?! I'm not used to chips having such a short lifespan.

metadat8mo ago

This sounds like major FUD unless the data is public.

bigyabai8mo ago

Feels like a teensy tiny conflict of interest, coming from GCP.

Additionally - GPUs have multiple components. Which parts are at 60-70% load, the SM unit or the memory controller? If you're throttling the GPU but not the memory, it makes perfect sense why you're burning the damn card out...

ivape8mo ago· 2 in thread

One of the things before AI in the market was that capital had limited growth opportunities. Tech, which was basically a universe of scaled out crud apps, was where capital would keep going back into.

AI is a lot more useful than hyper scaled up crud apps. Comparing this to the past is really overfitting imho.

The only argument against accumulating GPUs is that they get old and stop working. Not that it sucks, not that it’s not worth it. As in, the argument against it is actually in the spirit of “I wish we could keep the thing longer”. Does that sound like there’s no demand for this thing?

The AI thesis requires getting on board with what Jenson has been saying:

1) We have a new way to do things

2) The old ways have been utterly outclassed

3) If a device has any semblance of compute power, it will need to be enhanced, updated, or wholesale replaced with an AI variant.

There is no middle ground to this thesis. There is no “and we’ll use AI here and here, but not here, therefore we predictably know what is to come”.

Get used to the unreal. Your web apps could truly one day be generated frame by frame by a video model. Really. The amount of compute we’ll need will be staggering.

pessimizer8mo ago

> Your web apps could truly one day be generated frame by frame by a video model. Really. The amount of compute we’ll need will be staggering.

We've technically been able to play board games by entering our moves into our telephones, sending them to a CPU to be combined, then printing out a new board on paper to conform to the new board state. We do not do this because it would be stupid. We can not depend on people starting to do this saving the paper, printer, and ink industries. Some things are not done because they are worthless.

ivape8mo ago

You know that N people can now point a webcam onto their boards and have a multi modal LLM understand everyone’s board state now, right? Literally zero programming involved, you just have to point a camera at the damn thing and maybe write some glue code.

If you’re a board game player then you are more than capable of imagining possibilities well beyond this.

1 more reply

mwkaufma8mo ago· 1 in thread

Distinguishing that in-hindsight Lucent was committing accounting fraud and present firms aren't is a load-bearing assumption here; for all we know the big players in the AI bubble just haven't been outed yet.

brazukadev8mo ago

They are infringing copyright in all ways possible, if AI fail they will also have to pay this bill. It'll be the reparations after they lose the war.

davedx8mo ago· 1 in thread

Some great insights with some less interesting in there. I didn’t know about the SPVs, that’s sketchy and now I wanna know how much of that is going on. The MIT study that gets pulled out for every critical discussion of AI was an eye roll for me. But very solid analysis of the quants.

How much of a threat is custom silicon to Nvidia remains an open question to me. I kinda think, by now, we can say they’re similar but different enough to coexist in the competitive compute landscape?

alephnerd8mo ago

> How much of a threat is custom silicon to Nvidia remains an open question to me

Nvidia has also begun trying to enter the custom silicon sector as well, but it's still largely dominated by Broadcom, Marvell, and Renesas.

xbmcuser8mo ago· 1 in thread

With all the major players like Amzn, Msft and Alphabet going for their own custom chips and restrictions on selling to China it will be interesting to see how Nvidia does.

I personally would prefer China to get to parity on node size and get competitive with nvidia. As that is the only way I see the world not being taken over by the tech oligarchy.

JCM98mo ago

The custom chips don’t seem to be gaining traction at scale. On paper the specs look good but the ecosystem isn’t there. The bubble popping and flooding the market with CUDA GPUs means it will make even less sense to switch.

cameldrv8mo ago

There was a similar circular effect in the dot com boom around ads. VCs poured money into startups, which put the money into ads on Yahoo and other properties. Yahoo was getting huge revenue from the ads, which pumped up the stock price. The rising price and revenue, and hence stock price of Yahoo pumped up the market for other dot coms, as it proved you could make money on the Internet, so the market for dot com IPOs was strong. That drew more VC money. More VC money meant more ads.

sails8mo ago

> Nvidia’s vendor financing becomes exposure to customers building competitive alternatives.

This is surely the most important line in the piece? In what world would this much demand not lead to alternatives emerging?

(Assuming the upside, yes if the demand is not there in two years then yes it’s all going to burn)

monkeydust8mo ago

Where can you track GPU utilization rates? Assuming private data but curious if not.

MASNeo8mo ago

I wonder if the buying customers of Nvidia are going to find the self’s left with the overcapacity. Certainly people are waking up to LLM challenges and as budgets focus more on useful applications, smaller language models, how much of that demand will remain.

Also, depreciation schedules beyond useful life of an asset may not be fraud but I’d call it a bit too creative for my liking.

Time will tell.

Theodores8mo ago

This reminds me of SGI at the peak of the dot-com bubble.

SGI (Silicon Graphics) made the 3D hardware that many companies relied on for their own businesses, in the days before Windows NT and Nvidia came of age.

Alias|Wavefront and Discreet were two companies where their product cycles were very tied in the SGI product cycles, with SGI having some ownership, whether it be wholly owned or spun out (as SGI collapsed). I can't find the reporting from the time, but it seemed to me that the SGI share price was propped up by product launches from the likes of Alias|Wavefront or Discreet. Equally, the 3D software houses seemed to have share prices propped up by SGI product launches.

There was also the small matter of insider trading. If you knew the latest SGI boxes were lemons then you could place your bets of the 3D software houses accordingly.

Eventually Autodesk, Computer Associates and others eventually owned all the software, or, at least, the user bases. Once upon a time these companies were on the stock market and worth billions, but then they became just another bullet point in the Autodesk footer.

My prediction is that a lot of AI is like that, a classic bubble, and, when the show moves on, all of these AI products will get shoehorned into the three companies that will survive, with competition law meaning that it will be three rather than two eventual winners.

Equally, much like what happened with SGI, Nvidia will eventually come a cropper due to the evaluations due to today's hype and hubris not delivering.

mooreds8mo ago

Good analysis.

But the answer is, "kinda"? There are similarities, but the AI buildout is worse in some ways (more concentration, GPU backed debt) and better in others (capacity is being used, vendors actually have cash flow).

The conclusion:

> Unlike the telecom bubble, where demand was speculative & customers burned cash , this merry-go-round has paying riders.

Seems a little short sighted to me. IMO, there is a definite echo, but we are in the mid-late stage, not the end stage.

It's simply not fair to compare Lucent at the end of a bubble with Nvidia in the middle, and that is what the author did.

If you haven't listened to the referenced interview between Thompson and Kedrosky, I'd do so: https://www.theringer.com/podcasts/plain-english-with-derek-...

spaceballbat8mo ago

Looking at the last chapter of the essay, there was a lot of illegal activity by lucent in the runup to the collapse. Today, We won’t know the list of shady practices until the bubble bursts. I doubt Tom could legally speculate, he’d likely be sued into oblivion if he even hinted at malfeasance by these trillion dollar companies.

cl428mo ago

Great points. I am bullish on AI but also wary of accounting practices. Tom says Nvidia's financials are different from Lucent's but that doesn't mean we shouldn't be wary.

The Economist has a great discussion on depreciation assumptions having a huge impact on how the finances of the cloud vendors are perceived[1].

Revenue recognition and expectations around Oracle could also be what bursts the bubble. Coreweave or Oracle could be the weak point, even if Nvidia is not.

[1] https://www.economist.com/business/2025/09/18/the-4trn-accou...

ekjhgkejhgk8mo ago

https://capitalgains.thediff.co/p/vendor-financing

yalogin8mo ago

The telecom bubble built infrastructure for something that didn’t exist, they built anticipating the need for high didn’t come in time.

The gpu bubble is different. Nvidia is actually selling gpus in spades. So it’s not comparable to the telecom bubble. Now the question remains how many more gpus can they sell? That depends on the kind of services that are built and how their adoption takes off. So now is it a bubble or just frothy at the top? There is definitely going to be a pull back and some adjustment, but I cannot say how bad it is

rossdavidh8mo ago

"This time it's different"

redwood8mo ago

TLDR: Lucent was committing various forms of accounting fraud and had an unhealthy cash flow position, and had their primary customers on economically dangerous ground. Nvidia meanwhile appears to be above board, has strong cash flow, and has extremely strong dominant customers (eg customers that could reduce spending but can survive a downturn). Therefore there's no clear takeaway: similarities but also differences. Risk and a lot of debt as well as hyperscalers insulating themselves from some of that risk... but at the same time as lot more cash to burn.

j / k navigate · click thread line to collapse

213 comments

131 comments · 31 top-level

mg8mo ago· 24 in thread

    Fiber networks were using less
    than 0.002% of available capacity,
    with potential for 60,000x speed
    increases. It was just too early.

Mo38mo ago

> I doubt we will see unused GPU capacity

Workaccount28mo ago

>Meanwhile LLM fatigue has started to set in

Has it?

I think there is this compulsion to think that LLMs are made for senior devs, and if devs are getting wary of LLMs, the experiment is over.

From OpenAI, programming is ~4% of chatGPTs usage. That's 96% being used for other stuff.

I don't see any realistic or grounded forecast that includes a diminishing demand for compute. We're still at the tip of adoption...

2 more replies

xadhominemx8mo ago

delusional8mo ago

> There's no way this won't end up with a lot of idle GPUs.

Nvidia is betting the farm on reinventing GPU compute every 2 years. The GPUs wont end up idle, because they will end up in landfills.

Do I believe that's likely, no, but it is what I believe Nvidia is aiming for.

goalieca8mo ago

What’s the lifetime of these things once they’ve been running hot for 2-3 years

brazukadev8mo ago

This. I just found out that for my MCP needs, Qwen3 4B running local is good enough! So I just stopped using Gemini API.

idiotsecant8mo ago

Your bet is that people will simply use less compute, for the first time in the history of the human race?

1 more reply

mg8mo ago

Look at the human body.

2% of it is dedicated to thinking.

My guess is that as a species, we will turn a similar percentage of our environment into thinking matter.

If there are a billion houses on planet earth, 2% of it are 20 million datacenters we still have to build.

1 more reply

credit_guy8mo ago

bigbadfeline8mo ago

> "Try different ways to refactor it. Tomorrow, show me your best solution."

The cost/benefit analysis doesn't add up for two reasons:

First, a refactored codebase works almost the same as non-refactored one, that is, the tangible benefit is small.

Second, how many times are you going to refactor the codebase? Once and... that's it. There's simply no need for that much compute for lack of sufficient beneficial work.

That is, the present investments are going to waste unless we automate and robotize everything, I'm OK with that but it's not where the industry is going.

skrebbel8mo ago

> improved codebase

I've seen lots of claims about AI coding skill, but that one might be able to improve (and not merely passably extend) a codebase is a new one. I'd want to see it before I believe it.

Leynos8mo ago

Other things the agent might identify this way: multiple implications, use of conflicted APIs, poor separation of concerns at a module or class level.

I don't typically let the agent do any of this end to end, but I would typically manually review findings before spawning subagents with those findings.

fragmede8mo ago

Claude will refactor but more than that, it can add documentation. And it can be asked about a codebase too. "Where does FOO happen?" "How does BAR work?".

thenaturalist8mo ago

This is such a short sighted take glaringly ommitting a crucial ingredient in learning or improvement - both for humans or machines alike: feedback loops.

And you can't really hack / outsmart feedback loops.

Just because something is conceptually possible, interaction with the real rest of the world separates a possible from an optimal solution.

The low hanging fruits/ obvious incremental improvements might be quickly implemented by LLMs based on established patterns in their training data.

That doesn't get you from 0 to 1 dollar, though and that's what it's all about.

bwfan1238mo ago

this. Was highlighted by Sutton in a recent podcast rather starkly.

cantor_S_drug8mo ago

With improvements on the algorithm side and new techniques, even older hardware will become useful.

Zigurd8mo ago

I get what you're saying and the reasoning behind it, but older hardware has never been useful where power consumption is part of determining usefulness.

chatmasta8mo ago

2 more replies

jdlshore8mo ago

> As soon as we can prompt…

Workaccount28mo ago

Getting gold in the math Olympiad is a pretty strong indicator of operating independently on substantive problems.

A year ago they need an extensive harness to get silver, and two years ago they could hardly multiply 1000x10000.

Terence Tao tweeted yesterday about using GPT5 to help quickly solve a problem he was working on.

1 more reply

ccorcos8mo ago

yubblegum8mo ago

ninkendo8mo ago

The actual computation speed isn't as important nowadays but it doesn't really change the conclusion with respect to whether they're underutilized.

davedx8mo ago

Tasks being memory bound is not the same thing as GPU's being idle for economic reasons though.

stephc_int138mo ago· 10 in thread

Knowing history of past bubbles is only mildly informative. The dotcom bubble was different than the railroads bubble etc.

The only thing to keep in mind is that all of this is about business and ROI.

Given the colossal investments, even if the companies finances are healthy and not fraudulent, the economic returns have to be unprecedented or there will be a crash.

They are all chasing a golden goose.

jauntywundrkind8mo ago

N70Phone8mo ago

It is folly to take these statements at their words.

Bezos is just saying shit to generate hype. All these executives are just saying shit. There is no plan. You must treat these people as morons who understand nothing.

1 more reply

gruez8mo ago

Why does "Not needing labor at all" need to be in space?

1 more reply

davedx8mo ago

chairmansteve8mo ago

>10 GW+ data-centers in space.

We are a long way from that. At least 10 years, probably never gonna happen.

delusional8mo ago

> only mildly informative

Mistletoe8mo ago

stephc_int138mo ago

Of course, CFOs are all very aware of what failed previously.

Printerisreal8mo ago

This time they have fiat money and government on their side. so that is also different.

candiddevmike8mo ago

It just means we're all going to be hurt by the collapse, not just investors. In line with socialized loses, privatized profits.

1 more reply

digitcatphd8mo ago· 10 in thread

Simple as this - as to why its just not possible for this to continue.

xadhominemx8mo ago

JumpinJack_Cash8mo ago

> > the eve of possibly the most important technology transition in the history of humankind.

1 more reply

digitcatphd8mo ago

What do you think is going to happen to their earnings when CAPEX slows?

1 more reply

Printerisreal8mo ago

2 more replies

SiempreViernes8mo ago

Since you aren't talking about the green transition, whatever technology transition you have in mind is obviously second most important at best.

1 more reply

swexbe8mo ago

What are you talking about? It's trading at 55x EPS and 41x forward EPS.

1 more reply

bwfan1238mo ago

Workaccount28mo ago

I'm pretty sure Nvidia sells out all their cards instantly once they are announced.

Their margin is ridiculous and they are still unable to meet demand.

JCM98mo ago

cyanydeez8mo ago

TSLA is the same. Tge market is basically a new rich persons bank, abstracted by loans and lines of credit.

Obviously its a bubble but thats meaningless for anyone but the richest to manage.

The rest of us are just ants.

pragmatic8mo ago· 8 in thread

At a "telecom of telecom" we (they) were still lighting up dark fiber 15 years later (2015) when mobile data for cell carriers finally created enough demand. Hard to fathom the amount of overbuild.

The only difference is fiber optic lines remained useful the whole time. Will these cards have the same longevity?

(I have no idea just sharing anecdata)

Zigurd8mo ago

New fiber isn't significantly more power efficient. The other side of the coin is that backhoes haven't become more efficient since the fiber was buried.

beerandt8mo ago

Directional drilling is a game changer and has become accessible in the last decade.

You're looking for advancement in carriages unaware of the 'automobile' that made 5g and ftth deployment at scale possible.

1 more reply

heisenbit8mo ago

> Will these cards have the same longevity?

The article cites anecdotal 1-2 years due to the significant stress.

pragmatic8mo ago

In 2005 telecom was a cash cow because of long distance charges and if your mechanical phone switch was paid off you were printing money (regulations guaranteed revenue)

Add written an things business, irrational exuberance can kill you.

hyghjiyhu8mo ago

I think the chips themselves won't have longevity, but the r&d gone into them is useful. Question is whether the value of that can be captured.

adventured8mo ago

mjcl8mo ago

pragmatic8mo ago

Everything in that data center is depreciating as soon as they turn on the power.

AC, ups, generators not to mention the severs.

That's the thing with fiber it was still useful. The cards at either end are easy to add, waaaayyy cheaper and higher perf (they're were no cards on end of dark lines) 15 years later.

narmiouh8mo ago· 8 in thread

I think the fundamental issue is the uncertainty of achieving AGI with baked in fundamentals of reasoning.

Almost 90% of topline investments appear to be geared around achieving that in the next 2-5 years.

If that doesn’t come to pass soon enough, investors will loose interest.

Whats next is to show real results, in true software development, cancer research, robotics.

I am highly doubtful the current model architecture will get there.

Zigurd8mo ago

chuckadams8mo ago

> AGI is not near

5 more replies

cl428mo ago

Not sure why you're getting downvoted.

If you speak with AI researchers, they all seem reasonable in their expectations.

12 months later, when things don't work out, their response to AI goes to the other end of the spectrum -- anger, avoidance, suspicion of new products, etc.

Cheer21718mo ago

> their expectations are NOT reasonable. They expect ChatGPT to do their entire job for $20/month and hire, plan, budget accordingly.

This is entirely on the AI companies and their boosters. Sam Altman literally says gpt 5 is "like having a team of PhD-level experts in your pocket." All the commercials sell this fantasy.

3 more replies

belter8mo ago

> If you speak with AI researchers, they all seem reasonable in their expectations.

If you want to colonize the Solar System the track is clear. If you to have Fusion, the track is clear. AGI track ?

1 more reply

narmiouh8mo ago

I'm not sure either - for a second I thought perhaps llm agents are prowling around to ensure the right messages are floating up, but who knows...

xadhominemx8mo ago

Hyperscalers are only spending less than half of their operating cash flows on AI capex. Full commitment to achieving AGI within a few years would look much different.

stevenhuang8mo ago

This doubt of yours is as credible as all the claims 5 years ago that we will never have capable thinking machines, yet here we are with LLMs.

hackthemack8mo ago· 7 in thread

But, wouldn't you know it, the Telecommunication companies sued the FCC and the Telecommunications Act was gutted in 2003

https://en.wikipedia.org/wiki/Competitive_local_exchange_car...

marcusb8mo ago

> I think a T1 went from 1800 a month in 1996, to around 600 a month in 1999. It was a long time ago, so my memory is hazy.

jauntywundrkind8mo ago

bwfan1238mo ago

echelon8mo ago

Capitalism only works if there is lots of competition.

Monopolies gum up the system, reward the institutional capital rather than innovation capital, and prevent new entrants from de-ossifying and being the renewing forest fire.

We've been so lax on antitrust. Google, Apple, Meta, Amazon - they all need to be broken up. Our economy and our profession would be better for it.

Innovation should be a treadmill.

YC and a16z want this.

2 more replies

awongh8mo ago

You're implying only 4 years of regulation was enough to shift the balance of power between telecoms and smaller ISPs."

I've never heard anyone claim before that just having these laws on the books for a small period of time is "enough".

watwut8mo ago

> 've never heard anyone claim before that just having these laws on the books for a small period of time is "enough".

Why would it be enough? This legislation prevents monopolies from abusing position, therefore we will repeal it the moment it turns out to be useful?

Yeah, it takes time to consolidate power again, that does not mean the legislation is not good.

1 more reply

nroets8mo ago

Changes in the legislative landscaped may have influenced the timing of the price war and the telecoms bubble.

But the price war was inevitable. And the telecoms bubble was highly likely in any case.

Telecoms investment was a response to crazy valuations of dot-com stocks.

rglullis8mo ago· 6 in thread

The one thing I don't understand is this assumption that demand for GPUs for training is going to keep growing at the rate they grew so far.

porridgeraisin8mo ago

Inference will be cheapest when run in a shared cloud environment, simply due to the LLMs roofline. Thus, most B2B use cases are likely to be datacenter based, like AWS today.

rglullis8mo ago

I do not buy this premise. I think it will end up being cheaper to simply run the LLMs directly on the user device.

1 more reply

Leynos8mo ago

I expect models will get larger again once everyone is doing their inference on B200s, but the RL training budget is where the insatiable appetite sits right now.

bix68mo ago

Isn’t the whole point of the arms race that the more GPUs you have the closer you get to AGI? Which is the supposed goal here.

rglullis8mo ago

Leynos8mo ago

I suspect continuous learning will be the next driver of GPU usage.

ekjhgkejhgk8mo ago· 4 in thread

In reference 14 we read

> However, what’s become clear is that OpenAI plans to pay for Nvidia’s graphics processing units (GPUs) through lease arrangements, rather than upfront purchases

sbuttgereit8mo ago

ekjhgkejhgk8mo ago

Thanks for your reply, but:

1 more reply

foundart8mo ago

Speculation by a non-accountant:

At the end Nvidia retains ownership of what are probably very low value assets.

Contrast that with car leases: there is a robust market for used cars.

Nvidia is in effect financing the GPUs by not requiring the full payment up front.

Do the lease payments add up to the total cost?

ekjhgkejhgk8mo ago

The GPUs do/might lose value over time, but perhaps Nvidia is betting that they wont? Going by recent years, their value went up in some cases.

EDIT: Interesting note

dangus8mo ago· 4 in thread

I think we are at the PS3/Xbox 360 phase of AI.

By that I mean, those were the last consoles where performance improvements delivered truly new experiences, where the hardware mattered.

Today, any game you make for a modern system is a game you could have made for the PS3/Xbox 360 or perhaps something slightly more powerful.

jcranmer8mo ago

> By that I mean, those were the last consoles where performance improvements delivered truly new experiences, where the hardware mattered.

ben_w8mo ago

> By that I mean, those were the last consoles where performance improvements delivered truly new experiences, where the hardware mattered.

I hope that's where we are, because that means my experience will still be valuable and vibe coding remains limited to "only" tickets that take a human about half a day, or a day if you're lucky.

Given the cost needed for improvements, it's certainly not implausible…

…but it's also not a sure thing.

I tried "Cursor" for the first time last week, and just like I've been experiencing every few months since InstructGPT was demonstrated, it blew my mind.

* we used to call stuff like this "photorealistic": https://www.reddit.com/r/gaming/comments/ktyr1/unreal_yes_th...

BlueTemplar8mo ago

We did get more : the return of VR couldn't have been possible without drastically improved hardware.

But the way how it stayed niche shows how it's not just about new gameplay experiences.

Compare with the success of the Wii Sports and Wii Fit, which I would guess managed it better, though through a different kind of hardware that you are thinking about ?

And I kind of expect the next Nintendo console to have a popular AR glasses option, which also would only have been made possible thanks to improving hardware (of both kinds).

dangus8mo ago

I could be very wrong, obviously.

alephnerd8mo ago· 3 in thread

Glad to see Tom's blog on HN - as usual a great write up. A number of us have been chatting about this for several months now, and the take is fairly sober.

Meta commentary but I've grown weary of how commentary by actual domain experts in our industry are underrepresented and underdiscussed on HN in favor of emotionally charged takes.

dvt8mo ago

> actual domain experts

The incentives are, even in a true "domain expert" case (think: doctors, engineers, economists), often opaque. But when it comes to VCs, this gets ratcheted up by an order of magnitude.

alephnerd8mo ago

hodgesrm8mo ago

Martin Casado is a counter-example. His writings on technology starting with his phd thesis are very informative. [0] He’s the real thing as are many others.

[0] http://yuba.stanford.edu/~casado/mcthesis.pdf

JCM98mo ago· 3 in thread

ProjectArcturis8mo ago

How would you even begin to bet against OpenAI specifically? The closest proxy I can think of is shorting NVDA.

delusional8mo ago

> have backed themselves into a corner where the only way to stop the bubble from bursting is to keep inflating the bubble.

The only way they lose is if they sit by and do nothing. The upside is huge, and the downside is non-existent.

bwfan1238mo ago

nickdothutton8mo ago· 2 in thread

How much is a 10 year old GPU worth? Where is the “dwdm but for GPUs?”.

There truly are interesting times and we have the benefit of being in them.

keeda8mo ago

Just so I understand correctly, you mean that with DWDM 2 strands of cables were equivalent to 8 since DWDM as a multiplexing technology increased the capacity of each fiber, right?

> How much is a 10 year old GPU worth? Where is the “dwdm but for GPUs?”.

nickdothutton8mo ago

pgspaintbrush8mo ago· 2 in thread

Are these companies developing InfiniBand-class interconnects to pair with their custom chips? Without equivalent fabric, they can’t replace NVIDIA GPUs for large-scale training.

whp_wessel8mo ago

recent Huang podcast went into this, making the point that custom chips won't be competitive to Nvidia's as they are now making specialised chips instead of just 'gpu's'.

https://open.spotify.com/episode/2ieRvuJxrpTh2V626siZYQ?si=2...

pgspaintbrush8mo ago

Thank you for the pointer!

nextworddev8mo ago· 2 in thread

This article is pretty confusing, doesn't really have a thesis, just listing some stats. Maybe that's the intent.

foundart8mo ago

I had the same response.

Certainly it suggests that “this time is different” without saying it in a quotable fashion.

The metrics it provides seem useful. What are the metrics it is missing?

nextworddev8mo ago

dehrmann8mo ago· 2 in thread

> Evidence from Google architects shows GPUs at 60-70% utilization survive 1-2 years , with 3 years maximum.

Really?! I'm not used to chips having such a short lifespan.

metadat8mo ago

This sounds like major FUD unless the data is public.

bigyabai8mo ago

Feels like a teensy tiny conflict of interest, coming from GCP.

ivape8mo ago· 2 in thread

AI is a lot more useful than hyper scaled up crud apps. Comparing this to the past is really overfitting imho.

The AI thesis requires getting on board with what Jenson has been saying:

1) We have a new way to do things

2) The old ways have been utterly outclassed

3) If a device has any semblance of compute power, it will need to be enhanced, updated, or wholesale replaced with an AI variant.

There is no middle ground to this thesis. There is no “and we’ll use AI here and here, but not here, therefore we predictably know what is to come”.

Get used to the unreal. Your web apps could truly one day be generated frame by frame by a video model. Really. The amount of compute we’ll need will be staggering.

pessimizer8mo ago

> Your web apps could truly one day be generated frame by frame by a video model. Really. The amount of compute we’ll need will be staggering.

ivape8mo ago

If you’re a board game player then you are more than capable of imagining possibilities well beyond this.

1 more reply

mwkaufma8mo ago· 1 in thread

brazukadev8mo ago

They are infringing copyright in all ways possible, if AI fail they will also have to pay this bill. It'll be the reparations after they lose the war.

davedx8mo ago· 1 in thread

alephnerd8mo ago

> How much of a threat is custom silicon to Nvidia remains an open question to me

Nvidia has also begun trying to enter the custom silicon sector as well, but it's still largely dominated by Broadcom, Marvell, and Renesas.

xbmcuser8mo ago· 1 in thread

With all the major players like Amzn, Msft and Alphabet going for their own custom chips and restrictions on selling to China it will be interesting to see how Nvidia does.

I personally would prefer China to get to parity on node size and get competitive with nvidia. As that is the only way I see the world not being taken over by the tech oligarchy.

JCM98mo ago

cameldrv8mo ago

sails8mo ago

> Nvidia’s vendor financing becomes exposure to customers building competitive alternatives.

This is surely the most important line in the piece? In what world would this much demand not lead to alternatives emerging?

(Assuming the upside, yes if the demand is not there in two years then yes it’s all going to burn)

monkeydust8mo ago

Where can you track GPU utilization rates? Assuming private data but curious if not.

MASNeo8mo ago

Also, depreciation schedules beyond useful life of an asset may not be fraud but I’d call it a bit too creative for my liking.

Time will tell.

Theodores8mo ago

This reminds me of SGI at the peak of the dot-com bubble.

SGI (Silicon Graphics) made the 3D hardware that many companies relied on for their own businesses, in the days before Windows NT and Nvidia came of age.

There was also the small matter of insider trading. If you knew the latest SGI boxes were lemons then you could place your bets of the 3D software houses accordingly.

Equally, much like what happened with SGI, Nvidia will eventually come a cropper due to the evaluations due to today's hype and hubris not delivering.

mooreds8mo ago

Good analysis.

The conclusion:

> Unlike the telecom bubble, where demand was speculative & customers burned cash , this merry-go-round has paying riders.

Seems a little short sighted to me. IMO, there is a definite echo, but we are in the mid-late stage, not the end stage.

It's simply not fair to compare Lucent at the end of a bubble with Nvidia in the middle, and that is what the author did.

If you haven't listened to the referenced interview between Thompson and Kedrosky, I'd do so: https://www.theringer.com/podcasts/plain-english-with-derek-...

spaceballbat8mo ago

cl428mo ago

Great points. I am bullish on AI but also wary of accounting practices. Tom says Nvidia's financials are different from Lucent's but that doesn't mean we shouldn't be wary.

The Economist has a great discussion on depreciation assumptions having a huge impact on how the finances of the cloud vendors are perceived[1].

Revenue recognition and expectations around Oracle could also be what bursts the bubble. Coreweave or Oracle could be the weak point, even if Nvidia is not.

[1] https://www.economist.com/business/2025/09/18/the-4trn-accou...

ekjhgkejhgk8mo ago

https://capitalgains.thediff.co/p/vendor-financing

yalogin8mo ago

The telecom bubble built infrastructure for something that didn’t exist, they built anticipating the need for high didn’t come in time.

rossdavidh8mo ago

"This time it's different"

redwood8mo ago

j / k navigate · click thread line to collapse