undefined | Better HN

0 pointsPowerElectronix1mo ago0 comments

On top of that, AI providers are also eating a big loss on the service.

0 comments

14 comments · 3 top-level

tempay1mo ago· 9 in thread

Are they? I only ever see unsubstantiated claims for this whereas I see many justifications that interference is comfortably profitable in isolation.

tomelders1mo ago

SpaceX's has disclosed that they're loosing $2Bln a quarter on A.I - and rising - in their IPO documents.

Anthropic told the Department of War-nee-Defence that they'd made $5bln total, which is a lot LOT less than what they're spending.

We'll see what's in OpenAi's IPO later this year I guess. I'll be very surprised if they're losing less that $100bln a year.

koliber1mo ago

Is it capex of training new models and hiring people for 250mln pay packages? Or is it opex running inference?

1 more reply

ai_fry_ur_brain1mo ago

Its basic math, go calculate max sessions for a certain tps on any hardware. Session# * tps * 86400 (secs in a day) * 30 days.

You'll realize real quick its not profitible. You cant just say things you don't like to hear are unsubstantiated without verifying.

Not to mention, subscriptions.. $2mm in GPUs being given out for 5 hrs a day at a cost of $200 a month.

I could easily say that everyone who says its profitible is msking unsubstantiated claims lol.

fauigerzigerk1mo ago

>Its basic math

Yes, once you have modeled the problem correctly and you know all the input parameters. This is not that: Session# * tps * 86400 (secs in a day) * 30 days.

I don't think there is enough public information to check Anthropic's claims regarding inference profitability. It depends not just on unknown technical factors but also on agreements they have with other companies.

1 more reply

mr_mitm1mo ago

We should specify which subscription plan we are talking about. You seem to be talking about the Anthropic Claude Max plan. I think it's consensus that these flat rate type of subscriptions are loss leaders, as they come with restrictions how you can use the API via T&C, namely only with Claude Code et al. They are meant to hook developers into their products.

Shouldn't we compare the API pricing, where we pay per token? The whole point of local inference is that we don't have any restrictions regarding product use or time limits, so it would only be fair if we compare it to a plan that offers the same. And even that is only a first approximation, because the commercial models are usually much more capable than the open weight models.

mbesto1mo ago

> I could easily say that everyone who says its profitible is msking unsubstantiated claims lol.

And people who don't understand the difference between capex and opex are making uneducated claims. It's not basic math.

Running an inference data center is a mix of variable and fixed costs. The fixed costs are currently in the billions of billions of dollars for pretty much any investment in this space. Many of those fixed costs have (currently) unknown refresh cycles. So, unless you have access to the financial books of these companies it's currently just speculation whether inference is profitable.

adastra221mo ago

You got numbers? Because it seems perfectly possible to me. OpenAI and Anthropic’s marginal cost for inference is certainly far less than their API pricing.

2 more replies

brightball1mo ago

To some degree I think there's a hope that it becomes like a gym membership. If everybody used their membership, the gym would be too crowded. It's all of those memberships that people feel like they need to have but don't use where the extra profit comes in.

As long as the power users are paying per token, everything is good.

1 more reply

exploderate1mo ago

Especially since their costs might be multi-year investments. It's too early to judge the quality of those investments.

solumunus1mo ago· 2 in thread

Supposedly Anthropic just reported that they’re operationally profitable. So maybe not?

akho1mo ago

"operationally" implies that capex (which I would assume includes datacenters, gpus, and r&d) is not in. So the big news is that they can now pay for electricity and sysadmin.

camdenreslink1mo ago

I believe they also excluded stock-based compensation from their calculation, which could easily tip them in the non-profitable direction.

ankit2191mo ago

at a gross margin level, mostly no. if you include the cost of training a model as full R&D then possibly yes.

Batch size is what you should look at. If a cluster is running and processing one request, filling the batch has almost no marginal cost (kv cache creation/storage/fetch costs aside). But if the concurrent requests exceed batch size, one extra request would cost basically the rent cost of entire new cluster. APIs have the bursty nature so companies would plan to price it such that they are profitable / break even at 40%-50% utilization (% of filled batch for simplicity). so any extra request would not have the same cost as long as they are alongside an api request. you might think it degrades teh performance. easy: just assign a priority tier to api requests, and a lower tier to subscription requests.

its even more effective and powerful now that you have continuous batching. so likely if the api is being used, they are not eating any loss, let alone "big loss"

j / k navigate · click thread line to collapse

0 comments

14 comments · 3 top-level

tempay1mo ago· 9 in thread

Are they? I only ever see unsubstantiated claims for this whereas I see many justifications that interference is comfortably profitable in isolation.

tomelders1mo ago

SpaceX's has disclosed that they're loosing $2Bln a quarter on A.I - and rising - in their IPO documents.

Anthropic told the Department of War-nee-Defence that they'd made $5bln total, which is a lot LOT less than what they're spending.

We'll see what's in OpenAi's IPO later this year I guess. I'll be very surprised if they're losing less that $100bln a year.

koliber1mo ago

Is it capex of training new models and hiring people for 250mln pay packages? Or is it opex running inference?

1 more reply

ai_fry_ur_brain1mo ago

Its basic math, go calculate max sessions for a certain tps on any hardware. Session# * tps * 86400 (secs in a day) * 30 days.

You'll realize real quick its not profitible. You cant just say things you don't like to hear are unsubstantiated without verifying.

Not to mention, subscriptions.. $2mm in GPUs being given out for 5 hrs a day at a cost of $200 a month.

I could easily say that everyone who says its profitible is msking unsubstantiated claims lol.

fauigerzigerk1mo ago

>Its basic math

Yes, once you have modeled the problem correctly and you know all the input parameters. This is not that: Session# * tps * 86400 (secs in a day) * 30 days.

1 more reply

mr_mitm1mo ago

mbesto1mo ago

> I could easily say that everyone who says its profitible is msking unsubstantiated claims lol.

And people who don't understand the difference between capex and opex are making uneducated claims. It's not basic math.

adastra221mo ago

You got numbers? Because it seems perfectly possible to me. OpenAI and Anthropic’s marginal cost for inference is certainly far less than their API pricing.

2 more replies

brightball1mo ago

As long as the power users are paying per token, everything is good.

1 more reply

exploderate1mo ago

Especially since their costs might be multi-year investments. It's too early to judge the quality of those investments.

solumunus1mo ago· 2 in thread

Supposedly Anthropic just reported that they’re operationally profitable. So maybe not?

akho1mo ago

"operationally" implies that capex (which I would assume includes datacenters, gpus, and r&d) is not in. So the big news is that they can now pay for electricity and sysadmin.

camdenreslink1mo ago

I believe they also excluded stock-based compensation from their calculation, which could easily tip them in the non-profitable direction.

ankit2191mo ago

at a gross margin level, mostly no. if you include the cost of training a model as full R&D then possibly yes.

its even more effective and powerful now that you have continuous batching. so likely if the api is being used, they are not eating any loss, let alone "big loss"

j / k navigate · click thread line to collapse