undefined | Better HN

0 pointswhynotminot1y ago0 comments

> Like they've been making it all this time?

Yes exactly like they’ve been doing this whole time, with the cost of running each model massively dropping sometimes even rapidly after release.

0 comments

YeGoblynQueenne1y ago

No, the cost of training is the one that isn't dropping any time soon. When data, compute and parameters increase, then the cost increases, yes?

whynotminotOP1y ago

Do you understand the difference between training and inference?

Yes, it costs a lot to train a model. Those costs go up. But once you trained it, it’s done. At that point inference — the actual execution/usage of the model — is the cost you worry about.

Inference cost drops rapidly after a model is released as new optimizations and more efficient compute comes online.

feznyng1y ago

That’s precisely what’s different about this approach. Now the inference itself is expensive because the system spends far more time coming up with potential solutions and searching for the optimal one.

1 more reply

YeGoblynQueenne1y ago

>> Do you understand the difference between training and inference?

Oh yes indeed-ee-o and I'm referring to training and not inference because the big problem is the cost of training, not inference. The cost of training has increased steeply with every new generation of models because it has to, in order to improve performance. That process has already reached the point where training ever larger models is prohibitively expensive even for companies with the resources of OpenAI. For example, the following is from an article that was posted on HN a couple days ago and is basically all about the overwhelming cost to train GPT-5:

In mid-2023, OpenAI started a training run that doubled as a test for a proposed new design for Orion. But the process was sluggish, signaling that a larger training run would likely take an incredibly long time, which would in turn make it outrageously expensive. And the results of the project, dubbed Arrakis, indicated that creating GPT-5 wouldn’t go as smoothly as hoped.

(...)

Altman has said training GPT-4 cost more than $100 million. Future AI models are expected to push past $1 billion. A failed training run is like a space rocket exploding in the sky shortly after launch.

(...)

By May, OpenAI’s researchers decided they were ready to attempt another large-scale training run for Orion, which they expected to last through November.

Once the training began, researchers discovered a problem in the data: It wasn’t as diversified as they had thought, potentially limiting how much Orion would learn.

The problem hadn’t been visible in smaller-scale efforts and only became apparent after the large training run had already started. OpenAI had spent too much time and money to start over.

From:

https://archive.ph/L7fOF

HN discussion:

https://news.ycombinator.com/item?id=42485938

"Once you trained it it's done" - no. First, because you need to train new models continuously so that they pick up new information (e.g. the name of the President of the US). Second because companies are trying to compete with each other and to do that they have to train bigger models all the time.

Bigger models means more parameters and more data (assuming there is enough which is a whole other can of worms) more parameters and data means more compute and more compute means more millions, or even billions. Nothing in all this is suggesting that costs are coming down in any way, shape or form, and yep, that's absolutely about training and not inference. You can't do inference before you do training, you need to train continuously, and for that reason you can't ignore the cost of training and consider only the cost of inference. Inference is not the problem.

j / k navigate · click thread line to collapse

0 comments

YeGoblynQueenne1y ago

No, the cost of training is the one that isn't dropping any time soon. When data, compute and parameters increase, then the cost increases, yes?

whynotminotOP1y ago

Do you understand the difference between training and inference?

Yes, it costs a lot to train a model. Those costs go up. But once you trained it, it’s done. At that point inference — the actual execution/usage of the model — is the cost you worry about.

Inference cost drops rapidly after a model is released as new optimizations and more efficient compute comes online.

feznyng1y ago

1 more reply

YeGoblynQueenne1y ago

>> Do you understand the difference between training and inference?

(...)

By May, OpenAI’s researchers decided they were ready to attempt another large-scale training run for Orion, which they expected to last through November.

Once the training began, researchers discovered a problem in the data: It wasn’t as diversified as they had thought, potentially limiting how much Orion would learn.

The problem hadn’t been visible in smaller-scale efforts and only became apparent after the large training run had already started. OpenAI had spent too much time and money to start over.

From:

https://archive.ph/L7fOF

HN discussion:

https://news.ycombinator.com/item?id=42485938

j / k navigate · click thread line to collapse