undefined | Better HN

0 pointssummerlight1y ago0 comments

This is really impressive engineering. I thought real time agents would completely change the way we're going to interact with large models but it would take 1~2 more years. I wonder what kind of new techs are developed to enable this, but OpenAI is fairly secretive so we won't be able to know their sauce.

On the other hand, this also feels like a signal that reasoning capability has probably already been plateaued at GPT-4 level and OpenAI knew it so they decided to focus on research that matters to delivering product engineering rather than long-term research to unlock further general (super)intelligence.

0 comments

MVissers1y ago

Why would reasoning have plateau’d?

I think reasoning ability is not the largest bottleneck for improvement in usefulness right now. Cost is a bigger one IMO.

Running these models as agents is hella expensive, and agents or agent-like recurrent reasoning (like humans do) is the key to improved performance if you look at any type of human intelligence.

Single-shot performance only gets you so far.

For example- If it can write code 90% of the way, and then debug in a loop, it’d be much more performant than any single shot algorithm.

And OpenAI has these huge models in their basement probably. But they might not be much more useful than GPT-4 when used as single-shot. I mean, what could it do what we can’t do today with gpt-4?

It’s agents and recurrent reasoning we need for more usefulness.

At least- That’s my humble opinion as an amateur neuroscientist that plays around with these models.

Jensson1y ago

> Running these models as agents is hella expensive

Because they are dumb so you need to over compute so many things to get anything useful. Smarter models would solve this problem. Making the current model cheaper is like trying to solve Go by scaling up Deep Blue, it doesn't work to just hardcode dumb pieces together, the model needs to get smarter.

2 more replies

Aeolun1y ago

I’d be pretty happy if they could just make ChatGPT4 10x faster and cheaper. It’d be fine for basically all of my use cases.

meet_zaveri1y ago

> For example- If it can write code 90% of the way, and then debug in a loop, it’d be much more performant than any single shot algorithm.

OOC, Would this make the academics including algorithms as more or less important in their curriculum? That's a bad win for soceity if it's true.

cchance1y ago

Ya so sad that OpenAI isn't more Open imagine if OpenAI was still sharing their thought processes and papers with the overall commity, really wish we saw collaborations between OpenAI and Meta for instance to really have helped push the open source arena further ahead, i love that their latest models are so great but the fact they aren't helping the Open source arena to progress is sad. Imagine how far we'd be if OpenAI was still as open as they once were and we saw collaborations betweeen Meta, OpenAI and Anthropic all working and sharing growth and tech to reduce double work and help each other not go down failed paths.

nopinsight1y ago

Reliable agents in diverse domains need better reasoning ability and fewer hallucinations. If the rumored GPT-5 and Q* capabilities are true, such agents could become available soon after it’s launched.

summerlightOP1y ago

Sam has been pretty clear on denying GPT-5 rumors, so I don't think it will come anytime soon.

2 more replies

CuriouslyC1y ago

This isn't really new tech, it's just an async agent in front of a multimodal model. It seems from the demo that the improvements have been in response latency and audio generation. Still, it looks like they're building a solid product, which has been their big issue so far.

cchance1y ago

Its 200-300ms for a multimodal response, thats REALLY a big step forward, especially given it's doing it with full voice response, not just text.

1 more reply

searealist1y ago

No, audio is fed directly into the model. There is no text to speech transformer in front of it like there was with chatgpt-4.

j / k navigate · click thread line to collapse

0 comments

MVissers1y ago

Why would reasoning have plateau’d?

I think reasoning ability is not the largest bottleneck for improvement in usefulness right now. Cost is a bigger one IMO.

Running these models as agents is hella expensive, and agents or agent-like recurrent reasoning (like humans do) is the key to improved performance if you look at any type of human intelligence.

Single-shot performance only gets you so far.

For example- If it can write code 90% of the way, and then debug in a loop, it’d be much more performant than any single shot algorithm.

And OpenAI has these huge models in their basement probably. But they might not be much more useful than GPT-4 when used as single-shot. I mean, what could it do what we can’t do today with gpt-4?

It’s agents and recurrent reasoning we need for more usefulness.

At least- That’s my humble opinion as an amateur neuroscientist that plays around with these models.

Jensson1y ago

> Running these models as agents is hella expensive

2 more replies

Aeolun1y ago

I’d be pretty happy if they could just make ChatGPT4 10x faster and cheaper. It’d be fine for basically all of my use cases.

meet_zaveri1y ago

> For example- If it can write code 90% of the way, and then debug in a loop, it’d be much more performant than any single shot algorithm.

OOC, Would this make the academics including algorithms as more or less important in their curriculum? That's a bad win for soceity if it's true.

cchance1y ago

nopinsight1y ago

summerlightOP1y ago

Sam has been pretty clear on denying GPT-5 rumors, so I don't think it will come anytime soon.

2 more replies

CuriouslyC1y ago

cchance1y ago

Its 200-300ms for a multimodal response, thats REALLY a big step forward, especially given it's doing it with full voice response, not just text.

1 more reply

searealist1y ago

No, audio is fed directly into the model. There is no text to speech transformer in front of it like there was with chatgpt-4.

j / k navigate · click thread line to collapse