undefined | Better HN

0 pointsidunnoman12221y ago0 comments

Where does a sora video turn around backwards? I can’t maintain such consistency in my own dreams.

0 comments

I don't know of an example (not to say it doesn't exist) but the problem is fundamentally the same as things moving out of sight/out of frame and coming back again.

Jensson1y ago

> the problem is fundamentally the same as things moving out of sight/out of frame and coming back again

Maybe it is, but doing that with the entire scene instead of just a small part of it makes the problem massively harder, as the model needs to grow exponentially to remember more things. It isn't something that we will manage anytime soon, maybe 10-20 years with current architecture and same compute progress.

Then you make that even harder by remembering a whole game level? No, ain't gonna happen in our lifetimes without massive changes to the architecture. They would need to make a different model keep track of level state etc, not just an image to image model.

Workaccount21y ago

10 to 20 years sounds wildly pessimistic

In this sora video the dragon covers half the scene, and its basically identical when it is revealed again ~5 seconds later, or about 150 frames later. The is lots of evidence (and some studies) that these models are in fact building internal world models.

https://www.youtube.com/watch?v=LXJ-yLiktDU

Buckle in, the train is moving way faster. I don't think there would be much surprise if this is solved in the next few generations of video generators. The first generation is already doing very well.

Jensson1y ago

Did you watch the video, it is completely different after the dragon goes past? Its still a flag there, but everything else changed. Even the stores in the background changed, the mass of people is completely different with no hint of anyone moving there etc.

You always get this from AI enthusiast, they come and post "proof" that disproves their own point.

2 more replies

j / k navigate · click thread line to collapse

0 comments

Workaccount21y ago

I don't know of an example (not to say it doesn't exist) but the problem is fundamentally the same as things moving out of sight/out of frame and coming back again.

Jensson1y ago

> the problem is fundamentally the same as things moving out of sight/out of frame and coming back again

Workaccount21y ago

10 to 20 years sounds wildly pessimistic

https://www.youtube.com/watch?v=LXJ-yLiktDU

Jensson1y ago

You always get this from AI enthusiast, they come and post "proof" that disproves their own point.

2 more replies

j / k navigate · click thread line to collapse