undefined | Better HN

0 pointsjohnyzee2y ago0 comments

> I do think we'll hit a soft ceiling in the not too distant future ... it's going to plateau and progress will become substantially more gradual.

I don't think this will age well.

It's a matter of simple compute power to advance from realistic text/token prediction, to realistic synthesis of stuff like human (or animal) body movement, for all kinds of situations, including realistic facial/body language, moods, and so on. Of course perfect voice synthesis. Coupled with good enough robotics, you can see where I'm going with this, and that's only because my imagination is limited to sci-fi movie tropes. I think this is going to be wilder than we can imagine, while still just copying training sets.

0 comments

2 comments · 2 top-level

FiberBundle2y ago

Isn't video prediction a substantially harder problem than text prediction? At least that was the case a couple of years ago with RNNs/LSTMs. Haven't kept up with the research, maybe there's been progress.

troupo2y ago

> It's a matter of simple compute power to advance

Yup. It's "just" a compute advance away. Never mind it's already consuming as much computing as we can throw at it. It's "just" there.

j / k navigate · click thread line to collapse