undefined | Better HN

0 pointswhazor1y ago0 comments

But a LLM can certainly make up a lot information that never existed before.

0 comments

1 comments · 1 top-level

I strongly believe this gets into an information theoretical constraint akin to why perpetual motion machines don't work.

In theory, yes you could generate an unlimited amount of data for the models, but how much of it is unique or valuable information? If you were to compress all this generated training data using a really good algorithm, how much actual information remains?

3 more replies

j / k navigate · click thread line to collapse