undefined | Better HN

0 pointsjbenjoseph3y ago0 comments

Even so, I don't think there is any evidence that LLM performance degrades when it is trained on its own output, and there is no intuitive reason it should.

0 comments

2 comments · 1 top-level

throwanem3y ago· 1 in thread

Why not? Training a model destroys information.

jbenjosephOP3y ago

I have seen no evidence for that, only the opposite: https://arxiv.org/abs/2210.11610

Intuitively, training a simple enough linear statistical model with its own output should be a NOP. But LLMs are anything but simple models, so I think the non-linearities may be synthesizing new useful information. Similarly to how all of maths can be synthesized from a few basic axioms with enough intelligence or computation.

j / k navigate · click thread line to collapse