undefined | Better HN

0 pointsSJC_Hacker11mo ago0 comments

This is my watershed for true AGI. It should be able to create a smarter version of itself.

Last I checked, feeding the output of an LLM back into its training data leads to a progressively worse LLM. (Note I'm not talking about distillation, which involves training a smaller model, by sacrificing accuracy. I'm referring to an equal or greater number of model parameters)

0 comments

1 comments · 1 top-level

fragmede11mo ago

If the LLM is given the code for its training and is able to improve that, does that count? Because it seems like a safe bet that we're already there, the only problem is latency of training runs.

j / k navigate · click thread line to collapse