Ask HN: How does my usage of LLMs train the underlying models?

1 pointsemehex0y ago1 comments

I understand that using tools like Chat, Cursor, and Claude Code for software development is likely providing training data to help these LLMs get better at coding (the irony isn't lost on me that I might be contributing to making myself obsolete...)

But I'm curious about the actual mechanics: How exactly does this feedback loop work? When I accept, reject, or modify the code that these models spit out, is that signal fed directly back into training?

Not necessarily against this, just genuinely curious about how the sausage is made.

1 comments

1 comments · 1 top-level

incomingpain0y ago

>ow exactly does this feedback loop work? When I accept, reject, or modify the code that these models spit out, is that signal fed directly back into training?

My understanding is that essentially nobody uses our inputs as training anymore. It makes the models worse when they were doing this.

Garbage data in, garbage out.

They want to control and use only quality data as their training data.

j / k navigate · click thread line to collapse