undefined | Better HN

0 pointsEamonnMR3y ago0 comments

> Where is the form to remove my reddit comments from chat gpt training data? Or my blog posts from gpt training data?

More pointedly, how do I keep my GPL'd code from spewing, license free, out of CodePilot?

0 comments

3 comments · 1 top-level

tadfisher3y ago· 2 in thread

I think that's the point of this blog post: it doesn't matter if the inputs are copyrighted, it matters if the output is infringing. It appears to be almost impossible to directly recreate a source image with SD, but it seems Copilot tends to produce a single input as its output, verbatim. Copilot isn't doing "synthesis" as does SD, it's acting more like a search engine.

simiones3y ago

Look at these images:

> https://huggingface.co/spaces/stabilityai/stable-diffusion/d...

They were prompted with the text "Mona Lisa Smile". Would you not say that they are an extremely close reproduction of the Mona Lisa, with barely any kind of synthesis?

SideQuark3y ago

Look at the actual Mona Lisa. None of those other images are close to being a reproduction.

I can hand paint a Mona Lisa like image that are this removed and be fine.

1 more reply

j / k navigate · click thread line to collapse