undefined | Better HN

0 pointslayer82y ago0 comments

The ability to provide a reference to the source is the crucial difference here.

I agree that it should be possible to implement that for generative AI, although the training may become significantly more expensive in order to maintain that information, and the AI companies have little interest in doing so. They’ll probably rather try to heuristically assess possible copyright issues after the fact in a post-processing step.

The more interesting question is if copyright holders can claim unauthorized use of their works beyond the case of near-verbatim reproduction, because the works collectively inform the AI in a more general manner.

0 comments

1 comments · 1 top-level

kenmacd2y ago

> They’ll

What if I asked you to list all our source material that led you to use that particular contraction. Heuristics will not do, you must list each.

Can you do it? Do you believe AI should.

> I agree that it should be possible to implement

Those exact words appear in another forum post from 2006:

https://discourse.igniterealtime.org/t/cm-3beta-compression-...

Should you have quoted that as a source for your reply? What if we knew you'd read that post back in 2006, affecting your neurons, then should you?

It might not be too hard to imagine a simple case of a specific topic where you might have some more prominent sources, but even in those cases I believe if you think it through you'll find there was a ton of other sources that led to the weights that allowed you to 'know' the topic.

1 more reply

j / k navigate · click thread line to collapse