undefined | Better HN

0 pointsdubcanada3y ago0 comments

I think you missed the memo of the comment.

They were referring to the fact that everything ChatGPT is built on is other peoples work. Beyond the actual building of the model details, there is nothing that ChatGPT owns. All the content they use to train, all of the art they use to train. Everything is stolen/used without permission. Obviously there is more to it than that, because you published it on the internet. But that's a different topic.

0 comments

26 comments · 6 top-level

jstummbillig3y ago· 8 in thread

This litany is already getting old and it's just 2 month in.

All intellectual property is inherently stolen. Just let it go.

cormacrelf3y ago

The reason intellectual property was invented was to encourage people to go and create new things and share them, the logic being that having a monopoly on your own work by default means you can make money from being creative and therefore people will choose to do it. The reverse is already happening, people are deciding (privately) not to publish things they have created because they rightly assume it will be stolen by an AI, monetised and used to destroy their own job. It is not merely complaining for its own sake. There is a good amount of theft and a bad amount of theft. As theft increases unchecked the amount of new output is poised to decline.

jstummbillig3y ago

All true.

I don't see any world where it matters in the slightest. When it comes to how we deal with currently available training data nothing will change, first because of politics but also because people want the LLMs superpower more than they want to protect IP of a few individuals. And I firmly believe that no human training data that has not been produced and publishes today will play any significant role in future AI development.

We are simply too slow.

chefandy3y ago

The topic's biggest cop-out. Intellectual property doesn't exist in a vacuum. I have limited-to-zero sympathy for corporate entities like Getty images that hoard IP, but our society's social contract says labor isn't free unless people donate it. We need to implement some sort of alternate compensation system before entirely disregarding IP so we don't pull the rug out from under perfectly honest independent creatives with kids and mortgages and medical bills plying their craft in an established system. Until then, taking the fruits of creative labor without permission is theft that is much more consequential and much less morally defensible than what you describe.

I'll bet if someone outside of our IP jurisdiction figured out a way to reliably and thoroughly reverse engineer the most complex commercial software from binaries so people could spit out a working, fully-customized copy of a commercial application from a prompt, and the entirety of the software development market would soon collapse, the tenor of this conversation would be very different.

Maybe the people with the very ethically defensible stance that private property is theft would be totally fine with OpenAI knocking down your home to build their new headquarters without compensating you? Imagine the progress! (hint: they probably wouldn't be ok with it)

None of this stuff exists in a vacuum. None of it.

1 more reply

devmor3y ago

That's all great until people stop providing intellectual property for free because of the chilling effects.

Artists are already starting to completely paywall their content.

How far do we let AI scraping and incorporation go? Just say "fuck it" until there's nothing left to scrape other than content also made by AI?

welshwelsh3y ago

"chilling effects" usually refers to when people decide not to share things because of potential legal consequences. For example, if people stop creating or distributing AI art because they don't want to be sued by artists for using their style, that's a chilling effect. Basically the opposite of what you are describing

>Just say 'fuck it' until there's nothing left to scrape other than content also made by AI?

Sounds good to me! There will always be people making free art, and AI will make this much easier.

The thing that I think people are missing is that AI-generated content CAN be used to improve AI models. There is no requirement that the input data is created without AI.

Furthermore, AI-generated content on the internet is not random; it is curated content. Generally speaking people don't post every image they generate with Stable Diffusion, they only post the best images. If you consider engagement metrics and user feedback (upvotes etc), they can be a valuable and useful part of a training set.

devmor3y ago

The fact that you think that sounds good and is not a bleak and dystopian hellscape tells me that your ideal future is likely my nightmare scenario.

I fear our views on this issue are wholly incompatible.

LordDragonfang3y ago

Two months? The (ai-luddite) preachers have been reciting this litany since the first decent diffusion models released over a year ago. They haven't slowed down any.

cmdialog3y ago

I wonder how large the Venn overlap is for people who think IP is good and people who don't think hip hop is "real music"?

px433y ago· 5 in thread

Everything that anyone has ever built is built on the works of others. This is how we progress as a species. The entire reason why the internet is so revolutionary is that it allows for permissionless innovation.

dubcanadaOP3y ago

I am in no way suggesting that it is wrong. I do however feel this level of "built upon the work of others" is different.

vkou3y ago

Then OpenAI should allow us to do some permissionless innovation on their work.

Strangly enough, it's only interested in promoting permissionless innovation when it stands to profit. It plunders the commons, and gives nothing unencumbered back.

whitepaint3y ago

Right, so what's the problem with gpt4free then?

MichaelZuo3y ago

They're consuming real electricity and real time on servers that don't belong to them nor do they have permission to use.

ChatGTP3y ago

Do you see the contradiction here ?

ChatGPT-4 is built on real peoples time.

1 more reply

Kiro3y ago· 3 in thread

Imagine considering your random posts on reddit "work" and thinking people are stealing it when they train their models on your internet drivel.

msla3y ago

So it's valueless when the original author wants it to have value and valuable when OpenAI wants it to have value?

I am all for training AIs, but at least exhibit some self-consistency in your arguments!

squeaky-clean3y ago

A penny is close to valueless. A trillion pennies is a lot of value.

salad-tycoon3y ago

All the misguided comments of my younger years coming to haunt me? Nightmare. Luckily I deleted my live journal many moons ago.

glitchc3y ago· 2 in thread

This is an incorrect and unfair statement that would not pass the test in any court of law. ChatGPT uniquely orders information in a way that gives them a competitive advantage in the marketplace. While the source information is public, the ordering of it is proprietary and a trade secret.

Your argument is a reductio ad absurdum to "everything is made of atoms and no one ones atoms, ergo no one owns anything."

mcguire3y ago

The source information in public? Copyright isn't a thing anymore?

That's news to me.

glitchc3y ago

If you think ChatGPT has infringed on your copyright, you have legal recourse. Do you have evidence?

smoldesu3y ago· 2 in thread

If we enforced intellectual property rights that harshly, nothing more complex than a 6502 would have ever been made.

dubcanadaOP3y ago

I personally don't think IP has a place in modern society. But I was mostly replying to the authors comment.

My concerns mostly lie with the fact it's owned largely by $MSFT rather than a more "open source" contributing to society entity. But again that's a much different topic.

devmor3y ago

I'd say IP is more important in modern society than at any time in history.

It shouldn't have a place, but so long as people require the ownership of their own concepts to gain food and shelter, it has to.

dcow3y ago

I'm sorry but you can't honestly use stolen without permission here. If you publish something and someone else acquires it legally (because you published it for free or because they paid for or otherwise obtained a license to it) then you don't get to control how the work is used after the fact. You only control the terms of them receiving a copy. You can't say "I didn't want my work used for AI training data when I published it so it's all stolen as far as I'm concerned". It just doesn't work that way.

Now that doesn't mean you can't license your work for exclusive use by humans and explicitly forbid AI training data in the license applied to your work, but you'd have to do that when you publish it, not retroactively.

j / k navigate · click thread line to collapse

0 comments

26 comments · 6 top-level

jstummbillig3y ago· 8 in thread

This litany is already getting old and it's just 2 month in.

All intellectual property is inherently stolen. Just let it go.

cormacrelf3y ago

jstummbillig3y ago

All true.

We are simply too slow.

chefandy3y ago

None of this stuff exists in a vacuum. None of it.

1 more reply

devmor3y ago

That's all great until people stop providing intellectual property for free because of the chilling effects.

Artists are already starting to completely paywall their content.

How far do we let AI scraping and incorporation go? Just say "fuck it" until there's nothing left to scrape other than content also made by AI?

welshwelsh3y ago

>Just say 'fuck it' until there's nothing left to scrape other than content also made by AI?

Sounds good to me! There will always be people making free art, and AI will make this much easier.

The thing that I think people are missing is that AI-generated content CAN be used to improve AI models. There is no requirement that the input data is created without AI.

devmor3y ago

The fact that you think that sounds good and is not a bleak and dystopian hellscape tells me that your ideal future is likely my nightmare scenario.

I fear our views on this issue are wholly incompatible.

LordDragonfang3y ago

Two months? The (ai-luddite) preachers have been reciting this litany since the first decent diffusion models released over a year ago. They haven't slowed down any.

cmdialog3y ago

I wonder how large the Venn overlap is for people who think IP is good and people who don't think hip hop is "real music"?

px433y ago· 5 in thread

dubcanadaOP3y ago

I am in no way suggesting that it is wrong. I do however feel this level of "built upon the work of others" is different.

vkou3y ago

Then OpenAI should allow us to do some permissionless innovation on their work.

Strangly enough, it's only interested in promoting permissionless innovation when it stands to profit. It plunders the commons, and gives nothing unencumbered back.

whitepaint3y ago

Right, so what's the problem with gpt4free then?

MichaelZuo3y ago

They're consuming real electricity and real time on servers that don't belong to them nor do they have permission to use.

ChatGTP3y ago

Do you see the contradiction here ?

ChatGPT-4 is built on real peoples time.

1 more reply

Kiro3y ago· 3 in thread

Imagine considering your random posts on reddit "work" and thinking people are stealing it when they train their models on your internet drivel.

msla3y ago

So it's valueless when the original author wants it to have value and valuable when OpenAI wants it to have value?

I am all for training AIs, but at least exhibit some self-consistency in your arguments!

squeaky-clean3y ago

A penny is close to valueless. A trillion pennies is a lot of value.

salad-tycoon3y ago

All the misguided comments of my younger years coming to haunt me? Nightmare. Luckily I deleted my live journal many moons ago.

glitchc3y ago· 2 in thread

Your argument is a reductio ad absurdum to "everything is made of atoms and no one ones atoms, ergo no one owns anything."

mcguire3y ago

The source information in public? Copyright isn't a thing anymore?

That's news to me.

glitchc3y ago

If you think ChatGPT has infringed on your copyright, you have legal recourse. Do you have evidence?

smoldesu3y ago· 2 in thread

If we enforced intellectual property rights that harshly, nothing more complex than a 6502 would have ever been made.

dubcanadaOP3y ago

I personally don't think IP has a place in modern society. But I was mostly replying to the authors comment.

My concerns mostly lie with the fact it's owned largely by $MSFT rather than a more "open source" contributing to society entity. But again that's a much different topic.

devmor3y ago

I'd say IP is more important in modern society than at any time in history.

It shouldn't have a place, but so long as people require the ownership of their own concepts to gain food and shelter, it has to.

dcow3y ago

j / k navigate · click thread line to collapse