I don't see any developped country pressing the brake on AGI in the near future to protect a few copyright holders from getting "stolen" in hypothetic scenarios.
Unless Nintendo plans on busting down the doors of every person who tries to draw Mario or preventing little Timmy from making a parody of Coca-Cola, making it where AI cannot generated copyrighted works is insane imo.
Those brands should be proud to be such a big part of the cultural fabric that it is difficult to get away from their branding. Plus it's not infringement to my knowledge until you use it for commercial purposes so as long as no one as creating Lario and Muigi to sell or otherwise use in business, it's no different than drawing it yourself.
If the AI is completely unable to generate non-infringing works even if you are _trying_ to get away from it (which the author very much doesn't seem they are, they are purposefully making and show prompts that infringe), that's the problem of the AI creator then.
I also point to a YouTube video by Kirby Ferguson "Everything is a Remix" [1] which talks about how so much of our collective culture stems from copy. It's a great video if you have an hour.
When Little Timmy crayons a copy of Mario, we congratulate him for his creativity. Is it unique, one of a kind art? Well Timmy made it, but he didn't think up the original idea of a video game plumber. I give this view to GenAI right now - it's not capable of achieving that "next step" in "original design", but its performing like a novice artist/musician, it's mimicking what it sees.
If you’re making an ethical argument “it’s okay because it’s already happening to a lesser degree somewhere else” isn’t the flex you think it is.
If you’re talking ethics, talk about impact. Who does it help the most who does it hurt the most? Is your argument favoring equality of access or outcome? Who is the most vulnerable in the situation and how will it impact them?
> I cover generative AI
Wait, are you teaching the class or taking it?
I teach it, my background is located in my profile and my research focuses on CS education.
Scale and impact do matter, I wholeheartedly agree. However, I stand by my point that genAI is mirroring how humans learn - repetition of previously observed actions. As part of my dissertation, I argued that humans operate using 'templates', or previously established frameworks / systems. Even in higher cognitive tasks like problem solving, we rely on workflows that we were trained on previously. Soloway referred to problem solving as a mental set of "basic recurring plans" [1] and if you look at the old 1980s Usborne children's books, they required kids to retype code [2]. For creative tasks, depending on the actor's background, Method and Meisner both tell people to draw from previous experiences and observations to develop a character. This behavior is similar in many areas like music, dance, martial arts, cooking, language acquisition, etc.
I am not making an ethical argument that GenAI violating copyright is okay because that's what humans do. I'm arguing that GenAI mirrors how humans learn. We observe a behavior and attempt to recreate that behavior. The difference is that humans can extract a fraction of the behavior and utilize it as part of something larger while GenAI cannot to the degree humans do. I'm sure GenAI would struggle to recreate "Who Framed Roger Rabbit?" because of the two polar different visual elements of the film (cartoon and real life).
In regards to your "If you’re talking ethics, talk about impact" section, its a bit of a loaded question. One side of the conversation could state that GenAI is helping many people that do not have confidence in their creative ability to produce their ideas, while the other could state its making it harder for artists.
Yes, it absolutely is hurting artists and I fully support the recent writer's strike over AI concerns. But I do not believe that diminishes how the mathematical models used in GenAI mirror our own skill acquistion.
It actually is.
It shows that we as a society are completely OK with this, and nobody is complaining about a very standard and common thing that all artists do.
It shows that the outrage is fake, and people don't actually care about the issue.
It's not a problem for me to draw Micky Mouse. It _is_ a problem when someone pays me to draw an animated mouse and I sell them a picture of Micky Mouse.
For me, its not really about the AI at all, it's a problem of undervaluing Artists contribution to these tools. And it's not even fully about copyright it's about not asking for permission to use their content and then creating an entire business on top of that stolen content.
All parties are responsible on some level. This just reads like passing the buck/burying ones head in the sand to me.
I'm perfectly free to ask people on the street for t-shirt with Mario on it, but as soon as someone who isn't Nintendo or licensed by Nintendo sells me that t-shirt they're the ones infringing on the copyright and trademark. As the consumer I did nothing illegal, and a court would say that I was deceived by the infringing party.
Distribution (seeding, uploading) and facilitating copyright infringement is what gets you in trouble. When you ask DALL-E (a paid, commercial product) for a picture of Italian plumbers and it gives you an obvious picture of Mario 100% recognizable to the layperson as Mario and not a distinctly different image of a similar character, that's blatant trademark and/or copyright infringement on the part of OpenAI.
> If the AI is completely unable to generate non-infringing works even if you are _trying_ to get away from it (which the author very much doesn't seem they are, they are purposefully making and show prompts that infringe), that's the problem of the AI creator then.
I see some parallels to the Napster lawsuit. The fact that the users were the bad people asking for infringing content didn't give Napster the right to facilitate infringement. Napster was ordered to monitor its network and make sure that they were blocking non-legitimate uses. They couldn't logistically comply and went bankrupt.
https://en.wikipedia.org/wiki/Napster
Which begs the question: Does OpenAI even have the technological ability to block trademark and copyright infringing content generation? Even if they do, how useful will ChatGPT be if all phrases and imagery that closely resemble copyrighted works are blocked from output?
Whats even worse for OpenAI compared to Napster is that it wasn’t individual users uploading copyrighted content, it was OpenAI’s ingesting the data. Nobody twisted their arm to include copyrighted works in their models.
Further extending the argument - I can potentially ask GenAI "Can you show me what does Mario looks like?" since I have never seen one and GenAI is my go to tool.
By which you mean every copyright holder.
> AGI in the near future
Something that is purely speculative, undefined, and has been promised in the near future for 50+ years.
I don't see copyright holders lying down for someone else's benefit and I don't see governments gutting copyright, contract law, and several other avenues of protection that copyright holders can deploy in the name of something that doesn't exist and may not ever exist.
https://www.natlawreview.com/article/japanese-government-ide...
https://www.cliffordchance.com/insights/resources/blogs/talk...
Why should other intelligent entities be prevented from reading copyrighted works and gaining whatever there is to gain from those works the way any human might?
Edit: typo
"Undefined", although not literally, in practice definitely: each letter of that initialism means a different thing to different people. To that extent, I'll even grant "speculative" despite many of those meanings being demonstrably met by us humans.
But as someone who (unfortunately) has just turned 40: who was it that was promising AGI "in the near future" for more than my entire lifetime? Including the second AI winter? Because even the biggest timeline-optimists I can remember (Kurzweil and Yudkowsky), who very few cared to listen to, put things more than 20 years ahead of when they were writing. (And yes, Yudkowsky was definitely wrong about a singularity in 2021, though as you say AGI is undefined I think if someone in 1996 had seen ChatGPT they'd have said "yes, this is AGI" despite its flaws).
Now the crowdsourced guess for AGI is 7 about years: https://www.metaculus.com/questions/5121/date-of-artificial-...
> I don't see copyright holders lying down for one else's benefit and I don't see governments gutting copyright, contract law, and several other avenues of protection that copyright holders can deploy in the name of something that doesn't exist and may not exist.
I tend to agree. Although I don't accept that contract law has much of anything to do with this discussion, to the extent that it does have implications, it isn't going anywhere.
But at the same time, Google exists by reading the entire public internet, indexing it, and presenting clips of it to its users. This has in fact resulted in copyright disputes, and I was surprised how long it took for that to happen. Likewise, while copyright holders must fight for their survival, mere LLMs even as they exist right now are economically relevant, so this isn't going to be a one-sided fight by just copyright holders.
Tech companies have more money to throw at politicians.
This is really about replacement. The copyright holders in the content industry aren't really afraid of LLMs infringing on past copyright, but are terrified of it replacing them on future work, and there is absolutely no legal protection from this. The lawsuit might officially be about copyright, but that's just because it is their only available legal angle of attack.
How do you square this with literally the first image in the OP showing side by side GPT reproing copyrighted work? imo a good modern art project would be someone making a website that “archives” NYT articles by laundering them through GPT rather than using the archive link that everyone posts to get around the paywall. Even HN guidelines bend over backwards to allow bypassing the paywall by allowing these links.
It’s already happening with EU AI Act https://www.europarl.europa.eu/news/en/headlines/society/202...
It generally didn’t care about generative AI.
It’s politically 100% viable to kneecap AI with copyright restrictions. This will go to the Supreme Court and it’s far from clear whether fair use applies to every case here.
There's a way to sell this to the public, but AI proponents don't want to have to sell it, because they feel that they shouldn't have to, and there's an underlying theme of "The benefit of AI is so overwhelming, and eventually it will replace most commodified creative work anyway so why bother litigating this now, let's just skip this messy step and get to that part" and that's super not going to work to convince skeptics.
LLM people are really starting to veer into crypto-bro territory with the evangelising about how they’re the best thing since sliced bread and transistors.
That's your take on LLMs?
Ask it how it is possible for a photon to travel across the universe, arriving at the same time it departed, resulting in the journey taking zero time (in its reference frame).
Ask what implications are if certain viral amino sequences result in messenger RNA translocating to the host cell nucleus, potentially with the entire genome.
Ask if aircraft fly due to Bernoulli's Principle or Newton's Third Law and physical impact.
This is "crypto-bro territory"? No, not quite.
How would that be any different than Google displaying some type of news headline?
[1] https://www.reedsmith.com/en/perspectives/ai-in-entertainmen...
I have nothing against creators, they deserve to get paid.
For what its worth, LLMs are facing the coke vs Pepsi challenge, and sadly they are most definitely Pepsi.
https://aws.amazon.com/service-terms/
See items as of 50.10 and 50.10.1 that I reproduce here:
"50.10. Defense of Claims and Indemnity for Indemnified Generative AI Services. AWS Services may incorporate generative AI features and provide Generative AI Output to you. “Generative AI Output” means output generated by a generative artificial intelligence model in response to inputs or other data provided by you. “Indemnified Generative AI Services” means, collectively, generally available features of Amazon CodeWhisperer Professional, Amazon Titan Text Express, Amazon Titan Text Lite, Amazon Titan Text Embeddings, Amazon Titan Multimodal Embeddings, AWS HealthScribe, Amazon Personalize, Amazon Connect Contact Lens, and Amazon Lex. The following terms apply to the Indemnified Generative AI Services:
50.10.1. Subject to the limitations in this Section 50.10, AWS will defend you and your employees, officers, and directors against any third-party claim alleging that the Generative AI Output generated by an Indemnified Generative AI Service infringes or misappropriates that third party’s intellectual property rights, and will pay the amount of any adverse final judgment or settlement."
IMO, what's most likely is some sort of licensing model between the AI companies and the 'big content providers' (remember most content on the web these days is not owned by the person who created it, wasn't always like that). The smaller companies then would be forced to live with either being scraped or ending up being 'invisible'.
If that's so, things are about to get worse for everyone, too. With little to no protection against AI, no one will be incentivized to create new IPs, whether they're books, drawings, songs. Or even films and games, when AI is able to also generate those in the (possibly near) future.
Sorry if your 40 hour work won't pay you $10 bucks a month forever. That's the case for most of the rest of us: we produce for 40 hours, we get paid for those 40 hours, regardless of what we do.
Welcome to the club!!