DALL-E 2 generates images of Kermit The Frog in various films (opens in new tab)

(twitter.com)

390 pointsjayalammar4y ago192 comments

192 comments

122 comments · 35 top-level

PheonixPharts4y ago· 15 in thread

These are honestly not very impressive (no sarcasm here) and further convince me that the next AI Winter will come with this coming recession.

Don't get me wrong, they are still impressive in the quality of the visual they produce, but just like Markov Chain demos of old, they're neat but way miss the mark.

None of these capture the "feel" of Kermit the Frog. Most of them look like weird designs for the Ninja Turtles movie in the 90s.

There are several distinctive features of Kermit that a missing from nearly all of these.

- For any of the "live action" ones, Kermit should still always be a puppet. - Kermit notoriously has lanky arms, - Kermit never has eye lids - His eyes sit way on top of his head. - He often has his weird neck decoration. - His eyes have a very distinctive pupil shape.

None of these get Kermit correct, they all just look like frogs (maybe Dalle2 isn't trained on copyrighted/trademarked material?)

There are fan made versions of some of these which show just how different Dalle2 is from human imagination:

Kermit actually has been on family guy: https://static.wikia.nocookie.net/muppet/images/7/71/Famguys...

There are several "Kermit in Star Wars Examples" here are two: https://i.kym-cdn.com/entries/icons/original/000/021/668/ker..., https://i.ytimg.com/vi/6MebZx-4950/maxresdefault.jpg

Again if this was done on someone's laptop it would be really impressive. However the fact that so much talent and resources were poured into pushing AI to it's limits and this is what we get tells me we've hit another brick wall as far as research goes.

Guest190238924y ago

I completely disagree. I think these are taking it a step further than your examples. Dalle2 is not just using the existing Kermit and pasting it in different environments, it's modifying Kermit to fit in that world.

For example, your Star Wars example...

https://i.ytimg.com/vi/6MebZx-4950/maxresdefault.jpg

It's clearly just an existing photo of Kermit pasted over an image from the film. There are even two sets of arms. I could Photoshop that in a few minutes.

Then, the Dalle2 image...

https://pbs.twimg.com/media/FUEDDm2UEAAO8yb?format=jpg&name=...

I think it's impressive. It looks like Kermit is a character in the Star Wars universe. There are a few issues with the eyes and feet, and it's also hard to tell if it's a creature or a person in a frog suit. However, it gets 90% of the way there, and the pose is great for a frog/human hybrid.

The most exciting thing is how this could be used as a starting point for design. I could take the Dalle2 Kermit image above, fix the eyes/feet, add a few distinctive Kermit features, and have a great piece of concept art in an hour, rather than taking a day or two to create something from scratch. Obviously it can't be applied to all workflows, but for those it's suited for, it'll save vast amounts of time and costs. For that reason, it's already something of real value in its current state. The same can't be said about the Star Wars examples you provided.

tommoor4y ago

I genuinely can't tell if you're trolling. This isn't impressive because the AI model doesn't accurately capture the "feel" of Kermit!?

witheld4y ago

The computer was asked to produce photos of Kermit the frog. It failed spectacularly at rendering anything resembling Kermit the frog.

2 more replies

bergenty4y ago

When it completely does capture the feel of Kermit.

fullshark4y ago

I disagree that this is unimpressive, but do largely agree about AI winter. Dall-E 2 is probably the most impressive AI implementation I can recall in the past 5-10 years and it's still highly specialized problem being solved, and it's unclear what market it really can go after other than freelancing digital artists online who work for tiny commissions. I guess it's also gonna be great for NFTs but I consider that market illusory and will disappear within a few years.

natly4y ago

I am definitely going to bet against the "AI winter is returning" idea by investing huge amounts of my time into understanding these algorithms. History doesn't have to repeat, sometimes that's the foolish prediction (the apple newton was made fun of by the simpsons but when the ipad came out the timing was perfect). I don't like overinflating OpenAIs already enormous ego but these are incredible images.

1 more reply

simiones4y ago

While I think many are over-interpreting the quality of these results, yours is sounding like a clear case of a No True Kermit fallacy.

There are many ways to define what "Kermit the Frog in $MOVIE" means, and the choice the AI made is absolutely valid. There are of course various other valid choices, but this doesn't invalidate the ones presented.

Furthermore, judging by some other examples in this HN thread, it seems that the fact most of the pictures are not puppets is more of a choice of the human choosing the photos, as in other cases DALL-E was indeed adding puppet-like characters in movie-like decors.

throw67464y ago

I thought I was taking crazy pills, none of them look like kermit bur rather they look like a generic frog. They don't even have the same pattern around his collar.

vintermann4y ago

It is odd, isn't it? It captures "essential" characteristics of all those films in a honestly brilliant way - but it doesn't capture any of the iconic characteristics of Kermit himself!

deusum4y ago

Your take on Kermit is too literal. Allow some artistic license. And you neglect all of the other thematic elements from the prompt.

Gnarled4y ago

> These are honestly not very impressive (no sarcasm here) and further convince me that the next AI Winter will come with this coming recession.

"Sure, this AI can produce high-resolution realistic images leaps and bounds above anything that's been shown before... but there's an aspect which could use improvement. Obviously, this proves that the current AI technology will never amount to anything and we should just give up on it now."

gk14y ago

> Again if this was done on someone's laptop it would be really impressive. However the fact that so much talent and resources were poured into pushing AI to it's limits and this is what we get tells me we've hit another brick wall as far as research goes.

You might be missing the point of what OpenAI is doing. The point is to show off the capability of their models in a way that's likely to go viral and lead to more business for OpenAI. Some people laughed at GPT-3's silly demos, but when they launched GitHub Copilot...

throw67464y ago

... And it's a decent tool?

If people say Dalle can improve the workflow of digital artists, sure, but Copilot hasn't revolutionized programming either, you still have to be a good programmer to finish whatever you are doing:

> A paper accepted for publication in the IEEE Symposium on Security and Privacy in 2022 assessed the security of code generated by Copilot [...] The study found that across these axes in multiple languages, 39.33% of top suggestions and 40.73% of total suggestions lead to code vulnerabilities. Additionally, they found that small, non-semantic (i.e., comments) changes made to code could impact code safety.[14]

PheonixPharts4y ago

> but when they launched GitHub Copilot...

What happened next? Is anyone using copilot for serious work? Has it changed programming in a fundamental way?

I personally have zero use for copilot since the for type of code I write the actual code writing is not a bottle neck, so automating that process is of no value to me. On top of that getting the details exactly right is essential so the ratio of boiler plate to real code is very, very low for me.

carapace4y ago

I agree with what you say in re: Kermit. Most of these images look to me like a frog that looks like Kermit the Frog but isn't. Metaphorically (and literally) Jim Henson isn't in these images.

However, I don't think you're correct in your assessment of the import of this sort of thing: it's an imagination machine. This isn't a brick wall, it's a foundation on which to build.

tinalumfoil4y ago· 10 in thread

The amount of creativity here is astounding. Just imagine all the decisions the AI made in incorporating Kermit into the movies: the clothing it's chosen, how the character wears the clothing, the facial expressions, how to make Kermit himself look similar to the other movies characters. Should he be lanky? pudgy? Even simple decisions like obliviously Kermit in Wall-E is going to be a robot, it has to figure out what he looks like as a robot, what his mouth looks like, that his eyes should be enlarged.

It gets a lot of things wrong, like I'm not sure why kermit has a plastic texture in many of the pictures. If you showed me ten pictures of Kermit and ten frames of total recall, and for some reason 8 of your pictures had a plastic Kermit, and asked me to combine them in my head, I'd probably imagine something on-par or worse than what Dalle has managed to do. But I wouldn't be able to show anyone what I'd made!

tdehnel4y ago

Sorry to be this guy but that is not creativity. It’s using what already exists, not conjecturing something new.

Contrast with real creativity (what people can do but machines currently cannot) where you conjecture something completely new.

For example, Copernicus conjecturing the idea that the Earth revolves around the Sun. No machine learning model would have gotten there because it would have been trained on a bunch of data that said the Earth was the center of the universe.

dougmwne4y ago

These are fun discussions because words like "artistic creativity" have a colloquial meaning that could only apply to humans since the dawn of humanity. Now you have an image of Kermit in Wall-E. I have never seen or conceived of an image of Kermit in Wall-E. Let's assume that adorable robot Kermits do not exist in the training data to be spit out like a search algorithm.

The image is new, it did not previously exist. It is a creation, a very vague idea of a few words that was created in full realization.

So it sees like the only difference between the "Not creativity" that Dall-E is doing and "Real Creativity" that humans do is tht humans are the ones doing it?

I agree there's this concept of expanding the frontiers of human aesthetic capability that has slow-marched from cave paintings till post-modernism. That there are a very few artists that invent completely new styles that the rest of us copy and remix. It's questionable that Dall-E can do that, but I'm also not sure that it can't do that.

1 more reply

pphysch4y ago

Humans don't generate new ideas from nothing. Your "real creativity" doesn't exist. Everything is dependent on what came before, and therefore derivative to some extent.

Copernicus got his idea after gathering a lot of data, explicitly and implicitly, training his internal model of the world.

1 more reply

andrewstuart24y ago

The scientific and creative methods are both entirely centered around evaluating data or existing creative elements and combining it in novel ways. Science adds the goal of making and testing hypotheses but nothing about Copernicus's conjecture would have been possible without observing existing information. I'm including e.g. "new" astronomical observations in the realm of existing information because even a human is simply observing and measuring it.

I fail to see what the difference is between 99% of existing "creativity," which is essentially arranging existing ideas into novel combinations, and what DALLE2 does.

klik994y ago

"Good artists borrow, great artists steal"

Creativity is a very vague word, I'm sure we can come up with definitions of it that let humans keep sole domain over it. But breakthroughs often come from combining domains and concepts, very very rarely do we ever jump out of one local maxima into another, and I'm not even convinced that Copernicus counts as that. There's a reason why there are so many examples of the same breakthrough happening in multiple places in the world independently - innovation is a slow gradual collaborative process and not plateaus waiting for men of genius to have a spark of inspiration.

Also I'm not convinced that a computer couldn't have discovered the earth revolves around the sun - it's hard to make machine learning jump out of local maxima, but it does happen, and I can see some hidden layers becoming far more efficient at predicting outcomes by stumbling across a model that centered the sun. That being said - there likely are examples of things that computers couldn't have theoretically figured out the model for, but I'm hard pressed to think of one.

1 more reply

vouwfietsman4y ago

This is a common fallacy.

Call it moving goalposts, no true Scottsman, the AI effect, whatever. The behavior is as follows: an argument over whether an ill-defined attribute is possessed by a computer is defended or attacked with useless semantics since nobody can agree on what any of the words mean anyway.

Creativity, intelligence, consciousness.

It doesn't matter what you say, you cannot define these concepts with the same clarity you use to defend that the concept is missing.

Saying there is no creativity because its just a neural net extrapolating from data is like saying there's no god because its all just atoms: what is god and why would the existence of atoms have anything to do with it.

Learn from Wittgenstein: Worüber man nicht sprechen kann, darüber muss man schweigen.

1 more reply

gwern4y ago

> No machine learning model

https://www.nature.com/articles/d41586-019-03332-7

SergeAx4y ago

Artist is using ideas already existed and creates something new out of them. It has always been this way since cave wall art.

I am 100% sure Copernicus was not the first to suggest a heliocentric system, but he was the one who put enough energy into proving it and defend that theory.

adamsmith1434y ago

>Sorry to be this guy but that is not creativity. It’s using what already exists, not conjecturing something new.

Such a cute point of view, completely wrong but cute. Please go find the original images of Kermit in Blade Runner and WallE that were just copied here.

>For example, Copernicus conjecturing the idea that the Earth revolves around the Sun. No machine learning model would have gotten there because it would have been trained on a bunch of data that said the Earth was the center of the universe.

If the model were trained on actual observations of planetary trajectories it would trivially recreate keplers laws, newtons laws etc.

2 more replies

gwern4y ago

> like I'm not sure why kermit has a plastic texture in many of the pictures

That might be the known bug with low-resolution textures: the DALL-E 2 paper notes that the details in very complex scenes can be bad, and thinks it's because you start with a 64px image which is necessarily bad for details (64px is really small!) and upscale with dumber models from there https://cdn.openai.com/papers/dall-e-2.pdf#page=17 I think this explains the issues with images where the 'skin' or 'fur' looks really creepy (eg. all the semi-nude bears).

sp3324y ago· 8 in thread

Does that mean the developers put The Big Lebowski through the model during training? Where did they get all the movies and TV shows from? And does copying so directly from the source material open them and users up to copyright infringement liability?

alkonaut4y ago

I was assuming it was just movie posters and screenshots from online articles/reviews/imdb etc, and not any analysis of the video itself (I might be wrong though - using video would make the number of available input frames grow by orders of magnitude, not that there is a lack of pictures online).

E.g. The Shining picture of Jack Nicholson with the door isn't representative of the "look" of the film, but very much an iconic still frame and basically what you see in a Google image search for "The Shining".

natly4y ago

They probably did, but even if they didn't it may have come from the terrabytes of data they scraped from the internet. OpenAI doesn't care. They claim that it's derivative enough to go under fair use. And whether it is or isn't, I guess their calculation is that the risk is worth taking to be the first to develop these algorithms, which is a huge head start if the courts decide that it does count as fair use.

gwern4y ago

Hilarious parodies like these are copyright infringement, yes, but also open-and-shut fair use defense. (You're confusing the issue of transformativeness and fair use defense.)

BeefWellington4y ago

I'm imagining the defense will be: "Your honor, it can't be infringement because I ran it through an ML model first."

Especially if the movie(s) that are eventually generated this way are ripping whole scenes or sequences out of other films, a la copilot.

bogwog4y ago

I feel like if they don’t have licenses for all of their source material, then the model should be required to be released into the public domain.

It’s like extremely expensive piracy that is bad for artists and bad for the environment.

I wonder if the reason OpenAI, Google, etc don’t release these things isn’t so much that they’re worried about racist/offensive output, but instead they’re worried about people using it to create images of, say, Mickey Mouse and drawing the attention of his lawyers? It’d be better for AI companies to keep all of this stuff in a legal gray area for as long as possible.

1 more reply

xeromal4y ago

I would hope that the spirit of the law would be considered in this. This is a clear application of fair use. Are the owners of the IP losing money on letting someone generate new characters in their movie/show?

simiones4y ago

> Are the owners of the IP losing money on letting someone generate new characters in their movie/show?

Typically, creators are very protective of this sort of thing, unless it stays in the area of fan art. If anyone tries to seriously monetize this kind of output, I'm sure we'll see a lot of cases.

Imagine what Disney would do if you used DALL-E to create an animated feature film in the style of Mickey Mouse, but with cats instead of mice, and they found out you used actual footage from, say, Fantasia to train an AI model. No idea if they would win, but I'm certain they'd sue.

jhart994y ago

Wouldn't the images be a transformative work? But then again there were those recent music cases where the infringement was only a small number of notes in sequence. This will almost inevitably end up in court because I'm not sure there is a comparable case.

mensetmanusman4y ago· 8 in thread

This is amazing.

We are at the precipice of someone releasing a $100M blockbuster movie just based on the language in the script with zero cost beyond compute.

What will this mean for the future of entertainment…

aqzman4y ago

I suspect at some point in the next 5-15 years we will begin to see AI generated entertainment perfectly tailored to a person's preferences.

Workaccount24y ago

I can already envision everyone talking over each other trying to say how their personal AI generated TV show is the best show they have ever seen.

Just imagine how much lonelier the world is going to feel when people don't even have entertainment in common anymore.

3 more replies

phkahler4y ago

>> I suspect at some point in the next 5-15 years we will begin to see AI generated entertainment perfectly tailored to a person's preferences.

Kermit in Debby Does Dallas. Kermit in the Graduate. Kermit with 2 broken flippers. Oh the depravity. I'm not sure getting high quality visualizations of any random passing thought is a good idea ;-)

marcodiego4y ago

When I think about such a possibility, https://c.tenor.com/emURWFXTpvIAAAAC/pearl-jam-do-the-evolut... comes to my mind.

AnIdiotOnTheNet4y ago

On the one hand, the ability for Star Trek's Holodeck to create large amounts of content from a few terse natural language instructions is look less and less implausible.

On the other hand, I feel like this will ultimately be kinda like traditional procgen algorithms: once you've seen enough of what it produces it all starts feeling very bland and same-y. Sure, the AI may be able to produce a feature-length movie based on the input "What if Nicolas cage had played The Terminator and Aaron Sorkin wrote the script?", but somehow none of it would be surprising or interesting to you, it would lack the novelty and playfulness of a good human creative work, and it likely would be very shallow in its themes.

On the gripping hand though, perhaps in achieving that level of sophistication we inadvertently create something more alive and aware than we intended and instead of merely trying to produce satisfactory results it actually attempts to express itself in ways that resonate with us.

andreilys4y ago

it would lack the novelty and playfulness of a good human creative work, and it likely would be very shallow in its themes.

Simply tune the parameters associated with novelty and playfulness and you’ll get the desired result.

There’s nothing inherent in human creativity that can’t be replicated by an AI. Most creative work is derivative and remix’s prior art.

This is a good short video on the phenomenon of remixing https://m.youtube.com/watch?v=MZ2GuvUWaP8

2 more replies

bambax4y ago

> The Terminator and Aaron Sorkin wrote the script?

You can't handle the future! We live in a world that has time machines, and those time machines have to be manned by robots! Who's gonna do it? You?

xg154y ago

Waiting for the next iteration of Copilot to use this technology.

"Like Facebook but like make it not suck"

AI: "Here you go!"

alex_duf4y ago· 7 in thread

These are mind bogglingly good

coldcode4y ago

If you ran it 1000 times and picked the best, you might get all good ones. I would want to see all 1000. It's like stock picker ads (person who called the market says XXX) where you only show the lucky ones.

silicon24014y ago

Are you aware that this is how human artists work, too? They make countless works that aren't comparable to their top works. Even more so if you count the practice when they were just starting out. I think some people just refuse to believe that real art can be generated without humans and selectively look for things that confirm their pre-determined conclusion. Witnessing this always feels like witnessing someone before the wheel was invented, when things were rolled and logs, assuring everyone that technology will never beat manual labor because look at how cumbersome it is to work with logs.

1 more reply

mensetmanusman4y ago

It doesn’t matter if these images were one in 10,000, the facts that any exist that are this good is crazy.

2 more replies

jayalammarOP4y ago

Agreed that people should pay attention to cherry-picking of model outputs.

For this one in particular, here are a few more results for Battlestar and The Office:

https://twitter.com/Miles_Brundage/status/153247388947686195...

1 more reply

tgv4y ago

Yes, and it still feels a lot like Searle's Chinese Room. It's as if it skips a dozen steps. Well, that's exactly what happens, of course. But it does show that the network can match linguistic descriptions to images extraordinarily well.

myrmidon4y ago

Do you find the Chinese Room argument convincing?

Do you feel that the human mind is more than an "appropriately" trained "biological" neural network?

What do you consider the limits of a DALL-E like system compared to a "true" mind?

My personal opinion is that the Chinese Room argument is fancy handwaving that crucially relies on never being explicit about what it means by "understanding", combined with an appeal to intuition.

I strongly believe that there is nothing "magical" about the human mind or brain (that could not be replicated artificially), and thus that a comparably trained, appropriately designed system ("DALL-E successor") OR a copy OR a simulation of a human brain would be all just as capable and "understanding"/"conscious" as another human...

3 more replies

puranjay4y ago

Kermit in Sopranos did it for me. Hilariously accurate.

jonnycomputer4y ago· 7 in thread

Not to detract from the accomplishment, but none of these are Kermit. The "Kermit" part of the query seems mostly to have accomplished querying for "humanoid frog"

maxfurman4y ago

Some of them look a lot like Kermit! But many do not, depending on how far away the target aesthetic is from "muppet." A smarter Dalle2 could do a better job at preserving the "essential Kermit-ness" perhaps. But maybe it's more impressive to adapt to a completely new aesthetic?

notahacker4y ago

That's actually one of the more interesting and impressive things: the AI has substituted in "humanoid frog" and in some cases given it movie-appropriate features like Matrix sunglasses ot Pixar-style eyes which are probably unique to the image rather than simply pasting photographs of the original muppet into movie poster settings which would be an acceptable lower level response to the brief.

treesprite824y ago

Seems like it adapted Kermit's features to fit with the world of the movie, as if he was actually a character from that movie.

I don't have access to DALL-E 2, but I wonder if a prompt like "A cameo from Kermit the Frog in ..." would give more literal Kermits.

moss24y ago

No those are clearly Kermit the Frogs

jonnycomputer4y ago

If I hadn't been prompted with the words Kermit, no way would I have guessed that they were supposed to be Kermit. For example, Kermit has a characteristic shape of his pupils. None of the examples have that.

carapace4y ago

Instead of arguing about "what is" in re: subjective judgements, it can sometimes be useful to phrase things more literally, as in:

"To me those are clearly Kermit the Frogs."

"To me those are clearly not Kermit the Frogs."

Then there's nothing really to argue about. Instead we can discuss what we see and how that affects our subjective perceptions.

For example, Kermit the Frog doesn't have eyelids, but most of these images show a frog with eyelids.

bergenty4y ago

These are all kermit. The identifying caricatural features are all here.

FredPret4y ago· 6 in thread

How is this so good? If you told me this was done by a very talented human, I’d still be surprised at how good they are. That must count as a sort of Turing test, no?

pygy_4y ago

DALL-E is indeed superhuman in its ability to create images from simple prompts.

This shows the limits of the Turing test. To pass it a program must not only be smart enough, it must be dumb enough too.

Pulling what DALL-E does is a tell-tale sign it’s most likely not human, and would make it fail the test.

simiones4y ago

Well, most of the pictures (if not all), while astoundingly good as an idea, have the tell-tale signs of being AI-generated, the kinds of mistakes a human would never make.

For example, looking at the WALL-E one [0], you can clearly see that the hands and feet aren't actually separated properly. There is also plenty of missing "logic" around the armpits. These are the kinds of mistakes a human can't make - especially one that is so adept at drawing the other parts so perfectly.

[0] https://twitter.com/HvnsLstAngel/status/1531512163738669057/...

1 more reply

sbierwagen4y ago

2021: This machine fails the Turing test, it's too dumb. Look at how crappy this generated art is.

2022: This machine fails the Turing test, it's way too smart! No human could be this good at creating art.

davidkunz4y ago

The Kermit Test.

lgvld4y ago

Love this name. ;-)

ASalazarMX4y ago

I'd guess these are curated, but I have no idea how many tries were discarded, as I'm still waiting for the invitation. Even in the best examples you can find fused fingers, deformities, melting, and slight errors that a human artist would need to touch up before it truly passes as a human work.

endisneigh4y ago· 6 in thread

I don’t understand how this isn’t infringing some copyright. Anyone do any research on this?

If we accept that a model trained on copyright material does not infringe on the materials rights, then circumventing all copyright can be as simple as creating a sufficiently close derivative and giving it away.

Not to say that copyright is good to begin with.

akersten4y ago

Everything that you, a person with a paintbrush, could paint "in the style of" something else is informed by what your model (your brain) has been exposed to. There's no getting around that, and commissioning you to paint something "in the style of postwar authoritarian England" does not infringe the copyright of V for Vendetta (even if I told you "make it look just like the movie"); it's an original painting.

Stylistic inspiration is not an infringement of copyright, in either that case or the "do it on a computer" case here.

The Kermit the Frog aspect though is interesting - it applies equally for both the human and machine made works - if an argument could be made that the subject of the work sufficiently resembles the character, maybe there's a trademark issue at hand?

But in any scenario, nothing legally novel about the work being created by machine.

bogwog4y ago

> But in any scenario, nothing legally novel about the work being created by machine.

…except for the fact that it was created by a machine.

Just like copyright law had to be revised to deal with software and the internet, it will need to be revised to deal with AI.

1 more reply

jovial_cavalier4y ago

>circumventing all copyright can be as simple as creating a sufficiently close derivative and giving it away.

That's correct. People do this all the time, sans the giving it away part.

Also, there is no way that you can argue these images are not transformative.

endisneigh4y ago

I shouldn’t clarify - I’m not talking about the Kermit images, I’m talking about Dalle2 itself. If you had it render Neo from the Matrix, albeit imperfectly, would that be transformative use?

2 more replies

phkahler4y ago

>> Also, there is no way that you can argue these images are not transformative.

Exactly. Kermit has been very much transformed, and "in the style of" is not copyright infringement AFAIK.

ceejayoz4y ago

https://en.wikipedia.org/wiki/Transformative_use

feynmanalgo4y ago· 5 in thread

If this is legit it would be one of the most impressive things I've seen in many years.

oldstrangers4y ago

It's legit, just variations / iterations.

Here's what I got for "A still of Kermit The Frog in Blade Runner 2049 (2017)":

https://imgur.com/a/y7t3RKx

feynmanalgo4y ago

None of them scream Blade Runner 2049 to me.

1 more reply

lfkdev4y ago

? Dale is a legit project from openAI

jffry4y ago

I think "legit" in this context might mean more of the Twitter post author's representation of their use of DALL-E 2:

- Were the prompts shown the ones fed to DALL-E 2 or were there more complex details described in the prompt?

- Were these the first images generated for the prompt, or did the author generate many images and cherry-pick the best example, and if so from how many?

1 more reply

tommoor4y ago

That doesn't mean the images in this tweet thread are output from Dalle though…

Although if an individual created all of these then that's about the same amount of impressive

1 more reply

messe4y ago· 3 in thread

Loved the David Lynch ones. I'm disappointed the image it generated for Eraserhead[1] didn't have Kermit as the baby[2]. I'm curious as to what it would generate for "Kermit the Frog in David Lynch's Dune".

[1]: https://twitter.com/HvnsLstAngel/status/1531774195234791424?...

[2]: https://duckduckgo.com/?t=ffab&q=eraserhead+baby&iax=images&...

canjobear4y ago

Close https://twitter.com/Plinz/status/1532578234855936000

monkeybutton4y ago

Absolutely terrifying https://twitter.com/Plinz/status/1532578531674189824

giaour4y ago

> Kermit the Frog in David Lynch's Dune

DALL-E would just shoot back a still with Kyle McLachlan in it. He's already so Kermit like!

ahoka4y ago· 2 in thread

Is there an easy way to try this model on my computer?

jffry4y ago

As far as I know, the actual trained system is proprietary, and you can only use it by requesting access to their online system for generating imagery: https://openai.com/dall-e-2/

There are open-source efforts to implement it and make trained models available, but I don't imagine they are yet at the same scale of ingested data / model size as OpenAI's system: https://github.com/lucidrains/DALLE2-pytorch

jayalammarOP4y ago

Not Dalle2 specifically, it's proprietary and there's a waitlist. Older open source alternatives include https://huggingface.co/dalle-mini

Tade04y ago· 2 in thread

I see a distinct lack of "Kermit The Frog in Dragon Ball Z".

rememberlenny4y ago

Here is the prompt for "A still of Kermit The Frog in Dragon Ball Z (1989)"

https://user-images.githubusercontent.com/1332366/171921054-...

sillysaurusx4y ago

Oh man. Now I want to apply for access just for that. Thanks for the idea.

jdmoreira4y ago· 1 in thread

Dall-E still blows my mind every time. To someone who is plugged into these things, how is this possible?

I understand computers and I understand back-propagation but this... it feels like magic to me.

Can someone indulge me in a short explanation of how this works and how is it this good?

jayalammarOP4y ago

These are good explanations:

- https://www.assemblyai.com/blog/how-dall-e-2-actually-works/

- https://www.youtube.com/watch?v=F1X4fHzF4mQ

rlv-dan4y ago· 1 in thread

https://nitter.net/HvnsLstAngel/status/1531506455714492416

aasasd4y ago

There are browser extensions to do this for you. Specifically ‘Redirector’ for FF.

spyremeown4y ago· 1 in thread

This is THE technology of the 2020s. It's insane. I'm in absolute awe at how amazing this AI is.

aaaaaaaaata4y ago

Disagree — GPT-3 type AIs are.

But people who do viral news and posts don't...read. So, their impact will continue to go unnoticed in comparison to DeEpFaKeS and Dall-E.

mynegation4y ago· 1 in thread

I am finishing “Love, Death + Robots” on Netflix and between the quality of CGI and things like DALL-E and Imagen, movie, design, and illustration industries are in for an upheaval.

layer84y ago

Video is probably a whole other ballpark regarding required training and resources.

qwertyuiop_4y ago· 1 in thread

“ All the impressive achievements of deep learning amount to just curve fitting” - Judea Pearl

https://archive.ph/89lqw

ryanklee4y ago

If the results are actually impressive, then the dismissive rhetoric doesn't amount to a hill of beans.

fred2564y ago· 1 in thread

These are so good I am scared.

adamsmith1434y ago

Good, that is the correct response. The progress in AI recently has been staggering

dash24y ago· 1 in thread

"Various films." I hope you're not thinking what I think you're thinking.

sillysaurusx4y ago

You can bet your life savings that OpenAI could generate some really interesting Kermit porn.

FeepingCreature4y ago· 1 in thread

man look at this amazing curve fitting from training samples

gwern4y ago

man all this is is a fancy Photoshop. i bet you could find all of these in Google Images or something.

b0ner_t0ner4y ago

More eye candy here: https://np.reddit.com/r/dalle2/top/?sort=top&t=year

Very good one: https://np.reddit.com/r/dalle2/comments/u5kkty/a_fluffy_baby...

“a masterful impressionist portrait painting of a little doggey who is worried he may not be a good boy”: https://nitter.net/MarkRich388/status/1532482006809866240

bergenty4y ago

If these pictures are not touched up this is superhuman. My mind hasn’t been this blown in a long time. What a time to be alive.

motoxpro4y ago

In this thread:

1. People thinking it's amazing (me) 2. People thinking it's not creative enough e.g. "It’s using what already exists, not conjecturing something new." 3. People thinking it's too creative e.g. "This looks nothing like Kermit"

high_54y ago

What about ... Kermit in the Muppet Show?

stuntkite4y ago

Kermit the Frog in Behind The Green Door

Kermit the Frog in Salò, or the 120 Days of Sodom

Kermit the Frog in Pink Flamingos

----

I actually might have Dalle2 access soonish. Honestly this is the best demonstration I've seen that demonstrates to me very well that we are about 2 years away from maybe not "general ai" but some pretty wild shit that is going to make most of what we do and value as humans very different.

gwern4y ago

Followup: "Big Bird Throughout Cinematic History": https://www.reddit.com/r/dalle2/comments/v4q5rh/big_bird_thr...

stuntkite4y ago

When will they get back to me with my Dalle2 access! I need to make a ton of kermit images.

ridgered44y ago

Is that Fozzie Bear in the mirror/backdrop of the Stranger Things one? Or am I seeing what I expect to see...

hcarvalhoalves4y ago

The Big Lebowski’s one is clearly a guy in a frog costume… hilarious!

jeffreygoesto4y ago

Not a single image shows Kermit. Green frogs yes. Kermit no. Fail.

qgin4y ago

A lot of art-adjacent jobs are about to become toast.

rvieira4y ago

Kermit in Eraserhead is everything I expected it to be.

gaudat4y ago

Looking better than those pixel art NFTs.

layer84y ago

The Matrix version looks like TMNT. :)

candlemas4y ago

None of those look like Kermit the Frog.

j / k navigate · click thread line to collapse

192 comments

122 comments · 35 top-level

PheonixPharts4y ago· 15 in thread

These are honestly not very impressive (no sarcasm here) and further convince me that the next AI Winter will come with this coming recession.

Don't get me wrong, they are still impressive in the quality of the visual they produce, but just like Markov Chain demos of old, they're neat but way miss the mark.

None of these capture the "feel" of Kermit the Frog. Most of them look like weird designs for the Ninja Turtles movie in the 90s.

There are several distinctive features of Kermit that a missing from nearly all of these.

None of these get Kermit correct, they all just look like frogs (maybe Dalle2 isn't trained on copyrighted/trademarked material?)

There are fan made versions of some of these which show just how different Dalle2 is from human imagination:

Kermit actually has been on family guy: https://static.wikia.nocookie.net/muppet/images/7/71/Famguys...

There are several "Kermit in Star Wars Examples" here are two: https://i.kym-cdn.com/entries/icons/original/000/021/668/ker..., https://i.ytimg.com/vi/6MebZx-4950/maxresdefault.jpg

Guest190238924y ago

For example, your Star Wars example...

https://i.ytimg.com/vi/6MebZx-4950/maxresdefault.jpg

It's clearly just an existing photo of Kermit pasted over an image from the film. There are even two sets of arms. I could Photoshop that in a few minutes.

Then, the Dalle2 image...

https://pbs.twimg.com/media/FUEDDm2UEAAO8yb?format=jpg&name=...

tommoor4y ago

I genuinely can't tell if you're trolling. This isn't impressive because the AI model doesn't accurately capture the "feel" of Kermit!?

witheld4y ago

The computer was asked to produce photos of Kermit the frog. It failed spectacularly at rendering anything resembling Kermit the frog.

2 more replies

bergenty4y ago

When it completely does capture the feel of Kermit.

fullshark4y ago

natly4y ago

1 more reply

simiones4y ago

While I think many are over-interpreting the quality of these results, yours is sounding like a clear case of a No True Kermit fallacy.

throw67464y ago

I thought I was taking crazy pills, none of them look like kermit bur rather they look like a generic frog. They don't even have the same pattern around his collar.

vintermann4y ago

It is odd, isn't it? It captures "essential" characteristics of all those films in a honestly brilliant way - but it doesn't capture any of the iconic characteristics of Kermit himself!

deusum4y ago

Your take on Kermit is too literal. Allow some artistic license. And you neglect all of the other thematic elements from the prompt.

Gnarled4y ago

> These are honestly not very impressive (no sarcasm here) and further convince me that the next AI Winter will come with this coming recession.

gk14y ago

throw67464y ago

... And it's a decent tool?

If people say Dalle can improve the workflow of digital artists, sure, but Copilot hasn't revolutionized programming either, you still have to be a good programmer to finish whatever you are doing:

PheonixPharts4y ago

> but when they launched GitHub Copilot...

What happened next? Is anyone using copilot for serious work? Has it changed programming in a fundamental way?

carapace4y ago

I agree with what you say in re: Kermit. Most of these images look to me like a frog that looks like Kermit the Frog but isn't. Metaphorically (and literally) Jim Henson isn't in these images.

However, I don't think you're correct in your assessment of the import of this sort of thing: it's an imagination machine. This isn't a brick wall, it's a foundation on which to build.

tinalumfoil4y ago· 10 in thread

tdehnel4y ago

Sorry to be this guy but that is not creativity. It’s using what already exists, not conjecturing something new.

Contrast with real creativity (what people can do but machines currently cannot) where you conjecture something completely new.

dougmwne4y ago

The image is new, it did not previously exist. It is a creation, a very vague idea of a few words that was created in full realization.

So it sees like the only difference between the "Not creativity" that Dall-E is doing and "Real Creativity" that humans do is tht humans are the ones doing it?

1 more reply

pphysch4y ago

Humans don't generate new ideas from nothing. Your "real creativity" doesn't exist. Everything is dependent on what came before, and therefore derivative to some extent.

Copernicus got his idea after gathering a lot of data, explicitly and implicitly, training his internal model of the world.

1 more reply

andrewstuart24y ago

I fail to see what the difference is between 99% of existing "creativity," which is essentially arranging existing ideas into novel combinations, and what DALLE2 does.

klik994y ago

"Good artists borrow, great artists steal"

1 more reply

vouwfietsman4y ago

This is a common fallacy.

Creativity, intelligence, consciousness.

It doesn't matter what you say, you cannot define these concepts with the same clarity you use to defend that the concept is missing.

Learn from Wittgenstein: Worüber man nicht sprechen kann, darüber muss man schweigen.

1 more reply

gwern4y ago

> No machine learning model

https://www.nature.com/articles/d41586-019-03332-7

SergeAx4y ago

Artist is using ideas already existed and creates something new out of them. It has always been this way since cave wall art.

I am 100% sure Copernicus was not the first to suggest a heliocentric system, but he was the one who put enough energy into proving it and defend that theory.

adamsmith1434y ago

>Sorry to be this guy but that is not creativity. It’s using what already exists, not conjecturing something new.

Such a cute point of view, completely wrong but cute. Please go find the original images of Kermit in Blade Runner and WallE that were just copied here.

If the model were trained on actual observations of planetary trajectories it would trivially recreate keplers laws, newtons laws etc.

2 more replies

gwern4y ago

> like I'm not sure why kermit has a plastic texture in many of the pictures

sp3324y ago· 8 in thread

alkonaut4y ago

natly4y ago

gwern4y ago

Hilarious parodies like these are copyright infringement, yes, but also open-and-shut fair use defense. (You're confusing the issue of transformativeness and fair use defense.)

BeefWellington4y ago

I'm imagining the defense will be: "Your honor, it can't be infringement because I ran it through an ML model first."

Especially if the movie(s) that are eventually generated this way are ripping whole scenes or sequences out of other films, a la copilot.

bogwog4y ago

I feel like if they don’t have licenses for all of their source material, then the model should be required to be released into the public domain.

It’s like extremely expensive piracy that is bad for artists and bad for the environment.

1 more reply

xeromal4y ago

simiones4y ago

> Are the owners of the IP losing money on letting someone generate new characters in their movie/show?

Typically, creators are very protective of this sort of thing, unless it stays in the area of fan art. If anyone tries to seriously monetize this kind of output, I'm sure we'll see a lot of cases.

jhart994y ago

mensetmanusman4y ago· 8 in thread

This is amazing.

We are at the precipice of someone releasing a $100M blockbuster movie just based on the language in the script with zero cost beyond compute.

What will this mean for the future of entertainment…

aqzman4y ago

I suspect at some point in the next 5-15 years we will begin to see AI generated entertainment perfectly tailored to a person's preferences.

Workaccount24y ago

I can already envision everyone talking over each other trying to say how their personal AI generated TV show is the best show they have ever seen.

Just imagine how much lonelier the world is going to feel when people don't even have entertainment in common anymore.

3 more replies

phkahler4y ago

>> I suspect at some point in the next 5-15 years we will begin to see AI generated entertainment perfectly tailored to a person's preferences.

Kermit in Debby Does Dallas. Kermit in the Graduate. Kermit with 2 broken flippers. Oh the depravity. I'm not sure getting high quality visualizations of any random passing thought is a good idea ;-)

marcodiego4y ago

When I think about such a possibility, https://c.tenor.com/emURWFXTpvIAAAAC/pearl-jam-do-the-evolut... comes to my mind.

AnIdiotOnTheNet4y ago

On the one hand, the ability for Star Trek's Holodeck to create large amounts of content from a few terse natural language instructions is look less and less implausible.

andreilys4y ago

it would lack the novelty and playfulness of a good human creative work, and it likely would be very shallow in its themes.

Simply tune the parameters associated with novelty and playfulness and you’ll get the desired result.

There’s nothing inherent in human creativity that can’t be replicated by an AI. Most creative work is derivative and remix’s prior art.

This is a good short video on the phenomenon of remixing https://m.youtube.com/watch?v=MZ2GuvUWaP8

2 more replies

bambax4y ago

> The Terminator and Aaron Sorkin wrote the script?

You can't handle the future! We live in a world that has time machines, and those time machines have to be manned by robots! Who's gonna do it? You?

xg154y ago

Waiting for the next iteration of Copilot to use this technology.

"Like Facebook but like make it not suck"

AI: "Here you go!"

alex_duf4y ago· 7 in thread

These are mind bogglingly good

coldcode4y ago

silicon24014y ago

1 more reply

mensetmanusman4y ago

It doesn’t matter if these images were one in 10,000, the facts that any exist that are this good is crazy.

2 more replies

jayalammarOP4y ago

Agreed that people should pay attention to cherry-picking of model outputs.

For this one in particular, here are a few more results for Battlestar and The Office:

https://twitter.com/Miles_Brundage/status/153247388947686195...

1 more reply

tgv4y ago

myrmidon4y ago

Do you find the Chinese Room argument convincing?

Do you feel that the human mind is more than an "appropriately" trained "biological" neural network?

What do you consider the limits of a DALL-E like system compared to a "true" mind?

My personal opinion is that the Chinese Room argument is fancy handwaving that crucially relies on never being explicit about what it means by "understanding", combined with an appeal to intuition.

3 more replies

puranjay4y ago

Kermit in Sopranos did it for me. Hilariously accurate.

jonnycomputer4y ago· 7 in thread

Not to detract from the accomplishment, but none of these are Kermit. The "Kermit" part of the query seems mostly to have accomplished querying for "humanoid frog"

maxfurman4y ago

notahacker4y ago

treesprite824y ago

Seems like it adapted Kermit's features to fit with the world of the movie, as if he was actually a character from that movie.

I don't have access to DALL-E 2, but I wonder if a prompt like "A cameo from Kermit the Frog in ..." would give more literal Kermits.

moss24y ago

No those are clearly Kermit the Frogs

jonnycomputer4y ago

carapace4y ago

Instead of arguing about "what is" in re: subjective judgements, it can sometimes be useful to phrase things more literally, as in:

"To me those are clearly Kermit the Frogs."

"To me those are clearly not Kermit the Frogs."

Then there's nothing really to argue about. Instead we can discuss what we see and how that affects our subjective perceptions.

For example, Kermit the Frog doesn't have eyelids, but most of these images show a frog with eyelids.

bergenty4y ago

These are all kermit. The identifying caricatural features are all here.

FredPret4y ago· 6 in thread

How is this so good? If you told me this was done by a very talented human, I’d still be surprised at how good they are. That must count as a sort of Turing test, no?

pygy_4y ago

DALL-E is indeed superhuman in its ability to create images from simple prompts.

This shows the limits of the Turing test. To pass it a program must not only be smart enough, it must be dumb enough too.

Pulling what DALL-E does is a tell-tale sign it’s most likely not human, and would make it fail the test.

simiones4y ago

Well, most of the pictures (if not all), while astoundingly good as an idea, have the tell-tale signs of being AI-generated, the kinds of mistakes a human would never make.

[0] https://twitter.com/HvnsLstAngel/status/1531512163738669057/...

1 more reply

sbierwagen4y ago

2021: This machine fails the Turing test, it's too dumb. Look at how crappy this generated art is.

2022: This machine fails the Turing test, it's way too smart! No human could be this good at creating art.

davidkunz4y ago

The Kermit Test.

lgvld4y ago

Love this name. ;-)

ASalazarMX4y ago

endisneigh4y ago· 6 in thread

I don’t understand how this isn’t infringing some copyright. Anyone do any research on this?

Not to say that copyright is good to begin with.

akersten4y ago

Stylistic inspiration is not an infringement of copyright, in either that case or the "do it on a computer" case here.

But in any scenario, nothing legally novel about the work being created by machine.

bogwog4y ago

> But in any scenario, nothing legally novel about the work being created by machine.

…except for the fact that it was created by a machine.

Just like copyright law had to be revised to deal with software and the internet, it will need to be revised to deal with AI.

1 more reply

jovial_cavalier4y ago

>circumventing all copyright can be as simple as creating a sufficiently close derivative and giving it away.

That's correct. People do this all the time, sans the giving it away part.

Also, there is no way that you can argue these images are not transformative.

endisneigh4y ago

I shouldn’t clarify - I’m not talking about the Kermit images, I’m talking about Dalle2 itself. If you had it render Neo from the Matrix, albeit imperfectly, would that be transformative use?

2 more replies

phkahler4y ago

>> Also, there is no way that you can argue these images are not transformative.

Exactly. Kermit has been very much transformed, and "in the style of" is not copyright infringement AFAIK.

ceejayoz4y ago

https://en.wikipedia.org/wiki/Transformative_use

feynmanalgo4y ago· 5 in thread

If this is legit it would be one of the most impressive things I've seen in many years.

oldstrangers4y ago

It's legit, just variations / iterations.

Here's what I got for "A still of Kermit The Frog in Blade Runner 2049 (2017)":

https://imgur.com/a/y7t3RKx

feynmanalgo4y ago

None of them scream Blade Runner 2049 to me.

1 more reply

lfkdev4y ago

? Dale is a legit project from openAI

jffry4y ago

I think "legit" in this context might mean more of the Twitter post author's representation of their use of DALL-E 2:

- Were the prompts shown the ones fed to DALL-E 2 or were there more complex details described in the prompt?

- Were these the first images generated for the prompt, or did the author generate many images and cherry-pick the best example, and if so from how many?

1 more reply

tommoor4y ago

That doesn't mean the images in this tweet thread are output from Dalle though…

Although if an individual created all of these then that's about the same amount of impressive

1 more reply

messe4y ago· 3 in thread

[1]: https://twitter.com/HvnsLstAngel/status/1531774195234791424?...

[2]: https://duckduckgo.com/?t=ffab&q=eraserhead+baby&iax=images&...

canjobear4y ago

Close https://twitter.com/Plinz/status/1532578234855936000

monkeybutton4y ago

Absolutely terrifying https://twitter.com/Plinz/status/1532578531674189824

giaour4y ago

> Kermit the Frog in David Lynch's Dune

DALL-E would just shoot back a still with Kyle McLachlan in it. He's already so Kermit like!

ahoka4y ago· 2 in thread

Is there an easy way to try this model on my computer?

jffry4y ago

As far as I know, the actual trained system is proprietary, and you can only use it by requesting access to their online system for generating imagery: https://openai.com/dall-e-2/

jayalammarOP4y ago

Not Dalle2 specifically, it's proprietary and there's a waitlist. Older open source alternatives include https://huggingface.co/dalle-mini

Tade04y ago· 2 in thread

I see a distinct lack of "Kermit The Frog in Dragon Ball Z".

rememberlenny4y ago

Here is the prompt for "A still of Kermit The Frog in Dragon Ball Z (1989)"

https://user-images.githubusercontent.com/1332366/171921054-...

sillysaurusx4y ago

Oh man. Now I want to apply for access just for that. Thanks for the idea.

jdmoreira4y ago· 1 in thread

Dall-E still blows my mind every time. To someone who is plugged into these things, how is this possible?

I understand computers and I understand back-propagation but this... it feels like magic to me.

Can someone indulge me in a short explanation of how this works and how is it this good?

jayalammarOP4y ago

These are good explanations:

- https://www.assemblyai.com/blog/how-dall-e-2-actually-works/

- https://www.youtube.com/watch?v=F1X4fHzF4mQ

rlv-dan4y ago· 1 in thread

https://nitter.net/HvnsLstAngel/status/1531506455714492416

aasasd4y ago

There are browser extensions to do this for you. Specifically ‘Redirector’ for FF.

spyremeown4y ago· 1 in thread

This is THE technology of the 2020s. It's insane. I'm in absolute awe at how amazing this AI is.

aaaaaaaaata4y ago

Disagree — GPT-3 type AIs are.

But people who do viral news and posts don't...read. So, their impact will continue to go unnoticed in comparison to DeEpFaKeS and Dall-E.

mynegation4y ago· 1 in thread

I am finishing “Love, Death + Robots” on Netflix and between the quality of CGI and things like DALL-E and Imagen, movie, design, and illustration industries are in for an upheaval.

layer84y ago

Video is probably a whole other ballpark regarding required training and resources.

qwertyuiop_4y ago· 1 in thread

“ All the impressive achievements of deep learning amount to just curve fitting” - Judea Pearl

https://archive.ph/89lqw

ryanklee4y ago

If the results are actually impressive, then the dismissive rhetoric doesn't amount to a hill of beans.

fred2564y ago· 1 in thread

These are so good I am scared.

adamsmith1434y ago

Good, that is the correct response. The progress in AI recently has been staggering

dash24y ago· 1 in thread

"Various films." I hope you're not thinking what I think you're thinking.

sillysaurusx4y ago

You can bet your life savings that OpenAI could generate some really interesting Kermit porn.

FeepingCreature4y ago· 1 in thread

man look at this amazing curve fitting from training samples

gwern4y ago

man all this is is a fancy Photoshop. i bet you could find all of these in Google Images or something.

b0ner_t0ner4y ago

More eye candy here: https://np.reddit.com/r/dalle2/top/?sort=top&t=year

Very good one: https://np.reddit.com/r/dalle2/comments/u5kkty/a_fluffy_baby...

“a masterful impressionist portrait painting of a little doggey who is worried he may not be a good boy”: https://nitter.net/MarkRich388/status/1532482006809866240

bergenty4y ago

If these pictures are not touched up this is superhuman. My mind hasn’t been this blown in a long time. What a time to be alive.

motoxpro4y ago

In this thread:

high_54y ago

What about ... Kermit in the Muppet Show?

stuntkite4y ago

Kermit the Frog in Behind The Green Door

Kermit the Frog in Salò, or the 120 Days of Sodom

Kermit the Frog in Pink Flamingos

----

gwern4y ago