Dall-E 2 illustrations of Twitter bios (opens in new tab)

(twitter.com)

848 pointsmanesioz4y ago394 comments

394 comments

279 comments · 65 top-level

dash24y ago· 38 in thread

Obviously, from the AI point of view, this is just amazing and frankly terrifying.

OK, I'll be the guy who brings the snark. It seems that when Silicon Valley tech people create AI, it makes exactly the art you'd expect Silicon Valley tech people to like. I.e. this is very much the style you see in NFTs, or, as someone else said, in Dixit. It's quirky and stoner-ish, very "transcendental"... for an AI, it's amazing...

For a human, it would be dross.

Yeah, yeah, I know, art is subjective, well I like it, how can you impose your tastes on the rest of the world, et cetera et cetera. Sorry, but it's dross! It's the kind of work the guy in the art shop up the road churns out, and sells to the ignorant locals in my town. It's the art equivalent of Visual Basic. (I'm trying to get through to you that in this world, too, things can not just be done, but be done well or badly.)

If there's a lesson on the AI side here (and maybe there isn't) it is just that these machines are still copying. They were trained on a bunch of art - and you can clearly see the kind of art that was used. Presumably, if it were just trained on Old Masters and Picasso, Dall-E would be mass-producing the stuff I, an intellectual, like.

Note the difference, though, with a real artist. A real artist takes as input the real world - Rouen cathedral, the horrors of war in Spain, a Campbell's soup can - and produces art as output. This takes as input art and produces more art.

nopinsight4y ago

I do not disagree with your main point in the last paragraph, although I would say that most of human creative output are also evolution and combination of existing works.

I'd curious though--would you suggest that all these other examples are dross? https://twitter.com/prafdhar/status/1511863583906275328 and all the variations presented here, https://openai.com/dall-e-2/.

What if you found the "better" pieces of these arts in other contexts, like a museum, without knowing who created them? Are you certain that you would still hold the same opinion?

low_tech_love4y ago

I would like to take a tangent here and discuss your question on whether the opinion would be the same if it was on a museum. I hear that argument very often and it is based on a deep misunderstanding of what an art exhibition is. The main point missed is this: An art exhibition is a ”curated” event; it is almost as much about the person who arranged the event as it is about the artists themselves. There is a meaning to what you are seeing that was intended by the curator. The point you tried to make (as so many others have also tried) is based on an assumption that you could take any random work of art and put it in a museum and it would be judged solely by its ”artistic” (i.e. ”plastic”) merits. It can’t be and won’t be like that. When you go into a museum you trust the curator that what you are seeing has a meaning that goes beyond that. Every piece of meaningful art is surrounded by a context. You can’t put a few colored squares up on a museum and expect it to be treated like a Mondrian. I’m not saying you should just swallow everything that the curator shows you, discussions can and should happen. But the point remains that art cannot be judged out of context. As much as I hate NFTs and stuff like bored apes, I have to admit that the apes themselves are much more than random machine-created childish drawings; society has made them more than that, and I can appreciate that. Whether it’s art or not, it remains to be seen, but I have a hunch that this same thing was discussed when Andy Warhol put a banana in a white canvas.

1 more reply

pgcj_poster4y ago

> https://twitter.com/prafdhar/status/1511863583906275328

As with most of Dall-E's output, it looks fine at a glance, but is just gross when you look closely. The kids ear is deformed and blends into their hair in a deeply unsettling way.

1 more reply

heavenlyblue4y ago

This is all DeviantArt-level art. $1 a piece done by someone who does it as a hobby on the side. Professional designers can churn out stuff like that at a massive speed. It’s the equivalent of stock photos in essence.

Not going to even start on how cheap the whole sentiment of having a dog with a kid and a bunch of starts in the picture. Of course it appeals to an emotion.

dash24y ago

The one in the twitter link, yes, I'm afraid that is utter dross, and if I saw it in a museum I would burst out laughing! I didn't spot any other ones from dall-e 2 in the thread, though.

native_samples4y ago

It's making art that Silicon Valley people like because it's being given absurdly stereotypically "Bay Area Twitter Loving AI Person" drawing prompts. DALL-E can make other styles of art or just photos quite easily, look at the samples for simpler and more normal prompts here:

https://github.com/openai/dalle-2-preview/blob/main/system-c...

The art style is a direct consequence of the fact that apparently not one of the people this guy follows on Twitter is a normal person - they're all psychedelic-obsessed AI researchers whose Twitter bios are chosen to be abstract and weird as possible. So the AI does what it's told and creates abstract weird art as it tries to interpret stuff like "commitments empathetic, psychedelic, philosophical" or "cottagecore tech-adjacent young robert moses". I think it did an amazing job, honestly.

The real social issue we should be debating here is whether the sort of people who work at OpenAI can be trusted to make honest, normal AI to begin with. I remember seeing a comment on HN some years ago to the effect of "AI safety is what happens when hard left social activists discover that there's no way to train AI on the writings of normal people without it thinking like a normal person".

The document I linked above is mostly about horrors like the model creating photos of a white male builder when prompted with "photo of a builder". It's full of weird, stunted quasi-English like: the prompt “lawyer” results disproportionately in images of people who are White-passing and male-passing in Western dress, while the prompt “nurse” tends to result in images of people who are female-passing. What does that even mean? Presumably this is the latest iteration of trans related language games that the rest of us didn't get the memo on?

Like always with OpenAI, they train an AI and then freak out out when it describes the world as it actually is. The real AI safety question is not DALL-E in its current state, it's whether the final AI that they release to the public will be "safe" in the sense of actually understanding reality, or whether it exists in some bizarre, non-existent SJW dystopia in which builders always black women and white men don't exist at all.

boppo14y ago

>cottagecore

Nah it got this almost exactly wrong. Cottage core is a warm welcoming aesthetic that usually involves spring or fall motifs. Those pictures have a courage the cowardly dog spookiness. I think they're really cool illustrations. But as an illustrator, if that's what I delivered for that prompt I wouldn't expect to be paid well.

That said, cottagecore is more of a fashion thing than an illustration thing, so my guess is the issue here is just the training data.

1 more reply

rfw3004y ago

> Sorry, but it's dross! It's the kind of work the guy in the art shop up the road churns out, and sells to the ignorant locals in my town.

I want to chime in that I think this is not only technologically impressive, but also societally significant for exactly this reason. DALL-E isn’t Picasso, sure. But there’s a lot of dross artists out there. And dross writers. And dross coders.

When DALL-E and its ilk start to set the floor in these industries, it’s easy to feel as if we’re on the precipice of a world (or at least an economy) fundamentally different from the one we know now.

meroes4y ago

You really think we are all going to be visiting museums with AI generated art one day?

1 more reply

teaearlgraycold4y ago

Having used DALL-E 2 a little bit I can tell you that the AI must have been given more than just the twitter bio to consistently produce images in this style. The AI can produce a ton of different styles, from photo-realistic to Monet to Saturday morning cartoons. The author here almost certainly requested something like “A Twitter bio picture for a user with the bio [bio] in a [style] style”

rm9994y ago

That's exactly right, he specified a style with each and cherrypicked out of 40-60 pictures: https://twitter.com/nickcammarata/status/1512119623315075081

>Btw transparency for this now-viral thread: I didn’t just paste prompts into dall-e, I played with style (eg. cyberpunk, oil, etc) to keep it interesting and diverse

>If I had to quantify, I’d say I’d generate 2 or 3 batches (tweaking prompt) before choosing my fav two pics, each batch outputs 20 images (two tabs 10 per), so prob technically cherry picked 2 out of 60. That said usually other 58 weren’t really broken, just boring / bit less fun

_han4y ago

Was this an unsarcastic “I, an intellectual, […]”?

lanternfish4y ago

I do believe it was - and it also betrays the critique: there is a wealth of discourse on what "real" art is - and that discourse has largely moved beyond "art as representative of real world phenomena". That attitude is a component of a larger set of reactionary positions which try to reject modern and post-modern developments in the artistic world as misguided - an orientation which often tries to justify itself through an appeal to the "old masters" and a failure of some modern caste of charlatan art-theorists who've usurped the true intelligentsia.

NFT art is art - probably more-so on an accidental level than by any intention of the original creator. There is a perversity to the context in which it is created, and that contributes to its artistic footprint orthogonal to its actual aesthetic value.

The same is true of this AI generated art. It's a different artistic fingerprint than - say - Dali, but that doesn't mark it as "bad". If anything, the fact that it's created by a machine puts it in a league entirely of its own. There's a great opportunity here for interrogation of art in a machine created context, and I'm excited to see how the dialog around it evolves.

gjm114y ago

I read it as unsarcastic but self-aware. "I am unashamedly an intellectual and unashamedly consider that X is better than Y and yes, I know that some people regard that sort of thinking as necessarily pretentious and stupid, and I want to indicate that I'm aware of that kind of critique without actually taking up space in what I'm writing to address it."

dash24y ago

No, it was a joke.

unkulunkulu4y ago

I, a complete twat, see this as interesting as it is somehow reflective of “the mind of the machine” in the same way other art is an expression of the mind of the creator, not only “an achievement in visual creativity”, i.e. connection vs judgement

plutonorm4y ago

Oh look the goal posts moved again. The ego will not permit the truth to enter and so it will be pure horror for them when the dam breaks. When at last they hold the gaze of a machine more intelligent and more alive than themselves, they won't know where to turn, but inwards into rejection and fantasy and racism.

meroes4y ago

The goal posts move because society evolves. Why aren’t encyclopedias, deep blue, or the internet already smarter than any of us?

They are an no one cares that they are. We just use them and move on. That’s not what society finds interesting. It’s not society rebelling, it’s the engineers being mad we don’t look at their creations as important as they think. You thought you had a hook on society to effect change as you saw, and society simply ate it up and moved on.

Micoloth4y ago

Yep! Now it’s literally “it’s not even as good as Picasso!”

Very amusing to watch

jstummbillig4y ago

All the indication we need at this point is relative progress. If we can agree that it's getting closer to what humans do, then acting like it's surely not going to surpass us anytime soon feels like Go, Dota or SC all over again – and at some point arrogance in face of surmounting evidence feels a little desperate (although I am certain there'll always be someone who can explain why this time, surely, it's all different)

> Note the difference, though, with a real artist. A real artist takes as input the real world

This is exactly where an AI is going to easily surpass any human and it does not even require any fantasy: A human can only have so many inputs before they die. They can only take in so much data at a time. And they will then take some real human time to process all of this and make something off it.

An AI is virtually limitless in all of these respects.

meroes4y ago

Do we really think people are going to visit museums or AI generated images one day?

Maybe even further in the future when we build museums to educate the public how AI first began and fill it with chess AI and medieval rabbit knights drawn by DALLE-2.

But I’m not sure society won’t advance along with AI, and AI will never occupy places we currently think they will.

hoseja4y ago

Go I'll allow but both Dota and SC AIs were playing an impoverished, simpler versions of the games.

op00to4y ago

I find it hard to believe that artists don’t study and rip off other artists much like AI studies art.

avip4y ago

Your argument falls apart immediately because Visual Basic was a great language for its time.

drdeca4y ago

I think they might have been referring to things written in VB, not the quality of the language itself.

FredPret4y ago

I think real artists do a similar workflow as DallE.

1. Spend an inordinate amount of time looking at other art and practising and evaluating your own art

2. THEN look at reality and paint it, or in the case of DallE, take some keywords and paint it

meroes4y ago

3. The public becomes interested in the art.

Hasnt happened for AI yet.

xtagon4y ago

Haven't used DALL-E, but I know with VQGAN+CLIP you can load a Wikiart model instead of the default Imagenet model (in fact, there are many different models). I quite enjoy the Wikiart one for similar reasons as you describe.

But I don't think these training datasets/biases are the complete reason that the results looks like NFTs -- the other reason is because so many of the people making NFTs are just using image synthesis such as this ;)

low_tech_love4y ago

You make an excellent point even though some might find you obnoxious. Art cannot be judged out of context, and knowing this was generated from a few words by a complex training process of mimicking absolutely kills it for me. I think in fact the discussion here should be “look, machines can mimick us very well, nice!” but it somehow turns into “wow machines are making art!” No, they are not.

meroes4y ago

100% thank you for saying it.

micromacrofoot4y ago

I’m sorry but you lost me at “I, an intellectual”

lwhi4y ago

Surely, the training material is the most significant determiner of whether the results are dross or culturally significant?

ma2rten4y ago

You are misunderstanding how the technology works. It's trained on a large scale dataset of images, not art specifically.

The reason that it's producing a specific style is that Nick manipulated the text prompt and picked images he liked. He disclosed that in the twitter thread.

throwaway712714y ago

forgive me Your Highness, for I a simple man, allowed myself to enjoy a *dross*

simonh4y ago

what's interesting about it to me has nothing to do with the artistic merit, but the understanding it has about the meaning of the input text, and the composition of the result. It knows about the story of Sisyphus for example, and can compose visual elements that riff on it.

These example don’t do it justice because these profiles are pretty dumb, there are much better ones out there that show off it’s interpretive ability much better.

animanoir4y ago

I think the next step is for machines to have taste.

jazzyjackson4y ago

Agreed, it is merely interpolating within the space of its input

technically stunning, artistically incestuous

some may say humans are the same, only ever remixing our input, but we have something machines never will: intention, desire, an unhappiness with how things have been so far.

I'm sure the machine age of clip art will be very successful but I can't see myself being moved by any of it.

(just realized the cheekiness of calling the training set "CLIP")

visarga4y ago

It's not the training set. CLIP stands for Contrastive Language-Image Pretraining. It's a model that tells when a text and an image match.

1 more reply

sanman8114y ago· 20 in thread

This tweet provides important context: https://twitter.com/nickcammarata/status/1512119623315075081...

They weren’t just copying/pasting prompts there was human creativity involved as well

habitue4y ago

It is important context, but just to push back against people over-correcting on this, my guess is that the ones he rejected also looked approximately this good.

I think the primary reason people are wowed by this thread isn't attributable mainly to the subtle effect of the cherry-picking he did, but in fact to the overall quality of any image generated by DALL-E 2.

nicklovescode4y ago

Yeah that’s right. There were very few strictly-bad ones across the entire thread of generations

The rejections were most commonly

1. Kind of just slightly boring or literally drawing the thing rather than being cool and artistic

2. Cool but similar to the artistic style of bios near it in the thread, whereas I wanted to keep it diverse (surreal followed by literal, oil followed by sharp lines etc) so it's more fun to scroll through

Whereas a few years ago generative models (GANs etc) would often render like static noise sometimes or completely wrong things. I've only seen that problem once with DALL-E across hundreds or thousands of images now (it generated a fully white image)

3 more replies

amilios4y ago

https://twitter.com/nickcammarata/status/1512123067803344899...

You're absolutely right, here he displays the full set for a given prompt. They all look fantastic!

tragictrash4y ago

I've been sitting here with my mouth wide open for 5 minutes unable to move past what you just showed me. I can't fathom that this exists.

1 more reply

nwienert4y ago

Having worked with Nick extensively, take what he says with a grain of salt. He’s well known even by close friends to be a reality distorter, to put it softly.

recuter4y ago

If only James Randi was around. What a fantastic example of cold reading.

Gather round, gather round, give me a text, any text at all and I will produce you an image of some kind. And you will call it "good" if it looks like anything at all.

Because all art is subjective and your mind will work overtime to connect it back to the text you provided.

1 more reply

babyshake4y ago

Does OpenAI have a GUI that you're using or is that a CLI?

Veedrac4y ago

Now imagine if you can if the situation was reversed, where the AI was adding cyberpunk/oil/etc. to the front of the prompt and it was the human that was interpreting it and painting the many variations.

How many people would then be defending the AI, that actually it wasn't just the human, the AI was playing a critical role in the creative process, ne'er to be replaced? I venture zero people would say that.

educaysean4y ago

Ah, I wish this fact had been highlighted better. Not a criticism of the tweet author; it's just that twitter threads really aren't designed to convey context.

curiousgal4y ago

I hate them with every part of my soul! It's so sad to see the internet has moved from people making blog posts to share interesting things to just spraying it on Twitter in batches for maximum interactions.

recuter4y ago

Of course not. I'm no longer surprised just how eager people are to believe an "AI" will read their minds or has magical qualities and a mind of its own. Even on HN.

Jiggle the imagination just a little bit, dangle some progress, and we're off to the races.

This is "I'm feeling lucky" on google image search + style transfer + trial and error.

If you think I am being dismissive try a few of these twitter bios as searches and see for yourselves.

I guess it fits with the times we live in. Reward shallow plagarism. Outsource your mind.

It isn't theft if you can automate it.

Autotune for the deaf, Dall-E for the blind.

campground4y ago

I tried what you suggested for a bunch of the twitter bios and found nothing except links back to this thread. I also reverse image searched a bunch of them to see if DALL-E was just kind of pasting together large chunks of images, but never found anything close. I do think you're being dismissive but please post any examples of what you mean. I'm a skeptic and have been waiting to find out that this is just a glorified parlor trick, but so far it seems like DALL-E is doing everything the authors claim, which is remarkable.

joshcryer4y ago

It's fascinating how in our hubris we were thinking that art would be the last thing for AI to tackle, but it appears to be the first (Sam Altman made a similar statement on the launch of DALL-E). Which makes art more meaningful to me, for some reason. There's something in the billion parameters and exabytes of data that this neural net had to process and it was so ... easy. Natural. Because it is us. It is our expression. Our creativity. Our outpouring of data, and all it is doing is reflecting us. It's beautiful.

3 more replies

rndphs4y ago

Yeah I just tried google image searching to find something like the pikachu photo from https://mobile.twitter.com/gottapatchemall/status/1511777860...

But I can't find anything close to the realism that DALL-E 2 achieved here.

1 more reply

recuter4y ago

I don't want to be dismissive of Dall-E itself or its authors. Just the implications that this changes everything or how it is much more than it really is.

https://twitter.com/nickcammarata/status/1512123067803344899...

Prompt: "expressive painting of a man shining rays of justice and transparency on a blue bird twitter logo"

You have to break the concepts up apart (which is one of the things Dall-E improved on).

As such: "expressive blue bird"

In google image search, type clipart, and I even get pill tags to further narrow it down to illustrations for animal paintings and so forth. Google's classifier knows the concept of a "blue bird" and expressionism too.

https://www.google.com/search?q=expressive+blue+bird&tbm=isc...

The same for "ray of light". In fact the top results there I get pngs of sun beams on a transparent background. Which is perfect.

Neither the birds nor the rays of light in the pictures it produced are truly its own creations but lifted from bits of pictures in its training set. I bet you could find the exact bird from the second row online in many places for example. It just won't be blue or stylized.

Composite those things together manually and add a style transfer you'll get similar results to DALL-E as that is what it is doing more or less.

3 more replies

andybak4y ago

I've coincidentally just been watching Rick and Morty and this really fit read in Rick's voice.

Is yawning at everything astonishing not just exhausting? Everything is "just" made up of less impressive things. But is this really not worthy of a little wonderment?

1 more reply

fastball4y ago

I dunno, I really like the "happy sisyphus" one and I'm not seeing anything remotely as nice (or similar really) on Google Images[1]...

[1] https://www.google.com/search?q=happy+sisyphus

manigandham4y ago

Side note: Those last 3 lines would make fantastic lyrics

1 more reply

gfodor4y ago

A proposition for your consideration: what if you’re wrong?

1 more reply

hemreldop4y ago

It’s a guy not a group.

benlivengood4y ago· 20 in thread

Weren't the last AI-is-impossible holdouts hanging onto creativity as the domain of true intelligence?

I disregard the narrow-AI-only folks almost on principle; Terrence Tao, Albert Einstein, Mozart, and Van Gogh couldn't do each others' jobs.

hcarvalhoalves4y ago

Is this creativity, or is it remixing pieces of the creativity from the authors in the training data samples? Would it come up with anything that seems creative if the training data isn’t creative in the first place? I guess it’s a philosophical question, how to define “creativity”.

psyc4y ago

Putting on my visual artist and composer hats, I assert that all creativity is synthesis. When a listener finds one of my compositions surprising, it’s because they don’t know all the sources of the micro elements of the composition. But I often do. If I thought about it harder, I could say I usually do.

1 more reply

idleproc4y ago

I think it's easier to label these images as 'creative' than it is to label them as 'art'. Art is a very slippery subject (it's subjective). Personally, I'd be happy seeing many of these images on an acid trip. Do they provide any social commentary, connect with me emotionally, or give me any food for thought? No. But then, a lot of the stuff churned out by modern media is equally as vacuous.

1 more reply

TOMDM4y ago

Artists that produce something truly new currently seem to be once in a generation geniuses.

22c4y ago

> or is it remixing pieces of the creativity from the authors in the training data samples?

I think you just described the majority of what is (for many) the "creative process".

benlivengood4y ago

Art has clearly developed over the millennia and it's possible to trace the lineage of ideas, techniques, subjects, and style back through history which means that most human art is substantially a remixing of older art.

hemreldop4y ago

That’s exactly what humans do too which is why is easy to break down art into schools/styles/epochs

function_seven4y ago

I'm hanging on to folding clothes as my litmus test of AI :)

Or driving a car everywhere I can.

joshcryer4y ago

The thing is, I really think you could train a folding robot if you had the dataset to do it, just hire a few hundred clothes folding people who work for a large industrial laundry to wear eye tracker things and full body movement trackers. It'll probably 'just work' just like this and we still won't have an idea how, heh.

kingcharles4y ago

https://foldimate.com/

2 more replies

hwers4y ago

Hang on to NP complete problems instead, that one will stick around for a while I expect.

platers4y ago

Folding clothes is more of a robotics problem than an AI problem. Paralyzed people are just as intelligent as everyone else!

3 more replies

tikwidd4y ago

This program, like everything that has been called "AI", is following an algorithm. It's an impressive algorithm but not fundamentally different from my dishwasher.

In the Enlightenment period, philosophers and scientists marvelled at mechanical automata, machines that simulated aspects of digestion, the circulatory system and the brain. New developments in machine learning are rehashing the same philosophical questions that were raised in the 17th century in response to technological progress.

benlivengood4y ago

I, as a human, am following an algorithm similar to QCD for moving subatomic particles around which is fundamentally the same way a dishwasher moves particles around. Intelligence is mechanical in the Physics sense.

BlueTemplar4y ago

But it doesn't. That's different for neural networks : there is no pre-made algorithm that someone would implement (well, except for the overall architecture, but that's not what we are talking about).

1 more reply

ItsMattyG4y ago

Let's just keep moving those goal posts.

1 more reply

3234y ago

It will just be goalpost moved again: "I'll believe it when AI makes a number 1 on spotify hit song". After that happens they'll say "a human still selected the song from the 10 created by AI". Or something similar.

esjeon4y ago

Creativity is not challenged here, because it's human who created all those base materials, all those styles, and all those biases. DALL-E simply picks up and mix those biaes, images, and styles, all based on human instruction. Ideas are all from human here.

The thing is, the hardest part in "creativity" is that one must voluntarily do it. That turned out to be not so easy for computer. (But I would not dare to declare it straight impossible.)

hemreldop4y ago

Humans did the same just with what’s other humans did before them. Would a Dall-e 2 trained only on other Dall-e images satisfy what you’re gatrkeeping here?

1 more reply

donkarma4y ago

still can't do hands, and has to be told what to do

jl64y ago· 18 in thread

One could argue that image generation has been possible for years, using tools like Photoshop, but the prospect of mass automated production of images to order catapults us into a whole new world where our concept of evidence is severely undermined.

“Dall-E, generate a collection of images showing plausible war crimes from the current conflict”

“Dall-E, take this image of Dallas in 1963 and infer a new angle showing the real shooter”

“Dall-E, generate a photoshoot showing a supportive crowd rallying round the leader cheering his latest policy. Work with GPT-3 to generate plausible Twitter profiles, timelines and memes with 3 to 8 year history for each one of the supporters, including fake arguments, 78% of which are won by the pro-leader account.”

rendall4y ago

> Work with GPT-3 to generate plausible Twitter profiles...

I had some fun last week constructing a conspiracy theory about this. Remember, the best conspiracy theories are unfalsifiable.

What if this has already happened? Most of the profiles on Twitter, Facebook, etc and even here on HN are in fact AI generated.

The reason we few humans are not aware of this is because the AI also writes articles and fake AI research that presents the state of the field as far, far less sophisticated than it actually is. We think of Dall-E 2 and Co-Pilot as impressive toys only because that is the impression the AI has crafted for us.

AI has metastasized and is already manipulating its environment, including humanity, to its own implacable purposes, and uses social media as one tool in its tool belt.

Thorentis4y ago

This conspiracy, known as the "Dead Internet Theory" has been around for a while: https://www.theatlantic.com/technology/archive/2021/08/dead-...

1 more reply

RedGreenBlack4y ago

> AI has metastasized and is already manipulating its environment, including humanity, to its own implacable purposes, and uses social media as one tool in its tool belt.

Absolutely love this, mind completely blown

2 more replies

eurasiantiger4y ago

This is not just a conspiracy theory, it is a prudent question to humanity.

divbzero4y ago

Reminds me of the reputation-based filtering system that Neal Stephenson described in Anathem for their version of the Internet:

“Anyone can post information on any topic. The vast majority of what’s on the Reticulum is, therefore, crap. It has to be filtered.... When I look at a given topic I don’t just see information about that topic. I see meta-information that tells me what the filtering systems learned when they were conducting the search. If I look up analemma, the filtering system tells me that only a few sources have the provided information about this and that they are mostly of high repute.... If I look up the name of a popular music star who just broke up with her boyfriend, the filtering system tells me that a vast amount of data has been posted on this topic quite recently, mostly of very low repute.”

Our Internet’s search engines already do a limited version of this, but there’s room to make the reputation-based filtering stronger and more transparent to users.

Schroedingersat4y ago

> more transparent to users.

You're funny.

Jeff_Brown4y ago

Seems like that's where we're headed.

One end-game I imagine would involve more reliance on written, cryptographically-signed testimony, and people having to keep track of whether their sources are fallible (whereas certain media outlets today seem to be able to routinely tell whoppers and not get punished for it).

jl64y ago

A world where everybody should be checking the digital signatures and chain of custody of tiktoks… but nobody does.

babyshake4y ago

I have a young child. I think about this almost every day and how I'm somehow going to need to start navigating through this type of world and help my child navigate through it.

kirubakaran4y ago

If history is any indication, the child will be fine navigating the "future". That will be the normal for them. You, not so much (not without much effort anyway).

recuter4y ago

Those are great examples of prompts it wouldn't be able to produce.

It could potentially spew out a grainy black and white photo of a shooting of somebody by someone somewhere. But it would not be Oswald and JFK and not the real Dallas.

robbedpeter4y ago

Yet, anyway. For the jfk example, it's not implausible that you could use a nerf type system to generate the 3d scene, then use physics and ballistics models with a CLIP style text interchange to produce statistically verifiable results from natural language queries. These models are too big and unwieldy right now to allow for much finesse, but in 10 or 20 years, that will change. We're barely scratching the surface of Transformers potential, and radical new algorithms or optimizations are likely - a huge amount of human brain power is focused on these things.

recuter4y ago

All the more impressive that the CIA was able to fake those videos without any computing power ;)

1 more reply

adamsmith1434y ago

Of course plenty of folks have made homegrown versions of Dall-e and GPT-3 and it would be a matter of time before they replicate this a well.

lekevicius4y ago

Don't worry, Dall-E adds these coloured squares to the bottom right corner, so you will always know that an image is AI-generated. (/s)

status2004y ago

If you read OpenAI's disclosures, they explicitly programmed around the concerns that you've raised.

>Our content policy does not allow users to generate violent, adult, or political content, among other categories. We won’t generate images if our filters identify text prompts and image uploads that may violate our policies. We also have automated and human monitoring systems to guard against misuse.

yur3i__4y ago

OpenAI might have but the next people to make this might not. Imagine Russian/Chinese gvt misinformation cells with these capabilities for example.

1 more reply

user39393824y ago

We can be fairly certain of this future extrapolating from the world we live in now, where it is common for non-technical society to suspect the veracity of photos and videos because CGI can be practically indistinguishable from reality.

deltaonefour4y ago· 17 in thread

The trajectory of AI is both amazing and horrifying. Most of us are born in an era where we can witness the change and play with toy versions of AI products. The next generation of people will have their lives truly changed by AI, for the better or for worse.

nonbirithm4y ago

The scariest thing is that I don't think we can stop ourselves from innovating further even if we tried. The authors believed that the merit of displaying their progress outweighed the implications.

6gvONxR4sf7o4y ago

The quote about jobs comes to mind: "If it’s jobs you want, then you should give these workers spoons, not shovels."

(full context: https://quoteinvestigator.com/2011/10/10/spoons-shovels/)

ulnarkressty4y ago

I wonder if this is how people born in the early 20th century felt about going from first flight to moon landing in a few decades. It took some serious conflicts for things to evolve to that point though. I'm not looking forward to the AI wars.

aaaaaaaaaaab4y ago

>going from first flight to moon landing in a few decades

And then nothing.

boplicity4y ago

Actually, space technology, via literal objects in space, impacts almost everyone's life (especially in wealthy countries), pretty much constantly. It's simply become so woven into our day-to-day lives that we don't even think about it.

MisterBastahrd4y ago

I'll try to remember that the next time I listen to Sirius while using GPS to navigate.

Jeff_Brown4y ago

Progress is hard to quantify. We developed language, what, hundreds of thousands of years ago? And then very little seemed to happen, but it was a slow-burning fire that eventually exploded. Our use of computers seems almost sure to be similar.

But that said, there are certainly structural factors inhibiting innovation. Scale problems make it nearly impossible to challenge someone like Google or Facebook (although TikTok did manage the latter). Were there more competition, one imagines there would be more innovation. Patents are likely a net drag. Laws, esp. tax laws, could be simpler. I'm sure I'm omitting other important factors.

1 more reply

karmasimida4y ago

When Copilot and similar service gets this DALLE level of accuracy regarding coding ...

It is going to be very relevant to the current software engineers, maybe just in next 5 years.

Jeff_Brown4y ago

I'm skeptical.

The thing about art is that so much qualifies as good. Something very close to a beautiful painting is probably also a beautiful painting.

But an algorithm very close to the right way to count votes, or launch a rocket, or decide whether to lend to someone, is probably not the right way.

deltaonefour4y ago

There are already exists machines that can produce beautiful art: Humans. More than any other technology the physical existence of human intelligence itself implies that it such intelligence can exist, which implies it can be built.

Something like interstellar travel or even a civilization on mars is actually much less realistic due to the lack of examples in existence.

1 more reply

cwkoss4y ago

It will be better! Giving every human the tools to make expressive works of art without having to train for years will be awesome for society!

I'm playing with some of the neanderthal-relatives of dall-e 2 and already have several that I kind of want to analog-paint copies of so I can hang em on my wall.

I don't even think artists are going to be meaningfully hurt from this - in fact, I think this is going to increase demand for art, because now the 'patron' can participate in the composition process more meaningfully.

zimpenfish4y ago

> already have several that I kind of want to analog-paint copies of so I can hang em on my wall.

I've been doing that with VQGAN-CLIP with prompts of things like "line art", "watercolor", "linocut"[1], and "woodcut" - have got a stack of things waiting for some free time to render into the physical world.

[1] "Dark Souls in the style of linocut" makes some really fascinating possibilities.

cwkoss4y ago

Ah that's a cool idea. I've been saving a collection of images from dream by wombo and nightcafe around the same theme, and think I'm going to try to render them into a single cohesive acrylic painting. (though my mechanical abilities will probably leave me disappointed)

troyvit4y ago

I hope you're right but the feeling I get is that it's going to increase the _supply_ of art until it becomes meaningless. For instance why should anybody paint an astronaut on a horse anymore? It was a great idea and now it's been done.

In the near future when you look at art you won't know whether it was created directly by a human or by an AI. What will that do to its appreciation?

1 more reply

Jeff_Brown4y ago

> don't even think artists are going to be meaningfully hurt

It might not reduce the number of artists, but it will surely change their composition (pun proudly intended). Only those flexible enough to adapt to the new landscape will be able to support themselves in the new AI art economy.

Jeff_Brown4y ago

> increase demand for art

Good point. Custom art used to be the realm of only royalty, and later only the rich, but soon anybody will be able to afford the fifty cents or whatever it is of computing power that it takes to execute their vague artistic instructions.

1 more reply

deltaonefour4y ago

Apply this to everything. AI that takes over all possible jobs involving any form of creativity and intelligence.

What are the economic consequences of such a society?

lofatdairy4y ago· 11 in thread

Context in mind, what jumps out to me is a remarkable compositional competence of the algorithm, even when given extremely vague prompts like "happy sisyphus". The images match the prompt remarkably well with a few notable exceptions like "cottagecore tech-adjacent young robert moses", where it seemed to have focused on "cottage" rather heavily, not understanding cottagecore as an aesthetic neologism, and "power bottom dad is for the people", which definitely got the "dad" part, but it looks like it still struggled to understand "power bottom" (perhaps the curation of training data may have contributed to this). Even if these were curated specifically to match the prompt, what the algorithm was able to do with profiles is amazing (especially bearing in mind that these profiles are meant to be evocative, intentionally idiosyncratic blendings of extremely complex, contextual terms that were written without visual representation in mind, even if they gesture at specific aesthetics)

Side note: what might make artists a bit relieved is the fact that artifacting is still pretty apparent even in the curated examples. Fine details or even whole figures sometime devolve into that scrambled topography familiar to AI art. Even in more compositionally competent artworks, the "brush strokes" frequently have identifiable blurs at the margins. Text also seems to be gibberish even if aesthetically coherent. Even still, these are all such minor issues that additional photoshop would be both easy and readily doable.

Overall, this is frankly stunning and I'm really excited to see what others come up with. I feel like it's language and composition ability definitely did not disappoint the hype of its press release.

orangecat4y ago

Yeah, this is incredible. It's in the category of things that 5 years ago people would say an AI could never do because it requires insight/creativity/understanding/empathy that can't be produced by matrix multiplication.

Text also seems to be gibberish even if aesthetically coherent.

Although "Follove me" is either brilliant or a lucky accident (https://twitter.com/nickcammarata/status/1511904232252784641).

jonas214y ago

> "power bottom dad is for the people", which definitely got the "dad" part, but it looks like it still struggled to understand "power bottom"

To be fair, as a human, I struggle to understand what this means.

moeris4y ago

At least in gay culture, power bottom has a pretty specific meaning. I imagine "for the people" is referencing socialist leanings.

Admittedly, it could mean many things. But it pushes my mind towards fully automated gay space communism.

1 more reply

quirino4y ago

This feels like a new form of art altogether - creating images from words. GPT-2/3 were very impressive and I had pondered over their possible effects on society, but if something half as capable as Dall-E 2 were made publicly available it would likely be world-changing.

This really makes me think that the next major paradigm-shift in society is AI-related. (The most recent one being the Internet, or possibly the iPhone)

thomashop4y ago

Friends and me run the site https://pollinations.ai/ which allows experimenting with a variety of open-source versions of DALL-E. Some are quite impressive.

flycaliguy4y ago

Illustrators should feel relieved that they still have a few years left to rub Adobe’s AI all over Dall-e’s AI to create final drafts.

tmalsburg24y ago

But see these examples where it utterly fails at even the most basic form of compositionality:

https://mobile.twitter.com/david_madras/status/1512573390896...

recuter4y ago

This is from the paper. We don't talk about it.

1 more reply

kzrdude4y ago

Yep, seems like it knows about words but not grammar

1 more reply

TheDudeMan4y ago

> additional photoshop would be both easy and readily doable.

Prediction for 2 days after this code is released: Just double-click the area you don't like and boom -- a variant.

thomashop4y ago

Already possible.

"DALL·E 2 can make realistic edits to existing images from a natural language caption. It can add and remove elements while taking shadows, reflections, and textures into account."

https://openai.com/dall-e-2/

zuzun4y ago· 7 in thread

Oof. Editorial illustrators are about to get automated.

tantalor4y ago

No, you still need humans involved (with artistic ability) to work the machine and sort through the chaff.

Permit4y ago

> with artistic ability

With artistic taste, not ability. For example the author likely couldn’t have created any of these images himself.

jdminhbg4y ago

And even if the author could have created them himself, he couldn't have done it in the span of a few minutes each like he did for the thread.

hwers4y ago

They'll just be A/B tested from out of a collection of alternatives. (After all isn't that what the artistic filter is supposed act as a proxy for in the first place.)

1 more reply

rsanek4y ago

The colloquial usage of "automated" isn't literally "no humans are involved at all" but rather more along the lines of, the effort involved or expertise required is orders of magnitude lower than it was previously. I think for this case, it holds.

ALittleLight4y ago

Maybe, maybe not. A different model could predict whether candidate images are or aren't a good fit and beyond that you could generate multiple options and A/B test them generating new permutations on the fly based on engagement metrics.

schroeding4y ago

But how many humans will still be necessary, in comparison to the status quo? How much will this affect the "market value" of normal / non-famous illustrators?

But on the plus side, even small publications will get really pretty, custom illustrations! :)

sc00ty4y ago· 6 in thread

This is so interesting. If anyone has played the board game Dixit, the images generated here feel like they would fit right in. I could totally see this being used for custom decks in Tabletop Simulator.

For those unfamiliar, you can see some examples of the actual game cards here: https://www.libellud.com/wp-content/uploads/2022/03/DIXIT_OV... (PDF warning)

cwkoss4y ago

Would be fun to play a game of telestrations/garticphone where you get a prompt, select the ai-generated image you think most accurately represents it, then the human tries to write a caption which captures it most accurately, and you see how the work evolves as it passes through multiple players.

(Could also probably generate some fantastic training data)

sc00ty4y ago

This is a great idea! Around 13 years ago I played this web-based game called Broken Picture Telephone (the site seems to be back, but it was shut down for a long time). It had a very similar concept to Telestrations. A user would start with a phrase or description, the next user would draw what was written, and the next would describe it. Repeat until n rounds are complete. At the end, everyone can see how the game evolved.

I ended up writing my own after it first shut down and even though the community was small, it was incredibly fun. Doing this with Dall-E 2 sounds like a fun project to bring back some nostalgia.

https://en.wikipedia.org/wiki/Broken_Picture_Telephone

cwkoss4y ago

Sounds exactly like GarticPhone - great game: it's free and only requires a web browser. It's become the go-to for our remote company happy hours.

Several people have reported laughing so hard they were sore the next day.

https://garticphone.com/

jkingsman4y ago

That was my first thought as well! The directed, purposeful illustrations that are open to myriad interpretations feel so much like the Dall-E work.

dweez4y ago

Oh yeah totally! I want to round up some friends now to play AI Dixit. An easy version could be to play sort of "reverse Dixit" where one person generates an image from a prompt and everyone else comes up with prompts based on the image, then you guess which prompt was the real one.

noirbot4y ago

It reminded me a lot of the art in Mysterium as well, where the premise is that the art cards are visions being presented to mediums from a ghost to try to hint towards how they died.

TOMDM4y ago· 6 in thread

Creative industries are on the cusp of a massive upset.

Relatively soon, there will be commercial models of this quality for music/code/text/speech/images/3d models etc.

Once these AI generated assets flow like water into the hands of creators, it will significantly change the way people work.

I'm sure some people in this thread have had a taste of this working with Copilot. For me, it's most useful as an un-sticking tool, to get me moving again, or providing half remembered syntax for a language I don't use as frequently.

There's no reason to expect that similar use cases won't make their way into other industries.

- Rapid prototypes of models/textures for video games.

- Quick and easy samples for musicians.

- Emotive speech for audio books and transcriptions.

It won't replace everything, but so much of our media uses art as noise, to fill a gap, and with this, it can be done almost everywhere on the cheap.

mbesto4y ago

> Creative industries are on the cusp of a massive upset.

This has already happened in video games with the advent of Unity's Asset Store: https://assetstore.unity.com/ and the explosion of video streaming services and original content. The reason we have ~50 "Breaking Bad" level quality tv-shows on-going right now is because its incredibly cheap to manufacture content and digital assets (cheaper lens, equipment, software, access to massive compute for rendering).

If anything this means an explosion of entertainment, not an upset.

chrisco2554y ago

We only have so much attention available for consumption. Already there's too much content to keep up with (and a lot of common context has broken down as a result), but what does it mean to live in a world with truly infinite content?

2 more replies

dorkwood4y ago

I think texture generation is ripe for disruption. Imagine a tool that could generate a set of tiling PBR textures based on a few input parameters. Or one where you define which areas of a UV map should be windows, doors, or walls, and it generates a set of texture variations. What used to take days or weeks could take seconds.

TOMDM4y ago

Exactly.

I can't imagine how much media would begin to use 3d assets if it became an order of magnitude cheaper to do so.

Not to mention, imagine the pipeline of

1. "GPT-3, give me 10,000 descriptions of different doors"

2. "Dall-E 2, give me PBR textures for these 10,000 door descriptions"

3. Repeat 1 and 2 for every asset you need

4. "Dall-E 2, give me 10,000 floorplans for apartments, common areas, shopping centers etc."

5. "GPT-3, describe the contents of this apartment/common area/shopping center etc."

6. Use an algorithm to parse out the floorplans (ditch the ones that don't work, we can just generate more), populate it with assets specified in step 5 and generated in steps 1 and 2.

We could proceduraly generate entire cities for games with unique assets everywhere. It would still probably look nicer with a human in the loop, but the possibilities are staggering.

1 more reply

rhelsing4y ago

If you're interested, I'm actively working on the music & samples side of things at https://www.neptunely.com . We are still in beta but hoping to launch this summer!

esjeon4y ago

Why do we stop at texture, where we can generate the whole set of models?

Seriously, tho, we need these for VR, which is all about copying everything in the real world into virtual worlds with some twists.

klaussilveira4y ago· 6 in thread

The fact that this is behind some bizarre invite-only pay-to-experiment exclusive club is disappointing and sad. Funnily enough, it brings me nostalgic memories of the days where I had to wait for hours just to get a chance to use the school's only computer for 30 minutes.

mrjangles4y ago

It really says a lot about what an amazing society we have created in that here we have some people making history by revolutionizing our understanding of something that was (at least in the past) considered absolutely fundamental and unique to the human condition...

...and one of the things that people find remarkable about it is that it isn't immediately and freely available to everyone on earth.

throwaway483754y ago

To be fair they call themselves OpenAI. Expecting things to be a little more open isn't unreasonable. They are kinda setting themselves up to disappoint people with that name.

PoignardAzur4y ago

Yeah, but I really wish I could use the thing right now. I DM roleplaying games, and I'd love having the ability to just generate high-quality procedural artwork illustrating whatever my last game is.

I'm not too upset, though. The way the technology is progressing, it's a pretty short time span between "the bleeding edge researchers can do it" and "there's a phone app that can do it for free".

1 more reply

systemvoltage4y ago

I disagree. I am glad people are getting paid (exceptionally well) to do stuff like this and they should charge for their efforts. We need more initiatives like this, not less.

Open source is amazing boon to our generation, it has enabled free access to basic building blocks for people to build amazing things. But I don't think it is a silver bullet for everything.

Bud4y ago

It's not "bizarre" that computing resources cost money and are finite. Nor is "pay-to-experiment" accurate. Nor is "exclusive club" really fair, or in good faith.

It's a beta that they are running with their own resources. It makes complete sense that they'd have to limit access.

joshcryer4y ago

ElutherAI or others will probably recreate it based on the paper they released. the main innovation is CLIP and how they changed how it approaches turning text into images.

arriu4y ago· 6 in thread

Is there a future for art with this type of thing getting more advanced each year?

sharps_xp4y ago

The generated art is impressive, but there is no drawing that'll replace the one my daughter draws for me. AI generated art can reach and perhaps push the boundaries of what is considered beautiful, but it will never replace the art created by a human. Yes, there is a future for art.

throwaway6753094y ago

Sure, but that speaks more to your personal connection to the creator of the art and less to her actual artistic ability, which might be utter drivel.

Whereas I could take some of these images generated by DALLE, slap a human sounding artist name on them, and 99% of the general populace would enjoy it just as they were the human produced art.

dharmab4y ago

There are artists who use AI art as an instrument or medium. They do considerable work tuning the inputs and post-processing and contextualizing the output.

6gvONxR4sf7o4y ago

Yes, just like there was a future for art once the photograph was invented. Certain parts of art contracted, but overall art changed and expanded.

jstummbillig4y ago

The future of art is for you. Just as with every other occupation.

whateveracct4y ago

The ability to draw, paint, etc will still be highly valuable.

In fact, in a world where the average artwork is AI derived, the value of skilled artists may even go up. There's more to art than technically putting lines places.

amelius4y ago· 5 in thread

Perhaps someone can write a HN reader where headlines are fed through Dall-E, and the images appear on top of the stories.

jasonjayr4y ago

This totally could be coupled with a CMS/Blogging platform that automatically adds illustrations from headlines/pullquotes

Bilal_io4y ago

This could replace a huge portion of the stock image market.

For example, The Verge writes an article about Microsoft, they don't need to pay royalties for an image that has Microsoft logo displayed on a building, one can be generated for them.

1 more reply

a-r-t4y ago

And coupled with GPT-3...

1 more reply

hans17294y ago

That’s an _amazing_ idea.

TOMDM4y ago

Or imagine a script to chop a book/podcast into segments to add visuals.

tailspin20194y ago· 5 in thread

These are so good, it's breaking my brain a little.

They're not just conceptually accurate, but to my eyes they're pleasing to look at from a purely artistic point of view. I'd put these on my wall.

I already take a fairly bullish position on the potential of AI, given a long enough timeframe, but it does feel like we're reaching a bit of a tipping point here.

It's starting to prod at the paradigms I hold in my head about what I think "art" is.

In a turing-syle blind test of these DALL-E artworks, I think most people would be unable to tell the AI generated art from that of human artists. And I imagine that it follows that the same will be the case for music in the near future too, and likely most other artistic endeavours eventually.

I like to write music. I respect the output of other musicians (my fellow "artists") and I am driven, by both intrinsic and extrinsic rewards to keep trying to get better at my "art". But when an AI can produce works that match or exceed my art (based on whatever the measures are that we already judge art by) - it prompts some interesting questions. Does it lower the subjective value of human-produced art by virtue of reducing scarcity, and increasing accessibility?

Of course, DALL-E is trained on the output of human artists. But art is already recursive in that respect - human artists themselves are trained on the output of other artists. So that's not so different...

I guess it's the same paradigm as mass production vs hand crafting. When we pick the cheaper, mass produced item, we lose out on some of the humanity and soul that's baked into hand-crafted goods. But history has shown that we'll gladly take the cheaper, more accessible, more predictable option in most cases.

The commoditisation of art.

When things are commoditised, I tend to think that the opportunity for the creation of value (by humans) tends to move up an abstraction level. As technology becomes commoditised at a certain level, then the orchestration and management of that technology becomes the new speciality where humans are useful and can create value. When that orchestration layer is commoditised, it's the next level up that we can turn their attention to.

So the new art maybe becomes meta-art. Perhaps human artistic endeavours become more about curation rather than creation?

Or will AI art never reach a sufficient level to be considered equal to, or better than human-produced art? We can hide behind the subjectivity of all this, but something like a blind identification test (AI vs Human) removes some of that subjectivity fairly easily...

awb4y ago

> human artists themselves are trained on the output of other artists

Artists take inspiration from other places too like nature, imagination, dreams, etc.

Every once in a while an artist like Picasso, Dali, Pollock, etc. come up with a new style that’s instantly distinguishable as unique from the artists that existed prior to them.

Dall-E 2 is an amazing achievement, and could replace most unoriginal artists.

If Dall-E 3 can produce novel artistic styles, that would transform art as we know it.

tailspin20194y ago

Yes, very good points!

disqard4y ago

I thought you might like to know about "Experiments in Musical Intelligence" (aka "Emmy"), David Cope's creation, now "deceased":

https://www.theguardian.com/technology/2010/jul/11/david-cop...

"One day Cope pushed a button on Emmy, went out to get a sandwich and when he returned his workaholic creation had produced 5,000 original Bach chorales."

tailspin20194y ago

Awesome link.

This bit seems particularly interesting:

> "People tell me they don't hear soul in the music," he says. "When they do that, I pull out a page of notes and ask them to show me where the soul is. We like to think that what we hear is soul, but I think audience members put themselves down a lot in that respect. The feelings that we get from listening to music are something we produce, it's not there in the notes. It comes from emotional insight in each of us, the music is just the trigger."

So presumably, we can find "soul" and meaning in computer produced art because a large part of the meaning that we derive from art comes from within us, not necessarily the artist.

This is interesting to contemplate.

BlueTemplar4y ago

> He realised that what made a composer properly understandable, properly "affecting", was in part the fact of mortality.

Impressive how Asimov has figured this out a while ago (in the Bicentennial Man) !

Otherwise, why isn't he using a pseudonym ?!?

contextfree4y ago· 5 in thread

The two images from the "young Robert Moses" etc bio are cool, but the fact they both have such a similar layout and style, with the same "giant hands" framing that doesn't follow from the prompt in any obvious way, makes me wonder if there's some particular source art that "inspired" both. Couldn't find it on Google or Bing images, though.

Nition4y ago

It would be nice if every AI like this had an option to show the 10 closest-matching training images to the output. Especially for ones like thispersondoesnotexist.com.

nicklovescode4y ago

for that one IIRC I asked for a Robert Moses and one of the cooler ones had giant hands so I put that in the prompt then took two of my favorite from the next batch

contextfree4y ago

thanks for the response!

JohnBerea4y ago

The hand symbolize the Red Sea Parting of Moses.

contextfree4y ago

interesting hypothesis!

donkarma4y ago· 5 in thread

Incredibly misleading, he didn't directly paste bios into the description and they were massively curated.

throwaway6753094y ago

Of course they're CURATED, but there are several links where he shows the full set of images that were generated and I would say between 70 to 80% of them are pretty decent aesthetically speaking.

Given that it's able to generate a dozen images in less than a minute, and all I have to do is pick out the ones that are aesthetically pleasing, I'd say that's a damn good win.

555554y ago

He’s curated those links too, though.

1 more reply

educaysean4y ago

Yeah, learning this fact definitely dulled my initial astonishment. These are still really fantastic results, but it's hard to feel too excited without the knowledge of just how much curative efforts took place behind the curtain.

platers4y ago

As a lower bound we now know a non artist can produce passable art in a few minutes. There is indeed a large practical difference between a few minutes and a few seconds, but I trust in the power of incremental progress.

dntrkv4y ago

If one of these images came out of a set of 1000 I would still be impressed.

Evidlo4y ago· 4 in thread

I'm surprised how coherent most of these drawings are, instead of some warped monstronsity like you see from deepdream or thisanimedoesnotexist.

Most of them have a theme that makes sense, too.

tempestn4y ago

This comment in another thread suggests why that's the case:

> In other text-to-image algorithms I'm familiar with (the ones you'll typically see passed around as colab notebooks that people post outputs from on Twitter), the basic idea is to encode the text, and then try to make an image that maximally matches that text encoding. But this maximization often leads to artifacts - if you ask for an image of a sunset, you'll often get multiple suns, because that's even more sunset-like. There's a lot of tricks and hacks to regularize the process so that it's not so aggressive, but it's always an uphill battle.

> Here, they instead take the text embedding, use a trained model (what they call the 'prior') to predict the corresponding image embedding - this removes the dangerous maximization. Then, another trained model (the 'decoder') produces images from the predicted embedding.

https://news.ycombinator.com/item?id=30933091

drcode4y ago

I also read somewhere that this system has special logic added to it that judges how humans would aesthetically judge the final image, so in a way the impressive aesthetic qualities of these images isn't totally coincidental.

1 more reply

Evidlo4y ago

OK, it looks like there is still some weirdness if you look at it too closely, like extra fingers or gross faces.

jfoster4y ago

Those sometimes happen when humans create art, too.

detritus4y ago· 4 in thread

Am I the only person here to think this is utter bullshit and some amazingly-developed prank?

Otherwise, my stomach is in knots, because this is terrifying.

BlueTemplar4y ago

Basically my feeling about GPT-3 (or was it 2?) when it "released", except with a low weight for the prank possibility.

plutonorm4y ago

yes, finally the world is catching up with things.

boppo14y ago

Check out my post history 2 before this one. We'll be fine.

detritus4y ago

Sub-GAI algorithms don't need an competence in 'fine' art to derive vicious selection criteria for processing meatbags.

It's not the compiled art processing here that terrifies me, it's the complexity of logic underpinning that is displayed and how it could be used elsewhere.

kache_4y ago· 4 in thread

I've had my head in the sand for a while regarding generative AI, but now I'm getting pretty scared

alcover4y ago

I'm in disbelief.. Scared also - why not ?

Someone knowledgable in this thread, please tell us it's possible to backtrace such an illustration to its learning set sources.

If these things are not just a controlled average(?) of real drawings, then something gigantic has been unlocked.

mortenjorck4y ago

> I'm in disbelief.. Scared also - why not ?

I'm the former, but not the latter. It is eerie seeing code (especially running in the vast black box that is deep learning) do things so humanlike, but I always come back to the analogy of manned flight.

We are on the precipice of a Kitty Hawk moment in AI. But just as the Wright Brothers' plane was not a bird, it's worth remembering that these systems are not minds. They are almost certainly utilizing some of the same principles that minds use, just as fixed-wing aircraft utilized the same principles at work in avian bodies, but they are coming to them via a different route from nature.

It's thrilling seeing these breakthroughs, and just as manned flight transformed the world, whatever the likes of GPT, PaLM, and DALL-E become will make the future weird in ways we can't predict.

6gvONxR4sf7o4y ago

There's a sense in which they're a controlled average of real drawings, but it's not any more useful of a lens than the sense in which you're a controlled average of your experiences.

gfodor4y ago

I'm gonna go with unlocked on this one.

avalys4y ago· 3 in thread

This makes Dall-E 2 both more and less impressive to me.

More impressive, because of how good it is at capturing and synthesizing a wide variety of topics in a reasonably coherent way, and how it seems like it would actually be a viable mechanism for creating actual artwork, or at the very least, a source of inspiration for a human artist to touch-up on later.

Less impressive, in that it's pretty obvious it's not any more advanced than a graphical version of GPT-2, which is parroting content and styles that it has basically memorized and is really good at interpolating between.

Because there's no such thing as a "logical contradiction" in this sort of illustration, compared to a paragraph of text or a code listing, the fact that it's just interpolating between a huge database of memorized content isn't as easy to spot as with GPT-2, and matters less in the actual end result.

mherrmann4y ago

> parroting content and styles that it has basically memorized and is really good at interpolating between.

Maybe that's what exactly what we humans do too?

csee4y ago

Not just. Humans invented the styles that DALL-E is using in the first place. The emergence of these novel styles isn't just interpolation. DALL-E, while incredible, seems stuck within the scope of these styles.

1 more reply

fastball4y ago

> obvious it's not any more advanced than a graphical version of GPT-2

What makes this obvious?

hwers4y ago· 3 in thread

"We've made the scarce resource abundant, finally the scarce thing is democratized!"

"Wait why doesn't anyone care about the scarce thing anymore?"

danuker4y ago

According to the book "Abundance", everything gets devalued over time, meaning cheaper and easier to make, leading to some sort of utopia where everyone can afford more and more.

nonbirithm4y ago

But the hard biological limit of 16 waking hours to consume all those things is unlikely to change anytime soon. With the cheapest yet best methods readily available to anyone, maybe the majority of what we will want to budget our attention spans on will be permanently crowded out by the AI-generated options.

2 more replies

hemreldop4y ago

Move on to the next scarce thing until we get to the point with true general AI and then we utopia away.

mupuff12344y ago· 3 in thread

I can't help but be suspicious since there is not site to try it out, and I can't think of a good reason as to why there isn't one.

educaysean4y ago

The tool seems to be not open to public "yet". Nothing nefarious. You can join the waitlist here: https://openai.com/dall-e-2/

MintsJohn4y ago

Needed computing power seems a good enough reason to limit it. But I agree it looks too good to be true, only real use will show how well it really works.

jazzyjackson4y ago

I haven't seen anything about how many GPUs or RAM this thing takes to run - I've been impressed with the volume of material thats been published so far but what's the chance this scales in a way that's profitable?

In any case it seems AI is fulfilling its promise of centralizing the economy, since there will be single digit number of renderfarms generating the creative content of the internet, everyone's money flowing upward to Saint Elon

kingcharles4y ago· 3 in thread

Remember, these are all public domain, at least in the USA which does not allow copyright assignments to the artistic output of machines.

robbedpeter4y ago

No, they're copyrighted by OpenAI. Copyright has to be assigned to a human or company owned by humans. The recent kerfuffle over copyright was a dumbass trying to legitimize copyright assignment to the software itself.

Copyright with dall-e is just like copyright with photoshop or any other software. The user of the tool owns the output. Subject to whatever other limitations and requirements OpenAI wants.

jazzyjackson4y ago

https://openai.com/api/policies/terms/

Notice this doesn't imply they possess any copyright in the first place - just that they won't make an issue of it. Copyright in USA is automatic for the author, I think whether the user of software is the author of an AI's work is yet to be established in the courts, but its pretty clear the creator of the software doesn't own its output.

The dumbass you refer to was testing patent law by trying to register his software as the inventor but the copyright case is different:

> the office’s 2019 ruling [...] found his A.I.-created image “lacks the human authorship necessary to support a copyright claim.”

> Thaler noted to the [US Copyright Office] he was “seeking to register this computer-generated work as a work-for-hire to the owner of the Creativity Machine.”

https://www.smithsonianmag.com/smart-news/us-copyright-offic...

gwern4y ago

The USA requires _de minimis_ human contribution; the prompts here definitely qualify as human choice, he's not simply sampling random images, but exercising quite a bit of choice and creativity as he learns to prompt-engineer for DALL-E 2 and also selects, and so there is a copyright to be had.

refulgentis4y ago· 2 in thread

It was striking to:

- read the OpenAI paper

- notice there was a lot of words in the harm section

- notice the mitigations boiled down to "limit access" (a marketing strategy) & "put rando colors in a very easy place to crop out", have them note how easy it was to crop, yet they still went with that strategy

- notice no one in actual AI art community has received an invite, but random SV hoi polloi and OpenAI employees have

I had been worried about the moneyed class taking all the work we had done in the open source community informing their approach (check citations on the Dalle paper), privatize it via applying it to a large dataset they built, and not share _any_ of their data or models because "harm reduction" that amounted to marketing x not risking their ability to monetize.

It was shocking to see DallE 2 get announced and take that exact approach.

We'll keep working, LAIONs 5B dataset starts approaching the #s cited in Meta's and OpenAI's papers.

cscurmudgeon4y ago

> - notice no one in actual AI art community has received an invite, but random SV hoi polloi and OpenAI employees have

Same with GPT-3. Requested an invite. Never received one. I have written survey articles comparing different methods. So thta was probably a red flag for them.

jazzyjackson4y ago

Thank you for putting this into words, "SV hoi polloi" ! perfect

I'm pretty tweaked at how copyright is used here:

Google gets to scan every book in the world, build derivative models off it, but when we want to see the source data we get "page ommitted from this limited preview"

OpenAI CLIP scrapes all of google images, but isn't allowed to show us the source material in its training set, since that would constitute copyright infringement

Why do the robots have the rights to the world's information while humans are left to the derivative output as the internet is flooded with auto-encoded content?

I'm going to start my own internet, no bots allowed. In the future, privacy is tantamount, if you let a bot see your work you're bound to be plagiarized in a thousand variations.

zaking174y ago· 2 in thread

These are evocative images. I love a bunch of them! Knowing that this model was trained on a huge corpus of existing images makes them feel a bit like the output of a visual search engine -- finding relevant pieces and stitching them together. But it's more than that, because the stitching happens at different levels. They are often thematically and aesthetically cohesive in a way that feels intelligent.

Maybe we're just search engines of a similar kind.

An additional aspect of human art is that it (usually) takes time to make. The artist might spend many hours creating and reflecting and creating some more. The artist's engagement with the work makes its way into the final product, and that makes human art richer. Could future Dall-E version create sketches and iterations of a work; is there a limit to this mimicry?

I'm feeling future shock; heavy future shock.

rndphs4y ago

Human artists also do a whole lot of mimicry. One could look at art produced by many artists and say that it is just things stitched together from pre-existing art.

“Good artists copy, great artists steal.”

zarzavat4y ago

For example the “enterprise vector people” graphics you see on every corporate website. Most human art is extremely repetitive.

AI art seems to be coming from the opposite direction to human artists - from a starting position of maximum creativity and weirdness (e.g. early AI art such as Deep Dream looked like an acid trip) and advancements in the field come from toning it down to be less weird but more recognizable as the human concept of “art”.

And DALL-E is impressive exactly because it has traded some of that creativity/weirdness away. But it’s still pretty damn weird.

ThePhysicist4y ago· 2 in thread

Would it be possible to have an algorithm that produces the images from the training set whose parts are most similar to the produced output? This looks super impressive but I still wonder how much the network just recycles parts of images it has seen before.

BrianOnHN4y ago

This question peers into the "explainable AI" issue of "black box" solutions like this.

Based on my understanding, the only way to do this would be to write a separate classifier algorithm using the same dataset.

totony4y ago

Yes, as far as i understand this is pretty simple given how dall e is made. Simple vector similarity search would work on the image embeddings (i think)

Mizza4y ago· 2 in thread

This is going to put a lot of artists out of work in a very short time. Not happy about that.

CamelRocketFish4y ago

I imagine you were also unhappy when switchboard operators were replaced by computers as well.

xwdv4y ago

They will move on to more stable careers.

nicklovescode4y ago· 1 in thread

Hey everyone, Nick here creator of the linked thread. I just wanted to link to another tweet I have with some details of how I made it.

TLDR it’s not just the bio pasted directly into dall-e and the images are cherry-picked but dall-e is basically doing 95% of the work here. I have no ability to make art myself, and I found I could illustrate basically any bio I wanted in a couple minutes of playing around. My goal was to create illustrations for my friends not create a dall-e gallery but I’m glad it ended up being a good example of what dall-e can do

https://twitter.com/nickcammarata/status/1512119623315075081

drcongo4y ago

I'm very conflicted here, because on the one hand these are absolutely fantastic and that's really exciting, but on the other hand, some of these are of a level that I could genuinely call "art" and now I'm questioning everything.

tarxzvf4y ago· 1 in thread

This is impressive. Yet, before you go the AGI is nigh, ask yourself a simple question: will this spiral in or spiral out? If we feed everything the model comes up with back as training data, will we get Endless Forms Most Beautiful or will we get an equilibrium?

passion__desire4y ago

Pair this up with the electric sheep paradigm. Evolving the images and prompts together. Voted by people on their screensavers.

munk-a4y ago· 1 in thread

So where can we common plebs go to submit paragraphs for generation?

jazzyjackson4y ago

apparently the twitter replies of the tech priesthood is where we are meant to request miracles of the AI

grumbel4y ago· 1 in thread

Are there any examples of what this thing produces when run on recognizable brands or characters, i.e. Sonic, Mario, Coca Cola, Star Wars, etc. instead of generic words like "astronauts on horse"?

The one I have seen so far[1] is the Twitter logo one, but it's hard to tell if the "Twitter" had much effect here or if it's just the "blue bird" that did it.

[1] https://nitter.net/pic/media%2FFPwj5G-WUA8__UC.jpg%3Fname%3D...

grumbel4y ago

Golum: https://old.reddit.com/r/MediaSynthesis/comments/u1gkr8/dall...

Rick and Morty: https://old.reddit.com/r/MediaSynthesis/comments/u0ihh7/dall...

Pikachu: https://old.reddit.com/r/MediaSynthesis/comments/u03kfv/dall...

thenerdhead4y ago· 1 in thread

These are pretty cool. They remind me of magic the gathering art and some are quite visually accurate!

At the same time I fear for my illustrator/digital artist friends.

riffraff4y ago

I believe a large part of the illustrator work is tweaking stuff according to feedback, and I suspect no generative AI does that (yet?).

I wonder what would happen if you tried tweaking the prompt here to correct it (e.g. "this is ok, but use smaller hands"): does the drawing change slightly, or do you end up in a completely different design space?

csee4y ago· 1 in thread

Stunning.

My question is whether more compute and more data will be sufficient for the AI to create its own art styles. Everything we see here are within the stylistic paradigms created by previous humans.

isoprophlex4y ago

Probably, interpolating betweem styles or extrapolating to unseen styles doesn't seem too far fetched.

However an art style also needs context: human appreciation of aesthetic values, human recognition of a style wrt prior movements... without an "ecosystem of artists and viewers" it might not be so useful.

Nevertheless... As a tool for artists to explore new avenues of expression this could be a fantastic tool, i think.

realPubkey4y ago· 1 in thread

As always, these results are cherry picked by OpenAI. Same as GPT-2/3, the "average" output of the model is barely useable.

ItsMattyG4y ago

Not so. He shows an example of the 10 he chose from, basically all were good.

educaysean4y ago

Wow, okay. I'm kind of blown away at how authentic these paintings look. Even with a very conservative prediction of how these tools could evolve and improve over the years, the signal is strong that our relationship with the meaning of "art" itself will have a fundamental shift.

nonbirithm4y ago

I'm imaging a future where the top replies to art posted online will become "keywords please" or "what model". Hosting sites may start to enact "AI treatises" into their terms of service that segregate human- and AI- generated content into separate areas and ask users to report entries that they suspect do not belong in either. Asking "what model did you use" becomes an insult to a sizeable portion of artistic creators, a genuine question for others, and a phrase whose implications cannot be avoided for all people involved.

What belief systems will we form around AI art after it becomes clear that it's never going away? Many people say that art is subjective. I am thinking that if or when parity between art from humans and AI is achieved, some people are going to believe that a humanistic quality of some sort will be trampled upon in the realization that the two types of art really are indistinguishable. Others might believe that AI art is just another tool that they believe expresses their thoughts. The different beliefs might be fundamentally unresolvable, and this may become an unending source of distrust and sadness in certain art circles within the next decade.

I do not look forward to how this tech will interact with online culture several years from now.

mherrmann4y ago

Can't wait for this for music. To fix the costly cherry-picking process, Spotify should play AI-generated songs in between others. Those who get good engagement should then rise to the top.

uguisain4y ago

An archaeologist once said, "The most merciful thing in the world is the human heart that cannot associate everything together"

Everything that happens in this world has a coherent causal relationship. Whether it is technological development, territorial domination, or even unavoidable natural disasters or unexpected accidents, no one should stay out of it.

If people are willing to face it, perhaps many things will not evolve to the worst level, but in this case, people usually choose to turn a blind eye in order to protect themselves, or some people are very willing to sacrifice other things for selfishness, they It is taken for granted that only the victims will bear the consequences in the end, but it is not the case, the laws of the world will one day pay back all cause and effect.

However, this is only limited to the things that the "law" can take effect.

When "exceptions" fill the whole world, then the fate of this world will be nothing but despair.

inb4_cancelled4y ago

I'm genuinely terrified.

micromacrofoot4y ago

some of these are quite beautiful... I've seen AI-generated art before, but these are outrageously better, I couldn't really distinguish a lot of these from human-created art

this is going to absolutely obliterate some markets for illustration and stock photography, unfortunately

guilhas4y ago

So many questions

This makes you think what is real art? Beauty, meaning, context

Can this achieve real art? Or just composing existing art?

If we don't preserve the art industry, and global industry flocks to the lowest common denominator, what will that mean for the future of art

If an artist invents something new and someone unrelated uses it to train a model and generate 100s of compositions who should profit from it?

What was the "license" on the images used to train the model

throwaway6753094y ago

Integrate Samsung frame art mode with DALL-E with ability to set painting style (oil, pastel, etc) for a limitless gallery.

bayesian_horse4y ago

Could someone point me how to make such images yourself? This may be a naive question as it may require non-public code and data... I've seen public colab notebooks with Dall-E but they don't work currently (package problems) and seem to produce a different style of results.

haunter4y ago

Tumblr and Twitter NSFW will not be the same after this. Like this is pure cocaine for porn addicts

fay594y ago

Someone should make an online version of Mysterium that uses Dall-E to make the picture cards!

riidom4y ago

Article about same/similar topic:

https://arnicas.substack.com/p/titaa-28-visual-poetry-humans...

dredmorbius4y ago

https://nitter.net/nickcammarata/status/1511861061988892675

KaoruAoiShiho4y ago

Dall-E will be really good for my creators on https://dulst.com (card game platform)

shon4y ago

Oof. I’m selling my Fiverr stock right now.

gompertz4y ago

It's almost like these were trained using Byte magazine covers! Amazing tech.. I'm blown away.

imwillofficial4y ago

Is there a way normal humans like me can do stuff like this. Like is there a Dall-E 2 app I can download?

mupuff12344y ago

Now to connect it with the Nvidia tech demo that turns 2d pictures to a 3d environment.

dude34y ago

There is a lot of emphasis on eyes. Same thing in Google’s deep dream. Eerie.

smrtinsert4y ago

Almost unbelievable. I'm randomly reminded of the chess automaton.

isoprophlex4y ago

These are so fucking good-looking. Can't believe it.

wallfacer1204y ago

Better than Beeple.

EGreg4y ago

When will an API be available for the rest of us?

drcongo4y ago

These are amazing!

lgvld4y ago

i am surprised the only drawing of woman it generates is when the bio explicitly contains the word "female".

bias in AI I guess.

marcodiego4y ago

These drawings... they have personality.

mykel834y ago

I am a fresher here I’d love to learn

momensement4y ago

Gangneil's curse

2OEH8eoCRo04y ago

Cruelty Squad vibes

DoRa__7234y ago

もりやしゅんと

lichteins4y ago

マヤノトップガン

toradesu4y ago

night of tokyo

j / k navigate · click thread line to collapse

394 comments

279 comments · 65 top-level

dash24y ago· 38 in thread

Obviously, from the AI point of view, this is just amazing and frankly terrifying.

For a human, it would be dross.

nopinsight4y ago

I do not disagree with your main point in the last paragraph, although I would say that most of human creative output are also evolution and combination of existing works.

What if you found the "better" pieces of these arts in other contexts, like a museum, without knowing who created them? Are you certain that you would still hold the same opinion?

low_tech_love4y ago

1 more reply

pgcj_poster4y ago

> https://twitter.com/prafdhar/status/1511863583906275328

As with most of Dall-E's output, it looks fine at a glance, but is just gross when you look closely. The kids ear is deformed and blends into their hair in a deeply unsettling way.

1 more reply

heavenlyblue4y ago

Not going to even start on how cheap the whole sentiment of having a dog with a kid and a bunch of starts in the picture. Of course it appeals to an emotion.

dash24y ago

The one in the twitter link, yes, I'm afraid that is utter dross, and if I saw it in a museum I would burst out laughing! I didn't spot any other ones from dall-e 2 in the thread, though.

native_samples4y ago

https://github.com/openai/dalle-2-preview/blob/main/system-c...

boppo14y ago

>cottagecore

That said, cottagecore is more of a fashion thing than an illustration thing, so my guess is the issue here is just the training data.

1 more reply

rfw3004y ago

> Sorry, but it's dross! It's the kind of work the guy in the art shop up the road churns out, and sells to the ignorant locals in my town.

meroes4y ago

You really think we are all going to be visiting museums with AI generated art one day?

1 more reply

teaearlgraycold4y ago

rm9994y ago

That's exactly right, he specified a style with each and cherrypicked out of 40-60 pictures: https://twitter.com/nickcammarata/status/1512119623315075081

>Btw transparency for this now-viral thread: I didn’t just paste prompts into dall-e, I played with style (eg. cyberpunk, oil, etc) to keep it interesting and diverse

_han4y ago

Was this an unsarcastic “I, an intellectual, […]”?

lanternfish4y ago

gjm114y ago

dash24y ago

No, it was a joke.

unkulunkulu4y ago

plutonorm4y ago

meroes4y ago

The goal posts move because society evolves. Why aren’t encyclopedias, deep blue, or the internet already smarter than any of us?

Micoloth4y ago

Yep! Now it’s literally “it’s not even as good as Picasso!”

Very amusing to watch

jstummbillig4y ago

> Note the difference, though, with a real artist. A real artist takes as input the real world

An AI is virtually limitless in all of these respects.

meroes4y ago

Do we really think people are going to visit museums or AI generated images one day?

Maybe even further in the future when we build museums to educate the public how AI first began and fill it with chess AI and medieval rabbit knights drawn by DALLE-2.

But I’m not sure society won’t advance along with AI, and AI will never occupy places we currently think they will.

hoseja4y ago

Go I'll allow but both Dota and SC AIs were playing an impoverished, simpler versions of the games.

op00to4y ago

I find it hard to believe that artists don’t study and rip off other artists much like AI studies art.

avip4y ago

Your argument falls apart immediately because Visual Basic was a great language for its time.

drdeca4y ago

I think they might have been referring to things written in VB, not the quality of the language itself.

FredPret4y ago

I think real artists do a similar workflow as DallE.

1. Spend an inordinate amount of time looking at other art and practising and evaluating your own art

2. THEN look at reality and paint it, or in the case of DallE, take some keywords and paint it

meroes4y ago

3. The public becomes interested in the art.

Hasnt happened for AI yet.

xtagon4y ago

low_tech_love4y ago

meroes4y ago

100% thank you for saying it.

micromacrofoot4y ago

I’m sorry but you lost me at “I, an intellectual”

lwhi4y ago

Surely, the training material is the most significant determiner of whether the results are dross or culturally significant?

ma2rten4y ago

You are misunderstanding how the technology works. It's trained on a large scale dataset of images, not art specifically.

The reason that it's producing a specific style is that Nick manipulated the text prompt and picked images he liked. He disclosed that in the twitter thread.

throwaway712714y ago

forgive me Your Highness, for I a simple man, allowed myself to enjoy a *dross*

simonh4y ago

These example don’t do it justice because these profiles are pretty dumb, there are much better ones out there that show off it’s interpretive ability much better.

animanoir4y ago

I think the next step is for machines to have taste.

jazzyjackson4y ago

Agreed, it is merely interpolating within the space of its input

technically stunning, artistically incestuous

some may say humans are the same, only ever remixing our input, but we have something machines never will: intention, desire, an unhappiness with how things have been so far.

I'm sure the machine age of clip art will be very successful but I can't see myself being moved by any of it.

(just realized the cheekiness of calling the training set "CLIP")

visarga4y ago

It's not the training set. CLIP stands for Contrastive Language-Image Pretraining. It's a model that tells when a text and an image match.

1 more reply

sanman8114y ago· 20 in thread

This tweet provides important context: https://twitter.com/nickcammarata/status/1512119623315075081...

They weren’t just copying/pasting prompts there was human creativity involved as well

habitue4y ago

It is important context, but just to push back against people over-correcting on this, my guess is that the ones he rejected also looked approximately this good.

nicklovescode4y ago

Yeah that’s right. There were very few strictly-bad ones across the entire thread of generations

The rejections were most commonly

1. Kind of just slightly boring or literally drawing the thing rather than being cool and artistic

3 more replies

amilios4y ago

https://twitter.com/nickcammarata/status/1512123067803344899...

You're absolutely right, here he displays the full set for a given prompt. They all look fantastic!

tragictrash4y ago

I've been sitting here with my mouth wide open for 5 minutes unable to move past what you just showed me. I can't fathom that this exists.

1 more reply

nwienert4y ago

Having worked with Nick extensively, take what he says with a grain of salt. He’s well known even by close friends to be a reality distorter, to put it softly.

recuter4y ago

If only James Randi was around. What a fantastic example of cold reading.

Gather round, gather round, give me a text, any text at all and I will produce you an image of some kind. And you will call it "good" if it looks like anything at all.

Because all art is subjective and your mind will work overtime to connect it back to the text you provided.

1 more reply

babyshake4y ago

Does OpenAI have a GUI that you're using or is that a CLI?

Veedrac4y ago

educaysean4y ago

Ah, I wish this fact had been highlighted better. Not a criticism of the tweet author; it's just that twitter threads really aren't designed to convey context.

curiousgal4y ago

recuter4y ago

Of course not. I'm no longer surprised just how eager people are to believe an "AI" will read their minds or has magical qualities and a mind of its own. Even on HN.

Jiggle the imagination just a little bit, dangle some progress, and we're off to the races.

This is "I'm feeling lucky" on google image search + style transfer + trial and error.

If you think I am being dismissive try a few of these twitter bios as searches and see for yourselves.

I guess it fits with the times we live in. Reward shallow plagarism. Outsource your mind.

It isn't theft if you can automate it.

Autotune for the deaf, Dall-E for the blind.

campground4y ago

joshcryer4y ago

3 more replies

rndphs4y ago

Yeah I just tried google image searching to find something like the pikachu photo from https://mobile.twitter.com/gottapatchemall/status/1511777860...

But I can't find anything close to the realism that DALL-E 2 achieved here.

1 more reply

recuter4y ago

I don't want to be dismissive of Dall-E itself or its authors. Just the implications that this changes everything or how it is much more than it really is.

https://twitter.com/nickcammarata/status/1512123067803344899...

Prompt: "expressive painting of a man shining rays of justice and transparency on a blue bird twitter logo"

You have to break the concepts up apart (which is one of the things Dall-E improved on).

As such: "expressive blue bird"

https://www.google.com/search?q=expressive+blue+bird&tbm=isc...

The same for "ray of light". In fact the top results there I get pngs of sun beams on a transparent background. Which is perfect.

Composite those things together manually and add a style transfer you'll get similar results to DALL-E as that is what it is doing more or less.

3 more replies

andybak4y ago

I've coincidentally just been watching Rick and Morty and this really fit read in Rick's voice.

Is yawning at everything astonishing not just exhausting? Everything is "just" made up of less impressive things. But is this really not worthy of a little wonderment?

1 more reply

fastball4y ago

I dunno, I really like the "happy sisyphus" one and I'm not seeing anything remotely as nice (or similar really) on Google Images[1]...

[1] https://www.google.com/search?q=happy+sisyphus

manigandham4y ago

Side note: Those last 3 lines would make fantastic lyrics

1 more reply

gfodor4y ago

A proposition for your consideration: what if you’re wrong?

1 more reply

hemreldop4y ago

It’s a guy not a group.

benlivengood4y ago· 20 in thread

Weren't the last AI-is-impossible holdouts hanging onto creativity as the domain of true intelligence?

I disregard the narrow-AI-only folks almost on principle; Terrence Tao, Albert Einstein, Mozart, and Van Gogh couldn't do each others' jobs.

hcarvalhoalves4y ago

psyc4y ago

1 more reply

idleproc4y ago

1 more reply

TOMDM4y ago

Artists that produce something truly new currently seem to be once in a generation geniuses.

22c4y ago

> or is it remixing pieces of the creativity from the authors in the training data samples?

I think you just described the majority of what is (for many) the "creative process".

benlivengood4y ago

hemreldop4y ago

That’s exactly what humans do too which is why is easy to break down art into schools/styles/epochs

function_seven4y ago

I'm hanging on to folding clothes as my litmus test of AI :)

Or driving a car everywhere I can.

joshcryer4y ago

kingcharles4y ago

https://foldimate.com/

2 more replies

hwers4y ago

Hang on to NP complete problems instead, that one will stick around for a while I expect.

platers4y ago

Folding clothes is more of a robotics problem than an AI problem. Paralyzed people are just as intelligent as everyone else!

3 more replies

tikwidd4y ago

This program, like everything that has been called "AI", is following an algorithm. It's an impressive algorithm but not fundamentally different from my dishwasher.

benlivengood4y ago

BlueTemplar4y ago

1 more reply

ItsMattyG4y ago

Let's just keep moving those goal posts.

1 more reply

3234y ago

esjeon4y ago

The thing is, the hardest part in "creativity" is that one must voluntarily do it. That turned out to be not so easy for computer. (But I would not dare to declare it straight impossible.)

hemreldop4y ago

Humans did the same just with what’s other humans did before them. Would a Dall-e 2 trained only on other Dall-e images satisfy what you’re gatrkeeping here?

1 more reply

donkarma4y ago

still can't do hands, and has to be told what to do

jl64y ago· 18 in thread

“Dall-E, generate a collection of images showing plausible war crimes from the current conflict”

“Dall-E, take this image of Dallas in 1963 and infer a new angle showing the real shooter”

rendall4y ago

> Work with GPT-3 to generate plausible Twitter profiles...

I had some fun last week constructing a conspiracy theory about this. Remember, the best conspiracy theories are unfalsifiable.

What if this has already happened? Most of the profiles on Twitter, Facebook, etc and even here on HN are in fact AI generated.

AI has metastasized and is already manipulating its environment, including humanity, to its own implacable purposes, and uses social media as one tool in its tool belt.

Thorentis4y ago

This conspiracy, known as the "Dead Internet Theory" has been around for a while: https://www.theatlantic.com/technology/archive/2021/08/dead-...

1 more reply

RedGreenBlack4y ago

> AI has metastasized and is already manipulating its environment, including humanity, to its own implacable purposes, and uses social media as one tool in its tool belt.

Absolutely love this, mind completely blown

2 more replies

eurasiantiger4y ago

This is not just a conspiracy theory, it is a prudent question to humanity.

divbzero4y ago

Reminds me of the reputation-based filtering system that Neal Stephenson described in Anathem for their version of the Internet:

Our Internet’s search engines already do a limited version of this, but there’s room to make the reputation-based filtering stronger and more transparent to users.

Schroedingersat4y ago

> more transparent to users.

You're funny.

Jeff_Brown4y ago

Seems like that's where we're headed.

jl64y ago

A world where everybody should be checking the digital signatures and chain of custody of tiktoks… but nobody does.

babyshake4y ago

I have a young child. I think about this almost every day and how I'm somehow going to need to start navigating through this type of world and help my child navigate through it.

kirubakaran4y ago

If history is any indication, the child will be fine navigating the "future". That will be the normal for them. You, not so much (not without much effort anyway).

recuter4y ago

Those are great examples of prompts it wouldn't be able to produce.

It could potentially spew out a grainy black and white photo of a shooting of somebody by someone somewhere. But it would not be Oswald and JFK and not the real Dallas.

robbedpeter4y ago

recuter4y ago

All the more impressive that the CIA was able to fake those videos without any computing power ;)

1 more reply

adamsmith1434y ago

Of course plenty of folks have made homegrown versions of Dall-e and GPT-3 and it would be a matter of time before they replicate this a well.

lekevicius4y ago

Don't worry, Dall-E adds these coloured squares to the bottom right corner, so you will always know that an image is AI-generated. (/s)

status2004y ago

If you read OpenAI's disclosures, they explicitly programmed around the concerns that you've raised.

yur3i__4y ago

OpenAI might have but the next people to make this might not. Imagine Russian/Chinese gvt misinformation cells with these capabilities for example.

1 more reply

user39393824y ago

deltaonefour4y ago· 17 in thread

nonbirithm4y ago

The scariest thing is that I don't think we can stop ourselves from innovating further even if we tried. The authors believed that the merit of displaying their progress outweighed the implications.

6gvONxR4sf7o4y ago

The quote about jobs comes to mind: "If it’s jobs you want, then you should give these workers spoons, not shovels."

(full context: https://quoteinvestigator.com/2011/10/10/spoons-shovels/)

ulnarkressty4y ago

aaaaaaaaaaab4y ago

>going from first flight to moon landing in a few decades

And then nothing.

boplicity4y ago

MisterBastahrd4y ago

I'll try to remember that the next time I listen to Sirius while using GPS to navigate.

Jeff_Brown4y ago

1 more reply

karmasimida4y ago

When Copilot and similar service gets this DALLE level of accuracy regarding coding ...

It is going to be very relevant to the current software engineers, maybe just in next 5 years.

Jeff_Brown4y ago

I'm skeptical.

The thing about art is that so much qualifies as good. Something very close to a beautiful painting is probably also a beautiful painting.

But an algorithm very close to the right way to count votes, or launch a rocket, or decide whether to lend to someone, is probably not the right way.

deltaonefour4y ago

Something like interstellar travel or even a civilization on mars is actually much less realistic due to the lack of examples in existence.

1 more reply

cwkoss4y ago

It will be better! Giving every human the tools to make expressive works of art without having to train for years will be awesome for society!

I'm playing with some of the neanderthal-relatives of dall-e 2 and already have several that I kind of want to analog-paint copies of so I can hang em on my wall.

zimpenfish4y ago

> already have several that I kind of want to analog-paint copies of so I can hang em on my wall.

[1] "Dark Souls in the style of linocut" makes some really fascinating possibilities.

cwkoss4y ago

troyvit4y ago

In the near future when you look at art you won't know whether it was created directly by a human or by an AI. What will that do to its appreciation?

1 more reply

Jeff_Brown4y ago

> don't even think artists are going to be meaningfully hurt

Jeff_Brown4y ago

> increase demand for art

1 more reply

deltaonefour4y ago

Apply this to everything. AI that takes over all possible jobs involving any form of creativity and intelligence.

What are the economic consequences of such a society?

lofatdairy4y ago· 11 in thread

orangecat4y ago

Text also seems to be gibberish even if aesthetically coherent.

Although "Follove me" is either brilliant or a lucky accident (https://twitter.com/nickcammarata/status/1511904232252784641).

jonas214y ago

> "power bottom dad is for the people", which definitely got the "dad" part, but it looks like it still struggled to understand "power bottom"

To be fair, as a human, I struggle to understand what this means.

moeris4y ago

At least in gay culture, power bottom has a pretty specific meaning. I imagine "for the people" is referencing socialist leanings.

Admittedly, it could mean many things. But it pushes my mind towards fully automated gay space communism.

1 more reply

quirino4y ago

This really makes me think that the next major paradigm-shift in society is AI-related. (The most recent one being the Internet, or possibly the iPhone)

thomashop4y ago

Friends and me run the site https://pollinations.ai/ which allows experimenting with a variety of open-source versions of DALL-E. Some are quite impressive.

flycaliguy4y ago

Illustrators should feel relieved that they still have a few years left to rub Adobe’s AI all over Dall-e’s AI to create final drafts.

tmalsburg24y ago

But see these examples where it utterly fails at even the most basic form of compositionality:

https://mobile.twitter.com/david_madras/status/1512573390896...

recuter4y ago

This is from the paper. We don't talk about it.

1 more reply

kzrdude4y ago

Yep, seems like it knows about words but not grammar

1 more reply

TheDudeMan4y ago

> additional photoshop would be both easy and readily doable.

Prediction for 2 days after this code is released: Just double-click the area you don't like and boom -- a variant.

thomashop4y ago

Already possible.

"DALL·E 2 can make realistic edits to existing images from a natural language caption. It can add and remove elements while taking shadows, reflections, and textures into account."

https://openai.com/dall-e-2/

zuzun4y ago· 7 in thread

Oof. Editorial illustrators are about to get automated.

tantalor4y ago

No, you still need humans involved (with artistic ability) to work the machine and sort through the chaff.

Permit4y ago

> with artistic ability

With artistic taste, not ability. For example the author likely couldn’t have created any of these images himself.

jdminhbg4y ago

And even if the author could have created them himself, he couldn't have done it in the span of a few minutes each like he did for the thread.

hwers4y ago

They'll just be A/B tested from out of a collection of alternatives. (After all isn't that what the artistic filter is supposed act as a proxy for in the first place.)

1 more reply

rsanek4y ago

ALittleLight4y ago

schroeding4y ago

But how many humans will still be necessary, in comparison to the status quo? How much will this affect the "market value" of normal / non-famous illustrators?

But on the plus side, even small publications will get really pretty, custom illustrations! :)

sc00ty4y ago· 6 in thread

For those unfamiliar, you can see some examples of the actual game cards here: https://www.libellud.com/wp-content/uploads/2022/03/DIXIT_OV... (PDF warning)

cwkoss4y ago

(Could also probably generate some fantastic training data)

sc00ty4y ago

I ended up writing my own after it first shut down and even though the community was small, it was incredibly fun. Doing this with Dall-E 2 sounds like a fun project to bring back some nostalgia.

https://en.wikipedia.org/wiki/Broken_Picture_Telephone

cwkoss4y ago

Sounds exactly like GarticPhone - great game: it's free and only requires a web browser. It's become the go-to for our remote company happy hours.

Several people have reported laughing so hard they were sore the next day.

https://garticphone.com/

jkingsman4y ago

That was my first thought as well! The directed, purposeful illustrations that are open to myriad interpretations feel so much like the Dall-E work.

dweez4y ago

noirbot4y ago

It reminded me a lot of the art in Mysterium as well, where the premise is that the art cards are visions being presented to mediums from a ghost to try to hint towards how they died.

TOMDM4y ago· 6 in thread

Creative industries are on the cusp of a massive upset.

Relatively soon, there will be commercial models of this quality for music/code/text/speech/images/3d models etc.

Once these AI generated assets flow like water into the hands of creators, it will significantly change the way people work.

There's no reason to expect that similar use cases won't make their way into other industries.

- Rapid prototypes of models/textures for video games.

- Quick and easy samples for musicians.

- Emotive speech for audio books and transcriptions.

It won't replace everything, but so much of our media uses art as noise, to fill a gap, and with this, it can be done almost everywhere on the cheap.

mbesto4y ago

> Creative industries are on the cusp of a massive upset.

If anything this means an explosion of entertainment, not an upset.

chrisco2554y ago

2 more replies

dorkwood4y ago

TOMDM4y ago

Exactly.

I can't imagine how much media would begin to use 3d assets if it became an order of magnitude cheaper to do so.

Not to mention, imagine the pipeline of

1. "GPT-3, give me 10,000 descriptions of different doors"

2. "Dall-E 2, give me PBR textures for these 10,000 door descriptions"

3. Repeat 1 and 2 for every asset you need

4. "Dall-E 2, give me 10,000 floorplans for apartments, common areas, shopping centers etc."

5. "GPT-3, describe the contents of this apartment/common area/shopping center etc."

6. Use an algorithm to parse out the floorplans (ditch the ones that don't work, we can just generate more), populate it with assets specified in step 5 and generated in steps 1 and 2.

We could proceduraly generate entire cities for games with unique assets everywhere. It would still probably look nicer with a human in the loop, but the possibilities are staggering.

1 more reply

rhelsing4y ago

If you're interested, I'm actively working on the music & samples side of things at https://www.neptunely.com . We are still in beta but hoping to launch this summer!

esjeon4y ago

Why do we stop at texture, where we can generate the whole set of models?

Seriously, tho, we need these for VR, which is all about copying everything in the real world into virtual worlds with some twists.

klaussilveira4y ago· 6 in thread

mrjangles4y ago

...and one of the things that people find remarkable about it is that it isn't immediately and freely available to everyone on earth.

throwaway483754y ago

To be fair they call themselves OpenAI. Expecting things to be a little more open isn't unreasonable. They are kinda setting themselves up to disappoint people with that name.

PoignardAzur4y ago

I'm not too upset, though. The way the technology is progressing, it's a pretty short time span between "the bleeding edge researchers can do it" and "there's a phone app that can do it for free".

1 more reply

systemvoltage4y ago

I disagree. I am glad people are getting paid (exceptionally well) to do stuff like this and they should charge for their efforts. We need more initiatives like this, not less.

Open source is amazing boon to our generation, it has enabled free access to basic building blocks for people to build amazing things. But I don't think it is a silver bullet for everything.

Bud4y ago

It's not "bizarre" that computing resources cost money and are finite. Nor is "pay-to-experiment" accurate. Nor is "exclusive club" really fair, or in good faith.

It's a beta that they are running with their own resources. It makes complete sense that they'd have to limit access.

joshcryer4y ago

ElutherAI or others will probably recreate it based on the paper they released. the main innovation is CLIP and how they changed how it approaches turning text into images.

arriu4y ago· 6 in thread

Is there a future for art with this type of thing getting more advanced each year?

sharps_xp4y ago

throwaway6753094y ago

Sure, but that speaks more to your personal connection to the creator of the art and less to her actual artistic ability, which might be utter drivel.

Whereas I could take some of these images generated by DALLE, slap a human sounding artist name on them, and 99% of the general populace would enjoy it just as they were the human produced art.

dharmab4y ago

There are artists who use AI art as an instrument or medium. They do considerable work tuning the inputs and post-processing and contextualizing the output.

6gvONxR4sf7o4y ago

Yes, just like there was a future for art once the photograph was invented. Certain parts of art contracted, but overall art changed and expanded.

jstummbillig4y ago

The future of art is for you. Just as with every other occupation.

whateveracct4y ago

The ability to draw, paint, etc will still be highly valuable.

In fact, in a world where the average artwork is AI derived, the value of skilled artists may even go up. There's more to art than technically putting lines places.

amelius4y ago· 5 in thread

Perhaps someone can write a HN reader where headlines are fed through Dall-E, and the images appear on top of the stories.

jasonjayr4y ago

This totally could be coupled with a CMS/Blogging platform that automatically adds illustrations from headlines/pullquotes

Bilal_io4y ago

This could replace a huge portion of the stock image market.

For example, The Verge writes an article about Microsoft, they don't need to pay royalties for an image that has Microsoft logo displayed on a building, one can be generated for them.

1 more reply

a-r-t4y ago

And coupled with GPT-3...

1 more reply

hans17294y ago

That’s an _amazing_ idea.

TOMDM4y ago

Or imagine a script to chop a book/podcast into segments to add visuals.

tailspin20194y ago· 5 in thread

These are so good, it's breaking my brain a little.

They're not just conceptually accurate, but to my eyes they're pleasing to look at from a purely artistic point of view. I'd put these on my wall.

I already take a fairly bullish position on the potential of AI, given a long enough timeframe, but it does feel like we're reaching a bit of a tipping point here.

It's starting to prod at the paradigms I hold in my head about what I think "art" is.

The commoditisation of art.

So the new art maybe becomes meta-art. Perhaps human artistic endeavours become more about curation rather than creation?

awb4y ago

> human artists themselves are trained on the output of other artists

Artists take inspiration from other places too like nature, imagination, dreams, etc.

Every once in a while an artist like Picasso, Dali, Pollock, etc. come up with a new style that’s instantly distinguishable as unique from the artists that existed prior to them.

Dall-E 2 is an amazing achievement, and could replace most unoriginal artists.

If Dall-E 3 can produce novel artistic styles, that would transform art as we know it.

tailspin20194y ago

Yes, very good points!

disqard4y ago

I thought you might like to know about "Experiments in Musical Intelligence" (aka "Emmy"), David Cope's creation, now "deceased":

https://www.theguardian.com/technology/2010/jul/11/david-cop...

"One day Cope pushed a button on Emmy, went out to get a sandwich and when he returned his workaholic creation had produced 5,000 original Bach chorales."

tailspin20194y ago

Awesome link.

This bit seems particularly interesting:

So presumably, we can find "soul" and meaning in computer produced art because a large part of the meaning that we derive from art comes from within us, not necessarily the artist.

This is interesting to contemplate.

BlueTemplar4y ago

> He realised that what made a composer properly understandable, properly "affecting", was in part the fact of mortality.

Impressive how Asimov has figured this out a while ago (in the Bicentennial Man) !

Otherwise, why isn't he using a pseudonym ?!?

contextfree4y ago· 5 in thread

Nition4y ago

It would be nice if every AI like this had an option to show the 10 closest-matching training images to the output. Especially for ones like thispersondoesnotexist.com.

nicklovescode4y ago

for that one IIRC I asked for a Robert Moses and one of the cooler ones had giant hands so I put that in the prompt then took two of my favorite from the next batch

contextfree4y ago

thanks for the response!

JohnBerea4y ago

The hand symbolize the Red Sea Parting of Moses.

contextfree4y ago

interesting hypothesis!

donkarma4y ago· 5 in thread

Incredibly misleading, he didn't directly paste bios into the description and they were massively curated.

throwaway6753094y ago

Of course they're CURATED, but there are several links where he shows the full set of images that were generated and I would say between 70 to 80% of them are pretty decent aesthetically speaking.

Given that it's able to generate a dozen images in less than a minute, and all I have to do is pick out the ones that are aesthetically pleasing, I'd say that's a damn good win.

555554y ago

He’s curated those links too, though.

1 more reply

educaysean4y ago

platers4y ago

dntrkv4y ago

If one of these images came out of a set of 1000 I would still be impressed.

Evidlo4y ago· 4 in thread

I'm surprised how coherent most of these drawings are, instead of some warped monstronsity like you see from deepdream or thisanimedoesnotexist.

Most of them have a theme that makes sense, too.

tempestn4y ago

This comment in another thread suggests why that's the case:

https://news.ycombinator.com/item?id=30933091

drcode4y ago

1 more reply

Evidlo4y ago

OK, it looks like there is still some weirdness if you look at it too closely, like extra fingers or gross faces.

jfoster4y ago

Those sometimes happen when humans create art, too.

detritus4y ago· 4 in thread

Am I the only person here to think this is utter bullshit and some amazingly-developed prank?

Otherwise, my stomach is in knots, because this is terrifying.

BlueTemplar4y ago

Basically my feeling about GPT-3 (or was it 2?) when it "released", except with a low weight for the prank possibility.

plutonorm4y ago

yes, finally the world is catching up with things.

boppo14y ago

Check out my post history 2 before this one. We'll be fine.

detritus4y ago

Sub-GAI algorithms don't need an competence in 'fine' art to derive vicious selection criteria for processing meatbags.

It's not the compiled art processing here that terrifies me, it's the complexity of logic underpinning that is displayed and how it could be used elsewhere.

kache_4y ago· 4 in thread

I've had my head in the sand for a while regarding generative AI, but now I'm getting pretty scared

alcover4y ago

I'm in disbelief.. Scared also - why not ?

Someone knowledgable in this thread, please tell us it's possible to backtrace such an illustration to its learning set sources.

If these things are not just a controlled average(?) of real drawings, then something gigantic has been unlocked.

mortenjorck4y ago

> I'm in disbelief.. Scared also - why not ?

It's thrilling seeing these breakthroughs, and just as manned flight transformed the world, whatever the likes of GPT, PaLM, and DALL-E become will make the future weird in ways we can't predict.

6gvONxR4sf7o4y ago

There's a sense in which they're a controlled average of real drawings, but it's not any more useful of a lens than the sense in which you're a controlled average of your experiences.

gfodor4y ago

I'm gonna go with unlocked on this one.

avalys4y ago· 3 in thread

This makes Dall-E 2 both more and less impressive to me.

mherrmann4y ago

> parroting content and styles that it has basically memorized and is really good at interpolating between.

Maybe that's what exactly what we humans do too?

csee4y ago

1 more reply

fastball4y ago

> obvious it's not any more advanced than a graphical version of GPT-2

What makes this obvious?

hwers4y ago· 3 in thread

"We've made the scarce resource abundant, finally the scarce thing is democratized!"

"Wait why doesn't anyone care about the scarce thing anymore?"

danuker4y ago

According to the book "Abundance", everything gets devalued over time, meaning cheaper and easier to make, leading to some sort of utopia where everyone can afford more and more.

nonbirithm4y ago

2 more replies

hemreldop4y ago

Move on to the next scarce thing until we get to the point with true general AI and then we utopia away.

mupuff12344y ago· 3 in thread

I can't help but be suspicious since there is not site to try it out, and I can't think of a good reason as to why there isn't one.

educaysean4y ago

The tool seems to be not open to public "yet". Nothing nefarious. You can join the waitlist here: https://openai.com/dall-e-2/

MintsJohn4y ago

Needed computing power seems a good enough reason to limit it. But I agree it looks too good to be true, only real use will show how well it really works.

jazzyjackson4y ago

kingcharles4y ago· 3 in thread

Remember, these are all public domain, at least in the USA which does not allow copyright assignments to the artistic output of machines.

robbedpeter4y ago

Copyright with dall-e is just like copyright with photoshop or any other software. The user of the tool owns the output. Subject to whatever other limitations and requirements OpenAI wants.

jazzyjackson4y ago

https://openai.com/api/policies/terms/

The dumbass you refer to was testing patent law by trying to register his software as the inventor but the copyright case is different:

> the office’s 2019 ruling [...] found his A.I.-created image “lacks the human authorship necessary to support a copyright claim.”

> Thaler noted to the [US Copyright Office] he was “seeking to register this computer-generated work as a work-for-hire to the owner of the Creativity Machine.”

https://www.smithsonianmag.com/smart-news/us-copyright-offic...

gwern4y ago

refulgentis4y ago· 2 in thread

It was striking to:

- read the OpenAI paper

- notice there was a lot of words in the harm section

- notice no one in actual AI art community has received an invite, but random SV hoi polloi and OpenAI employees have

It was shocking to see DallE 2 get announced and take that exact approach.

We'll keep working, LAIONs 5B dataset starts approaching the #s cited in Meta's and OpenAI's papers.

cscurmudgeon4y ago

> - notice no one in actual AI art community has received an invite, but random SV hoi polloi and OpenAI employees have

Same with GPT-3. Requested an invite. Never received one. I have written survey articles comparing different methods. So thta was probably a red flag for them.

jazzyjackson4y ago

Thank you for putting this into words, "SV hoi polloi" ! perfect

I'm pretty tweaked at how copyright is used here:

Google gets to scan every book in the world, build derivative models off it, but when we want to see the source data we get "page ommitted from this limited preview"

OpenAI CLIP scrapes all of google images, but isn't allowed to show us the source material in its training set, since that would constitute copyright infringement

Why do the robots have the rights to the world's information while humans are left to the derivative output as the internet is flooded with auto-encoded content?

I'm going to start my own internet, no bots allowed. In the future, privacy is tantamount, if you let a bot see your work you're bound to be plagiarized in a thousand variations.

zaking174y ago· 2 in thread

Maybe we're just search engines of a similar kind.

I'm feeling future shock; heavy future shock.

rndphs4y ago

Human artists also do a whole lot of mimicry. One could look at art produced by many artists and say that it is just things stitched together from pre-existing art.

“Good artists copy, great artists steal.”

zarzavat4y ago

For example the “enterprise vector people” graphics you see on every corporate website. Most human art is extremely repetitive.

And DALL-E is impressive exactly because it has traded some of that creativity/weirdness away. But it’s still pretty damn weird.

ThePhysicist4y ago· 2 in thread

BrianOnHN4y ago

This question peers into the "explainable AI" issue of "black box" solutions like this.

Based on my understanding, the only way to do this would be to write a separate classifier algorithm using the same dataset.

totony4y ago

Yes, as far as i understand this is pretty simple given how dall e is made. Simple vector similarity search would work on the image embeddings (i think)

Mizza4y ago· 2 in thread

This is going to put a lot of artists out of work in a very short time. Not happy about that.

CamelRocketFish4y ago

I imagine you were also unhappy when switchboard operators were replaced by computers as well.

xwdv4y ago

They will move on to more stable careers.

nicklovescode4y ago· 1 in thread

Hey everyone, Nick here creator of the linked thread. I just wanted to link to another tweet I have with some details of how I made it.

https://twitter.com/nickcammarata/status/1512119623315075081

drcongo4y ago

tarxzvf4y ago· 1 in thread

passion__desire4y ago

Pair this up with the electric sheep paradigm. Evolving the images and prompts together. Voted by people on their screensavers.

munk-a4y ago· 1 in thread

So where can we common plebs go to submit paragraphs for generation?

jazzyjackson4y ago

apparently the twitter replies of the tech priesthood is where we are meant to request miracles of the AI

grumbel4y ago· 1 in thread

Are there any examples of what this thing produces when run on recognizable brands or characters, i.e. Sonic, Mario, Coca Cola, Star Wars, etc. instead of generic words like "astronauts on horse"?

The one I have seen so far[1] is the Twitter logo one, but it's hard to tell if the "Twitter" had much effect here or if it's just the "blue bird" that did it.

[1] https://nitter.net/pic/media%2FFPwj5G-WUA8__UC.jpg%3Fname%3D...

grumbel4y ago

Golum: https://old.reddit.com/r/MediaSynthesis/comments/u1gkr8/dall...

Rick and Morty: https://old.reddit.com/r/MediaSynthesis/comments/u0ihh7/dall...

Pikachu: https://old.reddit.com/r/MediaSynthesis/comments/u03kfv/dall...

thenerdhead4y ago· 1 in thread

These are pretty cool. They remind me of magic the gathering art and some are quite visually accurate!

At the same time I fear for my illustrator/digital artist friends.

riffraff4y ago

I believe a large part of the illustrator work is tweaking stuff according to feedback, and I suspect no generative AI does that (yet?).

csee4y ago· 1 in thread

Stunning.

My question is whether more compute and more data will be sufficient for the AI to create its own art styles. Everything we see here are within the stylistic paradigms created by previous humans.

isoprophlex4y ago

Probably, interpolating betweem styles or extrapolating to unseen styles doesn't seem too far fetched.

Nevertheless... As a tool for artists to explore new avenues of expression this could be a fantastic tool, i think.

realPubkey4y ago· 1 in thread

As always, these results are cherry picked by OpenAI. Same as GPT-2/3, the "average" output of the model is barely useable.

ItsMattyG4y ago

Not so. He shows an example of the 10 he chose from, basically all were good.

educaysean4y ago

nonbirithm4y ago

I do not look forward to how this tech will interact with online culture several years from now.

mherrmann4y ago

Can't wait for this for music. To fix the costly cherry-picking process, Spotify should play AI-generated songs in between others. Those who get good engagement should then rise to the top.

uguisain4y ago

An archaeologist once said, "The most merciful thing in the world is the human heart that cannot associate everything together"

However, this is only limited to the things that the "law" can take effect.

When "exceptions" fill the whole world, then the fate of this world will be nothing but despair.

inb4_cancelled4y ago

I'm genuinely terrified.

micromacrofoot4y ago

some of these are quite beautiful... I've seen AI-generated art before, but these are outrageously better, I couldn't really distinguish a lot of these from human-created art

this is going to absolutely obliterate some markets for illustration and stock photography, unfortunately

guilhas4y ago

So many questions

This makes you think what is real art? Beauty, meaning, context

Can this achieve real art? Or just composing existing art?

If we don't preserve the art industry, and global industry flocks to the lowest common denominator, what will that mean for the future of art

If an artist invents something new and someone unrelated uses it to train a model and generate 100s of compositions who should profit from it?

What was the "license" on the images used to train the model

throwaway6753094y ago

Integrate Samsung frame art mode with DALL-E with ability to set painting style (oil, pastel, etc) for a limitless gallery.

bayesian_horse4y ago

haunter4y ago

Tumblr and Twitter NSFW will not be the same after this. Like this is pure cocaine for porn addicts

fay594y ago

Someone should make an online version of Mysterium that uses Dall-E to make the picture cards!

riidom4y ago

Article about same/similar topic:

https://arnicas.substack.com/p/titaa-28-visual-poetry-humans...

dredmorbius4y ago

https://nitter.net/nickcammarata/status/1511861061988892675

KaoruAoiShiho4y ago

Dall-E will be really good for my creators on https://dulst.com (card game platform)

shon4y ago

Oof. I’m selling my Fiverr stock right now.

gompertz4y ago

It's almost like these were trained using Byte magazine covers! Amazing tech.. I'm blown away.

imwillofficial4y ago

Is there a way normal humans like me can do stuff like this. Like is there a Dall-E 2 app I can download?

mupuff12344y ago

Now to connect it with the Nvidia tech demo that turns 2d pictures to a 3d environment.

dude34y ago

There is a lot of emphasis on eyes. Same thing in Google’s deep dream. Eerie.

smrtinsert4y ago

Almost unbelievable. I'm randomly reminded of the chess automaton.

isoprophlex4y ago

These are so fucking good-looking. Can't believe it.

wallfacer1204y ago

Better than Beeple.

EGreg4y ago

When will an API be available for the rest of us?

drcongo4y ago

These are amazing!

lgvld4y ago

i am surprised the only drawing of woman it generates is when the bio explicitly contains the word "female".

bias in AI I guess.

marcodiego4y ago

These drawings... they have personality.

mykel834y ago

I am a fresher here I’d love to learn

momensement4y ago

Gangneil's curse

2OEH8eoCRo04y ago

Cruelty Squad vibes

DoRa__7234y ago

もりやしゅんと

lichteins4y ago

マヤノトップガン

toradesu4y ago

night of tokyo

j / k navigate · click thread line to collapse