Stable Diffusion animation (opens in new tab)

(replicate.com)

616 pointsgcollard-3y ago96 comments

96 comments

74 comments · 24 top-level

desindol3y ago· 14 in thread

Maybe I can shine some light on the debate from an concept artist standpoint that works in VFX and advertising. I worked on feature films (3 of them in the imdb top 100), tv shows (like game of thrones) and hundreds of AD campaigns.

In the last 10 year the work of a concept artist changed dramatically we have gone from purely painted concept art to mostly "photobashed". Photobashed means basically that you rip apart other images and stitch them together to get the desired image. Some start with a rough sketch for the composition or make really rough grey shade 3d model and "overpaint" them. When it comes to "photobashing" the disregard for copyrights was always there and it's the worst in smaller studios and a bit better in the leading ones. Still most of the time everyone argues that if you only use really small parts of the images it is covered by fair use. There are some examples were studios got sued but mostly without bigger financial impact.

A few months ago I started working with "DiscoDiffusion" to generate the images I use to photobash. "DiscoDiffusion" can produce great "painterly" images but struggles with photorealism and is slower, not as coherent as "StableDiffusion". Still the adoption rate in the concept art community was insanely fast. This all got topped by "StableDiffusion" in the last week. Ofc there are still people that want to do it the "right" way and not use AI but we had the same discussion years ago when "photobashing" came into place and some artists still wanted to paint the whole image. As concept artist you are mostly paid for your design thinking that means it is less about the process and more about the finished product. The turnaround time for styleframes got reduced from 3-4 hours while painting to 45 min - 1 hour when photobashing with stable diffusion me and my peers in the studio are now at 20-45 min per styleframe. When "photobashing" most people constrain themselves on their image library and ressources like Photobashing Kits. Not only does "StableDiffusion" cut the time in half it also gives greater freedom in composition and design especially if you are using img2img.

So where does this leave us? For the work in fast paced art environments like VFX, games, conept art or advertising "StableDiffusion" is a welcome gamechanger. Tradionalists and Artists outside of the industry might feel threatened but for us in these industries it's a god send.

shlip3y ago

I think Tradionalists and Artists do what they do because they enjoy the process of creating/creating something unique, not because they want to be productive/fast.

So yeah, StableDif is great for commercially constrained environments, plus, it frees some of your time so you can enjoy creating art that matters to you :p

cerol3y ago

> So yeah, StableDif is great for commercially constrained environments

For me, that's the big insight.

Not just SD, but any synthetic media. My take? The "upcoming revolution" is not one of a boss firing their artists and making prompts themselves. It's a shift from addictive art" to subtractive art* in commercial environments.

dougmwne3y ago

Awesome perspective. Would you be able to link to an an example of what a finished style frame looks like using Stable Diffusion? I am curious what human and machine can achieve in 20 minutes.

VBprogrammer3y ago

I wondered if we were suffering from a collective blind spot where, for example, an outsider looking at copilot might decide that it's revolutionary for writing code where, in practice, I don't know anyone using it in anger.

On the other hand artists are amazingly adept at leveraging new technology, mediums and techniques to create art so it's probably less surprising if they jump at Stable Diffusion in a lot of corners of the industry.

qqqwerty3y ago

Almost certainly, in the near term, these tools are going to make folks a little more productive and will not be replacing anyones job. Every digital product, whether it be SAAS software, games, or whatever, is heavily constrained by labor availability. Show me one product that was feature complete and exactly matched the scope and scale of the original vision, and I will show you 100's that needed to prioritize features and downsize scope in order to deliver in a timely manner.

And FWIW, I think these tools are still a little too limited to replace even a single person. They might make someone slightly more productive, but the modern stack for both programmers and digital artists is many levels deep. Automating any one of those levels is not sufficient to replace a human. You would need to automate every slice of the stack, plus the work required to stitch those slices together.

madelyn3y ago

100% agreed!

My first thoughts when I saw DALL-E were, "Wow, I can't wait to matte paint with this!"

A texture here, something robot like here, some technical or magical doodads there. Smash them all together and even use img2img to blend things, then do the finishing touches and lighting by hand. It's so nice for speed "painting".

As a programmer, it honestly reminds me of the situation with Copilot.

belter3y ago

Thanks a lot for your insight, it's great to hear from somebody in the industry. This same industry, is logically, also quite keen on enforcing the rights of creators and media copyright.

Most of the discussion, has been on the technical details of these very interesting advances.

Do you think it will be concern for the production process, the origin of the source data used to train these models?

desindol3y ago

The most images are made in preproduction and there won’t be concern but there is a lot going on creating datasets specific to clients like marvel or Star Wars for which they have millions of images in archives.

Llamamoe3y ago

This sounds like exactly what I always wanted to learn to do, but never knew it existed as a field enough to actually get into it. Honestly thank you so much for this comment.

Do you have resources about getting into this style of art as a novice?

Also what StableDiffusion and other AI tools do you recommend?

omnimus3y ago

This honestly only confirms their fears. What you are saying is "yeah we have been photobashing all those artist for a long time anyway now its just much more efficient and automated".

I would be pretty afraid and angry if i was artist too.

desindol3y ago

We are these artists or better art directors.

1 more reply

Llamamoe3y ago

Can I ask again about your recommendations for getting into photobashing and AI-assisted air again?

quadcore3y ago

I think people here are confusing everything. And Parent you dont realize that 10 years from now you might lose your job not because of some external factors but because you mindlessly got yourself out of your own job by using tools you dont understand. So let me clarify a few points.

0. Parent, every input you put in that box is an input you soon wont have to make (and that will feel great at first) until you have no input to give anymore.

0.5 By art, I mean entertainment mainly. Not constrained to fine art.

1. Of course this looks like a tool now. And tools are great. We gota make sure they stay tools though.

2. Whatever work you do now with them, the AI will learn to do it by itself without you. You are training the AI.

3. The copyright issue is not unlike music and games in the early 2000'. You would pay if you were proposed the right delivery method.

3.5 the AI isnt creative, this is sophisticated plagiarism.

4. Moore's law. Moore's law. Moore's law. There are for-profit companies behind these AI that have no limit in the amount of value they wanna grab. Their goal is that we can click on a button and voila, you have a movie.

5. Please automate my job you may think. Absolutely, I wish mine (software developer) was. But art / entertainment is different. Now I cant explain it entirely, Im not philosopher (there is something about we still gota control what we do and do not) but if you automate art, our brain becomes useless. Our soul dies. And nature tend to recycle whats useless. Turns it into food.

6. "We ll just do some new art / entertainment". Which the AI will mimic the next day. So you cant even hope to work on entertainment anymore. It's a death I cant qualify.

The core of whats wrong here is ultimately, in a perfect world, an artist could choose if he wants the AI to scan its work (i.e. integrate its work in the tool) and be payed for it. That way we would choose what we like to do and automate everything else. If we cant do this, the AI will do everything even what we should be doing. So we should ban those models (or improve their practicality).

desindol3y ago

See I am not a concept artist I am an Art/Creative Director… if I tell a machine what to do or a person doesn’t change my job. Even if it goes away it’s fine I’ll do something else. Crazy I know.

1 more reply

autoexec3y ago· 7 in thread

AI in animation has been interesting to me for a while now. It leaves me a little conflicted though. If we get to the point where we can throw key drawings at AI and let it handle all the inbewteens without a bunch of tweaking and cleanup afterwards it's going to really suck for places like Korea! I guess all those inbeatweeners will just be another victim of automation.

I've always loved animation, but I'll admit part of that comes from the hubris involved. It's pure insanity that people ever drew, by hand, mountains of individual drawings each slightly changed and assembled them into compelling illusions to tell stories. The amount of work that goes into animation is just staggering and anyone sensible would have rejected the entire concept as absurd. I wonder if animation will start losing part of its magic for me when it's done primarily by AI.

On the other hand though, another thing I've always loved about animation as a storytelling medium is that it isn't as limited by practical concerns like physics or reality. If something can be imagined, it can be drawn and animated if somebody has the skill and the resources to fund the massive amounts of work. It's time/money that forces animators to take shortcuts and make compromises. Creative decisions are made and rejected all the time due to those constraints. If AI driven animation gets more advanced to the point where that's no longer such a barrier it could create output more in line with the vision of creators and that's exciting too!

I hope that traditional hand drawn animation never dies, but I look forward to seeing how AI continues to change the industry and the output.

pdpi3y ago

> It's pure insanity that people ever drew, by hand, mountains of individual drawings each slightly changed and assembled them into compelling illusions to tell stories.

Traditional animation by itself is nothing short of insanity, convincingly blending live action and traditional animation takes it a step further, and then there's the "Bumping the Lamp"[0] scene in Who Framed Roger Rabbit.

The film as a whole refuses to keep the camera static to make things easy for the animators, which was unusual enough by itself, but then they went above and beyond — they casually bumped a pendant lamp and let it flail about. Every time it slows down, it gets bumped again. And they shaded and cast shadows for the damned rabbit for every single frame of that sequence. Madness.

0. https://www.youtube.com/watch?v=_EUPwsD64GI

barrysteve3y ago

This is not an example of AI handling the inbetweens.

This water-morph effect is undesirable for inbetweens.

autoexec3y ago

You're right, but projects like this do make me think AI handling inbetweens well enough to replace animators is where we'll end up eventually and that it's probably not far off.

1 more reply

megablast3y ago

> leaves me a little conflicted though. If we get to the point where we can throw key drawings at AI and let it handle all the inbewteens without a bunch of tweaking and cleanup afterwards it's going to really suck for places like Korea!

These comments on every single post are getting really boring.

vasco3y ago

You can probably expect them for any interesting technology forever into the future, since people have made these useless complaints for hundreds of years at least.

1 more reply

autoexec3y ago

I think it's pretty normal for people to muse about the impacts future technologies are likely to have on people's lives, including those people who will find themselves out of work. I'm not even advocating that we try to turn back the clock or hold back progress to preserve anyone's careers because I think that'd be boring since it's pretty much settled (it's not going to happen and it's not worth trying to hold back progress).

While I can't expect it to interest everyone, I don't personally mind discussions of specific industries when it looks like their time is coming up though. Each industry is going to have to deal with the change in their own way and we'll all have to adapt in different ways. The more interested in the industry I am, the more interesting I'll find it's decline/collapse. Brace yourself, because when AI comes for the coders that topic is going to dominate this site for some time (at least until the AIs themselves start commenting)

makobado3y ago

this is very true

Dwedit3y ago· 7 in thread

In this example, 25 frames are generated using Stable Diffusion, then frames are interpolated using FILM-Net. I hadn't see FILM-net before, it looks really neat.

o_____________o3y ago

This? Looks pretty amazing:

https://film-net.github.io

https://github.com/google-research/frame-interpolation

bfirsh3y ago

You can run it on Replicate too! https://replicate.com/google-research/frame-interpolation

eezurr3y ago

Neat. Didnt Microsoft release a tool that morphed between two photos a decade ago? I dont recall the name unfortunately, but the effect was similar, less quality.

Jaruzel3y ago

Back in the DOS days I had (and a still do) a copy of a tool called 'Morph', it took two GIF images, and you placed marker points on the first image and then again on the second, and it generated an MPEG2 file of the morphing.

Very impressive for the time.

2 more replies

maven293y ago

You're thinking of Microsoft PhotoSynth from 2006. This is similar to what google streetview showed up with.

From what I can see, they later revamped PhotoSynth to include actual 3d mesh reconstruction in 2014.

nl3y ago

Morphing is different to interpolation and has been available on consumer computers since the 1990s (I remember making gifs in 1996 with it).

In morphing you end up with a blury mess for the between frames. This technique tracks individual features (eg eyes) and keeps them coherent.

nopenopenopeno3y ago

I was doing this with an After Effects plugin called Twixtor long before a decade ago.

https://revisionfx.com/products/twixtor/

r3trohack3r3y ago· 5 in thread

I just came across this on twitter, every frame appears to be an evolution of the previous frame using img2img paired with a tilt/zoom to create a psychedelic animation.

The author claims to have made this with Stable Diffusion, Disco, and Wiggle: https://www.youtube.com/watch?v=Nz_n0qxqoPg

I believe Wiggle is used to automate the tilt/zoom between frames.

waking_at_night3y ago

Thats really great and has come a long way since the beginnings... I had this video animation going on back in the day when VQGAN was still all the rave:

https://www.youtube.com/watch?v=CgDbbg802-8

Its incredible what 6 months only can do

westoncb3y ago

Interesting to see in comparison to those old latent space zoom vids making the rounds in like ~2014..

alx__3y ago

That is wild, feels like an intense dream you can't wake up from

alhirzel3y ago

Reminds me of Electric Sheep[1] - I can't wait until someone hooks up news stories to have abstract news as a screensaver! Also reminds me of a time lapse of someone painting - like [2]

[1]: https://electricsheep.org/

[2]: https://www.youtube.com/watch?v=2O4ccHgfcl8

actionfromafar3y ago

I like their earlier work more, with the audio pictograms.

1 more reply

fagerhult3y ago· 4 in thread

Andreas, author of the Replicate model here -- though "author" feels wrong since I basically just stitched two amazing models together.

The thing that really strikes me is that open source ML is starting to behave like open source software. I was able to take a pretrained text-to-image model and combine it with a pretrained video frame interpolation model and the two actually fit together! I didn't have to re-train or fine tune or map between incompatible embedding spaces, because these models can generalize to basically any image. I could treat these models as modular building blocks.

It just makes your creative mind spin. What if you generate some speech with https://replicate.com/afiaka87/tortoise-tts, generate an image of an alien with Stable Diffusion, and then feed those two into https://replicate.com/wyhsirius/lia. Talking alien! Machine learning is starting to become really fun, even if you don't know anything about partial derivatives.

benreesman3y ago

For the moment at least I'm personally more interested in the image applications than video use cases, but even so this is just fantastic for helping to develop an intuition about how the diffusion mechanism works.

It's admirable that you're so modest regarding the antecedent work, but sometimes it's the "obvious in hindsight" compositional insights that really open up the possibility space. Top work!

noduerme3y ago

It's a nifty piece of work. Often when you're trying to get an answer from a regression model or a neural net you have to try to craft your inputs so carefully that you already sort of know, intuitively, what it will figure out. In some way the thought process of the refining the input is more valuable in a lot of quantitative cases than the actual output.

This is simply very impressive... whether or not it was humbly stitched together, you were sort of the first to do it, so take pride.

The next real magic will be reading its net and figuring out how to get [vfx/film] effects from it... which if I were you would probably occupy 22 hours of my day now.

NaturalPhallacy3y ago

>Talking alien!

Maybe that's what we were supposed to do all along. Not find or be found by aliens, but to invent them.

rnjesus3y ago

that’s exactly what we’ve been doing

mrpf1ster3y ago· 4 in thread

I feel like I’m watching an explosion of progress in AI image generation in real-time.

Every day there’s a new application of Stable Diffusion. It’s incredible to watch unfold

zmmmmm3y ago

I think it's pretty notable how most of the explosion happened since stable diffusion released their model and code as open source, while Dall-E generated initial excitement with their closed source model, but limited progress / creativity since. It's a pretty nice demonstration I think of how much innovation can happen from openness.

kristopolous3y ago

Yes. Example #5*10^7 or so. Some people are just opposed ideologically or due to temperament. Locking things down is one of the best ways to make them die

napolux3y ago

GPT-4 (or a reduced version of it) should be opensource too if you ask me :P

ducktective3y ago

Incredible for you, depressing for me.

I feel left behind...

justinlloyd3y ago· 4 in thread

Last year, when 3090 GPUs were astronomically priced, I thought "screw it, I'll just buy an RTX A5000 for a couple of hundred bucks more." Which begat a second A5000 for "reasons." It was almost prescient. Now all these models are coming out requiring slightly higher VRAM GPUs than a 3090, i.e. more in the range of the A5000, and I get to run them. I am a kid in a candy store this past couple of weeks.

rjh293y ago

The A5000 and 3090 both have 24GB of ram?

justinlloyd3y ago

I should have mentioned linking the GPUs. I meant requiring slightly more VRAM than a single 3090 can handle. Or a 3080 as others have pointed out. The main difference between the way the 3090 works and the A5000 works is SLI vs NvLink/NvSwitch. I believe the 3090 uses NvLink, but not quite in the same way the A5000 does. I can chain together far more A5000's than I can 3090's. Eight A500's vs four 3090's IIRC. And even chain them across machines with the right h/w, though that's probably a bit of a stretch for my budget. Also, the A5000 will share VRAM, giving me a total usable heap of 48GB with two cards, whereas the 3090 will be limited to 24GB each. I can also share the A5000's with multiple VMs simultaneously, whereas with the 3090's I am stuck doing GPU pass through. All that for only a couple of percentage points drop in performance in video games.

1 more reply

dannyw3y ago

Maybe GP meant 3080.

1 more reply

trollied3y ago

The 3060 isn't a bad choice, seeing as it has 12Gb of VRAM.

NoMoreBro3y ago· 2 in thread

Very cool! I generated this with 1000 images https://twitter.com/UnshushProject/status/156315821457709465... using the Deforum's Colab[1], it's really easy and now has interpolation too. It was the very first video, I could have made something great but, you know, awesome guys keep releasing AI tech and I'm like a child at Luna Park right now, not able to concentrate.

If you are interested in my project (I doubt, you are too busy playing like me) I'm posting a lot of things on https://unshush.com and on the Instagram account: https://www.instagram.com/unshushproject/ (Sorry for posting my stuff but I'm not very social so no one will ever see them)

If you want to generate videos I can share some links I bookmarked of software/code to make them more smooth.

[1] Deforum's Colab (based on Stable Diffusion): https://colab.research.google.com/github/deforum/stable-diff...

yreg3y ago

I'm definitely interested in your video related bookmarks.

You have generated some pretty cool designs.

NoMoreBro3y ago

Sure! This would be my approach (and tools) if I was smarter:

If you make the generations with some similarities and use the right interpolation, you don't need 1000 images like my video and can obtain a smooth movement.

First, generate images with some kind of visual anchor (background, an object). You can use frames generated using the previous frame as reference image, or the same seed but different prompt/parameters, or you can go wild using img2img/inpainting (btw I struggle to find an inpainting tool for Stable Diffusion: they seem to be just img2img with a mask, without contest).

Then pass the generated images to one of the most recent interpolation algorithms, like this one https://github.com/megvii-research/ECCV2022-RIFE or the one used in the replicate we are commenting on (someone posted this reference: https://github.com/google-research/frame-interpolation )

The first link reports some free and paid implementation and a Colab, so depending on how deep you want to go, you have a lot of choices.

In the end, I'd use some good app to stabilize the image if needed, to get a more "calm" look. I use Luma Fusion, but it's a paid app (cheap, one-time payment, for iOS). I'm sure there are a ton of open-source implementations.

It's an approach similar to the animation on replicate, but it allows a lot of fine-tuning and you can add new animation ideas/tools to the process.

Nothing revolutionary, but I hope it helps!

> You have generated some pretty cool designs.

Thanks! I put in a lot of work in the last weeks. The project has a mission, I wrote something, but it's not ready yet. I believe it will be with the launch of Dall-E 8 :-/

Lwepz3y ago· 2 in thread

It's clear that the next frontier is to have 3D-space instead of image space transitions. Language itself is very static and action verbs are not enough to specify scene dynamics. I suppose we would need: A. an enriched version of natural language that refines the dynamic processes that occur in a scene B. a data set of isolated processes labeled in the language described in A.

I've had a hard time finding ongoing work on A. and B, perhaps it isn't much of a priority for research groups.

redox993y ago

For 3D we would probably need something like Blender or similar, because at some point it's just easier to use a 3D software to pinpoint where you want stuff to be, than try to use words.

Imagine opening blender and typing

> A medium sized classroom, well lit, with two blackboards and many geography posters

And the AI just generates all the 3D meshes and places them appropriately.

Repeat that for other props or characters that you need. After that you can manually tweak the scene as you currently would (moving things, etc).

Then you select a character, and to animate you tell it

> The character calmly walks to the door and proceeds to open it

You could literally do a 100+ hour job in 5 minutes.

danwills3y ago

I think you're right about the possibilities here and I love the idea and have thought similar things myself too, but to me the inclusion of a 3D element should probably be a format, not necessarily locked into any specific app such as Blender. Maybe (Pixar) USD is one possible format that could be used for general 3D interchange for this kind of thing?

1 more reply

cerol3y ago· 1 in thread

My predictions for 10 - 15 years:

- Mandela effect for famous art pieces: "Monalisa was AI generated" "No it wasn't" "Yes it was".

- Art critics will get the last laugh, as people start giving them truckloads of money to ask whether a piece of art is human or AI generated.

ducktective3y ago

>Art critics will get the last laugh, as people start giving them truckloads of money to ask whether a piece of art is human or AI generated.

- Each image or painting should have a reference to the organization maintaining it

- The organization produces hashes for its artworks

- Users compare hashes

- Problem solved

fragmede3y ago

There's also https://gist.github.com/karpathy/00103b0037c5aaea32fe1da1af5...

zone4113y ago

I played a bit with smoothly interpolating between Stable Diffusion prompts and the effect can be pretty cool but it's hard to avoid discontinuities (like the object changing its orientation), even when using some additional tricks like reusing the previous frame as the initial image or generating several new frames and choosing the one that's closest to the previous frame. You basically have to get lucky with the seed. It probably makes most sense to just wait for video models that take temporal consistency into account explicitly or generate 3D models. There is a lot of promising research out there already, so it's just a matter of time.

gdubs3y ago

I've been playing around w/ StableDiffusion animation using the "deforum" notebook. It's taken a bit to really understand how to get results I like with it, but I'm super happy with how this one came out:

https://twitter.com/dreamwieber/status/1565008078466326528?s...

It's a pretty magical time with this tech. Things are moving very rapidly and I feel excited the way I did when I first rendered 3d animation on my 286 from RadioShack.

synu3y ago

Wow, this is incredible. AI tech has been so interesting to follow along lately.

Is there something like an index of cool new AI projects that is easy to follow? HN works for this to an extent but I’d love to track more closely.

monkeydust3y ago

It is incredible how fast this thing is progressing. Amazing what you can do with some very smart people and 4,000 A100 cluster !

What is getting very clear though and this link proves it out is that 'prompt engineering' is really a thing, I tried this out and it took a while to get something I would consider half decent.

I feel like there is a space here for tools / technologies to 'suggest' prompts based on understanding user intentions. If anyone is actually working on this then reach out to me. Email on profile.

hackerlight3y ago

Will models like Stable Diffusion be useful for self-driving car research? Like you've got this large NN with weights that are useful for this vision-adjacent task, it should have learned concepts such as edge detection, which could serve as pretrained weights for a self-driving NN?

imhoguy3y ago

Amazing! So counting years now when a first AI feature film hits the cinema screens. Source code: screenplay text. I imagine it may look a bit like "A Scanner Darkly" (2006).

simultsop3y ago

Brace yourselves, Cartoon AI Network, coming soon xD

amelius3y ago

I guess it is missing the physics simulation between frames. Perhaps that is the next big step for ML to get right.

shlip3y ago

Great, Miyazaki and Lasseter can finally retire.

jdamon963y ago

incredible!

t2hv333y ago

pupe1511393y ago

Com.Samsung.Android.Game.gos:2290:9908:313b42360002

pupe1511393y ago

Com.Samsung.Android.Game.gos:2290:9908:313b42360002

j / k navigate · click thread line to collapse

96 comments

74 comments · 24 top-level

desindol3y ago· 14 in thread

shlip3y ago

I think Tradionalists and Artists do what they do because they enjoy the process of creating/creating something unique, not because they want to be productive/fast.

So yeah, StableDif is great for commercially constrained environments, plus, it frees some of your time so you can enjoy creating art that matters to you :p

cerol3y ago

> So yeah, StableDif is great for commercially constrained environments

For me, that's the big insight.

dougmwne3y ago

Awesome perspective. Would you be able to link to an an example of what a finished style frame looks like using Stable Diffusion? I am curious what human and machine can achieve in 20 minutes.

VBprogrammer3y ago

qqqwerty3y ago

madelyn3y ago

100% agreed!

My first thoughts when I saw DALL-E were, "Wow, I can't wait to matte paint with this!"

As a programmer, it honestly reminds me of the situation with Copilot.

belter3y ago

Thanks a lot for your insight, it's great to hear from somebody in the industry. This same industry, is logically, also quite keen on enforcing the rights of creators and media copyright.

Most of the discussion, has been on the technical details of these very interesting advances.

Do you think it will be concern for the production process, the origin of the source data used to train these models?

desindol3y ago

Llamamoe3y ago

This sounds like exactly what I always wanted to learn to do, but never knew it existed as a field enough to actually get into it. Honestly thank you so much for this comment.

Do you have resources about getting into this style of art as a novice?

Also what StableDiffusion and other AI tools do you recommend?

omnimus3y ago

This honestly only confirms their fears. What you are saying is "yeah we have been photobashing all those artist for a long time anyway now its just much more efficient and automated".

I would be pretty afraid and angry if i was artist too.

desindol3y ago

We are these artists or better art directors.

1 more reply

Llamamoe3y ago

Can I ask again about your recommendations for getting into photobashing and AI-assisted air again?

quadcore3y ago

0. Parent, every input you put in that box is an input you soon wont have to make (and that will feel great at first) until you have no input to give anymore.

0.5 By art, I mean entertainment mainly. Not constrained to fine art.

1. Of course this looks like a tool now. And tools are great. We gota make sure they stay tools though.

2. Whatever work you do now with them, the AI will learn to do it by itself without you. You are training the AI.

3. The copyright issue is not unlike music and games in the early 2000'. You would pay if you were proposed the right delivery method.

3.5 the AI isnt creative, this is sophisticated plagiarism.

6. "We ll just do some new art / entertainment". Which the AI will mimic the next day. So you cant even hope to work on entertainment anymore. It's a death I cant qualify.

desindol3y ago

1 more reply

autoexec3y ago· 7 in thread

I hope that traditional hand drawn animation never dies, but I look forward to seeing how AI continues to change the industry and the output.

pdpi3y ago

> It's pure insanity that people ever drew, by hand, mountains of individual drawings each slightly changed and assembled them into compelling illusions to tell stories.

0. https://www.youtube.com/watch?v=_EUPwsD64GI

barrysteve3y ago

This is not an example of AI handling the inbetweens.

This water-morph effect is undesirable for inbetweens.

autoexec3y ago

You're right, but projects like this do make me think AI handling inbetweens well enough to replace animators is where we'll end up eventually and that it's probably not far off.

1 more reply

megablast3y ago

These comments on every single post are getting really boring.

vasco3y ago

You can probably expect them for any interesting technology forever into the future, since people have made these useless complaints for hundreds of years at least.

1 more reply

autoexec3y ago

makobado3y ago

this is very true

Dwedit3y ago· 7 in thread

In this example, 25 frames are generated using Stable Diffusion, then frames are interpolated using FILM-Net. I hadn't see FILM-net before, it looks really neat.

o_____________o3y ago

This? Looks pretty amazing:

https://film-net.github.io

https://github.com/google-research/frame-interpolation

bfirsh3y ago

You can run it on Replicate too! https://replicate.com/google-research/frame-interpolation

eezurr3y ago

Neat. Didnt Microsoft release a tool that morphed between two photos a decade ago? I dont recall the name unfortunately, but the effect was similar, less quality.

Jaruzel3y ago

Very impressive for the time.

2 more replies

maven293y ago

You're thinking of Microsoft PhotoSynth from 2006. This is similar to what google streetview showed up with.

From what I can see, they later revamped PhotoSynth to include actual 3d mesh reconstruction in 2014.

nl3y ago

Morphing is different to interpolation and has been available on consumer computers since the 1990s (I remember making gifs in 1996 with it).

In morphing you end up with a blury mess for the between frames. This technique tracks individual features (eg eyes) and keeps them coherent.

nopenopenopeno3y ago

I was doing this with an After Effects plugin called Twixtor long before a decade ago.

https://revisionfx.com/products/twixtor/

r3trohack3r3y ago· 5 in thread

I just came across this on twitter, every frame appears to be an evolution of the previous frame using img2img paired with a tilt/zoom to create a psychedelic animation.

The author claims to have made this with Stable Diffusion, Disco, and Wiggle: https://www.youtube.com/watch?v=Nz_n0qxqoPg

I believe Wiggle is used to automate the tilt/zoom between frames.

waking_at_night3y ago

Thats really great and has come a long way since the beginnings... I had this video animation going on back in the day when VQGAN was still all the rave:

https://www.youtube.com/watch?v=CgDbbg802-8

Its incredible what 6 months only can do

westoncb3y ago

Interesting to see in comparison to those old latent space zoom vids making the rounds in like ~2014..

alx__3y ago

That is wild, feels like an intense dream you can't wake up from

alhirzel3y ago

Reminds me of Electric Sheep[1] - I can't wait until someone hooks up news stories to have abstract news as a screensaver! Also reminds me of a time lapse of someone painting - like [2]

[1]: https://electricsheep.org/

[2]: https://www.youtube.com/watch?v=2O4ccHgfcl8

actionfromafar3y ago

I like their earlier work more, with the audio pictograms.

1 more reply

fagerhult3y ago· 4 in thread

Andreas, author of the Replicate model here -- though "author" feels wrong since I basically just stitched two amazing models together.

benreesman3y ago

It's admirable that you're so modest regarding the antecedent work, but sometimes it's the "obvious in hindsight" compositional insights that really open up the possibility space. Top work!

noduerme3y ago

This is simply very impressive... whether or not it was humbly stitched together, you were sort of the first to do it, so take pride.

The next real magic will be reading its net and figuring out how to get [vfx/film] effects from it... which if I were you would probably occupy 22 hours of my day now.

NaturalPhallacy3y ago

>Talking alien!

Maybe that's what we were supposed to do all along. Not find or be found by aliens, but to invent them.

rnjesus3y ago

that’s exactly what we’ve been doing

mrpf1ster3y ago· 4 in thread

I feel like I’m watching an explosion of progress in AI image generation in real-time.

Every day there’s a new application of Stable Diffusion. It’s incredible to watch unfold

zmmmmm3y ago

kristopolous3y ago

Yes. Example #5*10^7 or so. Some people are just opposed ideologically or due to temperament. Locking things down is one of the best ways to make them die

napolux3y ago

GPT-4 (or a reduced version of it) should be opensource too if you ask me :P

ducktective3y ago

Incredible for you, depressing for me.

I feel left behind...

justinlloyd3y ago· 4 in thread

rjh293y ago

The A5000 and 3090 both have 24GB of ram?

justinlloyd3y ago

1 more reply

dannyw3y ago

Maybe GP meant 3080.

1 more reply

trollied3y ago

The 3060 isn't a bad choice, seeing as it has 12Gb of VRAM.

NoMoreBro3y ago· 2 in thread

If you want to generate videos I can share some links I bookmarked of software/code to make them more smooth.

[1] Deforum's Colab (based on Stable Diffusion): https://colab.research.google.com/github/deforum/stable-diff...

yreg3y ago

I'm definitely interested in your video related bookmarks.

You have generated some pretty cool designs.

NoMoreBro3y ago

Sure! This would be my approach (and tools) if I was smarter:

If you make the generations with some similarities and use the right interpolation, you don't need 1000 images like my video and can obtain a smooth movement.

The first link reports some free and paid implementation and a Colab, so depending on how deep you want to go, you have a lot of choices.

It's an approach similar to the animation on replicate, but it allows a lot of fine-tuning and you can add new animation ideas/tools to the process.

Nothing revolutionary, but I hope it helps!

> You have generated some pretty cool designs.

Thanks! I put in a lot of work in the last weeks. The project has a mission, I wrote something, but it's not ready yet. I believe it will be with the launch of Dall-E 8 :-/

Lwepz3y ago· 2 in thread

I've had a hard time finding ongoing work on A. and B, perhaps it isn't much of a priority for research groups.

redox993y ago

For 3D we would probably need something like Blender or similar, because at some point it's just easier to use a 3D software to pinpoint where you want stuff to be, than try to use words.

Imagine opening blender and typing

> A medium sized classroom, well lit, with two blackboards and many geography posters

And the AI just generates all the 3D meshes and places them appropriately.

Repeat that for other props or characters that you need. After that you can manually tweak the scene as you currently would (moving things, etc).

Then you select a character, and to animate you tell it

> The character calmly walks to the door and proceeds to open it

You could literally do a 100+ hour job in 5 minutes.

danwills3y ago

1 more reply

cerol3y ago· 1 in thread

My predictions for 10 - 15 years:

- Mandela effect for famous art pieces: "Monalisa was AI generated" "No it wasn't" "Yes it was".

- Art critics will get the last laugh, as people start giving them truckloads of money to ask whether a piece of art is human or AI generated.

ducktective3y ago

>Art critics will get the last laugh, as people start giving them truckloads of money to ask whether a piece of art is human or AI generated.

- Each image or painting should have a reference to the organization maintaining it

- The organization produces hashes for its artworks

- Users compare hashes

- Problem solved

fragmede3y ago

There's also https://gist.github.com/karpathy/00103b0037c5aaea32fe1da1af5...

zone4113y ago

gdubs3y ago

https://twitter.com/dreamwieber/status/1565008078466326528?s...

It's a pretty magical time with this tech. Things are moving very rapidly and I feel excited the way I did when I first rendered 3d animation on my 286 from RadioShack.

synu3y ago

Wow, this is incredible. AI tech has been so interesting to follow along lately.

Is there something like an index of cool new AI projects that is easy to follow? HN works for this to an extent but I’d love to track more closely.

monkeydust3y ago

It is incredible how fast this thing is progressing. Amazing what you can do with some very smart people and 4,000 A100 cluster !

What is getting very clear though and this link proves it out is that 'prompt engineering' is really a thing, I tried this out and it took a while to get something I would consider half decent.

I feel like there is a space here for tools / technologies to 'suggest' prompts based on understanding user intentions. If anyone is actually working on this then reach out to me. Email on profile.

hackerlight3y ago

imhoguy3y ago

Amazing! So counting years now when a first AI feature film hits the cinema screens. Source code: screenplay text. I imagine it may look a bit like "A Scanner Darkly" (2006).

simultsop3y ago

Brace yourselves, Cartoon AI Network, coming soon xD

amelius3y ago

I guess it is missing the physics simulation between frames. Perhaps that is the next big step for ML to get right.

shlip3y ago

Great, Miyazaki and Lasseter can finally retire.

jdamon963y ago

incredible!

t2hv333y ago

pupe1511393y ago

Com.Samsung.Android.Game.gos:2290:9908:313b42360002

pupe1511393y ago

Com.Samsung.Android.Game.gos:2290:9908:313b42360002

j / k navigate · click thread line to collapse