undefined | Better HN

0 pointsthot_experiment3y ago0 comments

For what it's worth if you actually want to get help on the state of the art on this stuff the best place to ask is the 4chan /g/ /sdg/ threads, and you can absolutely diffuse images that large using TiledVAE and Mixture of Diffusers or Multidiffusion, both of which are part of the Tiled Diffusion plugin for auto1111.

https://i.imgur.com/zOMarKc.jpg

Here's an example using various techniques I've gathered from those 4chan threads. (yes I know it's 4chan but just ignore the idiots and ask for catboxes, you'll learn much faster than anywhere else, at least that was the case for me after exhausting the resources on github/reddit/various discords)

0 comments

9 comments · 3 top-level

ttctciyf3y ago· 3 in thread

Haha, I've been lurking sdg for the same reason and seen your efforts there which really stand out (in a good way!)

I haven't dared delurk, but if I had it would surely have been to ask for the scoop on how you accomplished these renders.

In this less troublesome venue, care to provide any details?

thot_experimentOP3y ago

I'm not the guy who came up with this style! I just vacuum up catboxes and do a lot of tests/repros. I'm not at my main computer so I can't give you exact gen parameters, but the cliff notes version is:

  - DPM++ 2M K or DPM++ SDE K w/ a high eta are the best samplers in general, the former was used here
  - search seed space at whatever res is reasonable (512x704) before upscaling
  - once you have a good seed hiresfix with ultrasharp or remarci @ 0.5-0.6 denoise (i prefer the former, ymmv)
  - do a second diffusion pass in i2i on top of the hires, but with a simpler prompt focusing on high quality e.x. "gorgeous anime wallpaper, beautiful colors, impeccable lineart"
  - for the second pass (this will be taking you from ~1k x 1.5k to 2.5k x 4k) you're gonna run out of VRAM if you're trying to diffuse traditionally so use the Tiled Diffusion addon with Tiled VAE enabled as well to conserve vram. DDIM seems to work best here though I've gotten good results with the two samplers above as well.
  - using a style finetune helps a lot, Cardos Anime was used here
  - when in doubt, search Civit.ai or huggingface, there are tons of great models/LoRAs and textual inversions out there and if you have anything specific in mind having some form of finetune helps a ton

Obviously you're going to need to know how to prompt as well which is definitely not something I can explain in a single post, just like any kind of art you just have to practice a bunch to gain an intuition for it.

P.S. I've recently started a patreon, if any of you'd like to support my work on this stuff. I'm a big believer in sharing all my knowledge, so most of it will come out for free eventually, but I gotta eat. [0]

[0] https://www.patreon.com/thot_experiment

ttctciyf3y ago

> gotta eat

Yeah :) Thanks for your infodump! I'm mostly curious how you achieve the insane amount of "clutter" in these images - is it done by referencing a specific artist or style in the prompt, or just by some difficult-to-find key phrase? I haven't been able to get anything near.

1 more reply

bigbillheck3y ago

> but I gotta eat.

So do the people who made the art used to train these things.

2 more replies

brigandish3y ago· 2 in thread

> ask for catboxes

What's that?

madmax1083y ago

https://catbox.moe/ ... It's a filehost popular among 4chan users. Basically OP asking folks to "show me the code"

thot_experimentOP3y ago

auto1111 saves generation parameters as text into exif/png data chunks, so you can re-create generations done by other people/yourself in the past. 4chan strips metadata from images, hence the need for catbox.

thunderbong3y ago· 1 in thread

This is amazing. Do you have some place where you've put up your work?

thot_experimentOP3y ago

I appreciate it, I'm trying to be a great artist here so this isn't really my work, it's 90% stolen from anons on 4chan. ;) I post on twitter[0] a reasonable bit but SD art isn't a big focus for me. I also recently started a patreon which is another post in this thread if you're interested in supporting.

[0] https://twitter.com/thot_exper1ment

j / k navigate · click thread line to collapse

0 comments

9 comments · 3 top-level

ttctciyf3y ago· 3 in thread

Haha, I've been lurking sdg for the same reason and seen your efforts there which really stand out (in a good way!)

I haven't dared delurk, but if I had it would surely have been to ask for the scoop on how you accomplished these renders.

In this less troublesome venue, care to provide any details?

thot_experimentOP3y ago

  - DPM++ 2M K or DPM++ SDE K w/ a high eta are the best samplers in general, the former was used here
  - search seed space at whatever res is reasonable (512x704) before upscaling
  - once you have a good seed hiresfix with ultrasharp or remarci @ 0.5-0.6 denoise (i prefer the former, ymmv)
  - do a second diffusion pass in i2i on top of the hires, but with a simpler prompt focusing on high quality e.x. "gorgeous anime wallpaper, beautiful colors, impeccable lineart"
  - for the second pass (this will be taking you from ~1k x 1.5k to 2.5k x 4k) you're gonna run out of VRAM if you're trying to diffuse traditionally so use the Tiled Diffusion addon with Tiled VAE enabled as well to conserve vram. DDIM seems to work best here though I've gotten good results with the two samplers above as well.
  - using a style finetune helps a lot, Cardos Anime was used here
  - when in doubt, search Civit.ai or huggingface, there are tons of great models/LoRAs and textual inversions out there and if you have anything specific in mind having some form of finetune helps a ton

[0] https://www.patreon.com/thot_experiment

ttctciyf3y ago

> gotta eat

1 more reply

bigbillheck3y ago

> but I gotta eat.

So do the people who made the art used to train these things.

2 more replies

brigandish3y ago· 2 in thread

> ask for catboxes

What's that?

madmax1083y ago