* If you put a few splashes of paint on a canvas you get something that looks pretty interesting for a minute, but it's not the picture in your head, and after a while all splashes start to look the same.
* If you grab some pencils and scribble all over a page, you'll get something that looks interesting for a minute, but it's not the picture in your head, and after a while all scribbles start to look the same.
* If you throw some geometric shapes into blender and run a render, you'll get something that looks interesting for -ok maybe a couple of minutes-, but it's not the picture in your head, and after a while all geometric shapes start to look the same.
* If you throw a few tokens into a prompt and hit generate, you'll get something that looks interesting for -now maybe a couple of hours?- but it's still not the picture in your head, and after a while all generated images start to look the same.
Each time, if you want to actually get anywhere close to the picture in your head, you're going to need to practice, and you're going to have to learn the medium, and you'll have a lot of experiences along the way. "There's no mistakes, just happy little trees", right? But the actual path -and the happy little trees you experience along the way- will be rather different for canvas, pencil, blender, or stable diffusion.
Have you ever tried to hold a 7-dimensional, non-linear, chaotic landscape in your head whilst trying to resolve nightmare images of deformed limbs and eldritch landscapes?