As has been pointed out ad-nauseam, beginner human artists find hands very hard to draw too - and newer models aren't really making hand mistakes often.
I've never seen significant or systematic errors with faces.
It sounds a little bit like you haven't actually tried these tools. The kinds of errors you seem to think they make just aren't there in practice. I'd encourage you to try them out!
> LLMs imo don't understand the rules of language and context they only approximate what a rule compliant system should output.
This is an area I know a lot about.
There are no real universal rules for English grammar. If you look at something like the Penn treebank you can see that English - as used by humans - is more exceptions than rules. The fact that LLMs outscore any rule based system merely means that our grammar rules are mostly things derived from how English is used in practice, not vice-versa.