The same way most of us review our compiler generated code today (ie not at all). If it works it works, if doesn't we fix the higher level input and try again. I won't be surprised if in a few more generation the AI will skip the human readable code step and generate ASTs directly.
> if doesn't we fix the higher level input and try again
How can I visit this fantasy world of yours where LLMs are as reliable and deterministic as compilers and any mistakes can be blamed solely on the user?