there are more elegant ways to leverage an LLM, see AlphaEvolve: https://arxiv.org/abs/2506.13131
it's difficult to frame most coding tasks in such a way where you can trivially verify correctness.
No comments yet.