Trying to talk it into writing anything other than toy code is an exercise in banging my head against the wall.
I have tried Phind and anything beyond mega junior tier questions it suffers as well and gives bad answers.
I like LLMs for general design work, but I’ve found accuracy to be atrocious in this area.
probably need routers, RAG, and reranking
I think there is a role for LLM + deterministic code gen as well (https://github.com/hofstadter-io/hof/blob/_dev/flow/chat/pro...)