They work when there's a lot of examples on github or google, but once you get into something that doesn't have a lot of examples like closed source code or rarely used libraries, it will start hallucinating and even mixing up different API versions to create a mess that doesn't work at all.
I don't believe LLMs will get any better than this without a new major breakthrough, but this is already better than using Google search.