ChatGPT knows a good answer to the question even without embeddings. But this particular tool can't replicate it, and says e.g. "In summary, these verses highlight sexual ethics and various sins in general, but do not uniformly condemn or condone homosexuality specifically." (which is not wrong, it's just the wrong verses found). (It gives a different summary on different tries)
This is a common problem with embedding search. Obviously, the other traditional techniques would be even worse. But I'd love the systems to be better, and I propose a potential solution, and ask for other ones. I will not be content with your "put up with AI idiosyncrasies and weaknesses, as if they were the real actual conceptual limitation of knowledge" approach. AI has potential to create great UI, but your attitude won't help with that