undefined | Better HN

0 pointskacperlukawski3y ago0 comments

The fact that the model does not have to rely on its internal knowledge anymore but can communicate literally with any external system makes me feel it may significantly reduce the hallucination.

0 comments

majormajor3y ago

If it was easy to simply verify truth "with any external system" then would we even need a language model?

E.g. if you could just ask [THING] for the true answer, or verify an answer trivially with it... just ask it directly!

I ran into this issue with some software documentation just this morning - the answer was helpful but completely wrong in some intermediate steps - but short of a plugin that literally controlled or cloned a similar dev environment to mine that it would take over, it wouldn't be able to tell that the intermediate result was different than it claimed.

CuriouslyC3y ago

If one api knows one set of facts, and another api knows another, ad infinitum, are you going to tell people they should remember which api knows which set of facts and query each individually? Why not have a single service that knows of all the various apis for different things, and can query and synthesize answers that extract the relevant information from all of them (with compare/contrast/etc)?

kacperlukawskiOP3y ago

When you develop a plugin, you provide a description that ChatGPT uses to know when to call that particular service. So you don't need to tell people what they need to use - the model will decide independently based on the plugins you enabled.

That being said - we developed a custom plugin for Qdrant docs, so our users will be able to ask questions about how to do certain things with our database. But I do not believe it should be enabled by default for everybody. A non-technical person doesn't need that many details. The same is for the other services - if you prefer using KAYAK over Expedia, you're free to choose.

2 more replies

vidarh3y ago

ChatGPT is already pretty good at "admitting" it's wrong when it's given the actual facts, so it does seem likely that providing it with a way to e.g. look up trusted sources and ask it to take those sources into consideration might improve things.

majormajor3y ago

I think that helps with "hallucination" but less so with "factuality" (when re-reading the parent discussions, I see the convo swerved a bit between those two, so I think that'll be an increasingly important distinction in the future).

Confirming it's output against a (potentially wrong) source helps the former but not the latter.

bluecrab3y ago

All it needs is guardrails which is available already.

snickerbockers3y ago

That's only going to solve the problem of incorrect facts. I have seen it make logical mistakes as well and having access to external services will not solve that problem.

As an example, I once asked it to show me the diff between two revisions of the code it was writing an it made something that looks like it might be a valid patch but did not represent the difference between the two versions.

Of course this specific problem could be fixed with a simple plug-in that runs the unix diff program but that wouldn't fix the root-cause, and i would argue that providing a special-case for every type of request is antithetical to what AI is supposed to be since this effectively is how alpha and Google already work.

j / k navigate · click thread line to collapse

0 comments

majormajor3y ago

If it was easy to simply verify truth "with any external system" then would we even need a language model?

E.g. if you could just ask [THING] for the true answer, or verify an answer trivially with it... just ask it directly!

CuriouslyC3y ago

kacperlukawskiOP3y ago

2 more replies

vidarh3y ago

majormajor3y ago

Confirming it's output against a (potentially wrong) source helps the former but not the latter.

bluecrab3y ago

All it needs is guardrails which is available already.

snickerbockers3y ago

That's only going to solve the problem of incorrect facts. I have seen it make logical mistakes as well and having access to external services will not solve that problem.

j / k navigate · click thread line to collapse