undefined | Better HN

0 pointslondons_explore3y ago0 comments

See the link in my post. It asks you to run the tool. You run the tool and tell it the result... And then it uses the result of the tool to decide to reply to the user.

The link talks about tools that 'lie' - ie. a calculator which deliberately tries to trick GPT-4 into giving the wrong answer. It turns out that GPT-4 only trusts the tools to a certain extent - if the answer the tool gives is too unbelievable, then GPT-4 will either re-run the tool or give a hallucinated answer instead.

0 comments

qbasic_forever3y ago

It's always giving a hallucinated answer. GPT doesn't 'run' anything. It sees an input string of text asking for the result of fibonacci(100) and finds from its immense training set a response that's closely related to training data that had the result of fibonacci(100) (an extremely common programming exercise with results all over the internet and presumably its training data).

Again, GPT is not running a tool or arbitrary python code. It's not applying trust to a tool response. It has no reasoning or even a concept of what a tool is--you're projecting that on it. It is only generating text from an input stream of text.

yunyu3y ago

There's nothing stopping you from identifying the code, running it, and passing the output back into the context window.

kolinko3y ago

You didn't read the article, did you?

qbasic_forever3y ago

Langchain has nothing to do with GPT itself or how it operates internally.

1 more reply

j / k navigate · click thread line to collapse

0 comments

qbasic_forever3y ago

yunyu3y ago

There's nothing stopping you from identifying the code, running it, and passing the output back into the context window.

kolinko3y ago

You didn't read the article, did you?

qbasic_forever3y ago

Langchain has nothing to do with GPT itself or how it operates internally.

1 more reply

j / k navigate · click thread line to collapse