undefined | Better HN

0 pointsdgb233y ago0 comments

Apparently much of the usefulness of ChatGPT comes from human guidance via reinforcement learning. The base model seems to be quite useless, but with only a little bit of human input it becomes deceptively smart.

0 comments

PeterisP3y ago

The key purpose for the reinforcement learning part (RLHF) is so that it would be socially acceptable to provide the model to the general public without getting into a PR nightmare like Microsoft Tay; that post-training does not make it "deceptively smart", it trades off a bit of smartness in order to ensure some alignment with certain restrictions. It decreases the performance on some tasks and e.g. the GPT-4 paper, which is very light on other details, provides some experimental evidence that this post-training significantly hurts the model's confidence calibration, which decreases its usability for tasks where you want to know how certain the model is.

j / k navigate · click thread line to collapse

0 comments

PeterisP3y ago

j / k navigate · click thread line to collapse