undefined | Better HN

0 pointsmyrmidon9mo ago0 comments

Yes, but those hand-crafted rules are just input data, they don't actually constrain the behavior, they are just an attempt.

Similarly to how verbal instruction works with a child: You can tell it not to touch the hot stove, but the child still might try.

0 comments

diggan9mo ago

> they don't actually constrain the behavior

They do actually constraint the behavior, to various degrees of success which depends on the model, the system prompt, the inference parameters, the current context length and a lot more. Add in the new `developer` role and you have another venue for constraining the assistant outputs. Finally, structured outputs can help in forbidding specific terms too.

exe349mo ago

You can zap them with RL.

j / k navigate · click thread line to collapse

0 comments

diggan9mo ago

> they don't actually constrain the behavior

exe349mo ago

You can zap them with RL.

j / k navigate · click thread line to collapse