undefined | Better HN

Skip to content

Top Best Ask Show New Jobs

0 pointssimianwords10mo ago0 comments

i don't think this is correct - such training data is usually made at SFT level after unsupervised learning on all available data in the web. the SFT level dataset is manually curated meaning there would be conscious effort to create more training samples of the form to say "i'm not sure". same with RLHF.

0 comments

4 comments · 1 top-level

therein10mo ago· 3 in thread

You mean I don't think this is automatically correct. Otherwise it very likely is correct. Either way, you're guessing the manual curation is done in a way that is favorable to include I don't know answers. Which it most likely doesn't.

vidarh10mo ago

Having done contract work on SFT datasets, at least one major provider absolutely includes don't know answers of different varieties.

I don't know why you assume it's a guess. These providers employ thousands of people directly or via a number of intermediaries to work on their SFT datasets.

simianwordsOP10mo ago

its completely in the incentive to include such examples in RLHF. or you have come up with a way to increase performance that the very employees haven't. why do you think they didn't try it?

frotaur10mo ago

How do you know which question should be answered with 'I dont know?'. There are obvious questions which have no answer, but if only those are in the dataset, the model will answer I dont know only for unreasonable questions.

To train this effectively you would need a dataset of questions which you know the model doesn't know. But if you have that... why not answer the question and put in the dataset so that the model will know ?

That's a bit imprecise, but I think it capture the idea of why 'I don't know' answers are harder to train.

j / k navigate · click thread line to collapse