undefined | Better HN

0 pointsfomine33y ago0 comments

Just curious why don't they training except denial responses. Is Copying ChatGPT ethics a purpose?

0 comments

1 comments · 1 top-level

It's ongoing effort. At first they scrapped ShareGPT and used it to train the Alpaca model. After that others have pruned the dataset to remove examples where ChatGPT refused to answer. These datasets and resulting models are called "uncensored". They often leave disclaimer that the model is biased and unaligned, and that aligning should be done with LoRA layer.

Of course, no one bothered to this "ethics" LoRA so far and the unaligned models have better quality outputs than the early Alpaca models.

j / k navigate · click thread line to collapse