P.S.: The IT Crowd did it first: https://youtu.be/5UT8RkSmN4k?t=17
I didn't know there are many techniques to mitigate this
More generally, if you teach the model to reject nonsense questions and admit if it doesn't know something it's more likely to do that
A trivial idea - you can use GPT-3 to inject bullshit/hallucinations into real text. Then train the model to solve the reverse task, of detecting bullshit in input text.
If it's something that augments the support experience, as in something you can interact with while a real person is assigned to your support request, I'm totally fine with that. But if anyone places this as the first line of support, with no way of reaching a real person, I can't wish them the best.
I think there is a mistake in the article. It is not possible to do fine-tuning for the latest davinci-text-003, but only for the original davinci model, which generates much worse results.
From my understanding, you can leave the `prompt` empty, and just push `completion` with your text. That way you don't need to generate Q&A first.
I also wonder how many people that are trying to make effective products out of this stuff are fronting it with a more rigid approach (like the intent/entity/slot approach of Rasa/dialogflow) and then leverage gpt-3 or chatgpt in specific/partial sub trees of the dialog.
- [0] https://idiotlamborghini.com/articles/using_gpt3_and_hacker_...
Isn't there a better way to feed an enormous document into DaVinci and make it bring answers only from that text?
- 1. Token limit: You can only input a limited amount of text at once. The challenge then becomes trying to compress data to fit into the window. But it can be lossy.
- 2. Trust: This is the main one. It's hard to determine if the output is based on the new learning material or the large amounts of data the model was originally trained on. There are techniques that can help but they add a lot of additional work and don't guarantee great results.
- [0] https://idiotlamborghini.com/articles/using_gpt3_and_hacker_...
The rest is all about taking actions to override programs and policies. Either because you don’t trust your customers to do it themselves, or to correct bugs in your process.
That’s the last thing you’d trust an Ai to do.