undefined | Better HN

0 pointsfamouswaffles2y ago0 comments

Feel free to show otherwise

0 comments

3 comments · 2 top-level

No, you need to show the model is better on this narrow task, not just assert it is because it’s a great general LLM. It’s quite possible you’re correct but just saying GPT is the best, prove me wrong, reeks of a VC or AI bandwagoner.

famouswafflesOP2y ago

The OP is about classification. I linked to benchmarks showing it is indeed much better on the kind of task outlined in the OP as well as several other NLP tasks.

I believe "prove me wrong" is more than appropriate here.

chaxor2y ago

I think they mean that it is likely to heavily depend upon the task.

Decoder models are good at generation of language, and of course they're going to do well where that counts. But if you want to do typical NER+Relation extraction and then normalize to an in-house dictionary of 10 million IDs? You can't do that as effectively with GPT-4. You need a local model (right now).

There are a lot of things that GPT-4 doesn't touch in terms of data domain, so for many projects, yes local (typically encoder) models are 1) way faster 2) way" cheaper and 3) have better metrics.

Of course, the capability

could* be there with a large decoder model, but if it's a task that needs your specific (large amount of) data, you have to make something locally.

j / k navigate · click thread line to collapse