undefined | Better HN

0 pointsttyprintk1y ago0 comments

It’s not naive; tesseract does this.

0 comments

4 comments · 1 top-level

rafram1y ago· 3 in thread

Tesseract doesn’t use an LLM. LLMs don’t know how confident they are; Tesseract’s model does.

With most Machine Learning algorithms I used to get shapley values or other 'explainable AI' metrics (for a large cost compared to simple inference, yes), it's very unsettling and frustrating to work without them now on LLMs.

hansvm1y ago

Kind of. Tesseract's confidence is just a raw model probability output. You could easily use the entropy associated with each token coming out of an LLM to do the same thing.

rafram1y ago

True, but LLM token probability doesn't map nearly as cleanly to "how readable was the text".

1 more reply

j / k navigate · click thread line to collapse