undefined | Better HN

0 pointslhuser1232y ago0 comments

I tried using the vision feature for OCR & it was worse than Tesseract. At least for financial documents where you need exact numbers amounts. Will the new PDF feature be better? I’m not so hopeful.

0 comments

BoorishBears2y ago

Using the vision feature for OCR is like using an LLM for math: it might work, but we already have a lot of tools that are hyper-optimized for the task.

There is practically no chance the new feature uses vision because that'd be _insanely_ slow and expensive for any reasonably sized document. They're likely using Azure's LayoutLM derived tech to get out text, then using embeddings to answer on questions

lhuser123OP2y ago

Will it be better than Tesseract & other OCR tools?

j / k navigate · click thread line to collapse

0 comments

BoorishBears2y ago

Using the vision feature for OCR is like using an LLM for math: it might work, but we already have a lot of tools that are hyper-optimized for the task.

lhuser123OP2y ago

Will it be better than Tesseract & other OCR tools?

j / k navigate · click thread line to collapse