undefined | Better HN

0 pointsOras1y ago0 comments

How would you know something is missing?

I tried multiple OCRs before and it’s hard to tell if the output is accurate or not but just comparing manually.

I created a tool to visualise the output of OCR [0] to see what’s missing and there are many cases that would be quite concerning especially when working with financial data.

This tool wouldn’t work with LLMs as they don’t return the character recognition (to my knowledge), which will make it harder to evaluate them on a scale.

If I want to use LLMs for the task, I would use them to help with training ML model to do OCR better, such as creating thousands of synthetic data to train.

[0] https://github.com/orasik/parsevision

0 comments

1 comments · 1 top-level

yigitkonur351y ago

Wow, you knocked it out of the park! I'll be sure to use this when I tackle that evaluation.

j / k navigate · click thread line to collapse