> I would rather use a self-hosted solution though
Me too. It surprises me that there aren't any great open source alternatives. Evernote's OCR was at times magical (it does a very good job on my hand written notes) and I've always wondered if they licensed that or built it.