Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
rspoerri
1y ago
0 comments
Share
i've made quite good conversions from pdf to markdown with
https://github.com/VikParuchuri/marker
. it's slow but worth a shot. Markdown should be easily parseable by a rag.
i'm trying to get a similar system setup on my computer.
0 comments
default
newest
oldest
nl
1y ago
This looks worth exploring, so thanks. The author has done a bunch of work beyond what PyMuPDF does on multicolumn layouts.
j
/
k
navigate · click thread line to collapse