Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
cpursley
2y ago
0 comments
Save
Share
Any tips on effectively getting financial data out of PDFs into a RAG system (especially data contained in tables)? And locally, not via proprietary cloud PDF parsing thingy. That's the current nut I'm trying to crack.
0 comments
2 comments · 2 top-level
top
newest
oldest
rawsh
2y ago
https://github.com/VikParuchuri/marker
is solid, but slow and needs gpu(s) to be practical
serjester
2y ago
You might find my library useful -
https://github.com/Filimoa/open-parse
j
/
k
navigate · click thread line to collapse