Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
ngrilly
9y ago
0 comments
Save
Share
Did you store the plain text of each PDF in PostgreSQL or just the ts_vector resulting from the plain text?
0 comments
2 comments · 1 top-level
top
newest
oldest
fatbird
9y ago
· 1 in thread
IIRC, I stored the plain text too because the engine can return contextually marked up plaintext after finding it in the ts_vector.
ngrilly
OP
9y ago
You're right, PostgreSQL needs the plain text to highlight it with ts_headline. It's similar to Elasticsearch keeping the original document in the _source attribute. Thanks!
j
/
k
navigate · click thread line to collapse