Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
0 points
kwillets
1y ago
0 comments
Share
One more: do you prefer the CDC technique over using the rowgroups as chunks (ie using knowledge of the file structure)? Is it worth it to build a parquet-specific diff?
undefined | Better HN
0 comments
default
newest
oldest
ylow
1y ago
I think both are necessary. The cdc technique is file format independent. The row group method makes Parquet robust to it.
j
/
k
navigate · click thread line to collapse