undefined | Better HN

0 pointsamrrs8y ago0 comments

Sorry for a noob, could you please explain how adding hashes would help in better integration?

0 comments

2 comments · 1 top-level

lgierth8y ago· 1 in thread

They mainly help in four ways:

- avoid data corruption when downloading/transferring/copying datasets

- notice changes/updates in the original dataset

- dataset versioning (think how e.g. git turns directories and files into hash trees -- also called content-addressing)

- most importantly: stable names without a naming authority

prepend8y ago

How does this apply when you can filter / conditional exports? Is the idea that the csv has a fixed hash and if you trust that, you can trust anything else?

j / k navigate · click thread line to collapse