undefined | Better HN

0 pointscodeviking4y ago0 comments

We don't retain the uploaded document. We cache the extracted content, as to make things more efficient.

> What data do we keep? We cache a copy of the extracted content as well as the extracted images. This allows us to serve the results more quickly when a user uploads the same file again. We do not retain the uploaded files themselves. Cached content is never served to a user who has not provided the exact same document.

Also, we can delete the extracted data on request. Just send a note to accessibility@semanticscholar.org.

Sorry for the confusion!

0 comments

3 comments · 1 top-level

kahon654y ago· 2 in thread

Ah okay, thank you.

>Also, we can delete the extracted data on request.

Just to be 100% clear, you are referring to the cached extracted data, right?

codevikingOP4y ago

Yup, that's right.

kahon654y ago

Thank you very much!

j / k navigate · click thread line to collapse