undefined | Better HN

0 pointscatoc3mo ago0 comments

That an LLM needs input tokens to produce output was understood. That is not what the privacy policy is about. To me the policy reads Anthropic also subsequently persists (“collects”) your data. That is the point I was hoping to get clarified.

0 comments

adastra223mo ago

The only thing Anthropic receives is the chat session. Files only ever get sent when they are included in the session - they are never sent to Anthropic otherwise.

Note that I am talking about this product where the Claude session is running locally (remote LLM of course, but local Claude Code). They also have a "Claude Code on the Web" thing where the Claude instance is running on their server. In principle, they could be collecting and training on that data even if it never enters a session. But this product is running on your computer, and Anthropic only sees files pulled in by tool calls.

catocOP3mo ago

So when using Cowork on a local folder and asking it to "create a new spreadsheet with a list of expenses from a pile of screenshots", those screenshots may[*] become part of the "collected Inputs" kept by Anthropic.

[*]"may" because depending on the execution, instead of directly uploading the screenshots, a (python) script may be created that does local processing and only upload derived output

adastra223mo ago

Yes, in general. I think in your specific example it is more likely to ingest the screenshots (upload to Anthropic) and use its built-in vision model to extract the relevant information. But if you had like a million screenshots, it might choose to run some Python OCR software locally instead.

In either case though, all the tool calls and output are part of the session and therefore Input. Even if it called a local OCR application to extract the info, it would probably then ingest that info to act on it (e.g. rename files). So the content is still being uploaded to Anthropic.

Note that you can opt-out of training in your profile settings. Now whether they continue to respect that into the future...

catocOP3mo ago

When local compute is more efficient data may remain local (e.g. when asking it to "find duplicate images" in millions of images it will likely (hopefully) just compute hashes and compare those), but complete folder contents are just as likely to be ingested (uploaded) and considered "Inputs", for which even the current Privacy Policy already explicitly says these will be "collected" (even when opting-out of allowing subsequent use for training).

To be clear: I like what Anthropic is doing, they appear more trustworthy/serious than OpenAI, but Cowork will result in millions of unsuspecting users having complete folders full of data uploaded and persisted on servers, currently, owned by Anthropic.

j / k navigate · click thread line to collapse

0 comments

adastra223mo ago

The only thing Anthropic receives is the chat session. Files only ever get sent when they are included in the session - they are never sent to Anthropic otherwise.

catocOP3mo ago

[*]"may" because depending on the execution, instead of directly uploading the screenshots, a (python) script may be created that does local processing and only upload derived output

adastra223mo ago

Note that you can opt-out of training in your profile settings. Now whether they continue to respect that into the future...

catocOP3mo ago

j / k navigate · click thread line to collapse