Cowork: Claude Code for the rest of your work (opens in new tab)

(claude.com)

1298 pointsadocomplete5mo ago565 comments

565 comments

213 comments · 87 top-level

felixrieseberg5mo ago· 31 in thread

Hi, Felix from the team here, this is my product - let us know what you think. We're on purpose releasing this very early, we expect to rapidly iterate on it.

(We're also battling an unrelated Opus 4.5 inference incident right now, so you might not see Cowork in your client right away.)

deanc5mo ago

Your terms for Claude Max point to the consumer ToS. This ToS states it cannot be used for commercial purposes. Why is this? Why are you marketing a product clearly for business use and then have terms that strictly forbid it.

I’ve been trying to reach a human at Anthropic for a week now to clarify this on behalf of our company but can’t get past your AI support.

5 more replies

bashtoni5mo ago

Hi Felix!

Simple suggestion: logo should be a cow and and orc to match how I originally read the product name.

3 more replies

dcreater5mo ago

AI and Claude Code are incredible tools. But use cases like "Organize my desktop" are horrible misapplications that are insecure, inefficient and a privacy nightmare. Its the smart refrigerator of this generation of tech.

I worry that the average consumer is none the wiser but I hope a company that calls itself Anthropic is anthropic. Being transparent about what the tool is doing, what permissions it has, educating on the dangers etc. are the least you can do.

With the example of clearing up your mac desktop: a) macOS already autofolds things into smart stacks b) writing a simple script that emulates an app like Hazel is a far better approach for AI to take

tildef5mo ago

Looks cool, and I'm guilty as charged of using CC for more than just code. However, as a Max subscriber since the moment it was a thing, I find it a bit disheartening to see development resources being poured into a product that isn't available on my platform. Have you considered adding first-class support for Linux? -- Or for that matter sponsoring one of the Linux repacks of Claude Desktop on Github? I would love to use this, but not if I need to jump through a bunch of hoops to get it up and running.

1 more reply

politelemon5mo ago

Hi there, your training and inference rely on the openness of Linux. Would you consider giving something back with Claude for Linux?

Recursing5mo ago

What probability would you give for Linux support for Claude Desktop in 2026?

2 more replies

hoss14744895mo ago

Beachball of death on “Starting Claude’s workspace” on the Cowork tab. Force quit and relaunch, and Claude reopens on the Cowork tab, again hanging with the beachball of death on “Starting Claude’s workspace”.

Deleting vm_bundles lets me open Claude Desktop and switch tabs. Then it hangs again, I delete vm_bundles again, and open it again. This time it opens on the Chat tab and I know not to click the Cowork tab...

3 more replies

jchung5mo ago

@Felix - How are you thinking about observability? Anthropic is very clear that evals are critical for agentic processes (your engineering blog just covered this last week). For my whole company to roll out access to agents for all staff, I'd need some way for staff (or IT) to be able to know (a) how reliable the systems are (i.e., evals), (b) how safe the systems are (could be audit trails), and (c) how often the access being given to agents is the right amount of access.

This has been one of the biggest bottlenecks for our company: not the capability of the agents themselves -- the tools needed to roll them out responsibly.

tkgally5mo ago

You released it at just the right time for me. When I saw your announcement, I had two tasks that I was about to start working on: revising and expanding a project proposal in .docx format and adapting some slides (.pptx) from a past presentation for different audience.

I created a folder for Cowork, copied a couple of hundred files into it related to the two tasks, and told Claude to prepare a comprehensive summary in markdown format of that work (and some information about me) for its future reference.

The summary looked good, so I then described the two tasks to Claude and told it to start working.

Its project proposal revision was just about perfect. It took me only about 10 more minutes to polish it further and send it off.

The slides took more time to fix. The text content of some additional slides that Claude created was quite good and I ended up using most of it, but the formatting did not match the previous slides and I had to futz with it a while to make it consistent. Also, one slide it created used a screenshot it took using Chrome from a website I have built; the screenshot didn’t illustrate what it was supposed to very well, so I substituted a couple of different screenshots that I took myself. That job is now out the door, too.

I had not been looking forward to either of those two tasks, so it’s a relief to get them done more quickly than I had expected.

One initial problem: A few minutes into my first session with Claude in Cowork, after I had updated the app, it started throwing API errors and refusing to respond. I used the "Clear Cache and Restart" from the Troubleshooting menu and started over again from the start. Since then there have been no problems.

martinald5mo ago

Hey, congrats on the launch. Been thinking lot about this space (wrote this back in August: https://martinalderson.com/posts/building-a-tax-agent-with-c...).

Would love to connect, my emails in my bio if you have time!

mastercheif5mo ago

Hi Felix, this looks like an incredible tool. I've been helping non-tech people at my org make agent flows for things like data analysis—this is exactly what they need.

However, I don't see an option for AWS Bedrock API in the sign up form, is it planned to make this available to those using Bedrock API to access Claude models?

skybrian5mo ago

Being able to undo any changes that Cowork makes seems important. Any plans for automatic snapshots or an undo log?

RamblingCTO5mo ago

Was looking forward to try it, but just processing a notion page and prepare an outline for a report breaks it: This is taking longer than usual...(14m 2s)

/e: stopped it and retried. it seems it can't use the connectors? I get No such tool available

torben-friis5mo ago

Question: I see that the “actions hints” in the demo show messaging people as an option.

Is this a planned usecase, for the user to hand over human communication in, say, slack or similar? What are the current capabilities and limitations for that?

pritambarhate5mo ago

I guess you need to know about this: https://news.ycombinator.com/item?id=46597781

9dev5mo ago

Hey Felix, would love to give you feedback, but the language redirect of the website is trying to route me to de-de, and thus I can't see the page.

You might want to fix this.

1 more reply

andreygrehov5mo ago

Why do all similar demos show “prep the deck” use case as if everybody is building power point slides all day long?

1 more reply

VadimPR5mo ago

Would love to see a Linux native application for this, after all a lot of folks are using it more and more these days.

tekacs5mo ago

Hullo! Congrats on shipping this, it looks great!

I'm very curious about what you mean by 'cross device sync' in the post?

pikseladam5mo ago

Do you expect more token usage with it or will Anthropic change the limits of user token limit in the future?

carlo-notion5mo ago

Cheers Felix, congrats on the launch!

tiahura5mo ago

The announcement says existing connectors work, but only Claude for chrome does.

oidar5mo ago

Congrats! I'll be working this out. It doesn't seem that you can connect to gmail currently through cowork right now. When will the connectors roll out for this? (Gmail works fine in chats currently).

jscottmiller5mo ago

Looks good so far - I hope Windows support follows soon!

mkbkn5mo ago

Can you release custom GPTs like ChatGPT has?

bibimsz5mo ago

would like to be able to point at aws bedrock models like i can with claude code

column5mo ago

Hi! Windows support when?

BaudouinVH5mo ago

hello Felix, that page is 404 here at the moment :(

jmkni5mo ago

Congrats Felix :)

motoboi5mo ago

Please give me access via api key

1 more reply

dabedee5mo ago

It's great and reassuring to know that, in this day and age, products still get made entirely by one individual.

> Hi, Felix from the team here, this is my product - let us know what you think. > We're on purpose releasing this very early, we expect to rapidly iterate on > it.

> (We're also battling an unrelated Opus 4.5 inference incident right now, so > you might not see Cowork in your client right away.)

2 more replies

jryio5mo ago· 21 in thread

It's so important to remember that unlike code which can be reverted - most file system and application operations cannot.

There's no sandboxing snapshot in revision history, rollbacks, or anything.

I expect to see many stories from parents, non-technical colleagues, and students who irreparably ruined their computer.

Edit: most comments are focused on pointing out that version control & file system snapshot exists: that's wonderful, but Claude Cowork does not use it.

For those of us who have built real systems at low levels I think the alarm bells go off seeing a tool like this - particularly one targeted at non-technical users

Workaccount25mo ago

Frequency vs. convenience will determine how big of a deal this is in practice.

Cars have plenty of horror stories associated with them, but convenience keeps most people happily driving everyday without a second thought.

Google can quarantine your life with an account ban, but plenty of people still use gmail for everything despite the stories.

So even if Claude cowork can go off the rails and turn your digital life upside down, as long as the stories are just online or "friend of a friend of a friend", people won't care much.

4 more replies

alwillis5mo ago

The first version is for macOS, which has snapshots [1] and file versioning [2] built-in.

[1]: https://eclecticlight.co/2024/04/08/apfs-snapshots/

[2]: https://eclecticlight.co/2021/09/04/explainer-the-macos-vers...

2 more replies

falcor845mo ago

Once upon a time, in the magical days of Windows 7, we had the Volume Shadow Copy Service (aka "Previous Versions") available by default, and it was so nice. I'm not using Windows anymore, and at least part of the reason is that it's just objectively less feature complete than it used to be 15 years ago.

1 more reply

hopelite5mo ago

Somewhat related is a concern I have in general as things get more "agentic" and related to the prompt injection concerns; without something like legally bullet-proof contracts, aren't we moving into territory of basically "employing" what could basically be "spies" at all levels from personal (i.e., AI company staff having access to your personal data/prompts/chats) to business/corporate espionage, to domestic and international state level actors who would also love to know what you are working on and what you are thinking/chatting about and maybe what your mental health challenges are that you are working through with an AI chat therapist.

I am not even certain if this issue can be solved since you are sending your prompts and activities to "someone else's computer", but I suspect if it is overlooked or hand-waved as insignificant, there will be a time when open, local models will become useful enough to allow most to jettison cloud AI providers.

I don't know about everyone else, but I am not at all confident in allowing access and sending my data to some AI company that may just do a rug pull once they have an actual virtual version of your mind in a kind of AI replication.

I'll just leave it at that point and not even go into the ramifications of that, e.g., "cybercrimes" being committed by "you", which is really the AI impersonator built based on everything you have told it and provide access to.

toddmorey5mo ago

Q: What would prevent them from using git style version control under the hood? User doesn’t have to understand git, Claude can use it for its own purposes.

3 more replies

y425mo ago

Indeed there are and this is no rocket science. Like Word Documents offer a change history, deleted files go to the trash first, there are undo functions, TimeMachine on MacOs, similar features on Windows, even sandbox features.

4 more replies

kamaal5mo ago

>>I expect to see many stories from parents, non-technical colleagues, and students who irreparably ruined their computer.

I do believe the approach Apple is taking is the right way when it comes to user facing AI.

You need to reduce AI to being an appliance that does one or at most a few things perfectly right without many controls with unexpected consequences.

Real fun is robots. Not sure no one is hurrying up on that end.

>>Edit: most comments are focused on pointing out that version control & file system snapshot exists: that's wonderful, but Claude Cowork does not use it.

Also in my experience this creates all kinds of other issues. Like going back up a tree creates all kinds of confusions and keeps the system inconsistent with regards to whatever else it is you are doing.

You are right in your analysis that many people are going to end up with totally broken systems

bob10295mo ago

In theory the risk is immense and incalculable, but in practice I've never found any real danger. I've run wide open powershell with an OAI agent and just walked away for a few hours. It's a bit of a rush at first but then you realize it's never going to do anything crazy.

The base model itself is biased away from actions that would lead to large scale destruction. Compound over time and you probably never get anywhere too scary.

seunosewa5mo ago

There's no reason why Claude can't use git to manage the folders that it controls.

2 more replies

Weryj5mo ago

TimeMachine has never been so important.

2 more replies

hans0l0745mo ago

IIUC, this is a preview for Claude Max subscribers - I'm not sure we'll find many teachers or students there (unless institutions are offering Max-level enterprise/team subscriptions to such groups). I speculate that most of those who will bother to try this out will be software engineering people. And perhaps they will strengthen this after enough feedback and use cases?

Aeolun5mo ago

If this is like Claude Code for everyone else, shouldn’t it be snapshotting anything it changes so that you can go back to the previous state?

machiaweliczny5mo ago

Yeah, seems like this could be achieved by using https://github.com/streamich/memfs/blob/master/docs/snapshot...

Weird they don't use it - might backfire hard

matt3D5mo ago

Pretty much every company I work with uses the desktop sync tools for OneDrive/GoogleDrive/Dropbox etc.

It would be madness to work completely offline these days, and all of these systems have version history and document recovery built in.

__MatrixMan__5mo ago

I hope we see further exploration into immutable/versioned filesystems and databases where we can really let these things go nuts, commit the parts we want to keep, and revert the rest for the next iteration.

Helmut100015mo ago

I would never use what is proposed by OP. But, in any case, Linux on ZFS that is automatically snapshotted every minute might be (part of) a solution to this dilemma.

akurilin5mo ago

You make a good point. I imagine that they will eventually add Perforce-style versioning to the product and this issue will be solved.

o_m5mo ago

So the future is NixOS for non-technical people?

2 more replies

big-chungus45mo ago

A human can also accidentally delete or mess up some files. The question is whether Claude Cowork is more prone to it.

heliumtera5mo ago

There was a couple of posts here on hacker news praising agents because, it seems, they are really good at being a sysadmin. You don't need to be a non-technical user to be utterly fucked by AI.

1 more reply

neocron5mo ago

Not a big problem to make snapshots with lvm or zfs and others. I use it automatically on every update

2 more replies

simonw5mo ago· 18 in thread

I was hoping for a moment that this meant they had come up with a design that was safe against lethal trifecta / prompt injection attacks, maybe by running everything in a tight sandbox and shutting down any exfiltration vectors that could be used by a malicious prompt attack to steal data.

Sadly they haven't completely solved that yet. Instead their help page at https://support.claude.com/en/articles/13364135-using-cowork... tells users "Avoid granting access to local files with sensitive information, like financial documents" and "Monitor Claude for suspicious actions that may indicate prompt injection".

(I don't think it's fair to ask non-technical users to look out for "suspicious actions that may indicate prompt injection" personally!)

felixrieseberg5mo ago

Worth calling out that execution runs in a full virtual machine with only user-selected folders mounted in. CC itself runs, if the user set network rules, with https://github.com/anthropic-experimental/sandbox-runtime.

There is much more to do - and our docs reflect how early this is - but we're investing in making progress towards something that's "safe".

9 more replies

viraptor5mo ago

> (I don't think it's fair to ask non-technical users to look out for "suspicious actions that may indicate prompt injection" personally!)

It's the "don't click on suspicious links" of the LLM world and will be just as effective. It's the system they built that should prevent those being harmful, in both cases.

3 more replies

cyanydeez5mo ago

There's no AI that's secure and capable of doing anything an idiot would do on the internet with whatever data you give it.

This is a perfect encapsulation of the same problem: https://www.reddit.com/r/BrandNewSentence/comments/jx7w1z/th...

Substitute AI with Bear

1 more reply

ashishb5mo ago

That's why I run it inside a sandbox - https://github.com/ashishb/amazing-sandbox

2 more replies

lifetimerubyist5mo ago

Prompt injection will never be "solved". It will always be a threat.

3 more replies

heliumtera5mo ago

What would you consider a tight sandboxed without exfiltration vectors? Agents are used to run arbitrary compute. Even a simple write to disk can be part of an exfiltration method. Instructions, bash scripts, programs written by agents can be evaluated outside the sandbox and cause harm. Is this a concern? Or, alternatively, your concern is what type of information can leak outside of that particular tight sandbox? In this case I think you would have to disallow any internet communication besides the LLM provider itself, including the underlying host of the sandbox.

You brought this up a couple of times now, would appreciate clarification.

1 more reply

redfloatplane5mo ago

I do get a "Setting up Claude's workspace" when opening it for the first time - it appears that this does do some kind of sandboxing (shared directories are mounted in).

1 more reply

nezhar5mo ago

I built https://github.com/nezhar/claude-container for exactly this reason - it's easy to make mistakes with these agents even for technical users, especially in yolo mode.

1 more reply

imovie45mo ago

> (I don't think it's fair to ask non-technical users to look out for "suspicious actions that may indicate prompt injection" personally!)

Yes, but at least now its only restricted to Claude Max subscribers, who are likely to be at least semi-technical (or at least use AI a lot)?

aussieguy12345mo ago

If you're on Linux, you can run AI agents in Firejail to limit access to certain folders/files.

2 more replies

schmuhblaster5mo ago

Is there any reasonably fast and portable sandboxing approach that does not require a full blown VM or containers? For coding agents containers are probably the right way to go, but for something like Cowork that is targeted at non-technical users who want or have to stay local, what's the right way?

container2wasm seems interesting, but it runs a full blown x86 or ARM emulator in WASM which boots an image derived from a docker container [0].

[0] https://github.com/container2wasm/container2wasm

1 more reply

sureglymop5mo ago

That's one thing. Another would be introducing homomorphic encryption in order for companies and people using their models to stay compliant and private. I can't believe it's such an under-researched area in AI.

1 more reply

jen729w5mo ago

> tells users "Avoid granting access to local files with sensitive information, like financial documents"

Good job that video of it organising your Desktop doesn't show folders containing 'Documents', 'Photos', and 'Projects'!

Oh wait.

bandrami5mo ago

My entire job is working with financial documents so this doesn't really do much for me

1 more reply

antidamage5mo ago

How does prompt injection happen? Or is it more a new link in a chain of existing failures?

1 more reply

fennecfoxy5mo ago

Problem is technical people on average (I wouldn't say all of us) know what we don't know. I'm naturally cautious when running new stuff or even just trying something new in life.

This is why the Android permissions system of "allow this app to x, y, z" whilst great for me, isn't really a good system for the average person, because what do they do "yes, yes, yes, just let me see my Tiktoks!1111"

btucker5mo ago

I haven't dug too deep, but it appears to be using a bubblewrap sandbox inside a vm on the Mac using Apple's Virtualization.framework from what I can tell. It then uses unix sockets to proxy network via socat.

ETA: used Claude Code to reverse engineer it:

   Insight ─────────────────────────────────────

  Claude.app VM Architecture:
  1. Uses Apple's Virtualization.framework (only on ARM64/Apple Silicon, macOS 13+)
  2. Communication is via VirtioSocket (not stdio pipes directly to host)
  3. The VM runs a full Linux system with EFI/GRUB boot

  ─────────────────────────────────────────────────

        ┌─────────────────────────────────────────────────────────────────────────────────┐
        │  macOS Host                                                                     │
        │                                                                                 │
        │  Claude Desktop App (Electron + Swift native bindings)                          │
        │      │                                                                          │
        │      ├─ @anthropic-ai/claude-swift (swift_addon.node)                           │
        │      │   └─ Links: Virtualization.framework (ARM64 only, macOS 13+)            │
        │      │                                                                          │
        │      ↓ Creates/Starts VM via VZVirtualMachine                                   │
        │                                                                                 │
        │  ┌──────────────────────────────────────────────────────────────────────────┐  │
        │  │  Linux VM (claudevm.bundle)                                              │  │
        │  │                                                                          │  │
        │  │  ┌────────────────────────────────────────────────────────────────────┐  │  │
        │  │  │  Bubblewrap Sandbox (bwrap)                                        │  │  │
        │  │  │  - Network namespace isolation (--unshare-net)                     │  │  │
        │  │  │  - PID namespace isolation (--unshare-pid)                         │  │  │
        │  │  │  - Seccomp filtering (unix-block.bpf)                              │  │  │
        │  │  │                                                                    │  │  │
        │  │  │  ┌──────────────────────────────────────────────────────────────┐  │  │  │
        │  │  │  │  /usr/local/bin/claude                                       │  │  │  │
        │  │  │  │  (Claude Code SDK - 213MB ARM64 ELF binary)                  │  │  │  │
        │  │  │  │                                                              │  │  │  │
        │  │  │  │  --input-format stream-json                                  │  │  │  │
        │  │  │  │  --output-format stream-json                                 │  │  │  │
        │  │  │  │  --model claude-opus-4-5-20251101                            │  │  │  │
        │  │  │  └──────────────────────────────────────────────────────────────┘  │  │  │
        │  │  │       ↑↓ stdio (JSON-RPC)                                          │  │  │
        │  │  │                                                                    │  │  │
        │  │  │  socat proxies:                                                    │  │  │
        │  │  │  - TCP:3128 → /tmp/claude-http-*.sock (HTTP proxy)                │  │  │
        │  │  │  - TCP:1080 → /tmp/claude-socks-*.sock (SOCKS proxy)              │  │  │
        │  │  └────────────────────────────────────────────────────────────────────┘  │  │
        │  │                                                                          │  │
        │  └──────────────────────────────────────────────────────────────────────────┘  │
        │           ↕ VirtioSocket (RPC)                                                 │
        │      ClaudeVMDaemonRPCClient.swift                                             │
        │           ↕                                                                    │
        │      Node.js IPC layer                                                         │
        └─────────────────────────────────────────────────────────────────────────────────┘

VM Specifications (from inside)

ComponentDetailsKernelLinux 6.8.0-90-generic aarch64 (Ubuntu PREEMPT_DYNAMIC)OSUbuntu 22.04.5 LTS (Jammy Jellyfish)HostnameclaudeCPU4 cores, Apple Silicon (virtualized), 48 BogoMIPSRAM3.8 GB total (~620MB used at idle)SwapNone

Storage Layout

DeviceSizeTypeMount PointPurpose/dev/nvme0n1p19.6 GBext4/Root filesystem (rootfs.img)/dev/nvme0n1p1598 MBvfat/boot/efiEFI boot partition/dev/nvme1n19.8 GBext4/sessionsSession data (sessiondata.img)virtiofs-virtiofs/mnt/.virtiofs-root/shared/...Host filesystem access

Filesystem Mounts (User Perspective)

        /sessions/gallant-vigilant-lamport/
        ├── mnt/
        │   ├── claude-cowork/     → Your selected folder (virtiofs + bindfs)
        │   ├── .claude/           → ~/.claude config (bindfs, rw)
        │   ├── .skills/           → Skills/plugins (bindfs, ro)
        │   └── uploads/           → Uploaded files (bindfs)
        └── tmp/                   → Session temp files
        
        Session User
        A dedicated user is created per session with a Docker-style random name:
        User: gallant-vigilant-lamport
        UID:  1001
        Home: /sessions/gallant-vigilant-lamport
        Process Tree
        PID 1: bwrap (bubblewrap sandbox)
        └── bash (shell wrapper)
            ├── socat TCP:3128 → unix socket (HTTP proxy)
            ├── socat TCP:1080 → unix socket (SOCKS proxy)
            └── /usr/local/bin/claude (Claude Code SDK)
                └── bash (tool execution shells)

        Security Layers

        Apple Virtualization.framework - Hardware-level VM isolation
        Bubblewrap (bwrap) - Linux container/sandbox

        --unshare-net - No direct network access
        --unshare-pid - Isolated PID namespace
        --ro-bind / / - Read-only root (with selective rw binds)


        Seccomp - System call filtering (unix-block.bpf)
        Network Isolation - All traffic via proxied unix sockets

        Network Architecture
        ┌─────────────────────────────────────────────────────────────┐
        │  Inside Sandbox                                             │
        │                                                             │
        │  claude process                                             │
        │      │                                                      │
        │      ↓ HTTP/HTTPS requests                                  │
        │  localhost:3128 (HTTP proxy via env vars)                   │
        │      │                                                      │
        │      ↓                                                      │
        │  socat → /tmp/claude-http-*.sock ─────────┐                │
        │                                            │                │
        │  localhost:1080 (SOCKS proxy)              │                │
        │      │                                     │                │
        │      ↓                                     │                │
        │  socat → /tmp/claude-socks-*.sock ────────┤                │
        └───────────────────────────────────────────┼────────────────┘
                                                    │
                                VirtioSocket ←──────┘
                                                    │
        ┌───────────────────────────────────────────┼────────────────┐
        │  Host (macOS)                             │                │
        │                                           ↓                │
        │                              Claude Desktop App            │
        │                                           │                │
        │                                           ↓                │
        │                                    Internet                │
        └─────────────────────────────────────────────────────────────┘
        Key insight: The VM has only a loopback interface (lo). No eth0, no bridge. All external network access is tunneled through unix sockets that cross the VM boundary via VirtioSocket.


  Communication Flow

  From the logs and symbols:

  1. VM Start: Swift calls VZVirtualMachine.start() with EFI boot
  2. Guest Ready: VM guest connects (takes ~6 seconds)
  3. SDK Install: Copies /usr/local/bin/claude into VM
  4. Process Spawn: RPC call to spawn /usr/local/bin/claude with args

  The spawn command shows the actual invocation:
  /usr/local/bin/claude --output-format stream-json --verbose \
    --input-format stream-json --model claude-opus-4-5-20251101 \
    --permission-prompt-tool stdio --mcp-config {...}

jms7035mo ago

Terrible advice to users: be on the lookout for suspicious actions. Humans are terrible at this.

1 more reply

hypfer5mo ago· 9 in thread

People do realize that if they're doing this, they're not feeding "just" code into some probably logging cloud API but literally anything (including, as mentioned here, bank statements), right?

Right?

RIGHT??????

Are you sure that you need to grant the cloud full access to your desktop + all of its content to sort elements alphabetically?

jjcm5mo ago

Some do, some don't.

The reality is there are some of us who truly just don't care. The convenience outweighs the negative. Yesterday I told an agent, "here's my api key and my root password - do it for me". Privacy has long since been dead, but at least for myself opsec for personal work is too.

8 more replies

AstroBen5mo ago

When choosing between convenience and privacy, most people seem to choose convenience

2 more replies

motoboi5mo ago

I have my bank statements on a drive on a cloud. We are way past that phase.

3 more replies

waterTanuki5mo ago

There has to be a way to set permissions right? The demo video they provided doesn't even need permission to read file contents, just read the file titles and sort them into folders based on that. It would be a win-win anyways, less tokens going into Claude -> lower bill for customer, more privacy, and more compute available to Anthropic to process more heavy workloads.

fragmede5mo ago

But I don't want alphabetical. Alphabetical is just a known sort order so I can find the file I want. How about it sorts by "this is the file you're looking for"?

TIPSIO5mo ago

Have you ever used any Anthropic AI product? You cannot literally do anything without big permissions, warnings, or annoying always-on popup warning you about safety.

2 more replies

m4635mo ago

     v-- click!
  [ACCEPT] [CANCEL]

hahahahhaah5mo ago

Ship has sailed. I have my deepest secrets in Gmail and Docs. We need big tech to make this secure as possible from threats. Scammers and nations alike.

1899-12-305mo ago

I pray for whoever has to review the slop I've generated.

1f60c5mo ago· 5 in thread

Anthropic blog posts have always caused a blank page for me, so I had Claude Code dig into it using an 11 MB HAR of a session that reproduces the problem, and it used grep and sed(!) to find the issue in just under 5 minutes (4m56s).

Turns out that the data-prevent-flicker attribute is never removed if the Intellimize script fails to load. I use DNS-based adblock and I can confirm that allowlisting api.intellimize.co solves the problem, but it would be great if this could be fixed for good, and I hope this helps.

h4ch15mo ago

hope u used these. can drastically reduce the 11mb to a couple of hundred kilobytes.

https://github.com/thameera/harcleaner and https://har-sanitizer.pages.dev/

lelandfe5mo ago

A more easy reproduction: disable JS.

To bypass: `.transition_wrap { display: none }`

_giorgio_5mo ago

On android, these don't work: Firefox Chrome Firefox focus :-(

Thanks anthropic

doesn't work.

1 more reply

motoboi5mo ago

you could have made if much simpler using playwright mcp.

worldsavior5mo ago

You could figure it out yourself under 5 mins. Nothing crazy here.

cc62cf4a4f205mo ago· 5 in thread

It's really quite amazing that people would actually hook an AI company up to data that actually matters. I mean, we all know that they're only doing this to build a training data set to put your business out of business and capture all the value for themselves, right?

simonw5mo ago

A few months ago I would have said that no, Anthropic make it very clear that they don't ever train on customer data - they even boasted about that in the Claude 3.5 Sonnet release back in 2024: https://www.anthropic.com/news/claude-3-5-sonnet

> One of the core constitutional principles that guides our AI model development is privacy. We do not train our generative models on user-submitted data unless a user gives us explicit permission to do so.

But they changed their policy a few months ago so now as-of October they are much more likely to train on your inputs unless you've explicitly opted out: https://www.anthropic.com/news/updates-to-our-consumer-terms

This sucks so much. Claude Code started nagging me for permission to train on my input the other day, and I said "no" but now I'm always going to be paranoid that I miss some opt-out somewhere and they start training on my input anyway.

And maybe that doesn't matter at all? But no AI lab has ever given me a convincing answer to the question "if I discuss company private strategy with your bot in January, how can you guarantee that a newly trained model that comes out in June won't answer questions about that to anyone who asks?"

I don't think that would happen, but I can't in good faith say to anyone else "that's not going to happen".

For any AI lab employees reading this: we need clarity! We need to know exactly what it means to "improve your products with your data" or whatever vague weasel-words the lawyers made you put in the terms of service.

4 more replies

TeMPOraL5mo ago

> I mean, we all know that they're only doing this to build a training data set

That's not a problem. It leads to better models.

> to put your business out of business and capture all the value for themselves, right?

That's both true and paranoid. Yes, LLMs subsume most of the software industry, and many things downstream of it. There's little anyone can do about it; this is what happens when someone invents a brain on a chip. But no, LLM vendors aren't gunning for your business. They neither care, nor have the capability to perform if they did.

In fact my prediction is that LLM vendors will refrain from cannibalizing distinct businesses for as long as they can - because as long as they just offer API services (broad as they may be), they can charge rent from an increasingly large amount of the software industry. It's a goose that lays golden eggs - makes sense to keep it alive for as long as possible.

falloutx5mo ago

Its impossible to explain this to the business owners, giving a company this much access cant end up well. Right now, Google, Slack, Apple have a share of the data but with this Claude can get all of that.

2 more replies

bearjaws5mo ago

This is the AI era equal to "I can't share my ideas because you will steal them"

Reality is good ideas and a few SOPs do not make a successful business.

eZinc5mo ago

It's either that, or you are 100X slower for not using Claude Code. The manpower per hour savings are most likely more worth it than protecting some inputs.

You could also always run a local LLM like GLM for sensitive documents or information on a separate computer, and never expose that to third party LLMs.

You also need to remember that if you hire regular employees that they are still untrustworthy at a base level. There needs to be some obfuscation anyway since they can steal your data/info too as a human. Very common case especially when they run off to China or something to clone your company where IP laws don't matter.

falloutx5mo ago· 5 in thread

Can humans do nothing now? Is it harder to organise your desktop? I thought Apple already organises them into stacks. (edit: Apple already does this)

Is it that hard to check your calendar? Also feels insincere to have a meeting of say 30 mins to show a claude made deck that you did it in 4 seconds.

cwoolfe5mo ago

Agree. Seems to me that if you need something like this to automate your workflow; it's your workflow that needs to change.

xlbuttplug25mo ago

You can still do all these things manually. Now you just have the option not to.

1 more reply

hk__25mo ago

I don’t think this is for _hard_ things but rather for repetitive tasks, or tasks where a human would bring no value. I’ve used Claude for Chrome to search for stays in Airbnb for example; something that is not hard but takes a lot of time to do by hand when you have some precise requirements.

loloquwowndueo5mo ago

It’s not that insincere if all the other attendees are just meeting-taking robots the end result of which will be an automated “summary of the meeting I attended for you” :)

How many people join meetings these days just to zone out and wait for the AI-produced summary at the end?

1 more reply

anthonypasq5mo ago

Can humans do nothing now? Is it that hard to pick the potatoes yourself? You already planted them in rows (nature already does this). is it that hard to water them yourself? also feels insincere to tell your neighbor you grew those potatoes when a machine did everything.

1 more reply

ossa-ma5mo ago· 4 in thread

Every startup is at the mercy of the big 3 (OpenAI, Anthropic, Google).

They can and most likely will release something that vaporises the thin moat you have built around their product.

This feels like the first time in tech where there are more startups/products being subsumed (agar.io style) than being created.

xlbuttplug25mo ago

> They can and most likely will release something that vaporises the thin moat you have built around their product.

As they should if they're doing most of the heavy lifting.

And it's not just LLM adjacent startups at risk. LLMs have enabled any random person with a claude code subscription to pole vault over your drying up moat over the course of a weekend.

1 more reply

dcchambers5mo ago

Best defense is to basically stay small/niche enough that the big guys don't think your work is worth consuming/competing with directly.

There will always be a market for dedicated tools that do really specific things REALLY well.

1 more reply

aroman5mo ago

I think that feeling is what you get when you read too much Hacker News :) There are, in fact, more startups being created now than ever. And I promise you, people said the same thing about going up against IBM back in the day...

Gijs4g5mo ago

When they go wide, you go deep

exitb5mo ago· 4 in thread

It’s kind of funny that apparently most of work that’s left after you automated software development is summarizing meetings and building slide decks.

sensanaty5mo ago

Hey, don't forget booking your flights! Because everyone who has ever flown knows it's very safe to let an RNG machine book something like a flight for you!

falloutx5mo ago

Now they can start saying 90% of the meetings will be done by Claude agents by 2027 (And we will all get free puppies)

1 more reply

ai-christianson5mo ago

Then there's the shuffling around of atoms.

riku_iki5mo ago

> you automated software development

very far from being true

Flux1595mo ago· 4 in thread

This looks useful for people not using Claude Code, but I do think that the desktop example in the video could be a bit misleading (particularly for non-developers) - Claude is definitely not taking screenshots of that desktop & organizing, it's using normal file management cli tools. The reason seems a bit obvious - it's much easier to read file names, types, etc. via an "ls" than try to infer via an image.

But it also gets to one of Claude's (Opus 4.5) current weaknesses - image understanding. Claude really isn't able to understand details of images in the same way that people currently can - this is also explained well with an analysis of Claude Plays Pokemon https://www.lesswrong.com/posts/u6Lacc7wx4yYkBQ3r/insights-i.... I think over the next few years we'll probably see all major LLM companies work on resolving these weaknesses & then LLMs using UIs will work significantly better (and eventually get to proper video stream understanding as well - not 'take a screenshot every 500ms' and call that video understanding).

ElatedOwl5mo ago

I keep seeing “Claude image understanding is poor” being repeated, but I’ve experienced the opposite.

I was running some sentiment analysis experiments; describe the subject and the subjects emotional state kind of thing. It picked up on a lot of little detail; the brand name of my guitar amplifier in the background, what my t shirt said and that I must enjoy craft beer and or running (it was a craft beer 5k kind of thing), and picked up on my movement through multiple frames. This was a video slicing a frame every 500ms, it noticed me flexing, giving the finger, appearing happy, angry, etc. I was really surprised how much it picked up on, and how well it connected those dots together.

1 more reply

EMM_3865mo ago

> Claude is definitely not taking screenshots of that desktop & organizing, it's using normal file management cli tools

Are you sure about that?

Try "claude --chrome" with the CLI tool and watch what it does in the web browser.

It takes screenshots all the time to feed back into the multimodal vision and help it navigate.

It can look at the HTML or the JavaScript but Claude seems to find it "easier" to take a screenshot to find out what exactly is on the screen. Not parse the DOM.

So I don't know how Cowork does this, but there is no reason it couldn't be doing the same thing.

1 more reply

oracleclyde5mo ago

Maybe at one time, but it absolutely understands images now. In VSCode Copilot, I am working on a python app that generates mesh files that are imported in a blender project. I can take a screenshot of what the mesh file looks like and ask Claude code questions about the object, in context of a Blender file. It even built a test script that would generate the mesh and import it into the Blender project, and render a screenshot. It built me a vscode Task to automate the entire workflow and then compare image to a mock image. I found its understanding of the images almost spooky.

1 more reply

minimaxir5mo ago

Claude Opus 4.5 can understand images: one thing I've done frequently in Claude Code and have had great success is just showing it an image of weird visual behavior (drag and drop into CC) and it finds the bug near-immediately.

The issue is that Claude Code won't automatically Read images by default as a part of its flow: you have to very explicitly prompt it to do so. I suspect a Skill may be more useful here.

1 more reply

majormajor5mo ago· 3 in thread

The hero image with a set of steps:

1) Read meeting transcripts 2) Pull out key points 3) Find action items 4) Check Google Calendar 5) Build standup deck

feels like "how to put yourself out of a job 101."

It's interesting to see the marketing material be so straightforward about that.

sepositus5mo ago

But it immediately forgets the results of step 1 by the time it hits step 3 (due to context rot) and starts inventing action items.

catoc5mo ago

I know managers think this is all there is to “work”, but at some point someone need do those action items.

1 more reply

comp35mo ago

Lmao its actually cute watching Anthropic and its employees desperately finding a way to stuff this into peoples lives - the reality is most people dont give a hoot about this stuff.

The folks working at these technology firms just dont get what the average person - who makes up most of the population - wants. They produce this fluffy stuff which may appeal to the audience here - but that market segment is tiny.

Also the use case of organising a desktop rocked me off my chair. LMAO!

samiv5mo ago· 2 in thread

Do the people rushing off to outsource their work to chatbots have a plan to explain to their bosses why they still need to have a job?

What's the play after you have automated yourselves out of a job?

Retrain as a skilled worker? Expect to be the lucky winner who is cahoots with the CEO/CTO and magically gets to keep the job? Expect the society to turn to social democracy and produce UBI? Make enough money to live off investments portfolio?

Davidzheng5mo ago

Many people will have to ask themselves these question soon regardless of their actions. I don't understand the critique here.

1 more reply

delegate5mo ago

I wonder who the managers are going to manage..

alexdobrenko5mo ago· 2 in thread

I've been using Claude Code in my terminal like a feral animal for months. Building weird stuff. Breaking things. Figuring it out as I go.

Cowork is the nice version. The "here's a safe folder for Claude to play in" version. Which is great! Genuinely. More people should try this.

But!!! The terminal lets you do more. It always will. That's just how it works.

And when Cowork catches up, you'll want to go further. The gap doesn't close. It just moves.

All of this, though, is good? I think??

energy1235mo ago

Isn't this like the "but rsync" comments on Dropbox launch? The vast majority of the addressable market doesn't know what a terminal is.

akurilin5mo ago

I've had a similar experience. My sense is that there's no way this isn't how eventually most of knowledge work at the computer is going to work. Not necessarily through a terminal interface, I expect UIs to evolve quite a bit in the next few years, but having an omnipotent agent in the loop to do all of the gluing and gruntwork for you. Seems inevitable.

theturtletalks5mo ago· 2 in thread

Isn't this just a UI over Claude Code? For most people, using the terminal means you could switch to many different coding CLIs and not be locked into just Claude.

basket_horse5mo ago

> For most people

Most people have no idea what a terminal is.

1 more reply

JLO645mo ago

Most people working office jobs are scared of the terminal though. I see this as not being targeted at the average HN user but for non-technical office job workers. How successful this will be in that niche I'm not certain of, but maybe releasing an app first will give them an edge over the name recognition of ChatGPT/Gemini.

Imnimo5mo ago· 1 in thread

>By default, the main thing to know is that Claude can take potentially destructive actions (such as deleting local files) if it’s instructed to.

What do the words "if it's instructed to" mean here? It seems like Claude can in fact delete files whenever it wants regardless of instruction.

For example, in the video demonstration, they ask "Please help me organize my desktop", and Claude decides to delete files.

olliepro5mo ago

I believe the idea is that it “files away” the files into folders.

d4rkp4ttern5mo ago· 1 in thread

A CLI chat interface seems ideal for when you keep code "at a distance", i.e. if you hardly/infrequently/never want to peek at your code.

But for writing prose, I don't think chat-to-prose is ideal, i.e. most people would not want the keep prose "at a distance".

I bet most people want to be immersed in an editor where they are seeing how the text is evolving. Something like Zed's inline assistant, which I found myself using quite a lot when working on documents.

I was hoping that Cowork might have some elements of an immersive editor, but it's essentially transplanting the CLI chat experience to an ostensibly "less scary" interface, i.e., keeping the philosophy of artifacts separate from your chat.

wek5mo ago

I agree that for writing documents and for a lot of other things like editing csv files or mockups, I want to be immersed in the editor together with Claude Code, not in a chat separated from my editors

1 more reply

forty5mo ago· 1 in thread

I cannot see this page, I'm redirected to https://claude.com/fr-fr/blog/cowork-research-preview which don't exist. Private tab doesn't help

sunaookami5mo ago

Same for me but with my language. US defaultism strikes again ;) https://archive.ph/dIVPO here is an archive link that works

tacoooooooo5mo ago· 1 in thread

This looks pretty cool. I keep seeing people (an am myself) using claude code for more an more _non-dev_ work. Managing different aspects of life, work, etc. Anthropic has built the best harness right now. Building out the UI makes sense to get genpop adoption

ai-christianson5mo ago

Yeah, the harness quality matters a lot. We're seeing the same pattern at Gobii - started building browser-native agents and quickly realized most of the interesting workflows aren't "code this feature" but "navigate this nightmare enterprise SaaS and do the thing I actually need done." The gap between what devs use Claude Code for vs. what everyone else needs is mostly just the interface.

tolerance5mo ago· 1 in thread

This is the sort of stuff Apple should’ve been trying to figure out instead of messing with app corners and springboards.

elpakal5mo ago

But they created GenMoji?!

simonw5mo ago· 1 in thread

I wrote up some first impressions of Claude Cowork here, including an example of it achieving a task for me (find the longest drafts in my blog-drafts folder from the past three months that I haven't published yet) with screenshots.

https://simonwillison.net/2026/Jan/12/claude-cowork/

redfloatplane5mo ago

I tend to think this product is hard for those of us who've been using `claude` for a few months to evaluate. All I have seen and done so far with Cowork are things _I_ would prefer to do with the terminal, but for many people this might be their first taste of actually agentic workflows. Sometimes I wonder if Anthropic sort of regret releasing Claude Code in its 'runs your stuff on your computer' form - it can quite easily serve as so many other products they might have sold us separately instead!

1 more reply

_pdp_5mo ago· 1 in thread

Yah I wouldn't.

In my opinion, these things are better run the cloud to ensure you have a properly sandboxed, recoverable environment.

At this point, I am convinced that almost anyone heavily relaying on desktop chat application has far too many credentials scattered on the file system ready to be grabbed and exploited.

nxobject5mo ago

I wonder if this is what makes immutable package/installation management finally take off...

mceachen5mo ago· 1 in thread

YMMV but TFA page content body didn’t render for me until I disabled my local pihole.

janwillemb5mo ago

Firefox reader mode also helps

redactsureAI5mo ago· 1 in thread

A lot of people here are discussing the security challenges here. If you're interested I'm working on a novel solution to the security of these systems.

Basic ideas are minimal privilege per task in a minimal and contained environment for everything and heavy control over all actions AI is performing. AI can performs tasks without seeing any of your personal information in the process. A new kind of orchestration and privacy layer for zero trust agentic actions.

Redactsure.com

From this feed I figured I'd plug my system, would love your feedback! I beleive we are building out a real solution to these security and privacy concerns.

While the entire field is early I do believe systems like my own and others will make these products safe and reliable in the near future.

philipwhiuk5mo ago

> Basic ideas are minimal privilege per task in a minimal and contained environment for everything and heavy control over all actions AI is performing.

The challenge is that no application on desktop is built around these privileges so there's no grant workflow.

Are you bytecode analysing the kernel syscalls an app makes before it runs? Or will it just panic-die when you deny one?

1 more reply

bahmboo5mo ago· 1 in thread

Is there anything similar to this in the local world? I’m setting up a full local “ai” stack on a 48gb MacBook for my sensitive data ops. Using webui. Will still use sota cloud services for coding.

HarHarVeryFunny5mo ago

There are lots of similar tools to Claude Code where a local executor agent talks to a remote/local AI. For example, OpenCode and Aider both support local models as well as remote (e.g. via OpenRouter).

1 more reply

mrcwinn5mo ago· 1 in thread

This product barely works. It can't connect to the browser extension and when I share folders for it to access, nothing happens. I love early previews but maybe one more week?

arthurcolle5mo ago

works fine for me, what's the matter?

redfloatplane5mo ago

Agents for other people, this makes a ton of sense. Probably 30% of the time I use claude code in the terminal it's not actually to write any code.

For instance I use claude code to classify my expenses (given a bank statement CSV) for VAT reporting, and fill in the spreadsheet that my accountant sends me. Or for noting down line items for invoices and then generating those invoices at the end of the month. Or even booking a tennis court at a good time given which ones are available (some of the local ones are north/south facing which is a killer in the evening). All these tasks could be done at least as well outside the terminal, but the actual capability exists - and can only exist - on my computer alone.

I hope this will interact well with CLAUDE.md and .claude/skills and so forth. I have those files and skills scattered all over my filesystem, so I only have to write the background information for things once. I especially like having claude create CLIs and skills to use those CLIs. Now I only need to know what can be done, rather than how to do it - the “how” is now “ask Claude”.

It would be nice to see Cowork support them! (Edit: I see that the article mentions you can use your existing 'connectors' - MCP servers I believe - and that it comes with some skills. I haven't got access yet so I can't say if it can also use my existing skills on my filesystem…)

(Follow-up edit: it seems that while you can mount your whole filesystem and so forth in order to use your local skills, it uses a sandboxed shell, so your local commands (for example, tennis-club-cli) aren't available. It seems like the same environment that runs Claude Code on the Web. This limits the use for the moment, in my opinion. Though it certainly makes it a lot safer...)

flyingzucchini5mo ago

For $200 month I’ll arrange my own desktop icons thanks. (Isn’t there a more compelling use case?)

jfletch3215mo ago

It's a little funny how the "Stay in control" section is mostly about how quickly you can lose control (deleting files, prompt injections). I can foresee non-technical users giving access to unfortunate folders and getting into a lot of trouble.

cwoolfe5mo ago

Is anybody out there actually being more productive in their office work by using AI like this? AI for writing code has been amazing but this office stuff is a really hard sell for me. General office/personal productivity seems to be the #1 use-case the industry is trying to sell but I just don't see it. What am I missing here?

steipete5mo ago

Funny timing. Written in 10 days just when this took off. https://clawd.bot/

jameslk5mo ago

This is the natural evolution of coding agents. They're the most likely to become general purpose agents that everyone uses for daily work because they have the most mature and comprehensive capability around tool use, especially on the filesystem, but also in opening browsers, searching the web, running programs (via command line for now), etc. They become your OS, colleague, and likely your "friend" too

I just helped a non-technical friend install one of these coding agents, because its the best way to use an AI model today that can do more than give him answers to questions. I'm not surprised to see this announced and I would expect the same to happen with all the code agents becoming generalized like this

The biggest challenge towards adoption is security and data loss. Prompt injection and social engineering are essentially the same thing, so I think prompt injection will have to be solved the same way. Data loss is easier to solve with a sandbox and backups. Regardless, I think for many the value of using general purpose agents will outweigh the security concerns for now, until those catch up

1 more reply

btown5mo ago

For those worried about irrevocable changes, sometimes a good plan is all the output.

Claude Code is very good at `doc = f(doc, incremental_input)` where doc is a code file. It's no different if doc is a _prompt file_ designed to encapsulate best practices.

Hand it a set of unstructured SOP documents, give it access to an MCP for your email, and have it gradually grow a set of skills that you can then bring together as a knowledge base auto-responder instruction-set.

Then, unlike many opaque "knowledge-base AI" products, you can inspect exactly how over-fitted those instructions are, and ask it to iterate.

What I haven't tried is whether Cowork will auto-compact as it goes through that data set, and/or take max-context-sized chunks and give them to a sub-agent who clears its memory between each chunk. Assuming it does, it could be immensely powerful for many use cases.

Wowfunhappy5mo ago

Under the hood, is this running shell commands (or Apple events) or is it actually clicking around in the UI?

If the latter, I'm a bit skeptical, as I haven't had great success with Claude's visual recognition. It regularly tells me there's nothing wrong with completely broken screenshots.

lossolo5mo ago

I would like to thank the 100,000 people in Madagascar[1] who made it all possible by creating training data for ~€0.30 per hour.

1. https://www.youtube.com/watch?v=Q7NZK6h9Tvo

appsoftware5mo ago

The thing about Claude code, is that it's usually used in version controlled directories. If Claude f**s up badly, I can revert to a previous git commit. If it runs amock on my office documents, I'm going to have a harder time recovering those.

jdeng5mo ago

Exciting to see Anthropic validate the "AI coworker" direction. We're building VITA AI (https://vita-ai.net) with similar philosophy but for enterprise QA testing.

One key architectural difference: Cowork runs sandboxed VMs on your local macOS machine, but we run sandboxes entirely in the cloud. This means:

- True isolation - agents never touch your local files or network, addressing the security concerns raised in this thread

- Actual autonomy - close your laptop, agent keeps working. Like delegating to a real coworker, not pairing with an assistant

- Scale - spin up 10 test agents without melting your CPU

The trade-off is latency and offline capability, but for testing workflows (our focus), asynchronous cloud execution is actually the desired model. You assign "test the checkout flow," go to lunch, come back to a full test report + artifacts.

Different use cases, different architectures. But the broader trend feels right - moving from conversational assistants to autonomous agents that operate independently.

krm015mo ago

I’ve tried just about every system for keeping my desktop tidy: folders, naming schemes, “I’ll clean it on Fridays,” you name it. They all fail for the same reason: the desktop is where creative work wants to spill out. It’s fast, visual, and forgiving. Cleaning it is slow, boring, and feels like admin.

Claude Cleaner, I mean Cowork will be sweeping my desktop every Friday.

Im sure itll be useful for more stuff but man…

hmokiguess5mo ago

This seems like a thin client UX running Claude Code for the less technical user.

fennecfoxy5mo ago

Hmm. I'm building something (quick and dirty) at the moment that looks at analysing customer service data.

Something like this is promising but from what I can see, still lacking. So far I've been dealing with the regular issues (models aren't actually that smart, work with their strengths and weaknesses) but also more of the data problem - simple embeddings just aren't enough, imo. And throwing all of the data at the model is just asking for context poisoning, hallucinations and incorrect conclusions.

Been playing with instruction tuned embeddings/sentiment and almost building a sort of "multimodal" system of embedding to use with RAG/db calls. What I call "Data hiding" as well - allowing the model to see the shape of the data but not the data itself, except only when directly relevant.

arjie5mo ago

This sounds really interesting. Perhaps this is the promise that Copilot was not. I'm really hoping that this gives people like my wife access to all the things I use Claude Code for.

I use Claude Code for everything. I have a short script in ~/bin/ called ,cc that I launch that starts it in an appropriate folder with permissions and contexts set up:

      ~ tree ~/claude-workspaces -d
    /Users/george/claude-workspaces
    ├── context-creator
    ├── imessage
    │   └── tmp
    │       └── contacts-lookup
    ├── modeler
    ├── research
    ├── video
    └── wiki

I'll usually pop into one of these (say, video) and say something stupid like: "Find the astra crawling video and stabilize it to focus on her and then convert into a GIF". That one knows it has to look in ~/Movies/Astra and it'll do the natural thing of searching for a file named crawl or something and then it'll go do the rest of the work.

Likewise, the `modeler` knows to create OpenSCAD files and so on, the `wiki` context knows that I use Mediawiki for my blog and have a Template:HackerNews and how to use it and so on. I find these make doing things a lot easier and, consequently, more fun.

All of this data is trusted information: i.e. it's from me so I know I'm not trying to screw myself. My wife is less familiar with the command-line so she doesn't use Claude Code as much as me, and prefers to use ChatGPT the web-app for which we've built a couple of custom GPTs so we can do things together.

Claude is such a good model that I really want to give my wife access to it for the stuff she does (she models in Blender). The day that these models get really good at using applications on our behalf will be wonderful! Here's an example model we made the other day for the game Power Grid: https://wiki.roshangeorge.dev/w/Blog/2026-01-11/Modeling_Wit...

monarchwadia5mo ago

This is a great idea! I'm building something very similar with https://practicalkit.com , which is the same concept done differently.

It will be interesting for me, trying to figure out how to differentiate from Claude Cowork in a meaningful way, but theres a lot of room here for competition, and no one application is likely to be "the best" at this. Having said that, I am sure Claude will be the category leader for quite a while, with first mover advantage.

I'm currently rolling out my alpha, and am looking for investment & partners.

mintflow5mo ago

I like this idea but really do not want to share my personal data to cloud based LLM vendors.

I have a folder which is controlled by Git, the folder contains various markdown files as my personal knowledge base and work planning files (It's a long story that I have gradually migrate from EverNote->OneNote->Obsidian->plain markdown files + Git), last time I tried to wire a Local LLM API(using LMStudio) to claude code/open code, and use the agent to analyze some documents, but the result is not quite good, either can't find the files or answer quality is bad.

break_the_bank5mo ago

We’re building something very similar but with files in the cloud instead.

Try it https://tabtabtab.ai

Would love some feedback!

tinyhouse5mo ago

I'm already using Claude Code to organize my work and life so this makes a lot of sense. However, I just tried it and it's not clear how this is different than using Claude with projects. I guess the main difference is that it can be used within a local folder on one's computer, so it's more integrated into ones workflow, rather than a project where you need to upload your data. This makes sense.

slimebot805mo ago

"Claude can’t read or edit anything you don’t give it explicit access to"

How confident are we that this is a strict measure?

I personally have zero confidence in Claude rulesets and settings as a way to fence it in. I've seen Claude decide desperately for itself what to access once it has context bloat? It can tend to ignore rules?

Unless there is a OS level restriction they are adhering to?

jpcompartir5mo ago

I've been working with a claude-specific directory in Claude Code for non-coding work (and the odd bit of coding/documentation stuff) since the first week of Claude Code, or even earlier - I think when filesystem MCP dropped.

It's a very powerful way to work on all kinds of things. V. interested to try co-work when it drops to Plus subscribers.

philip12095mo ago

This is cool, but Claude for Chrome seems broken - authentication doesn't work and there's a slew of recent reviews on the Chrome extension mentioning it.

Sharing here in case anybody from Anthropic sees and can help get this working again.

It may seem off-topic, but I think it hurts developer trust to launch new apps while old ones are busted.

codebyaditya5mo ago

Cowork feels like a real step toward usable agent AI — letting Claude actually interact with your files rather than just answer questions. But that also means we’ll really learn how robust (and safe) this stuff is once people start trying it on messy, real workflows instead of toy tasks.

spm10015mo ago

I need to go and do some proper timings but for comparable questions and inputs this feels a lot faster. Possible I’m just being beguiled by the UI but it does seem as though the responses are coming back faster.

Is it possible this gets access to a faster API tier?

Olshansky5mo ago

This is great, but it saddens me that this is still just the average total compensation of a single engineer at Anthropic.

Unsure what the future looks like unless Frontier Labs start financing everything that is open source.

sergiotapia5mo ago

Can it use the browser or the machine like a human? Meaning I can ask it to find a toaster on http://Target.com and it'll open my browser and try it?

kingkongjaffa5mo ago

When I need to create something like a powerpoint or whatever I use claude code and invoke a claude skill that knows how to do it. Why would I use claude cowork instead of that?

sbinnee5mo ago

A week ago I pitched to my managers that this form of general purpose claude code will come out soon. They were rather skeptical saying that claude code is just for developers. Now they can see.

ambicapter5mo ago

This is interesting because in the other thread about Anthropic/Claude Code, people are arguing that Anthropic is right to focus on what CC is good at (writing code).

system25mo ago

I use Claude 8+ hours per day. But this is probably the scariest use I can think of. An agent running with full privileges with no restriction. What can go wrong?

thiagowfx5mo ago

Since it is an agent, I wonder why they didn’t go with “Claude Coworker” instead.

On the other hand, it’s not “Claude Coder”, then it’s at least consistent.

lasgawe5mo ago

This comes with thousands of unknown attacks. When these kinds of features are introduced, we have to find ways to bypass them.

rao-v5mo ago

Cowork + litellm proxy + a local vision LLM should work incredibly well for overnight organizing tasks organizing md files, photos etc.

kewun5mo ago

I tried it out and it couldn't help me unsubscribe from spam/newsletter as it couldn't click the unsubscribe button.

StarterPro5mo ago

Damn, yall can't do anything by yourselves.

tolodot5mo ago

Unless this works almost exactly like Claude Code (minus GitHub) it will end up subtractng a lot of what makes cc so powerful.

rshanreddy5mo ago

Have still not been able to get a query to work. "Sending request" or other errors at every turn.

sparkalpha5mo ago

Tried Claude Cowork and Chatlily. Interesting idea, but Claude still feels stronger for my use cases.

650REDHAIR5mo ago

I tried to get Claude to build me a spreadsheet last night. I was explicit in that I wanted an excel file.

It’s made one in the past for me with some errors, but a framework I could work with.

It created an “interactive artifact” that wouldn’t work in the browser or their apps. Gaslit me for 3 revisions of me asking why it wasn’t working.

Created a text file that it wanted me to save as a .csv to import into excel that failed hilariously.

When I asked it to convert the csv to an excel file it apologized and told me it was ready. No file to download.

I asked where the file was and it apologized again and told me it couldn’t actually do spreadsheets and at that point I was out of paid credits for 4 more hours.

WesleyLivesay5mo ago

Really like the look of this. I use Claude Code (and other CLI LLM tools) to interact with my large collection of local text files which I usually use Obsidian to write/update. It has been awesome at organization, summarization, and other tasks that were previously really time consuming.

Bringing that type of functionality to a wider audience and out of the CLI could be really cool!

catoc5mo ago

If you don’t mind the terminal, what is the benefit of Cowork over Code? The sandboxing?

insanebrain5mo ago

This is like asking a hallucinating robot to paint your house using a sledgehammer

sharyphil5mo ago

This is incredible. Waiting for the rollout on other platforms. I really need it.

cm20125mo ago

Nothing important is in my file system, its all in google drive, gmail, and slack.

melonpan75mo ago

Personally I've only ever used Claude Code for coding.

imagetic5mo ago

I see the sales people completed their takeover...

Jamie4525mo ago

Is claude down? I can't create a new chat.

1 more reply

fluidcruft5mo ago

I mean this as genuinely non-snarkily as possible: I have been literally building my own personal productivity and workflow tools that could do things as shown.

Is this now a violation of the Claude terms of service that can get me banned from claude-code for me to continue work on these things?

brunoborges5mo ago

Anthropic: we will do the Code button first, then we implement Non-Code button.

OpenAI: we will do the Non-Code button first, then we implement the Code button.

1 more reply

cryptoegorophy5mo ago

It seems very similar to cursor AI?

pentagrama5mo ago

I think the next step for these big AI companies will be to launch their own operating systems, probably Linux distributions.

berryg5mo ago

I cannot read the pages on the Claude website. I am using pi-hole and that causes text not being rendered. Annoying.

nunez5mo ago

yeah, you shouldn't need to create a deck for a standup...

otherwise, looks interesting.

m4ck_5mo ago

can it play games for me? the factory must grow but I also need to cook dinner.

basedrum5mo ago

Can't load page contents

jeisc5mo ago

everybody knows that the only secure computer is one which is unplugged

focusgroup05mo ago

The Death of The Email Job

1 more reply

scottLobster5mo ago

Yeah, unless there's some automatic backup/snapshot implemented before any actions are taken, hard pass on this. Or at least I won't be using it on anything I'm not willing to 100% lose. Maybe give it read-only access and have it put results in a designated output folder?

Particularly in a work environment, one misfire could destroy months or years of important information.

1 more reply

FatherOfCurses5mo ago

Cowork: the 2026 version of training your offshore replacement.

daft_pink5mo ago

Now if there was just an easy and efficient way to drop a bunch of files into a directory.

goaaron5mo ago

Claude what's happening tomorrow ahghhg!!! hate this lol

zurfer5mo ago

I'm a bit shocked to see so many negative comments here on HN. Yes, there are security risks and all but honestly this is the future. It's a great amplifier for hackers and people who want to get stuff done.

It took some training but I'm now starting almost all tasks with claude code: need to fill out some word document, organize my mail inbox, write code, migrate blog posts from one system to another, clean up my computer...

It's not perfect perfect, but I'm having fun and I know I'm getting a lot of things done that I would not have dared to try previously.

4 more replies

j / k navigate · click thread line to collapse

565 comments

213 comments · 87 top-level

felixrieseberg5mo ago· 31 in thread

Hi, Felix from the team here, this is my product - let us know what you think. We're on purpose releasing this very early, we expect to rapidly iterate on it.

(We're also battling an unrelated Opus 4.5 inference incident right now, so you might not see Cowork in your client right away.)

deanc5mo ago

I’ve been trying to reach a human at Anthropic for a week now to clarify this on behalf of our company but can’t get past your AI support.

5 more replies

bashtoni5mo ago

Hi Felix!

Simple suggestion: logo should be a cow and and orc to match how I originally read the product name.

3 more replies

dcreater5mo ago

tildef5mo ago

1 more reply

politelemon5mo ago

Hi there, your training and inference rely on the openness of Linux. Would you consider giving something back with Claude for Linux?

Recursing5mo ago

What probability would you give for Linux support for Claude Desktop in 2026?

2 more replies

hoss14744895mo ago

3 more replies

jchung5mo ago

This has been one of the biggest bottlenecks for our company: not the capability of the agents themselves -- the tools needed to roll them out responsibly.

tkgally5mo ago

The summary looked good, so I then described the two tasks to Claude and told it to start working.

Its project proposal revision was just about perfect. It took me only about 10 more minutes to polish it further and send it off.

I had not been looking forward to either of those two tasks, so it’s a relief to get them done more quickly than I had expected.

martinald5mo ago

Hey, congrats on the launch. Been thinking lot about this space (wrote this back in August: https://martinalderson.com/posts/building-a-tax-agent-with-c...).

Would love to connect, my emails in my bio if you have time!

mastercheif5mo ago

Hi Felix, this looks like an incredible tool. I've been helping non-tech people at my org make agent flows for things like data analysis—this is exactly what they need.

However, I don't see an option for AWS Bedrock API in the sign up form, is it planned to make this available to those using Bedrock API to access Claude models?

skybrian5mo ago

Being able to undo any changes that Cowork makes seems important. Any plans for automatic snapshots or an undo log?

RamblingCTO5mo ago

Was looking forward to try it, but just processing a notion page and prepare an outline for a report breaks it: This is taking longer than usual...(14m 2s)

/e: stopped it and retried. it seems it can't use the connectors? I get No such tool available

torben-friis5mo ago

Question: I see that the “actions hints” in the demo show messaging people as an option.

Is this a planned usecase, for the user to hand over human communication in, say, slack or similar? What are the current capabilities and limitations for that?

pritambarhate5mo ago

I guess you need to know about this: https://news.ycombinator.com/item?id=46597781

9dev5mo ago

Hey Felix, would love to give you feedback, but the language redirect of the website is trying to route me to de-de, and thus I can't see the page.

You might want to fix this.

1 more reply

andreygrehov5mo ago

Why do all similar demos show “prep the deck” use case as if everybody is building power point slides all day long?

1 more reply

VadimPR5mo ago

Would love to see a Linux native application for this, after all a lot of folks are using it more and more these days.

tekacs5mo ago

Hullo! Congrats on shipping this, it looks great!

I'm very curious about what you mean by 'cross device sync' in the post?

pikseladam5mo ago

Do you expect more token usage with it or will Anthropic change the limits of user token limit in the future?

carlo-notion5mo ago

Cheers Felix, congrats on the launch!

tiahura5mo ago

The announcement says existing connectors work, but only Claude for chrome does.

oidar5mo ago

jscottmiller5mo ago

Looks good so far - I hope Windows support follows soon!

mkbkn5mo ago

Can you release custom GPTs like ChatGPT has?

bibimsz5mo ago

would like to be able to point at aws bedrock models like i can with claude code

column5mo ago

Hi! Windows support when?

BaudouinVH5mo ago

hello Felix, that page is 404 here at the moment :(

jmkni5mo ago

Congrats Felix :)

motoboi5mo ago

Please give me access via api key

1 more reply

dabedee5mo ago

It's great and reassuring to know that, in this day and age, products still get made entirely by one individual.

> Hi, Felix from the team here, this is my product - let us know what you think. > We're on purpose releasing this very early, we expect to rapidly iterate on > it.

> (We're also battling an unrelated Opus 4.5 inference incident right now, so > you might not see Cowork in your client right away.)

2 more replies

jryio5mo ago· 21 in thread

It's so important to remember that unlike code which can be reverted - most file system and application operations cannot.

There's no sandboxing snapshot in revision history, rollbacks, or anything.

I expect to see many stories from parents, non-technical colleagues, and students who irreparably ruined their computer.

Edit: most comments are focused on pointing out that version control & file system snapshot exists: that's wonderful, but Claude Cowork does not use it.

For those of us who have built real systems at low levels I think the alarm bells go off seeing a tool like this - particularly one targeted at non-technical users

Workaccount25mo ago

Frequency vs. convenience will determine how big of a deal this is in practice.

Cars have plenty of horror stories associated with them, but convenience keeps most people happily driving everyday without a second thought.

Google can quarantine your life with an account ban, but plenty of people still use gmail for everything despite the stories.

So even if Claude cowork can go off the rails and turn your digital life upside down, as long as the stories are just online or "friend of a friend of a friend", people won't care much.

4 more replies

alwillis5mo ago

The first version is for macOS, which has snapshots [1] and file versioning [2] built-in.

[1]: https://eclecticlight.co/2024/04/08/apfs-snapshots/

[2]: https://eclecticlight.co/2021/09/04/explainer-the-macos-vers...

2 more replies

falcor845mo ago

1 more reply

hopelite5mo ago

toddmorey5mo ago

Q: What would prevent them from using git style version control under the hood? User doesn’t have to understand git, Claude can use it for its own purposes.

3 more replies

y425mo ago

4 more replies

kamaal5mo ago

>>I expect to see many stories from parents, non-technical colleagues, and students who irreparably ruined their computer.

I do believe the approach Apple is taking is the right way when it comes to user facing AI.

You need to reduce AI to being an appliance that does one or at most a few things perfectly right without many controls with unexpected consequences.

Real fun is robots. Not sure no one is hurrying up on that end.

>>Edit: most comments are focused on pointing out that version control & file system snapshot exists: that's wonderful, but Claude Cowork does not use it.

You are right in your analysis that many people are going to end up with totally broken systems

bob10295mo ago

The base model itself is biased away from actions that would lead to large scale destruction. Compound over time and you probably never get anywhere too scary.

seunosewa5mo ago

There's no reason why Claude can't use git to manage the folders that it controls.

2 more replies

Weryj5mo ago

TimeMachine has never been so important.

2 more replies

hans0l0745mo ago

Aeolun5mo ago

If this is like Claude Code for everyone else, shouldn’t it be snapshotting anything it changes so that you can go back to the previous state?

machiaweliczny5mo ago

Yeah, seems like this could be achieved by using https://github.com/streamich/memfs/blob/master/docs/snapshot...

Weird they don't use it - might backfire hard

matt3D5mo ago

Pretty much every company I work with uses the desktop sync tools for OneDrive/GoogleDrive/Dropbox etc.

It would be madness to work completely offline these days, and all of these systems have version history and document recovery built in.

__MatrixMan__5mo ago

Helmut100015mo ago

I would never use what is proposed by OP. But, in any case, Linux on ZFS that is automatically snapshotted every minute might be (part of) a solution to this dilemma.

akurilin5mo ago

You make a good point. I imagine that they will eventually add Perforce-style versioning to the product and this issue will be solved.

o_m5mo ago

So the future is NixOS for non-technical people?

2 more replies

big-chungus45mo ago

A human can also accidentally delete or mess up some files. The question is whether Claude Cowork is more prone to it.

heliumtera5mo ago

There was a couple of posts here on hacker news praising agents because, it seems, they are really good at being a sysadmin. You don't need to be a non-technical user to be utterly fucked by AI.

1 more reply

neocron5mo ago

Not a big problem to make snapshots with lvm or zfs and others. I use it automatically on every update

2 more replies

simonw5mo ago· 18 in thread

(I don't think it's fair to ask non-technical users to look out for "suspicious actions that may indicate prompt injection" personally!)

felixrieseberg5mo ago

There is much more to do - and our docs reflect how early this is - but we're investing in making progress towards something that's "safe".

9 more replies

viraptor5mo ago

> (I don't think it's fair to ask non-technical users to look out for "suspicious actions that may indicate prompt injection" personally!)

It's the "don't click on suspicious links" of the LLM world and will be just as effective. It's the system they built that should prevent those being harmful, in both cases.

3 more replies

cyanydeez5mo ago

There's no AI that's secure and capable of doing anything an idiot would do on the internet with whatever data you give it.

This is a perfect encapsulation of the same problem: https://www.reddit.com/r/BrandNewSentence/comments/jx7w1z/th...

Substitute AI with Bear

1 more reply

ashishb5mo ago

That's why I run it inside a sandbox - https://github.com/ashishb/amazing-sandbox

2 more replies

lifetimerubyist5mo ago

Prompt injection will never be "solved". It will always be a threat.

3 more replies

heliumtera5mo ago

You brought this up a couple of times now, would appreciate clarification.

1 more reply

redfloatplane5mo ago

I do get a "Setting up Claude's workspace" when opening it for the first time - it appears that this does do some kind of sandboxing (shared directories are mounted in).

1 more reply

nezhar5mo ago

I built https://github.com/nezhar/claude-container for exactly this reason - it's easy to make mistakes with these agents even for technical users, especially in yolo mode.

1 more reply

imovie45mo ago

> (I don't think it's fair to ask non-technical users to look out for "suspicious actions that may indicate prompt injection" personally!)

Yes, but at least now its only restricted to Claude Max subscribers, who are likely to be at least semi-technical (or at least use AI a lot)?

aussieguy12345mo ago

If you're on Linux, you can run AI agents in Firejail to limit access to certain folders/files.

2 more replies

schmuhblaster5mo ago

container2wasm seems interesting, but it runs a full blown x86 or ARM emulator in WASM which boots an image derived from a docker container [0].

[0] https://github.com/container2wasm/container2wasm

1 more reply

sureglymop5mo ago

1 more reply

jen729w5mo ago

> tells users "Avoid granting access to local files with sensitive information, like financial documents"

Good job that video of it organising your Desktop doesn't show folders containing 'Documents', 'Photos', and 'Projects'!

Oh wait.

bandrami5mo ago

My entire job is working with financial documents so this doesn't really do much for me

1 more reply

antidamage5mo ago

How does prompt injection happen? Or is it more a new link in a chain of existing failures?

1 more reply

fennecfoxy5mo ago

Problem is technical people on average (I wouldn't say all of us) know what we don't know. I'm naturally cautious when running new stuff or even just trying something new in life.

btucker5mo ago

ETA: used Claude Code to reverse engineer it:

   Insight ─────────────────────────────────────

  Claude.app VM Architecture:
  1. Uses Apple's Virtualization.framework (only on ARM64/Apple Silicon, macOS 13+)
  2. Communication is via VirtioSocket (not stdio pipes directly to host)
  3. The VM runs a full Linux system with EFI/GRUB boot

  ─────────────────────────────────────────────────

        ┌─────────────────────────────────────────────────────────────────────────────────┐
        │  macOS Host                                                                     │
        │                                                                                 │
        │  Claude Desktop App (Electron + Swift native bindings)                          │
        │      │                                                                          │
        │      ├─ @anthropic-ai/claude-swift (swift_addon.node)                           │
        │      │   └─ Links: Virtualization.framework (ARM64 only, macOS 13+)            │
        │      │                                                                          │
        │      ↓ Creates/Starts VM via VZVirtualMachine                                   │
        │                                                                                 │
        │  ┌──────────────────────────────────────────────────────────────────────────┐  │
        │  │  Linux VM (claudevm.bundle)                                              │  │
        │  │                                                                          │  │
        │  │  ┌────────────────────────────────────────────────────────────────────┐  │  │
        │  │  │  Bubblewrap Sandbox (bwrap)                                        │  │  │
        │  │  │  - Network namespace isolation (--unshare-net)                     │  │  │
        │  │  │  - PID namespace isolation (--unshare-pid)                         │  │  │
        │  │  │  - Seccomp filtering (unix-block.bpf)                              │  │  │
        │  │  │                                                                    │  │  │
        │  │  │  ┌──────────────────────────────────────────────────────────────┐  │  │  │
        │  │  │  │  /usr/local/bin/claude                                       │  │  │  │
        │  │  │  │  (Claude Code SDK - 213MB ARM64 ELF binary)                  │  │  │  │
        │  │  │  │                                                              │  │  │  │
        │  │  │  │  --input-format stream-json                                  │  │  │  │
        │  │  │  │  --output-format stream-json                                 │  │  │  │
        │  │  │  │  --model claude-opus-4-5-20251101                            │  │  │  │
        │  │  │  └──────────────────────────────────────────────────────────────┘  │  │  │
        │  │  │       ↑↓ stdio (JSON-RPC)                                          │  │  │
        │  │  │                                                                    │  │  │
        │  │  │  socat proxies:                                                    │  │  │
        │  │  │  - TCP:3128 → /tmp/claude-http-*.sock (HTTP proxy)                │  │  │
        │  │  │  - TCP:1080 → /tmp/claude-socks-*.sock (SOCKS proxy)              │  │  │
        │  │  └────────────────────────────────────────────────────────────────────┘  │  │
        │  │                                                                          │  │
        │  └──────────────────────────────────────────────────────────────────────────┘  │
        │           ↕ VirtioSocket (RPC)                                                 │
        │      ClaudeVMDaemonRPCClient.swift                                             │
        │           ↕                                                                    │
        │      Node.js IPC layer                                                         │
        └─────────────────────────────────────────────────────────────────────────────────┘

VM Specifications (from inside)

Storage Layout

Filesystem Mounts (User Perspective)

        /sessions/gallant-vigilant-lamport/
        ├── mnt/
        │   ├── claude-cowork/     → Your selected folder (virtiofs + bindfs)
        │   ├── .claude/           → ~/.claude config (bindfs, rw)
        │   ├── .skills/           → Skills/plugins (bindfs, ro)
        │   └── uploads/           → Uploaded files (bindfs)
        └── tmp/                   → Session temp files
        
        Session User
        A dedicated user is created per session with a Docker-style random name:
        User: gallant-vigilant-lamport
        UID:  1001
        Home: /sessions/gallant-vigilant-lamport
        Process Tree
        PID 1: bwrap (bubblewrap sandbox)
        └── bash (shell wrapper)
            ├── socat TCP:3128 → unix socket (HTTP proxy)
            ├── socat TCP:1080 → unix socket (SOCKS proxy)
            └── /usr/local/bin/claude (Claude Code SDK)
                └── bash (tool execution shells)

        Security Layers

        Apple Virtualization.framework - Hardware-level VM isolation
        Bubblewrap (bwrap) - Linux container/sandbox

        --unshare-net - No direct network access
        --unshare-pid - Isolated PID namespace
        --ro-bind / / - Read-only root (with selective rw binds)


        Seccomp - System call filtering (unix-block.bpf)
        Network Isolation - All traffic via proxied unix sockets

        Network Architecture
        ┌─────────────────────────────────────────────────────────────┐
        │  Inside Sandbox                                             │
        │                                                             │
        │  claude process                                             │
        │      │                                                      │
        │      ↓ HTTP/HTTPS requests                                  │
        │  localhost:3128 (HTTP proxy via env vars)                   │
        │      │                                                      │
        │      ↓                                                      │
        │  socat → /tmp/claude-http-*.sock ─────────┐                │
        │                                            │                │
        │  localhost:1080 (SOCKS proxy)              │                │
        │      │                                     │                │
        │      ↓                                     │                │
        │  socat → /tmp/claude-socks-*.sock ────────┤                │
        └───────────────────────────────────────────┼────────────────┘
                                                    │
                                VirtioSocket ←──────┘
                                                    │
        ┌───────────────────────────────────────────┼────────────────┐
        │  Host (macOS)                             │                │
        │                                           ↓                │
        │                              Claude Desktop App            │
        │                                           │                │
        │                                           ↓                │
        │                                    Internet                │
        └─────────────────────────────────────────────────────────────┘
        Key insight: The VM has only a loopback interface (lo). No eth0, no bridge. All external network access is tunneled through unix sockets that cross the VM boundary via VirtioSocket.


  Communication Flow

  From the logs and symbols:

  1. VM Start: Swift calls VZVirtualMachine.start() with EFI boot
  2. Guest Ready: VM guest connects (takes ~6 seconds)
  3. SDK Install: Copies /usr/local/bin/claude into VM
  4. Process Spawn: RPC call to spawn /usr/local/bin/claude with args

  The spawn command shows the actual invocation:
  /usr/local/bin/claude --output-format stream-json --verbose \
    --input-format stream-json --model claude-opus-4-5-20251101 \
    --permission-prompt-tool stdio --mcp-config {...}

jms7035mo ago

Terrible advice to users: be on the lookout for suspicious actions. Humans are terrible at this.

1 more reply

hypfer5mo ago· 9 in thread

People do realize that if they're doing this, they're not feeding "just" code into some probably logging cloud API but literally anything (including, as mentioned here, bank statements), right?

Right?

RIGHT??????

Are you sure that you need to grant the cloud full access to your desktop + all of its content to sort elements alphabetically?

jjcm5mo ago

Some do, some don't.

8 more replies

AstroBen5mo ago

When choosing between convenience and privacy, most people seem to choose convenience

2 more replies

motoboi5mo ago

I have my bank statements on a drive on a cloud. We are way past that phase.

3 more replies

waterTanuki5mo ago

fragmede5mo ago

But I don't want alphabetical. Alphabetical is just a known sort order so I can find the file I want. How about it sorts by "this is the file you're looking for"?

TIPSIO5mo ago

Have you ever used any Anthropic AI product? You cannot literally do anything without big permissions, warnings, or annoying always-on popup warning you about safety.

2 more replies

m4635mo ago

     v-- click!
  [ACCEPT] [CANCEL]

hahahahhaah5mo ago

Ship has sailed. I have my deepest secrets in Gmail and Docs. We need big tech to make this secure as possible from threats. Scammers and nations alike.

1899-12-305mo ago

I pray for whoever has to review the slop I've generated.

1f60c5mo ago· 5 in thread

h4ch15mo ago

hope u used these. can drastically reduce the 11mb to a couple of hundred kilobytes.

https://github.com/thameera/harcleaner and https://har-sanitizer.pages.dev/

lelandfe5mo ago

A more easy reproduction: disable JS.

To bypass: `.transition_wrap { display: none }`

_giorgio_5mo ago

On android, these don't work: Firefox Chrome Firefox focus :-(

Thanks anthropic

doesn't work.

1 more reply

motoboi5mo ago

you could have made if much simpler using playwright mcp.

worldsavior5mo ago

You could figure it out yourself under 5 mins. Nothing crazy here.

cc62cf4a4f205mo ago· 5 in thread

simonw5mo ago

I don't think that would happen, but I can't in good faith say to anyone else "that's not going to happen".

4 more replies

TeMPOraL5mo ago

> I mean, we all know that they're only doing this to build a training data set

That's not a problem. It leads to better models.

> to put your business out of business and capture all the value for themselves, right?

falloutx5mo ago

2 more replies

bearjaws5mo ago

This is the AI era equal to "I can't share my ideas because you will steal them"

Reality is good ideas and a few SOPs do not make a successful business.

eZinc5mo ago

It's either that, or you are 100X slower for not using Claude Code. The manpower per hour savings are most likely more worth it than protecting some inputs.

You could also always run a local LLM like GLM for sensitive documents or information on a separate computer, and never expose that to third party LLMs.

falloutx5mo ago· 5 in thread

Can humans do nothing now? Is it harder to organise your desktop? I thought Apple already organises them into stacks. (edit: Apple already does this)

Is it that hard to check your calendar? Also feels insincere to have a meeting of say 30 mins to show a claude made deck that you did it in 4 seconds.

cwoolfe5mo ago

Agree. Seems to me that if you need something like this to automate your workflow; it's your workflow that needs to change.

xlbuttplug25mo ago

You can still do all these things manually. Now you just have the option not to.

1 more reply

hk__25mo ago

loloquwowndueo5mo ago

It’s not that insincere if all the other attendees are just meeting-taking robots the end result of which will be an automated “summary of the meeting I attended for you” :)

How many people join meetings these days just to zone out and wait for the AI-produced summary at the end?

1 more reply

anthonypasq5mo ago

1 more reply

ossa-ma5mo ago· 4 in thread

Every startup is at the mercy of the big 3 (OpenAI, Anthropic, Google).

They can and most likely will release something that vaporises the thin moat you have built around their product.

This feels like the first time in tech where there are more startups/products being subsumed (agar.io style) than being created.

xlbuttplug25mo ago

> They can and most likely will release something that vaporises the thin moat you have built around their product.

As they should if they're doing most of the heavy lifting.

And it's not just LLM adjacent startups at risk. LLMs have enabled any random person with a claude code subscription to pole vault over your drying up moat over the course of a weekend.

1 more reply

dcchambers5mo ago

Best defense is to basically stay small/niche enough that the big guys don't think your work is worth consuming/competing with directly.

There will always be a market for dedicated tools that do really specific things REALLY well.

1 more reply

aroman5mo ago

Gijs4g5mo ago

When they go wide, you go deep

exitb5mo ago· 4 in thread

It’s kind of funny that apparently most of work that’s left after you automated software development is summarizing meetings and building slide decks.

sensanaty5mo ago

Hey, don't forget booking your flights! Because everyone who has ever flown knows it's very safe to let an RNG machine book something like a flight for you!

falloutx5mo ago

Now they can start saying 90% of the meetings will be done by Claude agents by 2027 (And we will all get free puppies)

1 more reply

ai-christianson5mo ago

Then there's the shuffling around of atoms.

riku_iki5mo ago

> you automated software development

very far from being true

Flux1595mo ago· 4 in thread

ElatedOwl5mo ago

I keep seeing “Claude image understanding is poor” being repeated, but I’ve experienced the opposite.

1 more reply

EMM_3865mo ago

> Claude is definitely not taking screenshots of that desktop & organizing, it's using normal file management cli tools

Are you sure about that?

Try "claude --chrome" with the CLI tool and watch what it does in the web browser.

It takes screenshots all the time to feed back into the multimodal vision and help it navigate.

It can look at the HTML or the JavaScript but Claude seems to find it "easier" to take a screenshot to find out what exactly is on the screen. Not parse the DOM.

So I don't know how Cowork does this, but there is no reason it couldn't be doing the same thing.

1 more reply

oracleclyde5mo ago

1 more reply

minimaxir5mo ago

The issue is that Claude Code won't automatically Read images by default as a part of its flow: you have to very explicitly prompt it to do so. I suspect a Skill may be more useful here.

1 more reply

majormajor5mo ago· 3 in thread

The hero image with a set of steps:

1) Read meeting transcripts 2) Pull out key points 3) Find action items 4) Check Google Calendar 5) Build standup deck

feels like "how to put yourself out of a job 101."

It's interesting to see the marketing material be so straightforward about that.

sepositus5mo ago

But it immediately forgets the results of step 1 by the time it hits step 3 (due to context rot) and starts inventing action items.

catoc5mo ago

I know managers think this is all there is to “work”, but at some point someone need do those action items.

1 more reply

comp35mo ago

Lmao its actually cute watching Anthropic and its employees desperately finding a way to stuff this into peoples lives - the reality is most people dont give a hoot about this stuff.

Also the use case of organising a desktop rocked me off my chair. LMAO!

samiv5mo ago· 2 in thread

Do the people rushing off to outsource their work to chatbots have a plan to explain to their bosses why they still need to have a job?

What's the play after you have automated yourselves out of a job?

Davidzheng5mo ago

Many people will have to ask themselves these question soon regardless of their actions. I don't understand the critique here.

1 more reply

delegate5mo ago

I wonder who the managers are going to manage..

alexdobrenko5mo ago· 2 in thread

I've been using Claude Code in my terminal like a feral animal for months. Building weird stuff. Breaking things. Figuring it out as I go.

Cowork is the nice version. The "here's a safe folder for Claude to play in" version. Which is great! Genuinely. More people should try this.

But!!! The terminal lets you do more. It always will. That's just how it works.

And when Cowork catches up, you'll want to go further. The gap doesn't close. It just moves.

All of this, though, is good? I think??

energy1235mo ago

Isn't this like the "but rsync" comments on Dropbox launch? The vast majority of the addressable market doesn't know what a terminal is.

akurilin5mo ago

theturtletalks5mo ago· 2 in thread

Isn't this just a UI over Claude Code? For most people, using the terminal means you could switch to many different coding CLIs and not be locked into just Claude.

basket_horse5mo ago

> For most people

Most people have no idea what a terminal is.

1 more reply

JLO645mo ago

Imnimo5mo ago· 1 in thread

>By default, the main thing to know is that Claude can take potentially destructive actions (such as deleting local files) if it’s instructed to.

What do the words "if it's instructed to" mean here? It seems like Claude can in fact delete files whenever it wants regardless of instruction.

For example, in the video demonstration, they ask "Please help me organize my desktop", and Claude decides to delete files.

olliepro5mo ago

I believe the idea is that it “files away” the files into folders.

d4rkp4ttern5mo ago· 1 in thread

A CLI chat interface seems ideal for when you keep code "at a distance", i.e. if you hardly/infrequently/never want to peek at your code.

But for writing prose, I don't think chat-to-prose is ideal, i.e. most people would not want the keep prose "at a distance".

wek5mo ago

1 more reply

forty5mo ago· 1 in thread

I cannot see this page, I'm redirected to https://claude.com/fr-fr/blog/cowork-research-preview which don't exist. Private tab doesn't help

sunaookami5mo ago

Same for me but with my language. US defaultism strikes again ;) https://archive.ph/dIVPO here is an archive link that works

tacoooooooo5mo ago· 1 in thread

ai-christianson5mo ago

tolerance5mo ago· 1 in thread

This is the sort of stuff Apple should’ve been trying to figure out instead of messing with app corners and springboards.

elpakal5mo ago

But they created GenMoji?!

simonw5mo ago· 1 in thread

https://simonwillison.net/2026/Jan/12/claude-cowork/

redfloatplane5mo ago

1 more reply

_pdp_5mo ago· 1 in thread

Yah I wouldn't.

In my opinion, these things are better run the cloud to ensure you have a properly sandboxed, recoverable environment.

At this point, I am convinced that almost anyone heavily relaying on desktop chat application has far too many credentials scattered on the file system ready to be grabbed and exploited.

nxobject5mo ago

I wonder if this is what makes immutable package/installation management finally take off...

mceachen5mo ago· 1 in thread

YMMV but TFA page content body didn’t render for me until I disabled my local pihole.

janwillemb5mo ago

Firefox reader mode also helps

redactsureAI5mo ago· 1 in thread

A lot of people here are discussing the security challenges here. If you're interested I'm working on a novel solution to the security of these systems.

Redactsure.com

From this feed I figured I'd plug my system, would love your feedback! I beleive we are building out a real solution to these security and privacy concerns.

While the entire field is early I do believe systems like my own and others will make these products safe and reliable in the near future.

philipwhiuk5mo ago

> Basic ideas are minimal privilege per task in a minimal and contained environment for everything and heavy control over all actions AI is performing.

The challenge is that no application on desktop is built around these privileges so there's no grant workflow.

Are you bytecode analysing the kernel syscalls an app makes before it runs? Or will it just panic-die when you deny one?

1 more reply

bahmboo5mo ago· 1 in thread

HarHarVeryFunny5mo ago

1 more reply

mrcwinn5mo ago· 1 in thread

This product barely works. It can't connect to the browser extension and when I share folders for it to access, nothing happens. I love early previews but maybe one more week?

arthurcolle5mo ago

works fine for me, what's the matter?

redfloatplane5mo ago

Agents for other people, this makes a ton of sense. Probably 30% of the time I use claude code in the terminal it's not actually to write any code.

flyingzucchini5mo ago

For $200 month I’ll arrange my own desktop icons thanks. (Isn’t there a more compelling use case?)

jfletch3215mo ago

cwoolfe5mo ago

steipete5mo ago

Funny timing. Written in 10 days just when this took off. https://clawd.bot/

jameslk5mo ago

1 more reply

btown5mo ago

For those worried about irrevocable changes, sometimes a good plan is all the output.

Claude Code is very good at `doc = f(doc, incremental_input)` where doc is a code file. It's no different if doc is a _prompt file_ designed to encapsulate best practices.

Then, unlike many opaque "knowledge-base AI" products, you can inspect exactly how over-fitted those instructions are, and ask it to iterate.

Wowfunhappy5mo ago

Under the hood, is this running shell commands (or Apple events) or is it actually clicking around in the UI?

If the latter, I'm a bit skeptical, as I haven't had great success with Claude's visual recognition. It regularly tells me there's nothing wrong with completely broken screenshots.

lossolo5mo ago

I would like to thank the 100,000 people in Madagascar[1] who made it all possible by creating training data for ~€0.30 per hour.

1. https://www.youtube.com/watch?v=Q7NZK6h9Tvo

appsoftware5mo ago

jdeng5mo ago

Exciting to see Anthropic validate the "AI coworker" direction. We're building VITA AI (https://vita-ai.net) with similar philosophy but for enterprise QA testing.

One key architectural difference: Cowork runs sandboxed VMs on your local macOS machine, but we run sandboxes entirely in the cloud. This means:

- True isolation - agents never touch your local files or network, addressing the security concerns raised in this thread

- Actual autonomy - close your laptop, agent keeps working. Like delegating to a real coworker, not pairing with an assistant

- Scale - spin up 10 test agents without melting your CPU

Different use cases, different architectures. But the broader trend feels right - moving from conversational assistants to autonomous agents that operate independently.

krm015mo ago

Claude Cleaner, I mean Cowork will be sweeping my desktop every Friday.

Im sure itll be useful for more stuff but man…

hmokiguess5mo ago

This seems like a thin client UX running Claude Code for the less technical user.

fennecfoxy5mo ago

Hmm. I'm building something (quick and dirty) at the moment that looks at analysing customer service data.

arjie5mo ago

This sounds really interesting. Perhaps this is the promise that Copilot was not. I'm really hoping that this gives people like my wife access to all the things I use Claude Code for.

I use Claude Code for everything. I have a short script in ~/bin/ called ,cc that I launch that starts it in an appropriate folder with permissions and contexts set up:

      ~ tree ~/claude-workspaces -d
    /Users/george/claude-workspaces
    ├── context-creator
    ├── imessage
    │   └── tmp
    │       └── contacts-lookup
    ├── modeler
    ├── research
    ├── video
    └── wiki

monarchwadia5mo ago

This is a great idea! I'm building something very similar with https://practicalkit.com , which is the same concept done differently.

I'm currently rolling out my alpha, and am looking for investment & partners.

mintflow5mo ago

I like this idea but really do not want to share my personal data to cloud based LLM vendors.

break_the_bank5mo ago

We’re building something very similar but with files in the cloud instead.

Try it https://tabtabtab.ai

Would love some feedback!

tinyhouse5mo ago

slimebot805mo ago

"Claude can’t read or edit anything you don’t give it explicit access to"

How confident are we that this is a strict measure?

Unless there is a OS level restriction they are adhering to?

jpcompartir5mo ago

It's a very powerful way to work on all kinds of things. V. interested to try co-work when it drops to Plus subscribers.

philip12095mo ago

This is cool, but Claude for Chrome seems broken - authentication doesn't work and there's a slew of recent reviews on the Chrome extension mentioning it.

Sharing here in case anybody from Anthropic sees and can help get this working again.

It may seem off-topic, but I think it hurts developer trust to launch new apps while old ones are busted.

codebyaditya5mo ago

spm10015mo ago

Is it possible this gets access to a faster API tier?

Olshansky5mo ago

This is great, but it saddens me that this is still just the average total compensation of a single engineer at Anthropic.

Unsure what the future looks like unless Frontier Labs start financing everything that is open source.

sergiotapia5mo ago

Can it use the browser or the machine like a human? Meaning I can ask it to find a toaster on http://Target.com and it'll open my browser and try it?

kingkongjaffa5mo ago

When I need to create something like a powerpoint or whatever I use claude code and invoke a claude skill that knows how to do it. Why would I use claude cowork instead of that?

sbinnee5mo ago

A week ago I pitched to my managers that this form of general purpose claude code will come out soon. They were rather skeptical saying that claude code is just for developers. Now they can see.

ambicapter5mo ago

This is interesting because in the other thread about Anthropic/Claude Code, people are arguing that Anthropic is right to focus on what CC is good at (writing code).

system25mo ago

I use Claude 8+ hours per day. But this is probably the scariest use I can think of. An agent running with full privileges with no restriction. What can go wrong?

thiagowfx5mo ago

Since it is an agent, I wonder why they didn’t go with “Claude Coworker” instead.

On the other hand, it’s not “Claude Coder”, then it’s at least consistent.

lasgawe5mo ago

This comes with thousands of unknown attacks. When these kinds of features are introduced, we have to find ways to bypass them.

rao-v5mo ago

Cowork + litellm proxy + a local vision LLM should work incredibly well for overnight organizing tasks organizing md files, photos etc.

kewun5mo ago

I tried it out and it couldn't help me unsubscribe from spam/newsletter as it couldn't click the unsubscribe button.

StarterPro5mo ago

Damn, yall can't do anything by yourselves.

tolodot5mo ago

Unless this works almost exactly like Claude Code (minus GitHub) it will end up subtractng a lot of what makes cc so powerful.

rshanreddy5mo ago

Have still not been able to get a query to work. "Sending request" or other errors at every turn.

sparkalpha5mo ago

Tried Claude Cowork and Chatlily. Interesting idea, but Claude still feels stronger for my use cases.

650REDHAIR5mo ago

I tried to get Claude to build me a spreadsheet last night. I was explicit in that I wanted an excel file.

It’s made one in the past for me with some errors, but a framework I could work with.

It created an “interactive artifact” that wouldn’t work in the browser or their apps. Gaslit me for 3 revisions of me asking why it wasn’t working.

Created a text file that it wanted me to save as a .csv to import into excel that failed hilariously.

When I asked it to convert the csv to an excel file it apologized and told me it was ready. No file to download.

I asked where the file was and it apologized again and told me it couldn’t actually do spreadsheets and at that point I was out of paid credits for 4 more hours.

WesleyLivesay5mo ago

Bringing that type of functionality to a wider audience and out of the CLI could be really cool!

catoc5mo ago

If you don’t mind the terminal, what is the benefit of Cowork over Code? The sandboxing?

insanebrain5mo ago

This is like asking a hallucinating robot to paint your house using a sledgehammer

sharyphil5mo ago

This is incredible. Waiting for the rollout on other platforms. I really need it.

cm20125mo ago

Nothing important is in my file system, its all in google drive, gmail, and slack.

melonpan75mo ago

Personally I've only ever used Claude Code for coding.

imagetic5mo ago

I see the sales people completed their takeover...

Jamie4525mo ago

Is claude down? I can't create a new chat.

1 more reply

fluidcruft5mo ago

I mean this as genuinely non-snarkily as possible: I have been literally building my own personal productivity and workflow tools that could do things as shown.

Is this now a violation of the Claude terms of service that can get me banned from claude-code for me to continue work on these things?

brunoborges5mo ago

Anthropic: we will do the Code button first, then we implement Non-Code button.

OpenAI: we will do the Non-Code button first, then we implement the Code button.

1 more reply

cryptoegorophy5mo ago

It seems very similar to cursor AI?

pentagrama5mo ago

I think the next step for these big AI companies will be to launch their own operating systems, probably Linux distributions.

berryg5mo ago

I cannot read the pages on the Claude website. I am using pi-hole and that causes text not being rendered. Annoying.

nunez5mo ago

yeah, you shouldn't need to create a deck for a standup...

otherwise, looks interesting.

m4ck_5mo ago

can it play games for me? the factory must grow but I also need to cook dinner.

basedrum5mo ago

Can't load page contents

jeisc5mo ago

everybody knows that the only secure computer is one which is unplugged

focusgroup05mo ago

The Death of The Email Job

1 more reply

scottLobster5mo ago

Particularly in a work environment, one misfire could destroy months or years of important information.

1 more reply

FatherOfCurses5mo ago

Cowork: the 2026 version of training your offshore replacement.

daft_pink5mo ago

Now if there was just an easy and efficient way to drop a bunch of files into a directory.

goaaron5mo ago

Claude what's happening tomorrow ahghhg!!! hate this lol

zurfer5mo ago

It's not perfect perfect, but I'm having fun and I know I'm getting a lot of things done that I would not have dared to try previously.

4 more replies

j / k navigate · click thread line to collapse