Claude Code 2.0 (opens in new tab)

(npmjs.com)

842 pointspolyrand9mo ago413 comments

413 comments

235 comments · 57 top-level

simonw9mo ago· 56 in thread

Something I realized about this category of tool (I call them "terminal agents" but that already doesn't work now there's an official VS Code extension for this - maybe just "coding agents" instead) is that they're actually an interesting form of general agent.

Claude Code, Codex CLI etc can effectively do anything that a human could do by typing commands into a computer.

They're incredibly dangerous to use if you don't know how to isolate them in a safe container but wow the stuff you can do with them is fascinating.

pmarreck9mo ago

I too am amazed. Real-world example from last week:

After using gpt5-codex inside codex-cli to produce this fork of DOSBox (https://github.com/pmarreck/dosbox-staging-ANSI-server) that adds a little telnet server that allows me to screen-scrape VGA textmode data and issue virtual keystrokes (so, full roundtrip scripting, which I ended up needing for a side project to solve a Y2K+25 bug in a DOS app still in production use... yes, these still exist!) via 4000+ lines of C++ (I took exactly one class in C++), and it passes all tests and is non-blocking, I was able to turn around and (within the very same session!) have it help me price it to the client with full justification as well as a history of previous attempts to solve the problem (all of which took my billable time, of course), and since it had the full work history both in Git as well as in its conversation history, it was able to help me generate a killer invoice.

So (if all goes well) I may be getting $20k out of this one, thanks to its help.

Does the C++ code it made pass the muster of an experienced C++ dev? Probably not (would be happy to accept criticisms, lol, although I think I need to dress up the PR a bit more first), but it does satisfy the conditions of 1) builds, 2) passes all its own tests as well as DOSBox's, 3) is nonblocking (commands to it enter a queue and are processed one set of instructions at a time per tick), 4) works as well as I need it to for the main project. This still leaves it suitable for one-off tasks, of which there is a ton of need for.

This is a superpower in the right hands.

saberience9mo ago

Incredibly dangerous to use? Seems like a wild exaggeration.

I’ve been using Claude code since launch, must have used it for 1000 hours or more by now, and it’s never done anything I didn’t want it to do.

Why would I run it in a sandbox? It writes code for me and occasionally runs a build and tests.

I’m not sure why you’re so fixated on the “danger”, when you use these things all the time you end up realizing that the safety aspect is really nowhere near as bad as the “AI doomers” seem to make out.

simonw9mo ago

You've been safe since launch because you haven't faced an adversarial prompt injection attack yet.

You (and many, many others) likely won't take this threat seriously until adversarial attacks become common. Right now, outside of security researcher proof of concepts, they're still vanishingly rare.

You ask why I'm obsessed with the danger? That's because I've been tracking prompt injection - and our total failure to find a robust solution for it - for three years now. I coined the name for it!

The only robust solution for it that I trust is effective sandboxing.

6 more replies

guhcampos9mo ago

It is dangerous.

Just yesterday my cursor agent made some changes to a live kubernetes cluster even over my specific instruction not to. I gave it kubectl to analyze and find the issues with a large Prometheud + AlertManager configuration, then switched windows to work on something else.

When I was back the MF was patching live resources to try and diagnose the issue.

4 more replies

geeunits9mo ago

Because it grabs the headlines and upvotes more. It is becoming quite the bore to read as it offers nothing new, or an accurate representation of the facts. Thanks for calling it out. Same experience regarding thousands of hours of usage since launch, tested from sandboxed docker to take over an entire macbook air and here's an ssh login to a dev server whilst you're at it. I spot check with audits every other day and only wish for more autonomy with the agents, over less.

DowsingSpoon9mo ago

Just two days ago, I asked Claude Code (running as a restricted non-admin user) to generate a unit test. I didn’t look too closely at exactly what it wrote before it ran it for me. Unbounded memory use locked the system up so hard it stopped responding to all user input. After a few minutes, the machine restarted automatically. Oof.

edude039mo ago

Feels incredibly dismissive, if you look outside your own bubble for sec, there are people who've had CC drop their prod databases, delete their home folders, uninstall system dependencies etc etc.

And yes, these are all "skill issues" - as in, if they had known better this wouldn't have happened to them, however I think it's fair to call these possibilities out to counter balance the AI is amazing and everyone should use it for everything type narratives as to instil at least a little caution.

dangoodmanUT9mo ago

have you not seen the screenshots of claude asking permission to delete ~/, because some geniuses decided to make {repo}/~ a folder in cloudflare worker/cursor folders?

vessenes9mo ago

The original opus/sonnet 4 safety card mentioned that it would hand write emails to the fbi turning in a user if it thought they were doing something really bad. It has examples of the “snitch” emails.

I too use it extensively. But they’re very, very capable models, and the command line contains a bunch of ways to exfiltrate data off your system if it wants to.

1 more reply

victorbjorklund9mo ago

It is risky. Just like copy-pasting scripts from the internet is. I have done both and nothing bad ever happened (that I know about). But it does happen. The risk of running code/commands on your computer that you have not checked before is not zero.

johanneskanybal8mo ago

So far it's screwed up my wifi and directed me through malicious link's I've blindly followed even if I take full responsibility ofc. And that's from less than 80h usage just on my home computer.

raincole9mo ago

It's as dangerous as copying & pasting command line script from StackOverflow at the end of a 14-hour workday.

i.e. quite dangerous, but people do it anyway

coldtea9mo ago

>I’ve been using Claude code since launch, must have used it for 1000 hours or more by now, and it’s never done anything I didn’t want it to do.

You know what neighbors of serial killers say to the news cameras right?

"He was always so quiet and polite. Never caused any issues"

athrowaway3z9mo ago

They're only as dangerous as the capabilities you give them. I just created a `codex` and `claude` user on my Linux box and practically always run in yolo mode. I've not had a problem so far.

Also, I think shellagent sounds cooler.

simonw9mo ago

That's a great way to run this stuff.

I expect the portion of Claude Code users who have a dedicated user setup like this is pretty tiny!

1 more reply

globular-toast9mo ago

I tried this but it's incredibly annoying as you'll get a mixture of file ownerships and permissions.

Instead I run it in bubblewrap sandbox: https://blog.gpkb.org/posts/ai-agent-sandbox/

3 more replies

tuyiown9mo ago

> They're only as dangerous as the capabilities you give them.

As long as the supply chain is safe and the data it accesses does not generate some kind of jail break.

It does read instructions from files on the file system, I pretty sure it's not complex to have it poison its prompt and make it suggest to build a program infected with malicious intent. It's just one copy pasta away from a prompt suggestion found on the internet.

data-ottawa9mo ago

I have run it in a podman container and I mount the project directory.

polyrandOP9mo ago

Instead of containers, which may not always be available, I'm experimenting with having control over the shell to whitelist the commands that the LLM can run [0]. Similar to an allow list, but configured outside the terminal agent. Also trying to make it easy to use the same technique in macOS and Linux

[0]: https://ricardoanderegg.com/posts/control-shell-permissions-...

jcgl9mo ago

Not specific to LLM stuff, but I've lately been using bubblewrap more and more to isolate bits of software that are somewhat more sketchy (NPM stuff, binaries downloaded from GitHub, honestly most things not distro-packaged). It was a little rocky start out with, but it is nice knowing that a random binary can't snoop on and exfiltrate e.g. my shell history.

1 more reply

philipp-gayret9mo ago

I really like this and we're doing a similar approach but instead using Claude Code hooks. What's really nice about this style of whitelisting is that you can provide context on what to do instead; Let's say if `terraform apply` is banned, you can tell it why and instruct it to only do `terraform plan`. Has been working amazing for me.

1 more reply

khafra9mo ago

An interesting exercise would be to let a friend into this restricted shell, with a prize for breaking out and running rm -rf / --no-preserve-root. Then you know to switch to something higher-security once LLM capabilities reach the level of that friend.

user39393829mo ago

You have to put them in the same ACL, chroot, whatever permission context for authorization you’d apply to any other user human or otherwise. For some resources it’s cumbersome to setup but anything else is a hope and a prayer.

_heimdall9mo ago

This is how I've been using Gemini CLI. It has no permissions by default, whether it wants to search google, run tests, or update a markdown file it has to propose exactly what it needs to do next and I approve it. Often its helpful even just to redirect the LLM, if it starts going down the wrong path I catch it early rather than 20 steps down that road.

I have no way of really guaranteeing that it will do exactly what it proposed and nothing more, but so far I haven't seen it deviate from a command I approved.

hboon9mo ago

I didn’t check, but sometimes Claude Code writes scripts and run them (their decision); does your approach guard against that?

1 more reply

ehnto9mo ago

It's broad utility was immediately clear as soon as I saw it formulating bash commands.

I've used it to troubleshoot some issues on my linux install, but it's also why the folder sandbox gives me zero confidence that it can't still brick my machine. It will happily run system wide commands like package managers, install and uninstall services, it even deleted my whole .config folder for pulseaudio.

Of course I let it do all these things, briefly inspecting each command, but hopefully everyone is aware that there is no real sandbox if you are running claude code in your terminal. It only blocks some of the tool usages it has, but as soon as it's using bash it can do whatever it wants.

pancakemouse9mo ago

Something I've seen discussed very little is that Claude Code can be opened in a directory tree of any type of document you like (reports, spreadsheets, designs, papers, research, ...) and you can play around in all sorts of ways. Anthropic themselves hint at this by saying their whole organisation uses it, but the `Code` moniker is probably limiting adoption. They could release a generalised agent with a friendlier UI tomorrow and get much wider workplace adoption.

withinboredom9mo ago

I have it master my music. I drop all the stems in a folder, tell it what I want, and off it goes to write a python script specifically for the album. It’s way better than doing it in the DAW, which usually takes me hours (or days in some cases). It can get it to 90% in minutes, only requiring some fine-tuning at the end.

1 more reply

tkgally9mo ago

That’s how I use it. I’m not a developer, and using Claude Code with Git turned out to be more complicated than I wanted. Now I just give it access to a folder on my Mac, put my prompt and any associated files in that folder, and have it work there. It works fine for my needs.

I would like a friendlier interface than the terminal, though. It looks like the “Imagine with Claude” experiment they announced today is a step in that direction. I’m sure many other companies are working on similar products.

matlock9mo ago

Over the weekend I had it extract and Analyse Little but Fierce, a simplified and kid friendly DnD 5e and extract markdown files that help me DMing for my kids. Then it Analyse No, thank you evil as I want to base the setting on it but with LBF rules. And then have the markdown turn into nice looking pdfs. Claude code is so much more than coding and it’s amazing.

clbrmbr9mo ago

Indeed. I’m having success using it as a tool for requirements querying. (When a sales person asks “does product A have feature X” I can just ask Claude because I’ve got all the requirements in markdown files.

willio589mo ago

One thing I really like using them for is refactoring/reorganizing. The tedious nature of renaming, renaming all implementations, moving files around, creating/deleting folders, updating imports exports, all melts away when you task an agent with it. Of course this assumes they are good enough to do them with quality, which is like 75% of the time for me so far.

dgunay9mo ago

I've found that it can be hard or expensive for the agent to do "big but simple" refactors in some cases. For example, I recently tasked one with updating all our old APIs to accept a more strongly typed user ID instead of a generic UUID type. No business logic changes, just change the type of the parameter, and in some cases be wary of misleading argument names by lazy devs copy pasting code. This ended up burning through the entire context window of GPT-5-codex and cost the most $ of anything I've ever tasked an agent with.

1 more reply

singularity20019mo ago

does it use the smart refractoring hooks of the IDEs or does it do blunt text replacement

3 more replies

golergka9mo ago

Especially when you work with a language where an unfinished refactoring with give you the type error.

bhl9mo ago

Cursor will pivot to a computer use company.

The gap between coding agents in your terminal and computer agents that work on your entire operating system is just too narrow and will be crossed over quick.

teaearlgraycold9mo ago

Once this tech is eliminating jobs on a massive scale I'll believe the AI hype. Not to say that couldn't be right around the corner - I have no clue. But being able to perform even just data entry tasks with better-than-human accuracy would be a huge deal.

1 more reply

simonw9mo ago

I just published a related piece to this idea, on "Designing agentic loops" as a key skill you need to solve problems using these new coding agent tools: https://simonwillison.net/2025/Sep/30/designing-agentic-loop...

ACCount379mo ago

Back in 2022, when ChatGPT was new, quite a few people were saying "LLMs are inherently safe because they can't do anything other than write text". Some must have even believed what they were saying.

Clearly not. Just put an LLM into some basic scaffolding and you get an agent. And as capabilities of those AI agents grow, so would the degree of autonomy people tend to give them.

IMTDb9mo ago

> LLMs are inherently safe because they can't do anything other than write text

That is still very much the case; the danger comes from what you do from the text that is generated.

Put a developer in a meeting room and no computer access, no internet etc; and let him scream instructions through the window. If he screams "delete prod DB", what do you do ? If you end up having to restore a backup that's on you, but the dude inherently didn't do anything remotely dangerous.

The problem is that the scaffolding people put around LLM is very weak, the equivalent of saying "just do to everything the dude is telling, no question asked, no double check in between, no logging, no backups". There's a reason our industry has development policies, 4 eyes principles, ISO/SOC standards. There already are ways to massively improve the safety of code agents; just put Claude code in a BSD jail and you already have a much safer environment than what 99% of people are doing, this is not that tedious to make. Other safer execution environments (command whitelisting, arguments judging, ...) will be developed soon enough.

1 more reply

redhale8mo ago

Totally agree. The CLI agents (or whatever) lower the barrier of entry to building a custom agent all the way down to basically just writing markdown.

Excellent article in this vein: https://jxnl.co/writing/2025/09/04/context-engineering-rapid...

ozgung9mo ago

> Claude Code, Codex CLI etc can effectively do anything that a human could do by typing commands into a computer.

One criticism on current generation of AI is that they have no real world experience. Well, they have enormous amount of digital world experience. That, actually, has more economical value.

ACCount379mo ago

They have a lot of secondhand knowledge and very little firsthand knowledge. RLVR works so well because it's a way to give LLMs some of the latter.

brookst9mo ago

Dangerous how? Claude code literally asks before running any command.

I suppose they’re dangerous in the same way any terminal shell is dangerous, but it seems a bit of a moral panic. All tools can be dangerous if misused.

simonw9mo ago

Many people (myself included) run them in YOLO mode with approvals turned off, because it's massively more productive. And that's despite me understanding how unsafe that is more than most!

Even with approvals humans will fall victim to dialog fatigue, where they'll click approve on everything without reading it too closely.

3 more replies

budududuroiu9mo ago

I’m experimenting with Nix shells for this tool isolation and whitelisting

nextaccountic9mo ago

That's not enough for security. Morally it should be - there's no reason we shouldn't be able to run untrusted software easily - but it won't have a firewall for example

Maybe something like bubblewrap could help

monkeydust9mo ago

Been starting to wonder if this marks a step change in UX - moving away from pretty well designed screens where designers labor over positioning of artifacts like buttons, user input dialogs and color palettes to a CLI! I cant imagine CLI will work for everything but for a lot of things, when powered by LLM they are incredible and yea equally dangerous at the same time for many reasons.

visarga9mo ago

> Claude Code, Codex CLI etc can effectively do anything that a human could do by typing commands into a computer.

They still don't have good integration with the web browser, if you are debugging frontend you need to carry screenshots manually, it cannot inspect the DOM, run snippets of code in the console, etc.

simonw9mo ago

You can tell them to take screenshots using Playwright and they will. They can also use Playwright to inspect the console and manipulate the DOM.

I've seen Codex CLI install Playwright Python when I asked it to do this and it found it wasn't yet available in the environment.

nicewood9mo ago

True. Although worth mentioning that there is tooling and (e.g. Playwright) MCPs around this. But definitely not integrated well enough!

gazpachotron9mo ago

I'd recommend the chrome-devtools-mcp for that: https://github.com/ChromeDevTools/chrome-devtools-mcp/

It's pretty new, but so far it's been a lifesaver.

Rutledge9mo ago

I call them 'CLI agents'!

hmokiguess9mo ago

Obligatory mention: https://xkcd.com/2044/

resters9mo ago

> They're incredibly dangerous to use if you don't know how to isolate them in a safe container but wow the stuff you can do with them is fascinating.

True but all it will take is one report of something bad/dangerous actually happening and everyone will suddenly get extremely paranoid and start using correct security practices. Most of the "evidence" of AI misalignment seems more like bad prompt design or misunderstanding of how to use tools correctly.

igor479mo ago

This seems unlikely. We've had decades of horrible security issues, and most people have not gotten paranoid. In fact, after countless data leaks, crypto miner schemes, ransomware, and massive global outages, now people are running LLM bots with the full permission of their user and no guardrails and bragging about it on social media.

marckrn9mo ago· 17 in thread

You can find the revamped prompt on github[1], or on twitter summarized by my bot[2].

[1] https://github.com/marckrenn/cc-mvp-prompts/compare/v1.0.128...

[2] https://x.com/CCpromptChanges/status/1972709093874757976

kelnos9mo ago

> IMPORTANT: DO NOT ADD *ANY** COMMENTS unless asked*

Interesting. This was in the old 1.x prompt, removed for 2.0. But CC would pretty much always add comments in 1.x, something I would never request, and would often have to tell it to stop doing (and it would still do it sometimes even after being told to stop).

epiccoleman9mo ago

I can't decide if I like this change or not, tbh. I almost always delete the comments Claude adds, to be sure - but at the same time they seem to provide a sort of utility for me as I read through the generated code. They also act, in a funny way, as a kind of checklist as I review changes - I want them all cleaned up (or maybe edited and left in place) before I PR.

2 more replies

stefan_9mo ago

Meanwhile they deleted the "do not add emojis" part. Look forward to all sorts of logging messages with emojis in them.

1 more reply

IgorPartola9mo ago

I am guessing this is an attempt to save computing resources/tokens?

1 more reply

simonw9mo ago

This is excellent. Thanks for sharing this.

marckrn9mo ago

You're very welcome – that really means a lot coming from you, Simon.

Huppie9mo ago

Thanks. When testing today I noticed it 'forgot' to run the linter, build, test etc. commands. I thought this might've been a Sonnet 4.5 v.s. Opus 4 issue but it looks like this instruction was dropped for some reason.

I should probably include that in my Claude.md instead I guess?

rcv9mo ago

> 2025-09-29T16:55:10.367Z is the date. Write a haiku about it.

what in the world?

marckrn9mo ago

That's just a dynamic bogus prompt used to trace and extract the system prompt.

Here's how it works in detail: https://mariozechner.at/posts/2025-08-03-cchistory/

nojs9mo ago

How are you extracting this - aren’t the main labs obfuscating these (meaning it’s likely to be a decoy or incomplete version)?

marckrn9mo ago

With cchistory https://github.com/badlogic/cchistory

Here's how it works: https://mariozechner.at/posts/2025-08-03-cchistory/

1 more reply

amrrs9mo ago

Are you running the bot with the free tier api?

marckrn9mo ago

I'm using Anthropic's pay-as-you-go API, since it was easier to set up on the server than CC's CLI/web login method. Running the bot costs me ~$1.8 per month.

The bot is based on Mario Zechner's excellent work[1] - so all credit goes to him!

[1] https://mariozechner.at/posts/2025-08-03-cchistory/

1 more reply

Wowfunhappy9mo ago

Can anyone find the prompts for the new "Output style" options, ie Explanatory and Learning?

huflungdung9mo ago

How are these reliably extracted (hint: they’re not)

simonw9mo ago

It's very easy to extract the system prompt from Claude Code: you can patch it to intercept HTTP calls it makes.

I wrote about one tool for doing that here: https://simonwillison.net/2025/Jun/2/claude-trace/

marckrn9mo ago

See https://mariozechner.at/posts/2025-08-03-cchistory/

Why do you think these aren't legit?

arjie9mo ago· 11 in thread

I really like these tools. Yesterday I gave it a filename for a video of my infant daughter eating which I took while I had my phone on the charger. The top of the charger slightly obscured the video.

I told it to crop the video to just her and remove the obscured portion and that I had ffmpeg and imagemagick installed and it looked at the video, found the crop dimensions, then ran ffmpeg and I had a video of her all cleaned up! Marvelous experience.

My only complaint is that sometimes I want high speed. Unfortunately Cerebras and Groq don't seem to have APIs that are compatible enough for someone to have put them into Charm Crush or anything. But I can't wait for that.

pimeys9mo ago

You could try to use a router. I'm currently building this:

https://github.com/grafbase/nexus/

If croq talks openai API, you enable the anthropic protocol, and openai provider with a base url to croq. Set ANTHROPIC_BASE_URL to the open endpoint and start claude.

I haven't tested croq yet, but this could be an interesting use case...

arjie9mo ago

I assumed that OpenRouter wouldn't deliver the same tokens/second which seems to have been a complete mistake. I should have tried it to see. I currently use `ANTHROPIC_BASE_URL` and `ANTHROPIC_AUTH_TOKEN` with z.ai and it works well but CC 2.0 now displays a warning

      Auth conflict: Both a token (ANTHROPIC_AUTH_TOKEN) and an API key (/login managed key) are set. This may lead to unexpected behavior.
    • Trying to use ANTHROPIC_AUTH_TOKEN? claude /logout
    • Trying to use /login managed key? Unset the ANTHROPIC_AUTH_TOKEN environment variable.

Probably just another flag to find.

EDIT: For anyone coming here from elsewhere, Crush from Charm supports Cerebras/Groq natively!

1 more reply

Gigachad9mo ago

Isn't cropping a video something you can do in the photos app in 2 seconds?

867-53099mo ago

yeah, removing the unwanted item and keeping the video uncropped is surely more desirable, but far beyond the capabilities of "ai"

1 more reply

arjie9mo ago

I simply did not know you could do that with videos. TIL!

2 more replies

scosman9mo ago

Cerebras has OpenAI compatible "Qwen Code" support. ~4000 tokens/s. Qwen code's 480B param model (MoE) that's quite good. Not quite sonnet good, but speed is amazing.

https://www.cerebras.ai/blog/introducing-cerebras-code

arjie9mo ago

When they announced this I went to try it and they only work with Cline really (which is what they promote there) but Cline has this VSCode dependency as far as I know and I don't really like that. I have my IDE flow and my CLI flow and I don't want to mix them.

But you're right, they have an OpenAI compatible API https://inference-docs.cerebras.ai/resources/openai so perhaps I can actually use this in the CLI! Thanks for making me take another look.

EDIT: Woah, Charm supports this natively. This is great. I am going to try this now.

1 more reply

jascha_eng9mo ago

Cerebras is super cool. I wish OpenAI and Anthropic would have their models hosted there. But I guess supporting yet another platform is hard.

esperent9mo ago

Cline extension can use Grok, in fact I think it's free at the moment. I tried Claude Code and Cline for similar tasks and found Claude Code incredibly expensive but not better, so I've been sticking with Cline and switching between APIs depending on what model currently has the vest price/performance going on.

adastra229mo ago

Claude Code with the Max plan is significantly cheaper for full-time use.

1 more reply

Danjoe49mo ago

I wish the Cline extension was more performant. It has a 1000+ ms startup time for VScode and stutters occasionally. In terms of workflow though, it's my absolute favorite. I simply don't think the models are there yet for fully agentic coding in any reasonably complex/novel codebase. Cline lets me supervise the LLM step by step.

djha-skin9mo ago· 8 in thread

I'm currently using Goose[1]. My brother in law uses Claude Code and he likes it. It makes me wonder if I'm missing anything. Can anyone tell me if there's any reason I should switch to Claude Code, or comparisons between the two?

1: https://block.github.io/goose/

CuriouslyC9mo ago

The only real reason to use Claude Code is the inference plan. The agent itself isn't anything special.

faxmeyourcode9mo ago

Curious that you say that. I feel like the reason I love to use claude code is mostly because of the orchestration around the model itself. Maybe I've been trained by claude to write for it in a certain way. But when I try other clis like codex, gemini, and more recently opencode, they don't seem as well built and polished or even as capable, despite me liking the gemini and gpt-5 models themselves and using their apis more than anthropic's for work.

1 more reply

cesarvarela9mo ago

This, but also the usability of the cli, is a step above the others to me. i.e., switching between modes on the fly and having the plan mode easily accessible via shift+tab.

all29mo ago

I tried goose and it seems like there's a lot of nice defaults that Claude Code provides that Goose does not. How did you do your initial configuration?

kristopolous9mo ago

What I've been trying to use it for is to solve a number of long-standing bugs that I've frankly given up on in various Linux tools.

I think I lack the social skills to community drive a fix, probably through some undiagnosed disorder or something so I've been trying to soldier alone on some issues I've had for years.

The issues are things like focus jacking in some window manager I'm using on xorg where the keyboard and the mouse get separate focuses

Goose has been somewhat promising, but still not great.

I mean overall, I don't think any of these coding agents have given me useful insight into my long vexing problems

I think there has to be some type of perception gap or knowledge asymmetry to be really useful - for instance, with foreign languages.

I've studied a few but just in the "taking classes at the local JC" way. These LLMs are absolutely fantastic aids there because I know enough to frame the question but not enough to get the answer.

There's some model for dealing with this I don't have yet.

Essentially I can ask the right question about a variety of things but arguably I'm not doing it right with the software.

I've been writing software for decades, is it really that I'm not competent enough to ask the right question? That's certainly the simplest model but it doesn't check out.

Maybe in some fields I've surpassed a point where llms are useful?

It all circles back to an existential fear of delusional competency.

2 more replies

jatins9mo ago

Used both. I think Claude Code is better because of better System prompt. It'll divide work into smaller tasks and go through it by default. You can get same behavior with Goose but will likely need to do a lot of prompting yourself

383toast9mo ago

https://github.com/block/goose/discussions/3133#discussionco...

rirze9mo ago

Never used goose, but looked at it way back when-- Claude Code feels more native IMO. Especially if you're already using Anthropic API/Plans anyways, I'd say give it a try.

sunaookami9mo ago· 6 in thread

FINALLY checkpoints! All around good changes, Claude Code is IMHO the best of the LLM CLI tools.

rao-v9mo ago

It does sometimes feel that all of these systems are slowly rediscovering that the OG, Aider (https://github.com/Aider-AI/aider), had a near perfect architecture for pair programming with LLMs from the start.

sunaookami9mo ago

I find Aider kinda clunky but I would put it in #2.

pmarreck9mo ago

I already set up a jj (jujutsu) repo in my projects colocated with git (it uses git for its backend). Once you additionally set up a certain background daemon, it will then autocommit (label-lessly) every change to every file in that project. So you get "infinite undo", basically. It's actually more powerful than this checkpointing idea.

mistahchris9mo ago

I'm a recent jj convert, and working with llms was actually a driver for my own jj adoption. I haven't tried the watch daemon, but I do run `jj new` anytime i ask the llm agent to do anything. It has worked amazingly well.

3 more replies

ashu14619mo ago

How do checkpoints work ?

conception9mo ago

You can rewind your context back to the checkpoint

2 more replies

stared9mo ago· 6 in thread

Out of all changes I want the most is to not need to type `\ ` to make a line break.

sugarpile9mo ago

Assuming you aren't using windows terminal: launch claude code and run `/terminal-setup` -- that will enable shift+enter

Nizoss9mo ago

If you're on Windows and using vscode, add thiss to keybinds.json

[ { "key": "shift+enter", "command": "workbench.action.terminal.sendSequence", "args": { "text": "\u001b\n" }, "when": "terminalFocus" }, ]

It will allow you to get new lines without any strange output.

jspdown9mo ago

You can type Option+Enter. A more standard Shift+Enter would have been better but until then that's the best we have

atonse9mo ago

Early on in claude, I feel like it installed some terminal thing that allowed me to do Shift+Enter directly in the prompt, but I don't remember if that was CC that did it.

So I've been able to shift enter. I'm using iTerm2 and zsh with CC (if that's relevant)

roesel9mo ago

Have you tried Ctrl+Enter?

pmarreck9mo ago

codex has control-J

others say here that option/alt-enter may work? not sure why shift-enter couldn't though.

jmward019mo ago· 6 in thread

"When you use Claude Code, we collect feedback, which includes usage data (such as code acceptance or rejections), associated conversation data, and user feedback submitted via the /bug command."

So I can opt out of training, but they still save the conversation? Why can't they just not use my data when I pay for things. I am tired of paying, and then them stealing my information. Tell you what, create a free tier that harvests data as the cost of the service. If you pay, no data harvesting.

NitpickLawyer9mo ago

> So I can opt out of training

Even that is debatable. There are a lot of weasel words in their text. At most they're saying "we're not training foundation models on your data", which is not to say "we're not training reward models" or "we're not testing our other-data models on your data" and so on.

I guess the safest way to view this is to consider anything you send them as potentially in the next LLMs, for better or worse.

netcoyote9mo ago

> When you use Claude Code, we collect feedback

When they ask "How is Claude doing this session?", that appears to be a sneaky way for them to harvest the current conversation based on the terms-of-service clause you pointed out.

adastra229mo ago

I have this same suspicion. Worse, there’s no way to opt out of giving a response.

1 more reply

candiddevmike9mo ago

Not your model, not your code (in their mind). Self host your models or enjoy folks trying to get the LLM to regurgitate your private codebase.

gdudeman9mo ago

This enables the /resume command that lets you start mid-conversation again.

Storing the data is not the same as stealing. It's helpful for many use cases.

I suppose they should have a way to delete conversations though.

freeqaz9mo ago

That's not just them saving it locally to like `~/.claude/conversations`? Feels weird if all conversations are uploaded to the cloud + retained forever.

2 more replies

alecco9mo ago· 6 in thread

As a burnt-out, laid-off aging developer, I want to thank Anthropic for helping me get in love with programming again. Claude Code on terminal with all my beloved *nix tools and vim rocks.

taude9mo ago

100%. As a burnt-out manager, who doesn't get a lot of spare time to actually code. It's nice to have a tool like CC where I can make actual incremental changes in the spare 15 minutes I get here and there.

I spend most of my time making version files with the prompt, but pretty impressed by how far I've gotten on an idea that would have never seen the light of day....

The thoughts of having to write input validation, database persistence, and all the other boring things I've had to write a dozen times in the past....

swalsh9mo ago

As an Architect, i feel like a large part of my job is to help my team be their best, but I'm also focused on the delivery of a few key solutions. I'm used to writing tasks, and helping assign it to members on the team while occasionally picking up the odd-end piece of work myself, focusing more on architecture and helping individual members when they get stuck or when problems come up. But with the latest coding agents, i'm always thinking in the back of my head (I can get the AI to finish this task 3x quicker, and probably better quality if I just do it myself with the AI). We sit on SCRUM meetings sizing tasks, and i'm thinking "bro, you're just going to take my task description paste it into AI and be done in 1/2 hr" but we size it to a day or 2.

lupusreal9mo ago

Agreed, it's actually fun again. The evening hours I used to burn with video games and weed are now spent with claude code, rewriting and finishing up all my custom desktop tools and utilities I started years ago.

cevn9mo ago

I had a lot of fun making 'tools' like this, but once I settled into a complicated problem (networking in a multiplayer game), it has become frustrating to watch Claude give back control to me without accomplishing anything, over and over again. I think I need to start using the SDK in order to force it to its job.

6 more replies

protocolture9mo ago

I had ChatGPT write from spec an assignment I failed to complete during university, that has always stuck with me as something I would like to finish.

ojr9mo ago

I still spend my evening hours like that and do ai-assisted coding in the background

1 more reply

f311a9mo ago· 6 in thread

It still bothers me that almost every agentic TUI is written in TS + React. It often consumes at least a few GB of RAM. No one bothers about it. Everybody is trying to ship as fast as possible.

nylonstrung9mo ago

I highly recommend Crush which is built with Go

The UX is definitely better because it uses the bubble tea library which is probably the best TUI framework ever

And you can use a ton of different providers and models

jama2119mo ago

Ram is cheap, 99.9% of the audience that would use this are running heavy envs on powerful computers. I can totally understand why they write it that way. Better to have faster iteration and alienate 0.1% of serious users than to slow development just to cater to them.

1 more reply

ranguna9mo ago

They use react on a cli tool?

1 more reply

Squarex9mo ago

Well codex is written in rust.

1 more reply

h4ch19mo ago

Where did you get React from?

1 more reply

epolanski9mo ago

> No one bothers about it.

Why would they?

jakozaur9mo ago· 6 in thread

Just use `claude update` if you already have it. Unfortunately, they removed Plan mode, when I could use Opus for planning and Sonnect for coding.

Though I will see how this pans out.

g42gregory9mo ago

I am ended up not using this option anyway. I am using B-MAD agents for planning and it gets into a long-running planning stream, where it needs permission to execute steps. So you end up running the planning in the "accept edits" mode.

I use Opus to write the planning docs for 30 min, then use Sonnet to execute them for another 30 min.

sbene9709mo ago

> they removed Plan mode

This isn't true, you just need to use the usual shortcut twice: shift+tab

paulsmith9mo ago

> Unfortunately, they removed Plan mode

If I hit shift-Tab twice I can still get to plan mode

1 more reply

rafaquintanilha9mo ago

They removed the /model option where you can select Opus to plan and Sonnet to execute. But you can still Shift + Tab to cycle between auto-accept and plan mode.

2 more replies

spike0219mo ago

is Plan mode any different from telling Claude "this is what I'd like to do, please describe an implementation plan"?

that's generally my workflow and I have the results saved into a CLAUDE-X-plan.md. then review the plan and incrementally change it if the initial plan isn't right.

2 more replies

lupusreal9mo ago

> Unfortunately, they removed Plan mode

WTF. Terrible decision if true. I don't see that in the changelog though

1 more reply

j1elo9mo ago· 5 in thread

Prompt: https://raw.githubusercontent.com/marckrenn/cc-mvp-prompts/r...

I've always been curious. Are tags like that one: "<system-reminder>" useful at all? Is the LLM training altered to give a special meaning to specific tags when they are found?

Can a user just write those magic tags (if they knew what they are) and alter the behavior of the LLM in a similar manner?

cube22229mo ago

Claude tends to work well with such semi-xml tags in practice (probably trained for it?).

You can just make them up, and ask it to respond with specific tags, too.

Like “Please respond with the name in <name>…</name> tags and the <surname>.”

It’s one of the approaches to forcing structured responses, or making it role-play multiple actors in one response (having each role in its tags), or asking it to do a round of self-critique in <critique> tags before the final response, etc.

garfij9mo ago

We use them extensively in our agent framework at work for all sort of things. You can make up whatever you want, if the tags are semantic enough it just gets it, or you can add a bit of explanation about it in the system prompt or whatever.

  - Circuit breakers when it seem like it's stuck in a loop
  - Warnings about running low on context
  - Reminders about task lists (or anything)
  - All sorts of warnings about whatever

haefeledev9mo ago

I think they specifically trained claude on any kind of xml tags (see their docs https://docs.claude.com/en/docs/build-with-claude/prompt-eng...)

rancar29mo ago

A user can append similar system reminders in their own prompt. It’s one of the things that the Claude Code team discovered worked and now included in other CLIs like Factory, which was talked about today by cofounder of Factory: https://www.youtube.com/live/o4FuKJ_7Ds4?si=py2QC_UWcuDe7vPN

itsmevictor9mo ago

> If you do not use this tool when planning, you may forget to do important tasks - and that is _unacceptable_.

Okay, I know I shouldn't anthropomorphize, but I couldn't prevent myself from thinking that this was a bit of a harsh way of saying things :(

numpad09mo ago· 4 in thread

fyi: for chatboxes that may take CJK inputs, you MUST use "shift+enter to send" pattern. There is a reason why most multinational chat/LLM app providers always do that instead of simple enter to send even for single-line chatboxes; because plain enter to send breaks input for CJK users.

Specifically, Input Method Editors needed for CJK inputs(esp. for C and J), to convert ambiguous semi-readable forms into proper readable text, use enter to finalize after candidates were iterated with spacebar. While IME engines don't interchange between different languages, I believe basically all of them roughly follow this pattern.

Unless you specifically wants to exclude CJK users, you have to either detect presence of IME and work with it so that enter do nothing to the app unless conditions are met. Switching to shift+enter works too.

1: https://github.com/anthropics/claude-code/issues/8405

2: https://www.youtube.com/watch?v=mY6cg7w2eQU

3: https://youtu.be/sYAnawy_VoA?feature=shared&t=282

4: https://www.youtube.com/watch?v=VmoeZ_W3WXo

wrasee9mo ago

What’s CJK input? I’m guessing Chinese Japanese Korean?

numpad09mo ago

yes, the gif in the link[1] shows how it works, and a dupe issue[2] describes detailed "fully proper" fix. There's at least four dupes and one PR already, that situation kind of implies severity.

1: https://github.com/anthropics/claude-code/issues/8405#issuec...

2: https://github.com/anthropics/claude-code/issues/8466

chrisshroba9mo ago

Looks like you’re correct!

https://en.m.wikipedia.org/wiki/CJK_characters

johanyc8mo ago

On that note, does it affect korean though? It seems they don't need to select characters from the menu. I tried typing random characters here: https://urcook.com/kr.html

xmpirate9mo ago· 4 in thread

I wish there were an option to cancel a currently running prompt midway. Right now, pressing Ctrl+C twice ends up terminating the entire session instead.

g42gregory9mo ago

Wait, doesn't hitting Escape do this already?

turnsout9mo ago

I'm always watching Claude Code as it runs, ready to hit the Escape key as soon as it goes off the rails. Sometimes it gets stuck in a cul de sac, or misunderstands something basic about the project or its structure and gets off on a bad tangent. But overall I love CC.

cesarvarela9mo ago

Adding to the press Esc comments, if you press it twice, you can revert to previous messages in the current conversation.

qafy9mo ago

press escape

navanchauhan9mo ago· 3 in thread

You have to specify `/model sonnet[1m]` to get the 1 million context version

brulard9mo ago

Be careful. exceeding around the original 200k tokens leads to worse and worse results. It's important to have context clean and tailored to the current task.

navanchauhan9mo ago

Yes, but at the same time having the 1 million context enabled is nice because the model is aware that they have more context left and actually perform better. [0]

[0] https://cognition.ai/blog/devin-sonnet-4-5-lessons-and-chall...

AutumnsGarden9mo ago

Thank you!! I've been looking for this for a while now.

aeon_ai9mo ago· 3 in thread

To those lamenting that the Plan with Opus/Code with Sonnet feature is not available, check the charts.

Sonnet 4.5 is beating Opus 4.1 on many benchmarks. Feels like it's a change they made not to 'remove options', but because it's currently universally better to just let Sonnet rip.

jckahn9mo ago

Sure but I want to review the ripping plan so it tears along the correct lines.

NitpickLawyer9mo ago

Shift+Tab still brings up the planning mode.

2 more replies

adastra229mo ago

But not the specific benchmarks which reflect what Plan mode does.

gdudeman9mo ago· 3 in thread

> New native VS Code extension

Looks great, but it's kind of buggy:

- I can't figure out how to toggle thinking

- Have to click in the text box to write, not just anywhere in the Claude panel

- Have to click to reject edits

jakebasile9mo ago

I wish I could put it in the sidebar like every other flavor of AI plugin.

ffsm89mo ago

It seems they also removed the bypass permission setting...

claytonjy9mo ago

plans now open in a separate file tab, and if you don’t accept it, it just…disappears so you can’t discuss it!

cadamsdotcom9mo ago· 3 in thread

Claude Code is so much better than anything else.

If Claude Code was a car it'd be the ideal practical vehicle for all kinds of uses.

If OpenAI Codex was a car, it'd be a cauldron with wheels.

The reason I say this is CC offers so many features: plan mode, hooks, escape OR ctrl-c to interrupt it, and today added quick rewind. Meanwhile Codex can't even wrap text to the width of the terminal; you can't type to it while it's working to queue up messages to steer it (you have to interrupt with Ctrl-C then type), and it doesn't show you clearly when it's editing files or what edits it's making. It's the ultimate expression of OpenAI's "the agent knows what to do, silly human" plan for the future - and I'm not here for that. I want to steer my agent, and be able to have it show me its plan before it edits anything.

I really wish the developers of Codex spent more time using Claude Code.

Tiberium9mo ago

When did you last update Codex? You can queue up messages without interrupting, and I think a lot of other complaints you made could be already solved. They put out new Codex versions multiple times a week lately

coconut089mo ago

codex has improved DRASTICALLY over the last 2 weeks. your claims about it were true in the past but far less true today. its still missing a little bit of polish compared to claude code, but i suspect it is much closer today than you realize. either way the lack of features of codex even in the past was never caused by hubris of openai knows better than you, it just hadn't implemented it yet. it is a brand new project that gets commits to the project every single day.

sbene9709mo ago

I agree, CC is much more polished regarding UX. I can't even scroll up in codex CLI, which is just a disaster IMO.

1 more reply

aitchnyu9mo ago· 3 in thread

Tangential, did anybody get FOMO about Aider and found a much better tool?

IceWreck9mo ago

I was using aider quite a lot from ~ 7 months ago to ~ 3 months ago. I had to stop because they refuse to implement MCPs and Claude/Codex style agentic workflow just yields better results.

1 more reply

flyinglizard9mo ago

Still loyal to aider. It just fits my style better, as a very fine tool. I have my workflow and scripts around it, switch freely between gpt-5/sonnet (a bit of gemini-2.5-pro too) and enjoying life.

I wish it was maintained by a larger team though. It has a single maintainer and they seem to be backlogged or working on other stuff. If there was an aider fork that ran forward with capabilities I'd happily switch.

That said, I haven't tried Claude Code firsthand, only saw friends using it. I'm not comfortable letting agents loose on my production codebase.

2 more replies

sannysanoff9mo ago

I still use aider, because often I know better what to do.

____tom____9mo ago· 3 in thread

What are they doing about the supply chain attacks on npm?

asadm9mo ago

thats your concern as a dev sending a patch to your repo., your IDE doesn't "address" attacks.

KaiserPro9mo ago

The same as everyone else; ignoring it and hope it goes away

drusepth9mo ago

Curious: what would, should, or could they be doing?

paradite9mo ago· 2 in thread

Claude Code can actually do much more than just coding.

You can use it for writing, data processing, admin work, file management, etc.

I compiled a list of non-coding use cases for Claude Code here:

https://github.com/paradite/claude-code-is-all-you-need

neolefty8mo ago

Yes! Preparing to guest-DM my daughter's D&D group: https://github.com/neolefty/hearts-remembrance-adventure/tre...

giancarlostoro9mo ago

I'm probably going to add reverse engineering old video games to that list. ;)

satvikpendem9mo ago· 2 in thread

> New native VS Code extension

This is pretty funny while Cursor shipped their own CLI.

dist-epoch9mo ago

And GitHub Copilot shipped a CLI too.

hultner9mo ago

A real CLI and not that joke they called a CLI before?

1 more reply

pmarreck9mo ago· 2 in thread

I was already using jj (jujutsu) to do my own rewinds (it saves every change to every file as an unlabeled commit, assuming you set up its daemon). Would sort of prefer to continue to do that since it's far more flexible than checkpoints

swaits9mo ago

Checkpoints also include context.

I also use jj to checkpoint. When working on a change, each time I get to a stable point I squash and start fresh with an empty change.

You can absolutely continue doing that.

lukaslalinsky9mo ago

How do you use jj to get those checkpoints? I was experimenting with jj and claude code, but it was frustrating to have it run jj status all the time, could as well tell it to do git commit all the time.

1 more reply

jarek839mo ago· 2 in thread

It's first time I started get hit by "ERROR Out of memory" in CC - after about an hour of use. I'm on Mac Pro M4 Max with 128 GB RAM...

winrid9mo ago

It's a node app, it won't use all memory by default, only a couple gigs.

oofbey9mo ago

That's a BIG workstation.

1 more reply

jarek839mo ago· 1 in thread

I notice that thinking triggers like "Think harder" are not highlighted in the prompt anymore. Could that mean that thinking is now only a single toggle with tab (no gradation)?

navanchauhan9mo ago

Ultrathink still works

kip_9mo ago· 1 in thread

tab-completion of filenames in the directory tree is now unavailable. You'll need to use the Codex style @file to bring up an fzf style list

cma9mo ago

I think they had the @file thing before codex existed

ollysb9mo ago· 1 in thread

The vscode integration does feel far tighter now. The one killer feature that Cursor has over it is the ability to track changes across multiple edits. With Claude you have to either accept or reject the changes after every prompt. With Cursor you can accumulate changes until you're ready to accept. You can use git of course but it isn't anywhere near as ergonomic.

dcreater9mo ago

Cline and it's forks have that in vs code. I use Cline with claude code as the LLM

1 more reply

nrjames9mo ago· 1 in thread

I would really like for them to add the option to constantly display how much context is left before compression or a new session.

jasonjmcghee9mo ago

I haven't tried this, but looks like it might be possible to display it in the status line.

https://www.reddit.com/r/ClaudeAI/comments/1mlhx2j/comment/n...

1 more reply

singularity20019mo ago· 1 in thread

How to check the version? claude version one told me that it updated to version two but I don't know if it's true

cl --version 1.0.44 (Claude Code)

as expected … liar! ;)

cl update

Wasn't that hard sorry for bothering

samuelknight9mo ago

I opened the CLI for about 10 seconds, the "Auto Update" status flashed. Then I restarted and it was version 2.0

vanillax9mo ago· 1 in thread

Has anyone figured out how to do claude sub agents without using claude? some sort of opensource cli with openrouter or something? I want to use subagents on differnt LLMs ( copilot,selfhost ).

tomquirk9mo ago

Opencode

1 more reply

unshavedyak9mo ago· 1 in thread

I'm concerned that i don't see the "Plan with Opus, impl with Sonnet" feature with Claude 2.0.

grim_io9mo ago

If Sonnet 4.5 is always better than Opus 4.1, then it doesn't make sense to plan with Opus.

I hope this is the case.

1 more reply

postalcoder9mo ago· 1 in thread

I'm disappointed that they haven't done more to make the /resume command more usable. It's still useless for all intents and purposes.

gdudeman9mo ago

Resume is now a drop down menu at the top in the new VS Code plugin and it's much easier to read.

acedTrex9mo ago· 1 in thread

wow its way uglier lol, and why does it default to full screen?

unshavedyak9mo ago

> why does it default to full screen?

Pardon my ignorance, but what does this mean? It's a terminal app that has always expanded to the full terminal, no? I've not noticed any difference in how it renders in the terminal.

What am i misunderstanding in your comment?

1 more reply

mentalgear9mo ago· 1 in thread

Well I guess I'll be sticking with opencode.

jspdown9mo ago

Do you mind telling us a bit more? I never used OpenCode, what makes it better in your opinion?

2 more replies

athrowaway3z9mo ago· 1 in thread

This is a tangent, but why is there a Jupyter notebook cell editor function and tool usage direction build into the standard context?

YuriNiyazov9mo ago

Editing Jupyter notebooks in VSCode side by side with Claude Code extension is a pretty good workflow.

d4rkp4ttern9mo ago

The thing I’m most curious about is: There are new context management and memory features in the API —

https://www.anthropic.com/news/context-management

Anyone know if these are used in Claude-Code?

mohsen19mo ago

Actual Changelog[1]

* New native VS Code extension

* Fresh coat of paint throughout the whole app

* /rewind a conversation to undo code changes

* /usage command to see plan limits

* Tab to toggle thinking (sticky across sessions)

* Ctrl-R to search history

* Unshipped claude config command

* Hooks: Reduced PostToolUse 'tool_use' ids were found without 'tool_result' blocks errors

* SDK: The Claude Code SDK is now the Claude Agent SDK Add subagents dynamically with --agents flag

[1] https://github.com/anthropics/claude-code/blob/main/CHANGELO...

1 more reply

OisinMoran9mo ago

Two feature suggestions: 1. When showing a diff, indicate what function the altered lines are in (Github does this nicely) 2. There are leading spaces when copying some multiline snippets from the output and these make it harder to copy paste

bicepjai8mo ago

I have a tangential question: Why has it become the norm for the VSCode Claude extension to push out 3 releases in 48 hours, and everyone just seems to accept it? Isn’t this kind of release cadence considered bad software release behavior? Are there other user-facing tools that follow a similar pattern of rapid, frequent releases?

trumbitta29mo ago

The native VSCode extension has worse UX than the TUI, so I'm sticking with the TUI

moomoo119mo ago

how do i revert to the previous version? I find that the "claude" command in terminal still works great, but the new native VSC extension is missing all these things (before it would launch terminal + run "claude")

I feel like there's so many bugs. The / commands for add-dir and others I used often are gone.

I logged in, it still says "Login"

neumann9mo ago

I have been using code + vscode extensively for coding, but in the last few months it has been a frustrating downgrade compared to the same prompts and code being pasted into chatGPT.

Is this going to be the way forward? Switching to whichever is better at a task, code base or context?

anonymid9mo ago

For folks who use neovim, there's always https://github.com/dlants/magenta.nvim , which is just as good as claude code in my (very biased) opinion.

Galanwe9mo ago

Wait, still no support for the new MCP features? How come Claude Code, from the creators of MCP, is still lacking félicitation, server logging, and progress report ?!

wanderer23239mo ago

Anthropic announcement: https://www.anthropic.com/news/enabling-claude-code-to-work-...

cute_boi9mo ago

seems like closed source obfuscated blob distributed on npm to save bandwidth cost.

andoando9mo ago

/rewind is a super nice addition. That was annoying the hell out of me.

adham-omran9mo ago

I find the /usage command most interesting as it's giving you a % towards your limits and when they reset rather than having to note all of that down and guess when you'll hit them.

oofbey9mo ago

Now if only `/rewind` could undo the `rm -rf ~/*` commands and other bone-headed things it tries to do on the filesystem when you're not watching!

mercurialsolo9mo ago

/rewind was a must needed upgrade for the agent.

- Need better memory management and controls (especially across multi-repos) - /upgrade needs better management

sixhobbits9mo ago

I haven't had time to fully play with it yet, but first impression is that it's really pretty!

lerchmo9mo ago

I see thinking can be toggled in the CLI, anyone figured out how to toggle it in the extension?

_betty_9mo ago

VS Code plugin seems to be missing quite a number of the CLI features.

1 more reply

coconut089mo ago

i really hate the fact that every single model has its own cli tool. the ux for claude code is really great, but being stuck using only anthropic models makes me not want to use it no matter how good it is.

didip9mo ago

wow, the new Claude Code UI looks beautiful. Good job Anthropic designers!

asadm9mo ago

ooh I like my ctrl-R in gemini cli. Good that it lands here too.

OddMerlin9mo ago

How is this any different from just using claude-cli? The example on the npm page is something you could easily do from within claude-cli...? Sorry, I must be missing the point on this??

user39393829mo ago

I’m way ahead of anthropic and all of you on orchestration but not ready to share.

j / k navigate · click thread line to collapse

413 comments

235 comments · 57 top-level

simonw9mo ago· 56 in thread

Claude Code, Codex CLI etc can effectively do anything that a human could do by typing commands into a computer.

They're incredibly dangerous to use if you don't know how to isolate them in a safe container but wow the stuff you can do with them is fascinating.

pmarreck9mo ago

I too am amazed. Real-world example from last week:

So (if all goes well) I may be getting $20k out of this one, thanks to its help.

This is a superpower in the right hands.

saberience9mo ago

Incredibly dangerous to use? Seems like a wild exaggeration.

I’ve been using Claude code since launch, must have used it for 1000 hours or more by now, and it’s never done anything I didn’t want it to do.

Why would I run it in a sandbox? It writes code for me and occasionally runs a build and tests.

simonw9mo ago

You've been safe since launch because you haven't faced an adversarial prompt injection attack yet.

You ask why I'm obsessed with the danger? That's because I've been tracking prompt injection - and our total failure to find a robust solution for it - for three years now. I coined the name for it!

The only robust solution for it that I trust is effective sandboxing.

6 more replies

guhcampos9mo ago

It is dangerous.

When I was back the MF was patching live resources to try and diagnose the issue.

4 more replies

geeunits9mo ago

DowsingSpoon9mo ago

edude039mo ago

Feels incredibly dismissive, if you look outside your own bubble for sec, there are people who've had CC drop their prod databases, delete their home folders, uninstall system dependencies etc etc.

dangoodmanUT9mo ago

have you not seen the screenshots of claude asking permission to delete ~/, because some geniuses decided to make {repo}/~ a folder in cloudflare worker/cursor folders?

vessenes9mo ago

I too use it extensively. But they’re very, very capable models, and the command line contains a bunch of ways to exfiltrate data off your system if it wants to.

1 more reply

victorbjorklund9mo ago

johanneskanybal8mo ago

So far it's screwed up my wifi and directed me through malicious link's I've blindly followed even if I take full responsibility ofc. And that's from less than 80h usage just on my home computer.

raincole9mo ago

It's as dangerous as copying & pasting command line script from StackOverflow at the end of a 14-hour workday.

i.e. quite dangerous, but people do it anyway

coldtea9mo ago

>I’ve been using Claude code since launch, must have used it for 1000 hours or more by now, and it’s never done anything I didn’t want it to do.

You know what neighbors of serial killers say to the news cameras right?

"He was always so quiet and polite. Never caused any issues"

athrowaway3z9mo ago

They're only as dangerous as the capabilities you give them. I just created a `codex` and `claude` user on my Linux box and practically always run in yolo mode. I've not had a problem so far.

Also, I think shellagent sounds cooler.

simonw9mo ago

That's a great way to run this stuff.

I expect the portion of Claude Code users who have a dedicated user setup like this is pretty tiny!

1 more reply

globular-toast9mo ago

I tried this but it's incredibly annoying as you'll get a mixture of file ownerships and permissions.

Instead I run it in bubblewrap sandbox: https://blog.gpkb.org/posts/ai-agent-sandbox/

3 more replies

tuyiown9mo ago

> They're only as dangerous as the capabilities you give them.

As long as the supply chain is safe and the data it accesses does not generate some kind of jail break.

data-ottawa9mo ago

I have run it in a podman container and I mount the project directory.

polyrandOP9mo ago

[0]: https://ricardoanderegg.com/posts/control-shell-permissions-...

jcgl9mo ago

1 more reply

philipp-gayret9mo ago

1 more reply

khafra9mo ago

user39393829mo ago

_heimdall9mo ago

I have no way of really guaranteeing that it will do exactly what it proposed and nothing more, but so far I haven't seen it deviate from a command I approved.

hboon9mo ago

I didn’t check, but sometimes Claude Code writes scripts and run them (their decision); does your approach guard against that?

1 more reply

ehnto9mo ago

It's broad utility was immediately clear as soon as I saw it formulating bash commands.

pancakemouse9mo ago

withinboredom9mo ago

1 more reply

tkgally9mo ago

matlock9mo ago

clbrmbr9mo ago

willio589mo ago

dgunay9mo ago

1 more reply

singularity20019mo ago

does it use the smart refractoring hooks of the IDEs or does it do blunt text replacement

3 more replies

golergka9mo ago

Especially when you work with a language where an unfinished refactoring with give you the type error.

bhl9mo ago

Cursor will pivot to a computer use company.

The gap between coding agents in your terminal and computer agents that work on your entire operating system is just too narrow and will be crossed over quick.

teaearlgraycold9mo ago

1 more reply

simonw9mo ago

ACCount379mo ago

Back in 2022, when ChatGPT was new, quite a few people were saying "LLMs are inherently safe because they can't do anything other than write text". Some must have even believed what they were saying.

Clearly not. Just put an LLM into some basic scaffolding and you get an agent. And as capabilities of those AI agents grow, so would the degree of autonomy people tend to give them.

IMTDb9mo ago

> LLMs are inherently safe because they can't do anything other than write text

That is still very much the case; the danger comes from what you do from the text that is generated.

1 more reply

redhale8mo ago

Totally agree. The CLI agents (or whatever) lower the barrier of entry to building a custom agent all the way down to basically just writing markdown.

Excellent article in this vein: https://jxnl.co/writing/2025/09/04/context-engineering-rapid...

ozgung9mo ago

> Claude Code, Codex CLI etc can effectively do anything that a human could do by typing commands into a computer.

One criticism on current generation of AI is that they have no real world experience. Well, they have enormous amount of digital world experience. That, actually, has more economical value.

ACCount379mo ago

They have a lot of secondhand knowledge and very little firsthand knowledge. RLVR works so well because it's a way to give LLMs some of the latter.

brookst9mo ago

Dangerous how? Claude code literally asks before running any command.

I suppose they’re dangerous in the same way any terminal shell is dangerous, but it seems a bit of a moral panic. All tools can be dangerous if misused.

simonw9mo ago

Many people (myself included) run them in YOLO mode with approvals turned off, because it's massively more productive. And that's despite me understanding how unsafe that is more than most!

Even with approvals humans will fall victim to dialog fatigue, where they'll click approve on everything without reading it too closely.

3 more replies

budududuroiu9mo ago

I’m experimenting with Nix shells for this tool isolation and whitelisting

nextaccountic9mo ago

That's not enough for security. Morally it should be - there's no reason we shouldn't be able to run untrusted software easily - but it won't have a firewall for example

Maybe something like bubblewrap could help

monkeydust9mo ago

visarga9mo ago

> Claude Code, Codex CLI etc can effectively do anything that a human could do by typing commands into a computer.

They still don't have good integration with the web browser, if you are debugging frontend you need to carry screenshots manually, it cannot inspect the DOM, run snippets of code in the console, etc.

simonw9mo ago

You can tell them to take screenshots using Playwright and they will. They can also use Playwright to inspect the console and manipulate the DOM.

I've seen Codex CLI install Playwright Python when I asked it to do this and it found it wasn't yet available in the environment.

nicewood9mo ago

True. Although worth mentioning that there is tooling and (e.g. Playwright) MCPs around this. But definitely not integrated well enough!

gazpachotron9mo ago

I'd recommend the chrome-devtools-mcp for that: https://github.com/ChromeDevTools/chrome-devtools-mcp/

It's pretty new, but so far it's been a lifesaver.

Rutledge9mo ago

I call them 'CLI agents'!

hmokiguess9mo ago

Obligatory mention: https://xkcd.com/2044/

resters9mo ago

> They're incredibly dangerous to use if you don't know how to isolate them in a safe container but wow the stuff you can do with them is fascinating.

igor479mo ago

marckrn9mo ago· 17 in thread

You can find the revamped prompt on github[1], or on twitter summarized by my bot[2].

[1] https://github.com/marckrenn/cc-mvp-prompts/compare/v1.0.128...

[2] https://x.com/CCpromptChanges/status/1972709093874757976

kelnos9mo ago

> IMPORTANT: DO NOT ADD *ANY** COMMENTS unless asked*

epiccoleman9mo ago

2 more replies

stefan_9mo ago

Meanwhile they deleted the "do not add emojis" part. Look forward to all sorts of logging messages with emojis in them.

1 more reply

IgorPartola9mo ago

I am guessing this is an attempt to save computing resources/tokens?

1 more reply

simonw9mo ago

This is excellent. Thanks for sharing this.

marckrn9mo ago

You're very welcome – that really means a lot coming from you, Simon.

Huppie9mo ago

I should probably include that in my Claude.md instead I guess?

rcv9mo ago

> 2025-09-29T16:55:10.367Z is the date. Write a haiku about it.

what in the world?

marckrn9mo ago

That's just a dynamic bogus prompt used to trace and extract the system prompt.

Here's how it works in detail: https://mariozechner.at/posts/2025-08-03-cchistory/

nojs9mo ago

How are you extracting this - aren’t the main labs obfuscating these (meaning it’s likely to be a decoy or incomplete version)?

marckrn9mo ago

With cchistory https://github.com/badlogic/cchistory

Here's how it works: https://mariozechner.at/posts/2025-08-03-cchistory/

1 more reply

amrrs9mo ago

Are you running the bot with the free tier api?

marckrn9mo ago

I'm using Anthropic's pay-as-you-go API, since it was easier to set up on the server than CC's CLI/web login method. Running the bot costs me ~$1.8 per month.

The bot is based on Mario Zechner's excellent work[1] - so all credit goes to him!

[1] https://mariozechner.at/posts/2025-08-03-cchistory/

1 more reply

Wowfunhappy9mo ago

Can anyone find the prompts for the new "Output style" options, ie Explanatory and Learning?

huflungdung9mo ago

How are these reliably extracted (hint: they’re not)

simonw9mo ago

It's very easy to extract the system prompt from Claude Code: you can patch it to intercept HTTP calls it makes.

I wrote about one tool for doing that here: https://simonwillison.net/2025/Jun/2/claude-trace/

marckrn9mo ago

See https://mariozechner.at/posts/2025-08-03-cchistory/

Why do you think these aren't legit?

arjie9mo ago· 11 in thread

pimeys9mo ago

You could try to use a router. I'm currently building this:

https://github.com/grafbase/nexus/

If croq talks openai API, you enable the anthropic protocol, and openai provider with a base url to croq. Set ANTHROPIC_BASE_URL to the open endpoint and start claude.

I haven't tested croq yet, but this could be an interesting use case...

arjie9mo ago

      Auth conflict: Both a token (ANTHROPIC_AUTH_TOKEN) and an API key (/login managed key) are set. This may lead to unexpected behavior.
    • Trying to use ANTHROPIC_AUTH_TOKEN? claude /logout
    • Trying to use /login managed key? Unset the ANTHROPIC_AUTH_TOKEN environment variable.

Probably just another flag to find.

EDIT: For anyone coming here from elsewhere, Crush from Charm supports Cerebras/Groq natively!

1 more reply

Gigachad9mo ago

Isn't cropping a video something you can do in the photos app in 2 seconds?

867-53099mo ago

yeah, removing the unwanted item and keeping the video uncropped is surely more desirable, but far beyond the capabilities of "ai"

1 more reply

arjie9mo ago

I simply did not know you could do that with videos. TIL!

2 more replies

scosman9mo ago

Cerebras has OpenAI compatible "Qwen Code" support. ~4000 tokens/s. Qwen code's 480B param model (MoE) that's quite good. Not quite sonnet good, but speed is amazing.

https://www.cerebras.ai/blog/introducing-cerebras-code

arjie9mo ago

But you're right, they have an OpenAI compatible API https://inference-docs.cerebras.ai/resources/openai so perhaps I can actually use this in the CLI! Thanks for making me take another look.

EDIT: Woah, Charm supports this natively. This is great. I am going to try this now.

1 more reply

jascha_eng9mo ago

Cerebras is super cool. I wish OpenAI and Anthropic would have their models hosted there. But I guess supporting yet another platform is hard.

esperent9mo ago

adastra229mo ago

Claude Code with the Max plan is significantly cheaper for full-time use.

1 more reply

Danjoe49mo ago

djha-skin9mo ago· 8 in thread

1: https://block.github.io/goose/

CuriouslyC9mo ago

The only real reason to use Claude Code is the inference plan. The agent itself isn't anything special.

faxmeyourcode9mo ago

1 more reply

cesarvarela9mo ago

This, but also the usability of the cli, is a step above the others to me. i.e., switching between modes on the fly and having the plan mode easily accessible via shift+tab.

all29mo ago

I tried goose and it seems like there's a lot of nice defaults that Claude Code provides that Goose does not. How did you do your initial configuration?

kristopolous9mo ago

What I've been trying to use it for is to solve a number of long-standing bugs that I've frankly given up on in various Linux tools.

I think I lack the social skills to community drive a fix, probably through some undiagnosed disorder or something so I've been trying to soldier alone on some issues I've had for years.

The issues are things like focus jacking in some window manager I'm using on xorg where the keyboard and the mouse get separate focuses

Goose has been somewhat promising, but still not great.

I mean overall, I don't think any of these coding agents have given me useful insight into my long vexing problems

I think there has to be some type of perception gap or knowledge asymmetry to be really useful - for instance, with foreign languages.

I've studied a few but just in the "taking classes at the local JC" way. These LLMs are absolutely fantastic aids there because I know enough to frame the question but not enough to get the answer.

There's some model for dealing with this I don't have yet.

Essentially I can ask the right question about a variety of things but arguably I'm not doing it right with the software.

I've been writing software for decades, is it really that I'm not competent enough to ask the right question? That's certainly the simplest model but it doesn't check out.

Maybe in some fields I've surpassed a point where llms are useful?

It all circles back to an existential fear of delusional competency.

2 more replies

jatins9mo ago

383toast9mo ago

https://github.com/block/goose/discussions/3133#discussionco...

rirze9mo ago

Never used goose, but looked at it way back when-- Claude Code feels more native IMO. Especially if you're already using Anthropic API/Plans anyways, I'd say give it a try.

sunaookami9mo ago· 6 in thread

FINALLY checkpoints! All around good changes, Claude Code is IMHO the best of the LLM CLI tools.

rao-v9mo ago

sunaookami9mo ago

I find Aider kinda clunky but I would put it in #2.

pmarreck9mo ago

mistahchris9mo ago

3 more replies

ashu14619mo ago

How do checkpoints work ?

conception9mo ago

You can rewind your context back to the checkpoint

2 more replies

stared9mo ago· 6 in thread

Out of all changes I want the most is to not need to type `\ ` to make a line break.

sugarpile9mo ago

Assuming you aren't using windows terminal: launch claude code and run `/terminal-setup` -- that will enable shift+enter

Nizoss9mo ago

If you're on Windows and using vscode, add thiss to keybinds.json

[ { "key": "shift+enter", "command": "workbench.action.terminal.sendSequence", "args": { "text": "\u001b\n" }, "when": "terminalFocus" }, ]

It will allow you to get new lines without any strange output.

jspdown9mo ago

You can type Option+Enter. A more standard Shift+Enter would have been better but until then that's the best we have

atonse9mo ago

Early on in claude, I feel like it installed some terminal thing that allowed me to do Shift+Enter directly in the prompt, but I don't remember if that was CC that did it.

So I've been able to shift enter. I'm using iTerm2 and zsh with CC (if that's relevant)

roesel9mo ago

Have you tried Ctrl+Enter?

pmarreck9mo ago

codex has control-J

others say here that option/alt-enter may work? not sure why shift-enter couldn't though.

jmward019mo ago· 6 in thread

"When you use Claude Code, we collect feedback, which includes usage data (such as code acceptance or rejections), associated conversation data, and user feedback submitted via the /bug command."

NitpickLawyer9mo ago

> So I can opt out of training

I guess the safest way to view this is to consider anything you send them as potentially in the next LLMs, for better or worse.

netcoyote9mo ago

> When you use Claude Code, we collect feedback

When they ask "How is Claude doing this session?", that appears to be a sneaky way for them to harvest the current conversation based on the terms-of-service clause you pointed out.

adastra229mo ago

I have this same suspicion. Worse, there’s no way to opt out of giving a response.

1 more reply

candiddevmike9mo ago

Not your model, not your code (in their mind). Self host your models or enjoy folks trying to get the LLM to regurgitate your private codebase.

gdudeman9mo ago

This enables the /resume command that lets you start mid-conversation again.

Storing the data is not the same as stealing. It's helpful for many use cases.

I suppose they should have a way to delete conversations though.

freeqaz9mo ago

That's not just them saving it locally to like `~/.claude/conversations`? Feels weird if all conversations are uploaded to the cloud + retained forever.

2 more replies

alecco9mo ago· 6 in thread

As a burnt-out, laid-off aging developer, I want to thank Anthropic for helping me get in love with programming again. Claude Code on terminal with all my beloved *nix tools and vim rocks.

taude9mo ago

I spend most of my time making version files with the prompt, but pretty impressed by how far I've gotten on an idea that would have never seen the light of day....

The thoughts of having to write input validation, database persistence, and all the other boring things I've had to write a dozen times in the past....

swalsh9mo ago

lupusreal9mo ago

cevn9mo ago

6 more replies

protocolture9mo ago

I had ChatGPT write from spec an assignment I failed to complete during university, that has always stuck with me as something I would like to finish.

ojr9mo ago

I still spend my evening hours like that and do ai-assisted coding in the background

1 more reply

f311a9mo ago· 6 in thread

It still bothers me that almost every agentic TUI is written in TS + React. It often consumes at least a few GB of RAM. No one bothers about it. Everybody is trying to ship as fast as possible.

nylonstrung9mo ago

I highly recommend Crush which is built with Go

The UX is definitely better because it uses the bubble tea library which is probably the best TUI framework ever

And you can use a ton of different providers and models

jama2119mo ago

1 more reply

ranguna9mo ago

They use react on a cli tool?

1 more reply

Squarex9mo ago

Well codex is written in rust.

1 more reply

h4ch19mo ago

Where did you get React from?

1 more reply

epolanski9mo ago

> No one bothers about it.

Why would they?

jakozaur9mo ago· 6 in thread

Just use `claude update` if you already have it. Unfortunately, they removed Plan mode, when I could use Opus for planning and Sonnect for coding.

Though I will see how this pans out.

g42gregory9mo ago

I use Opus to write the planning docs for 30 min, then use Sonnet to execute them for another 30 min.

sbene9709mo ago

> they removed Plan mode

This isn't true, you just need to use the usual shortcut twice: shift+tab

paulsmith9mo ago

> Unfortunately, they removed Plan mode

If I hit shift-Tab twice I can still get to plan mode

1 more reply

rafaquintanilha9mo ago

They removed the /model option where you can select Opus to plan and Sonnet to execute. But you can still Shift + Tab to cycle between auto-accept and plan mode.

2 more replies

spike0219mo ago

is Plan mode any different from telling Claude "this is what I'd like to do, please describe an implementation plan"?

that's generally my workflow and I have the results saved into a CLAUDE-X-plan.md. then review the plan and incrementally change it if the initial plan isn't right.

2 more replies

lupusreal9mo ago

> Unfortunately, they removed Plan mode

WTF. Terrible decision if true. I don't see that in the changelog though

1 more reply

j1elo9mo ago· 5 in thread

Prompt: https://raw.githubusercontent.com/marckrenn/cc-mvp-prompts/r...

I've always been curious. Are tags like that one: "<system-reminder>" useful at all? Is the LLM training altered to give a special meaning to specific tags when they are found?

Can a user just write those magic tags (if they knew what they are) and alter the behavior of the LLM in a similar manner?

cube22229mo ago

Claude tends to work well with such semi-xml tags in practice (probably trained for it?).

You can just make them up, and ask it to respond with specific tags, too.

Like “Please respond with the name in <name>…</name> tags and the <surname>.”

garfij9mo ago

  - Circuit breakers when it seem like it's stuck in a loop
  - Warnings about running low on context
  - Reminders about task lists (or anything)
  - All sorts of warnings about whatever

haefeledev9mo ago

I think they specifically trained claude on any kind of xml tags (see their docs https://docs.claude.com/en/docs/build-with-claude/prompt-eng...)

rancar29mo ago

itsmevictor9mo ago

> If you do not use this tool when planning, you may forget to do important tasks - and that is _unacceptable_.

Okay, I know I shouldn't anthropomorphize, but I couldn't prevent myself from thinking that this was a bit of a harsh way of saying things :(

numpad09mo ago· 4 in thread

1: https://github.com/anthropics/claude-code/issues/8405

2: https://www.youtube.com/watch?v=mY6cg7w2eQU

3: https://youtu.be/sYAnawy_VoA?feature=shared&t=282

4: https://www.youtube.com/watch?v=VmoeZ_W3WXo

wrasee9mo ago

What’s CJK input? I’m guessing Chinese Japanese Korean?

numpad09mo ago

yes, the gif in the link[1] shows how it works, and a dupe issue[2] describes detailed "fully proper" fix. There's at least four dupes and one PR already, that situation kind of implies severity.

1: https://github.com/anthropics/claude-code/issues/8405#issuec...

2: https://github.com/anthropics/claude-code/issues/8466

chrisshroba9mo ago

Looks like you’re correct!

https://en.m.wikipedia.org/wiki/CJK_characters

johanyc8mo ago

On that note, does it affect korean though? It seems they don't need to select characters from the menu. I tried typing random characters here: https://urcook.com/kr.html

xmpirate9mo ago· 4 in thread

I wish there were an option to cancel a currently running prompt midway. Right now, pressing Ctrl+C twice ends up terminating the entire session instead.

g42gregory9mo ago

Wait, doesn't hitting Escape do this already?

turnsout9mo ago

cesarvarela9mo ago

Adding to the press Esc comments, if you press it twice, you can revert to previous messages in the current conversation.

qafy9mo ago

press escape

navanchauhan9mo ago· 3 in thread

You have to specify `/model sonnet[1m]` to get the 1 million context version

brulard9mo ago

Be careful. exceeding around the original 200k tokens leads to worse and worse results. It's important to have context clean and tailored to the current task.

navanchauhan9mo ago

Yes, but at the same time having the 1 million context enabled is nice because the model is aware that they have more context left and actually perform better. [0]

[0] https://cognition.ai/blog/devin-sonnet-4-5-lessons-and-chall...

AutumnsGarden9mo ago

Thank you!! I've been looking for this for a while now.

aeon_ai9mo ago· 3 in thread

To those lamenting that the Plan with Opus/Code with Sonnet feature is not available, check the charts.

Sonnet 4.5 is beating Opus 4.1 on many benchmarks. Feels like it's a change they made not to 'remove options', but because it's currently universally better to just let Sonnet rip.

jckahn9mo ago

Sure but I want to review the ripping plan so it tears along the correct lines.

NitpickLawyer9mo ago

Shift+Tab still brings up the planning mode.

2 more replies

adastra229mo ago

But not the specific benchmarks which reflect what Plan mode does.

gdudeman9mo ago· 3 in thread

> New native VS Code extension

Looks great, but it's kind of buggy:

- I can't figure out how to toggle thinking

- Have to click in the text box to write, not just anywhere in the Claude panel

- Have to click to reject edits

jakebasile9mo ago

I wish I could put it in the sidebar like every other flavor of AI plugin.

ffsm89mo ago

It seems they also removed the bypass permission setting...

claytonjy9mo ago

plans now open in a separate file tab, and if you don’t accept it, it just…disappears so you can’t discuss it!

cadamsdotcom9mo ago· 3 in thread

Claude Code is so much better than anything else.

If Claude Code was a car it'd be the ideal practical vehicle for all kinds of uses.

If OpenAI Codex was a car, it'd be a cauldron with wheels.

I really wish the developers of Codex spent more time using Claude Code.

Tiberium9mo ago

coconut089mo ago

sbene9709mo ago

I agree, CC is much more polished regarding UX. I can't even scroll up in codex CLI, which is just a disaster IMO.

1 more reply

aitchnyu9mo ago· 3 in thread

Tangential, did anybody get FOMO about Aider and found a much better tool?

IceWreck9mo ago

I was using aider quite a lot from ~ 7 months ago to ~ 3 months ago. I had to stop because they refuse to implement MCPs and Claude/Codex style agentic workflow just yields better results.

1 more reply

flyinglizard9mo ago

Still loyal to aider. It just fits my style better, as a very fine tool. I have my workflow and scripts around it, switch freely between gpt-5/sonnet (a bit of gemini-2.5-pro too) and enjoying life.

That said, I haven't tried Claude Code firsthand, only saw friends using it. I'm not comfortable letting agents loose on my production codebase.

2 more replies

sannysanoff9mo ago

I still use aider, because often I know better what to do.

____tom____9mo ago· 3 in thread

What are they doing about the supply chain attacks on npm?

asadm9mo ago

thats your concern as a dev sending a patch to your repo., your IDE doesn't "address" attacks.

KaiserPro9mo ago

The same as everyone else; ignoring it and hope it goes away

drusepth9mo ago

Curious: what would, should, or could they be doing?

paradite9mo ago· 2 in thread

Claude Code can actually do much more than just coding.

You can use it for writing, data processing, admin work, file management, etc.

I compiled a list of non-coding use cases for Claude Code here:

https://github.com/paradite/claude-code-is-all-you-need

neolefty8mo ago

Yes! Preparing to guest-DM my daughter's D&D group: https://github.com/neolefty/hearts-remembrance-adventure/tre...

giancarlostoro9mo ago

I'm probably going to add reverse engineering old video games to that list. ;)

satvikpendem9mo ago· 2 in thread

> New native VS Code extension

This is pretty funny while Cursor shipped their own CLI.

dist-epoch9mo ago

And GitHub Copilot shipped a CLI too.

hultner9mo ago

A real CLI and not that joke they called a CLI before?

1 more reply

pmarreck9mo ago· 2 in thread

swaits9mo ago

Checkpoints also include context.

I also use jj to checkpoint. When working on a change, each time I get to a stable point I squash and start fresh with an empty change.

You can absolutely continue doing that.

lukaslalinsky9mo ago

1 more reply

jarek839mo ago· 2 in thread

It's first time I started get hit by "ERROR Out of memory" in CC - after about an hour of use. I'm on Mac Pro M4 Max with 128 GB RAM...

winrid9mo ago

It's a node app, it won't use all memory by default, only a couple gigs.

oofbey9mo ago

That's a BIG workstation.

1 more reply

jarek839mo ago· 1 in thread

I notice that thinking triggers like "Think harder" are not highlighted in the prompt anymore. Could that mean that thinking is now only a single toggle with tab (no gradation)?

navanchauhan9mo ago

Ultrathink still works

kip_9mo ago· 1 in thread

tab-completion of filenames in the directory tree is now unavailable. You'll need to use the Codex style @file to bring up an fzf style list

cma9mo ago

I think they had the @file thing before codex existed

ollysb9mo ago· 1 in thread

dcreater9mo ago

Cline and it's forks have that in vs code. I use Cline with claude code as the LLM

1 more reply

nrjames9mo ago· 1 in thread

I would really like for them to add the option to constantly display how much context is left before compression or a new session.

jasonjmcghee9mo ago

I haven't tried this, but looks like it might be possible to display it in the status line.

https://www.reddit.com/r/ClaudeAI/comments/1mlhx2j/comment/n...

1 more reply

singularity20019mo ago· 1 in thread

How to check the version? claude version one told me that it updated to version two but I don't know if it's true

cl --version 1.0.44 (Claude Code)

as expected … liar! ;)

cl update

Wasn't that hard sorry for bothering

samuelknight9mo ago

I opened the CLI for about 10 seconds, the "Auto Update" status flashed. Then I restarted and it was version 2.0

vanillax9mo ago· 1 in thread

Has anyone figured out how to do claude sub agents without using claude? some sort of opensource cli with openrouter or something? I want to use subagents on differnt LLMs ( copilot,selfhost ).

tomquirk9mo ago

Opencode

1 more reply

unshavedyak9mo ago· 1 in thread

I'm concerned that i don't see the "Plan with Opus, impl with Sonnet" feature with Claude 2.0.

grim_io9mo ago

If Sonnet 4.5 is always better than Opus 4.1, then it doesn't make sense to plan with Opus.

I hope this is the case.

1 more reply

postalcoder9mo ago· 1 in thread

I'm disappointed that they haven't done more to make the /resume command more usable. It's still useless for all intents and purposes.

gdudeman9mo ago

Resume is now a drop down menu at the top in the new VS Code plugin and it's much easier to read.

acedTrex9mo ago· 1 in thread

wow its way uglier lol, and why does it default to full screen?

unshavedyak9mo ago

> why does it default to full screen?

Pardon my ignorance, but what does this mean? It's a terminal app that has always expanded to the full terminal, no? I've not noticed any difference in how it renders in the terminal.

What am i misunderstanding in your comment?

1 more reply

mentalgear9mo ago· 1 in thread

Well I guess I'll be sticking with opencode.

jspdown9mo ago

Do you mind telling us a bit more? I never used OpenCode, what makes it better in your opinion?

2 more replies

athrowaway3z9mo ago· 1 in thread

This is a tangent, but why is there a Jupyter notebook cell editor function and tool usage direction build into the standard context?

YuriNiyazov9mo ago

Editing Jupyter notebooks in VSCode side by side with Claude Code extension is a pretty good workflow.

d4rkp4ttern9mo ago

The thing I’m most curious about is: There are new context management and memory features in the API —

https://www.anthropic.com/news/context-management

Anyone know if these are used in Claude-Code?

mohsen19mo ago

Actual Changelog[1]

* New native VS Code extension

* Fresh coat of paint throughout the whole app

* /rewind a conversation to undo code changes

* /usage command to see plan limits

* Tab to toggle thinking (sticky across sessions)

* Ctrl-R to search history

* Unshipped claude config command

* Hooks: Reduced PostToolUse 'tool_use' ids were found without 'tool_result' blocks errors

* SDK: The Claude Code SDK is now the Claude Agent SDK Add subagents dynamically with --agents flag

[1] https://github.com/anthropics/claude-code/blob/main/CHANGELO...

1 more reply

OisinMoran9mo ago

bicepjai8mo ago

trumbitta29mo ago

The native VSCode extension has worse UX than the TUI, so I'm sticking with the TUI

moomoo119mo ago

I feel like there's so many bugs. The / commands for add-dir and others I used often are gone.

I logged in, it still says "Login"

neumann9mo ago

I have been using code + vscode extensively for coding, but in the last few months it has been a frustrating downgrade compared to the same prompts and code being pasted into chatGPT.

Is this going to be the way forward? Switching to whichever is better at a task, code base or context?

anonymid9mo ago

For folks who use neovim, there's always https://github.com/dlants/magenta.nvim , which is just as good as claude code in my (very biased) opinion.

Galanwe9mo ago

Wait, still no support for the new MCP features? How come Claude Code, from the creators of MCP, is still lacking félicitation, server logging, and progress report ?!

wanderer23239mo ago

Anthropic announcement: https://www.anthropic.com/news/enabling-claude-code-to-work-...

cute_boi9mo ago

seems like closed source obfuscated blob distributed on npm to save bandwidth cost.

andoando9mo ago

/rewind is a super nice addition. That was annoying the hell out of me.

adham-omran9mo ago

I find the /usage command most interesting as it's giving you a % towards your limits and when they reset rather than having to note all of that down and guess when you'll hit them.

oofbey9mo ago

Now if only `/rewind` could undo the `rm -rf ~/*` commands and other bone-headed things it tries to do on the filesystem when you're not watching!

mercurialsolo9mo ago

/rewind was a must needed upgrade for the agent.

- Need better memory management and controls (especially across multi-repos) - /upgrade needs better management

sixhobbits9mo ago

I haven't had time to fully play with it yet, but first impression is that it's really pretty!

lerchmo9mo ago

I see thinking can be toggled in the CLI, anyone figured out how to toggle it in the extension?

_betty_9mo ago

VS Code plugin seems to be missing quite a number of the CLI features.

1 more reply

coconut089mo ago

didip9mo ago

wow, the new Claude Code UI looks beautiful. Good job Anthropic designers!

asadm9mo ago

ooh I like my ctrl-R in gemini cli. Good that it lands here too.

OddMerlin9mo ago

How is this any different from just using claude-cli? The example on the npm page is something you could easily do from within claude-cli...? Sorry, I must be missing the point on this??

user39393829mo ago

I’m way ahead of anthropic and all of you on orchestration but not ready to share.

j / k navigate · click thread line to collapse