Claude Code login fails with OAuth timeout on Windows (opens in new tab)

(github.com)

222 pointssh1mmer2mo ago304 comments

304 comments

137 comments · 36 top-level

mvkel2mo ago· 27 in thread

Mounting evidence that claude max users are put into one big compute fuel pool. Demand increased dramatically with OpenAI's DoD PR snafu (even though Anth was already working with the DoD? But I digress...). The pool hit a ceiling. Anth has no compute left to give. Hence people maxing out after 1 query. "Working on it" means finding a way to distill Claude Code that isn't enough of a quality degradation to be noticed[0], in order to get the compute pool operational again. The distillation will continue until uptime improves.

0 as of this writing, it's noticeable. Lots of "should I continue?" And "you should run this command if you want to see that information." Roadblocks that I hadn't seen in a year+

bmurphy19762mo ago

There's another piece of the puzzle. Dario has very clearly stated they are not taking the OpenAI approach of spending $trillion to scale right now and assume the money comes later. They are spending significantly less and working towards profitability sooner.

That means they are going to be far more constrained infrastructurally than some of the competition. I think this is some of the constraints that we are seeing.

mvkel2mo ago

He did say that. And, in virtually the same breath, said they would have to spend $trillions if they hope to remain SOTA, which they have to be [0],[1].

They don't have compute because they didn't play the game and get the good rates a couple of years ago, and are now forced to work with third-rate providers. That's not a strategy.

I would take everything he says with a huge grain of salt.

[0] “We’re buying a lot. We’re buying a hell of a lot. We’re buying an amount that’s comparable to what the biggest players in the game are buying.”

“Profitability is this kind of weird thing in this field. I don’t think in this field profitability is actually a measure of spending down versus investing in the business.”

[1] “You don’t just serve the current models and never train another model, because then you don’t have any demand because you’ll fall behind.”

So he's not spending so they can be profitable, AND spending as much as the biggest players are spending, AND not really looking at profit as a measure of anything? K.

muyuu2mo ago

no grand strategy behind that

they're looking to IPO in 2028 vs 2030 for OpenAI, who have raised more than double the funds

so they're willing to play fast and loose with the terms and conditions of existing customers trying to make it happen

those pockets must be drying up really fast

827a2mo ago

Alternatively, the elephant in the room I'm surprised no one wants to talk about: the vibe coding is catching up with them.

xmprt2mo ago

I don't think anyone is talking about it because it's not a very productive conversation to have. I'm not particularly bullish on vibe coding either but if you could explain what exactly about vibe coding causes these specific issues then it could be more interesting to discuss.

But as it stands, the more likely reason is capacity crunch caused by a chips shortage and demand heavily outpacing supply. You vibe coding reason is based on as much vibes as their code probably is.

1 more reply

muyuu2mo ago

that is a separate issue indeed, but their comms make it rather obvious they are scrambling to reduce compute and they're just slashing their service selectively - with openclaw and max users being the first in the chopping block

eatsyourtacos2mo ago

That's not an elephant in the room.. it's just proof of how insanely useful the tool is and the reality that so much more hardware is needed. Thus people saying "why are these companies building insanely large data centers" ... this is why!

5 more replies

throwaway274482mo ago

It should catch up faster. It's absolutely useless for the bulk of the tedium—notably, soldering together random repos to satisfy executives—that makes up my job now.

1 more reply

sutib2mo ago

If an AI doesnt generate perfect code, if left to its devices it will at some point create a codebase big and nasty enough that it will not be able to deal with it.

vitosartori2mo ago

I was vacationing! What's up with OpenAI now? Asking with some morbid curiosity tbh.

Izkata2mo ago

I believe the first part is referring to these:

https://news.ycombinator.com/item?id=47186677

https://news.ycombinator.com/item?id=47199948

feature202602132mo ago

Nothing, Effective Altruist dweebs realizing that the world isn't their psychology experiment.

dyauspitr2mo ago

As a person that hasn’t used Claude code before, I’ve been using OpenAI’s Codex and it is pretty amazing. I wonder how much more amazing Claude is.

1 more reply

binarymax2mo ago

Codex switched to paid API tokens only. Not to mention their alignment with the department of war.

3 more replies

Someone12342mo ago

Codex just changed the way they calculate usage with a massive negative impact.

Before a Subscription was the cheapest way to gain Codex usage, but now they've essentially having API and Subscription pricing match (e.g. $200 sub = $200 in API Codex usage).

The only value of a subscription now is that you get the web version of ChatGPT "free." In terms of raw Codex usage, you could just as easily buy API usage.

edit: This is currently rolled out for Enterprise, but is coming to Pro/Plus soon. The people below saying "I haven't had this issue" haven't yet*.

2 more replies

jimkleiber2mo ago

I looked at my cc usage and I was at 90% of my weekly allowance after 3 days of use...BUT, if I looked at the usage stats with the chart, it showed, on a scale of 1-4 intensity of usage (4 being most intense), the three days as such:

Day 1: 2

Day 2: 3

Day 3: 1

Not sure how I can hit such limits so quickly with such low scores on its own chart.

skywhopper2mo ago

The limits are smaller now, is how.

2 more replies

brenoRibeiro7062mo ago

I agree; I think that's what happened. But it's a shame—I'm having a lot of trouble with poor-quality results from Claude-Code, and the session limit is being used up quickly.

garganzol2mo ago

Makes sense, even plan name seems to agree: "Claude Max".

bradgessler2mo ago

Reminds me of an “all you can eat” buffet I was at once where the owner told me, “that’s it, that’s all you can eat” and cut it off.

2 more replies

guelo2mo ago

The responsible thing would be to not sell way more subscriptions than their capacity. But they have to show the exponential revenue curves to their investors. I cancelled my subscription yesterday.

politelemon2mo ago

What is openai's involvement here, as I am out of the loop.

ezfe2mo ago

Claude: Autonomous weapons and domestic surveillance are our red line

Pentagon: No

OpenAI: We are okay if the line is merely a suggestion and we encourage you not to cross it!

Pentagon: Yes we pick that option

masklinn2mo ago

I assume it's anthropic rejecting the US Government's use of their software for domestic mass surveillance or fully autonomous weapons, and openai happily agreeing to it.

That has led to a significant number of people switching over from openai, or at least stating they were going to do so.

Analemma_2mo ago

They made a $25 million donation to Trump, which was repaid in kind by designating Anthropic a supply chain risk. Unfortunately, they weren’t nearly subtle enough about this, and went “sure, we’ll take over the contract with no limits on killbots or domestic surveillance, no problem!” on the same day as Anthropic got in trouble, and people put two and two together.

BlueRock-Jake2mo ago

On the nose. Dealt with this last week. Ran maybe 5 queries (not even in code) and was maxed out for the day. What a great way to spend my money

ramon1562mo ago

Distillation is how they're planning to make money? What a poor strategy. This is next level FOMO (Fear Of Not Being The #1 LLM Provider).

I have cancelled my subscription last week, I'll see them when they fix this nonesense

xantronix2mo ago· 25 in thread

As much as people on Hacker News complain about subscription models for productivity and creativity suites, the open arms embrace of subscription development tools (services, really) which seek to offload the very act itself makes me wonder how and why so many people are eager to dive right in. I get it. LLMs are cool technology.

Is this a symptom of the same phenomenon behind the deluge of disposable JavaScript frameworks of just ten years ago? Is it peer pressure, fear of missing out? At its root, I suspect so; of course I would imagine it's rare for the C-suite to have ever mandated the usage of a specific language or framework, and LLMs represent an unprecedented lever of power to have an even bigger shot at first mover's advantage, from a business perspective. (Yes, I am aware of how "good enough" local models have become for many.)

I don't really have anything useful nor actionable to say here regarding this dialling back of capability to deal with capacity issues. Are there any indications of shops or individual contributors with contingency plans on the table for dialling back LLM usage in kind to mitigate these unknowns? I know the calculus is such that potential (and frequently realised) gains heavily outweigh the risks of going all in, but, in the grander scheme of time and circumstance, long term commitments are starting to be more apparently risky. I am purposefully trying to avoid "begging the question" here; if instead of LLMs, this were some other tool or service, reactions to these events would have been far more pragmatic, with less of a reticence to invest time on in-house solutions when dealing with flaky vendors.

rurp2mo ago

HN is a big community that has always had a mix of people who value newness as a feature vs those who prioritize simplicity and reliability. Unless you're recognizing the exact same names taking these contradictory opinions it's probably different groups of people for the most part.

It seems like every LLM thread for the past couple years is full of posts saying that the latest hot AI tool/approach has made them unbelievably more productive, followed by others saying they found that same thing underwhelming.

echelon2mo ago

> I get it. LLMs are cool technology.

I don't think many of you have legitimately tried Claude Code, or maybe you're holding it wrong.

I'm getting 10x the work done. I'm operating at all layers of the stack with a speed and rapidity I've never had before.

And before anyone accuses me of being some "vibe coder", I've built five nines active-active money rails that move billions of dollars a day at 50kqps+, amongst lots of other hard hitting platform engineering work. Serious senior engineering for over a decade.

This isn't just a "cool technology". We've exited the punch card phase. And that is hard or impossible to come back from.

If you're not seeing these same successes, I legitimately think you're using it wrong.

I honestly don't like subscription services, hyperscaler concentration of power, or the fact I can't run Opus locally. But it doesn't matter - the tool exists in the shape it does, and I have to consume it in the way that it's presented. I hope for a different offering that is more democratic and open, but right now the market hasn't provided that.

It's as if you got access to fiber or broadband and were asked to go back to ISDN/dial up.

16 more replies

jlokier2mo ago

> the open arms embrace of subscription development tools (services, really) which seek to offload the very act itself makes me wonder how and why so many people are eager to dive right in

Here's a reason not in your list.

Short version: A kind of peer pressure, but from above. In some circles I'm told a developer must have AI skills on their resume now, and those probably need to be with well known subscription services, or they substantially reduce their employment prospects.

Multiple people I know who are employers have recently, without prompting, told me they no longer hire developers who don't use AI in their workflow.

One of them told me all the employers they know think "seniors" fall into two camps, those who are embracing AI and therefore nimble and adaptive, and those who are avoiding it and therefore too backward-looking, stuck-in-their-ways to be a good hire for the future. So if they don't see signs of AI usage on a senior dev's resume now, that's an automatic discard. For devs I know laid off from an R&D company where AI was not permitted for development (for IP/confidentiality reasons), that's unfair as they were certainly not backward-looking people, but the market is not fair.

Another "business leader" employer I met recently told me his devs are divided into those who are embracing AI and those who aren't, said he finds software feature development "so slow!", and said if it wasn't for employment law he'd fire all his devs who aren't choosing to use AI. I assume he was joking, but it was interesting to hear it said out loud without prompting.

I've been to several business leadership type meetups in recent months, and it seems to be simply assumed that everyone is using AI for almost everything worth talking about. I don't think they really are, so it's interesting to watch that narrative playing out.

woctordho2mo ago

Apart from local AI, a serious choice is aggregated API such as new-api [0]. An API provider aggregated thousands of accounts has much better stability than a single account. It's also cheaper than the official API because of how the subscription model works, see e.g. the analysis [1].

[0] https://github.com/QuantumNous/new-api

[1] https://she-llac.com/claude-limits

gruez2mo ago

>An API provider aggregated thousands of accounts has much better stability than a single account

Isn't this almost certainly against ToS, at least if you're using "plans" (as opposed to paying per-token)?

3 more replies

michael_j_x2mo ago

I really enjoy coding. I've build a number of projects, personal and professional, with Python, Rust, Java and even some Scala in the mix. However, I've been addicted to Claude Code recently, especially with the superpowers skill. It feels like I can manifest code with my mind. When developing with Claude, I am presented with design dilemmas, architectural alternatives, clarification questions, things that really make me think about the problem. I then choose a solution, propose alternatives, discuss, and the code manifests. I came to realize that I enjoy the problem solving, not the actual act of writing the code. Like I have almost cloned my self, and my clones are working on the projects and coming back to me for instructions. It feels amazing

throw48472852mo ago

"Addicted" "Superpowers" "manifest with my mind" "it feels amazing"

Why does it sound like you're on drugs? I know that sounds extremely rude, but I can't think of any other reasonable comparison for that language.

It's hard to take these kinds of endorsements seriously when they're written so hyperbolically, in terms of the same cliches, and focused on entirely on how it makes you feel rather than what it does.

4 more replies

skydhash2mo ago

That’s like saying enjoying composing music, but not enjoying playing music. Or creating stories, but don’t like writing. Yes they’re different activities, but linked together. The former is creativity, the latter is a medium of transmission.

Code is notation, just like music sheets, or food recipes. If your interaction with anyone else is with the end result only (the software), the. The code does not matter. But for collaboration, it does. When it’s badly written, that just increase everyone burden.

It’s like forcing everyone to learn a symphony with the record instead of the sheets. And often a badly recorded version.

2 more replies

withinboredom2mo ago

I feel this sentiment. It’s more like pair programming with someone both smarter and dumber than you. If you’re reviewing the code it is putting down, you’re likely to spot what it’s getting wrong and discussing it.

What I don’t understand, are the people who let it go over night or with whole “agent teams” working on software. I have no idea how they trust any of it.

snarfy2mo ago

Yep, I want to make stuff. Writing the code by hand was just a means to an end.

supriyo-biswas2mo ago

> if instead of LLMs, this were some other tool or service, reactions to these events would have been far more pragmatic, with less of a reticence to invest time on in-house solutions when dealing with flaky vendors

As an example, a long term goal at the employer I work for is exactly this: run LLMs locally. There's a big infrastructure backlog through, so it's waiting on those things, and hopefully we'll see good local models by then that can do what Claude Sonnet or GPT-5.3-Codex can do today.

raxxorraxor2mo ago

I think some workflows are just that much faster with AI. And if not I can spare the time for a prompt to get work done in parallel to the stuff I work on.

There is a cost though, the context switches of topics aren't free. But if I need to visualise a something, I let an LLM create a page. If I have two tables of data that needs to be joined/mapped, I let an LLM do the first shot, often that is enough.

I cannot even hope to reach that speed. It isn't a magic tool, but it really accelerates some task.

That speed allows for in-house solutions to become viable again, software that really adapts specific business processes instead of some wonky ERP package that never really fit what you were trying to do.

I have our dbs schema checked into a Gitea repository, which our AIs can just access to quickly ingest schema definitions. If data safety is an issue, use a local model. It is extremely beneficial if you quickly can establish context and let your AI deal with real problems. And it is quite good at that.

benterix2mo ago

Well, I wanted to understand what is this cool new tech everybody is talking about so I bought a Max plan, experimented with various setups recommended by various experts, vibe-coded a few apps and then threw it all away.

I still use more traditional approach for finding bugs and other issues in my code, but the agentic workflow doesn't give me any net value.

stefan_2mo ago

I would love nothing more than ditching Claude for a local solution tomorrow. But it doesn't exist today, so it is what it is - you gotta keep up with the joneses.

Maybe in 5 years we'll have an open weights model that is in the "good enough" category that I can run on a RTX 9000 for 15k dollars or whatever.

xantronix2mo ago

I don't want to get too much into the details but I don't work in or for the Valley and I don't think I'll ever be able to afford that sort of expenditure on computing. A down payment on a car, or a vital medical procedure? Sure. I'm probably not alone here.

torben-friis2mo ago

People will always go along with a removal of friction even against their benefit. It's natural bias, we have a preference for not spending energy.

It's why we pay stupid amounts for takeout when it's a button away, it's why we accept the issues that come with online dating rather than breaking the ice outside, it's why there's been decades scams that claim to get you abs without effort...

LLMs are the ultimate friction removal. They can remove gaps or mechanical work that regular programming can, but more importantly they can think for you.

I'm convinced this human pattern is as dangerous as addiction. But it's so much harder to fight against, because who's going to be in favor of doing things with more effort rather than less? The whole point of capitalism is supposed to be that it rewards efficiency.

xantronix2mo ago

> stupid amounts for takeout

Aw hell. You found my vice and my own cognitive dissonance here. If I want to truly stand by my convictions, I should probably cook more and log off. Waiting for signs that the tides are turning and that people are beginning to value a slower, more methodical approach again isn't doing anything in the current moment to stave off the genuine feelings of dread that have honestly led to some suicidal ideation.

(this is serious and not sarcasm, by the way)

2 more replies

dyauspitr2mo ago

I think most people understand the need for subscriptions here. It is an ongoing massive compute cost, and that’s what you’re paying for. Your local system is not capable of running the massive amount of compute required for this. If it were then we would see more people up in arms about it.

stephbook2mo ago

We could run it locally, but the problems that matter simply don't change.

We're paying for servers that sit idle at night, you don't find enough sysadmins for the current problems, the open source models aren't as strong as closed source, providing context (as in googling) means you hook everything up to the internet anyway, where do you find the power and the cooling systems and the space, what do you do with the GPUs after 3 years?

Suddenly that $500/month/user seems like a steal.

xantronix2mo ago

Is this pace necessary? I feel like this is causing people to consider code to be disposable, and I think that both are a massive mistake.

cookiengineer2mo ago

The great part is that you can always build your own selfhosted tools. There is nothing that can't be done at home, it's just a calculation of how much you're willing to spend.

Lately though the RAM crisis is continuing and making things like this more unfeasible. But you can still use a lot of smaller models for coding and testing tasks.

Planning tasks I'd use a cloud hosted one, for now, because gemma4 isn't there yet and because the GPU prices are still quite insane.

The cool and fun part is that with ollama and vllm you can just build your own agentic environment IDE, give it the tools you like, and make the workflow however you like. And it isn't even that hard to do, it just needs a lot of tweaking and prompt fiddling.

And on top of that: Use kiwix to selfhost Wikipedia, stackoverflow and devdocs. Give the LLM a tool to use the search and read the pages, and your productivity is skyrocketing pretty quickly. No need anymore to have internet, and a cheap Intel NUC is good enough for self-hosting a lot of containers already.

Source: I am building my own offline agentic environment for Golang [1] which is pretty experimental but sometimes it's also working.

[1] https://github.com/cookiengineer/exocomp

xantronix2mo ago

I'm definitely all in on self-hosting, though I rent my compute and pay for bandwidth with Linode and storage with rsync.net.

The LLM bit though, personally, is just not for me.

DeathArrow2mo ago

>I get it. LLMs are cool technology.

It would be cool to run SOTA models on my own hardware but I can't. Hence, the subscription.

jimmaswell2mo ago

Contingency plan? Just code without it like before. AI could disappear today and I would be very disappointed but it's not like I forgot how to code without it. If anything, I think it's made me a better programmer by taking friction away from the execution phase and giving me more mental space to think in the abstract at times, and that benefit has certainly carried over to my work where we still don't have copilot approved yet.

1 more reply

therealpygon2mo ago

Think of the stupidest product you can think of and you likely only know about it because people buy/bought them en masse. AI is no different from any other product; plenty will pay/adopt for exactly the reasons you said. There is powerful motivations for people to feel “ahead” of others (or more informed, or more “cool”, or more knowledgeable, or more experienced, or whatever their ego requires), even if their situation is exactly the same.

That said, I’m not sure I follow your statement of less resistance to the development of internal tools when the opposite seems to be the case; companies (or more specifically developers) are perhaps too quick to think they can just vibe-code a replacement for any vendor in a weekend these days.

kristjansson2mo ago· 12 in thread

No one is going to like this answer, but there’s a simple solution: pay for API tokens and adjust your use of CC so that the actions you have it take are worth the cost of the tokens.

It’s great to buy dollars for a penny, but the guy selling em is going to want to charge a dollar eventually…

Goronmon2mo ago

...pay for API tokens and adjust your use of CC so that the actions you have it take are worth the cost of the tokens

Do you feel there is enough visibility and stability around the "Prompt -> API token usage" connection to make a reliable estimate as to what using the API may end up costing?

Personally, it feels like paying for Netflix based on "data usage" without having anyway for me to know ahead of time how much data any given episode or movie will end up using, because Netflix is constantly changing the quality/compression/etc on the fly.

kristjansson2mo ago

Time is a relatively good proxy for spend. There are also more ex post diagnostics like count and cost it can write to the status line.

I agree that ex ante it’s tough, and they could benefit from some mode of estimation.

Perhaps we can give tasks sizes, like T shirts? Or a group of claudes can spend the first 1M tokens assigning point values to the prospective tasks?

1 more reply

0xffff22mo ago

I'm forced to do this at work. It adjusts the net value to very close to zero. Github's pay per prompt pricing model is phenomenal for users to the point of blowing Anthropic's subscription offering out of the water, much less API pricing. At Copilot pricing, it's quite a useful tool if carefully managed. At API pricing, it's very hard to find a use case for AI.

Of course, I have no idea how MS is justifying the Copilot pricing. I can't imagine any world in which it is sustainable, so I'm trying to get as much as I can out of it now before they jack up prices.

Nifty39292mo ago

This is it. These subscriptions have been heavily subsidized, which was fine when usage was much lower overall. But with so many folks trying to use the tools and soaking up all the chips something has to give.

Now we’re going to find out what these tools are really worth.

gonzalohm2mo ago

it's not a subsidy. It's predatory pricing and it should be illegal. I offer you a service at a loss to remove competition and then increase prices once you are stuck with it.

ronsor2mo ago

Actually, that is illegal.

2 more replies

bschwarz2mo ago

That's the VC playbook.

varispeed2mo ago

The problem with tokens is that they have wrong incentive. The quicker model arrives at the solution the less tokens you have to buy.

So I noticed the model is purposefully coming with dumb ideas or running around in circles and only when you tell it that they are trying to defraud you, they suddenly come back with a right solution.

jimkleiber2mo ago

I just want a little predictable insight into how much I get. For example, at a buffet, I know I can only eat so much food and can plan around it. This is like going to a buffet and not knowing how many plates I can take or how big the plates are, and it changes each week, and yet I have to keep paying the same price. Except it's not about eating, it's about my work and deadlines and promises and all that.

_flux2mo ago

That's what these providers want as well, but from the other side. They want to know that a customer won't be able to eat more than certain number of servings, as they need to pay for each of those servings.

It works out even if some customers are able to eat a lot, because people on average have a certain limit. The limits of computers are much higher.

1 more reply

criddell2mo ago

When you hire a person, you don't know what you are going to get out of them today.

If an hour of an excellent developer's time is worth $X, isn't that the upper bound of what the AI companies can charge? If hiring a person is better value than paying for an AI, then do that.

1 more reply

kristjansson2mo ago

If you need the tokens for real work, that’s what the API and the other providers like Bedrock are for. The subscription product is merely to whet your appetite.

2 more replies

ajb922mo ago· 11 in thread

The trend on the status page[1] does not inspire confidence. Beginning to wonder if this might be a daily thing.

[1] https://status.claude.com/

aurareturn2mo ago

They went from $9b ARR at the end of 2025 to $30b ARR today. That's more than 3x the size in 3 months. I expect growing pains.

For some context, they added 2x Palantir or .75x Shopify or .68x Adobe annual revenue in March alone.

twelvechairs2mo ago

Yeah its huge demand upswing from the growth of openclaw and similar pushing resources. Very clear from recent changes and announcement around this [0]

Fwiw there are worse delays from second tier providers like moonshot's kimik2.5 that are also popular for agentic use.

[0] https://news.ycombinator.com/item?id=47633396

rpozarickij2mo ago

It's also worth keeping in mind that Anthropic's compute needs are nothing like those of a company like Shopify or Adobe, so revenue might not paint accurately enough the picture of what they're dealing with right now.

samlinnfer2mo ago

And they are early adopters of the vibe coding paradigm, having a 100% Claude generated codebase.

1 more reply

nonameiguess2mo ago

To be clear, this number will probably end up being reasonably accurate, but it is a pet peeve nonetheless in the startup world how shitty these financial metrics have become. We're three months from the end of 2025. You'd think we'd want to see at least two years of $30 billion dollar revenue earned in each year before we say with any meaningful level of statistical validity that they have $30 billion in "annual recurring" revenue.

sh1mmerOP2mo ago

They might need to do some vibe refactoring.

ryandrake2mo ago

2026 may be the year that many companies relearn: there is no problem that can’t be made worse by adding even more code.

giwook2mo ago

And then some vibe code reviewing.

fb032mo ago

Outages are already happening, besides that, we need vibe warrooming

cube002mo ago

They've also stopped reporting on the causes too, just "it's resolved" and they move on.

skippyboxedhero2mo ago

It has been a daily thing for 2-3 months.

CapmCrackaWaka2mo ago· 5 in thread

If anthropic‘s reliability becomes a meme, they risk brand death like Microsoft. Go to hand it to them though, they’re really living that “AI writes all of our code and it should write your code too” life.

smt882mo ago

If Microsoft is your example of "brand death," Anthropic is dreaming of that kind of wild success and shouldn't care about its brand at all

fleischhauf2mo ago

I'm quite impressed on how far they got while the claude code code looks like it does.

muyuu2mo ago

VC money magic

love2read2mo ago

> they risk brand death like Microsoft

Is Microsoft (one of the largest companies in the world) really a victim of brand death?

mplewis2mo ago

have you ever met a person who likes outlook?

2 more replies

HoldOnAMinute2mo ago· 4 in thread

I solved this by upgrading Claude Code, closing down all instances, closing my browser, starting claude again, and doing a /login

stronglikedan2mo ago

I solved this by upgrading Claude Code, closing down all instances, closing my browser, and starting Codex

csomar2mo ago

Yes, an upgraded Claude Code instance telepathically improve Claude back-end servers.

giwook2mo ago

LOL telepathy!

It's actually via quantum entanglement.

reluctant_dev2mo ago

This resolved it for me as well but not sure if this was just a timing thing.

baq2mo ago· 4 in thread

Not sure how Claude and CC has become the defacto best model given gpt 5.3 codex and 5.4 exist. This space moves so fast you should be testing your workflows on different models at least once every quarter, prudently once a month.

Quothling2mo ago

We've got access to opus 4-6, gpt 5.4, gemini pro and a few others through corprate. I have customized agents on claude, gpt and gemini since we tend to run out of tokens for x model by the end of a month. Out of all of them I've consistently been using sonnet for most tasks. Opus functions mainly as hand-off agents and reviewer". In my anecdotal experience Claude is miles ahead of the other models and has been for a long while... when it comes to writing code the way we want it. Which eksplicit, no-abstraction, no-external packages, fail fast defensive programming. I imagine you'd get different milage with different models and different coding styles.

The rest of the organisation, which is not software development or IT related, mainly uses GPT models. I just wish I hadn't taught risk management about claude code so they weren't wasting MY tokens.

fakwandi_priv2mo ago

I've been an avid fan of codex for the last few month's but finally hit the weekly limit so I've wanted to try out claude code before biting the bullet and going for the 200 dollar codex sub.

Obviously in hindsight it would be unfair to Anthropic to judge them on an unstable day so I'l leave those complaints aside but I hit the session limit way too fast. I planned out 3 tasks and it couldn't finish the first plan completely, for that implementation task it has seen a grand total of 1 build log and hasn't even run any tests which already caused it to enter in the red territory of the context circle.

It was even asking me during planning which endpoints the new feature should use to hook into the existing system, codex would never ask this and just simply look these up during planning and whenever it encounters ambiguity it would either ask straight away or put it as an open question. I have to wonder if they're limiting this behavior due trying to keep the context as small as possible and preventing even earlier session limits.

Maybe codex's limits are not sustainable in the long run and I'm very spoiled by the limits but at this point CC(sonnet) and Codex(5.4) are simply not in the same league when comparing both 20 dollar subscriptions.

I will also clearly state that the value both these tools provide at these price points are absolutely worth it, it's just that codex's value/money ratio is much better.

m-schuetz2mo ago

Checking different models once every quarter is exactly what made me move to claude code.

skippyboxedhero2mo ago

Anthropic models haven't been far ahead for a while. Quite a few months at least. Chinese models are roughly equal at 1/6th the cost. Minimax is roughly equal to Opus. Chinese providers also haven't had the issues with uptime and variable model quality. The gap with OpenAI also isn't huge and GLM is a noticeably more compliant model (unsurprisingly given the hubristic internal culture at Anthropic around safety).

CC is a better implementation and seems to be fairly economic with token usage. That is the really the only defining point and, I suspect, Anthropic are going to have a lot of trouble staying relevant with all the product issues.

They were far ahead for a brief period in November/December which is driving the hype cycle that now appears to be collapsing the company.

You have to test at least every month, things are moving quickly. Stepfun is releasing soon and seems to have an Opus-level model with more efficient architecture.

3 more replies

honeycrispy2mo ago· 4 in thread

The solution is clearly more vibe coding at anthropic.

I doubt even the core engineers know how to begin debugging that spaghetti code.

Lionga2mo ago

correct proompt is:"you are a senior engineer. fix issues. NO hallucinations this time. PRETTY PLEASE"

mring336212mo ago

You forgot the "No Mistakes!" clause

cube002mo ago

Needs more bold CRITICAL and some ultra-think

gedy2mo ago

You missed: "Simon says:"

tomasphan2mo ago· 3 in thread

98% uptime is not great. Our eng department is thinking about going half half with Codex but of course there’s a switching cost.

tornikeo2mo ago

I'm VERY curious about your case. What kind of switching costs do you guys have? I'm working at a very young startup that is still not locked into either AI provider harnesses -- what causes switching costs, just the subscription leftovers or something else?

p_stuart822mo ago

subscription leftovers are noise. the real switching cost is the harness glue.

prompts. tool calling quirks. evals. auth. retries. all the weird failure modes your team already paid to learn.

prabal972mo ago

FYI I use my Codex models with Claude code and they work pretty great. It can even pick up on existing conversations w/ Opus and then resume w/ OAI models.

butz2mo ago· 1 in thread

Run LLMs locally. Otherwise suffer service disruptions and very likely price hikes in the future.

jimkleiber2mo ago

The analogy that recently came to me is like internet itself. Your advice seems to say just do everything locally on my computer without access to the internet, because internet might suffer service disruptions or price hikes in the future.

Luckily, ISPs tend to be quite reliable and don't have outrageous price hikes, but maybe that's because of regulation or focused competition, I'm not sure.

ivanjermakov2mo ago· 1 in thread

Wonder what the next AI winter trigger would be. Coding agent client collapsing under its own tech debt?

bachmeier2mo ago

I think it's been clear from the beginning that the per-token price of usage was far below what it will be when firms have implemented their profit-maximizing price plans. "AI winter" will happen when these firms start maximizing profit. At that point it'll be too expensive for all but certain use cases to use the best technology for work.

We'll see AI chat replace Google, we'll see companies adopting AI in high-value areas, and we'll see local models like Gemma 4 get used heavily.

AI winter will see a disappearance of the clickbait headlines about everyone losing their jobs. Literally nobody is making those statements taking into account that pricing to this point is way less than the profit maximizing level.

websap2mo ago· 1 in thread

Isn't it a little weird that we trust this app to help us build some of the most important parts of our business and the company that vends this app keep breaking it in unique ways.

At my workplace we have been sticking with older versions, and now stick to the stable release channel.

scottyah2mo ago

I like dogfooding. You can use Azure if you want infra that is clearly not being used, tested, and pushed to the limits by its own creators.

laacz2mo ago· 1 in thread

I'm more surprised that OpenAI is extremely subsidising their ChatGPT subscriptions. With Plus you can do a lot more than with Calude's x5 Max. Is it an expense they just can afford, while people have not migrated over from CC?

nickvec2mo ago

Just trying to win over market share I assume. OpenAI is willing to subsidize to try to get people to switch from CC to Codex, and the best incentive is to offer more tokens at a lower price.

whicks2mo ago· 1 in thread

IME this isn't just a 'Claude Code' problem, I'm seeing extremely degraded / unresponsive performance using Opus 4.6 in Cursor.

smt882mo ago

The status page indicates issues on almost all services

postalcoder2mo ago· 1 in thread

I stopped using Claude Code several months ago and I can't say I've missed it.

There was constant drama with CC. Degradation, low reliability, harness conspiring against you, and etc – these things are not new. Its burgeoning popularity has only made it worse. Anthropic is always doing something to shoot themselves in the foot.

The harness does cool things, don't get me wrong. But it comes with a ton of papercuts that don't belong in a professional product.

djmips2mo ago

Back to artisan all natural intelligence coding?

nathell2mo ago

HN’s guidelines say ‘Don’t editorialize’. The original title here is ‘[BUG] Claude Code login fails with OAuth timeout on Windows’, which is more specific and less clickbaity.

giancarlostoro2mo ago

Looks to be sourced from an outage:

https://status.claude.com/

SkyPuncher2mo ago

My biggest frustration right now is the seeming complete loss of background agent functionality. Permissions seem completely botched for background agents right now. When that happens, the foreground agent just takes over the task despite:

1. Me not wanting that for context management reasons

2. It burning tokens on an expensive model.

Literally a conversation that I just had:

* ME: "Have sonnet background agent do X"

* Opus: "Agent failed, I'll do it myself"

* Me: "No, have a background agent do it"

* Opus: Proceeds to do it in the foreground

* Flips keyboard

This has completely broken my workflows. I'm stuck waiting for Opus to monitor a basic task and destroy my context.

DiffTheEnder2mo ago

I'm finding queries are taking about 3x as long as they used to regardless of whether I use Sonnet or Opus (Claude Code on Max)

guzfip2mo ago

Anyone played much with Jetbrain’s LLM agent?

I’ve been toying around at home with it and I’ve been fine with its output mostly (in a Java project ofc), but I’ve run into a few consistent problems

- The thing always trips up validating its work. It consistently tries to use powershell in a WSL environment I don’t have it installed in. It also seems to struggle with relative/absolute paths when running commands.

- Pricing makes no sense to me, but Jetbrains offering seems to have its own layer of abstraction in “credits” that just seem so opaque.

Then again, I mostly use this stuff for implementing tedious utilities/features. I’m not doing entity agent written and still do a lot of hand tweaks to code, because it’s still faster to just do it myself sometimes. Mostly all from all from the IDE still.

dude2507112mo ago

How is coding "solved" then?

Unless they meant "all code that needs to be written has already been written" so their mission is to prevent any new code from being written via a kind of a bait and switch?

varispeed2mo ago

I found that telling Claude that it is trying to defraud you and making spend money often gets it back on track and return to pervious performance briefly until it agains starts doing nonsense.

I think Anthropics model has conflict of interest. They seem to have nerfed the models so that it takes more iterations to get the result (and spend more money) than it used to where e.g. Opus would get something right first time.

mikkupikku2mo ago

I really don't understand the way Claude does rate limiting, particularly the 5 hour limit. I can get on at 11:30, blow through my limit doing some stupid shit like processing a pile of books into my llm-wiki, and then get notified that I've used 90% of my 5 hour session limit and I have to wait for noon (aka wait 10 minutes) for the five hour limit to reset. Baffling.

JohnMakin2mo ago

The commenters here don't seem to realize this was posted during the outage yesterday that affected login for most claude code users.

alasano2mo ago

If you prepare yourself a token with "claude setup-token" (presuming you're not already locked out and had one) you can run "CLAUDE_CODE_OAUTH_TOKEN=sk.. claude" to use your account.

nprateem2mo ago

Antigravity has become near unusable too for the last week with Opus. Continual capacity alerts meaning tasks stop running.

Not worth the money now, will be canceling unless fixed soon.

fabbbbb2mo ago

Is this really relevant news? Please share more bug reports from popular services and tools. Feels a tiny bit biased. My CC is just fine since at least three weeks.

jostmey2mo ago

15000 milliseconds! Makes me laugh. I've had the same issue! Usually happens in the morning

arduanika2mo ago

The eternal return of https://xkcd.com/303/

world2vec2mo ago

I'm getting "Prompt is too long" a lot today

m3kw92mo ago

How are they making billions with reliability like that?

jollymonATX2mo ago

Simply put, Anthropic does not have enough compute.

mring336212mo ago

For a lot of my work, I'm pretty happy with OpenCode + GLM-4.7-Flash-REAP-23B-A3B-Q4_K_M.gguf running in llama.cpp.

Free and local.

LoganDark2mo ago

This was an outage.

rvz2mo ago

Claude is now making itself unavailable after it was on vacation yesterday.

Maybe you should consider....local models instead?

nurettin2mo ago

It started again.

j / k navigate · click thread line to collapse

304 comments

137 comments · 36 top-level

mvkel2mo ago· 27 in thread

0 as of this writing, it's noticeable. Lots of "should I continue?" And "you should run this command if you want to see that information." Roadblocks that I hadn't seen in a year+

bmurphy19762mo ago

That means they are going to be far more constrained infrastructurally than some of the competition. I think this is some of the constraints that we are seeing.

mvkel2mo ago

He did say that. And, in virtually the same breath, said they would have to spend $trillions if they hope to remain SOTA, which they have to be [0],[1].

They don't have compute because they didn't play the game and get the good rates a couple of years ago, and are now forced to work with third-rate providers. That's not a strategy.

I would take everything he says with a huge grain of salt.

[0] “We’re buying a lot. We’re buying a hell of a lot. We’re buying an amount that’s comparable to what the biggest players in the game are buying.”

“Profitability is this kind of weird thing in this field. I don’t think in this field profitability is actually a measure of spending down versus investing in the business.”

[1] “You don’t just serve the current models and never train another model, because then you don’t have any demand because you’ll fall behind.”

So he's not spending so they can be profitable, AND spending as much as the biggest players are spending, AND not really looking at profit as a measure of anything? K.

muyuu2mo ago

no grand strategy behind that

they're looking to IPO in 2028 vs 2030 for OpenAI, who have raised more than double the funds

so they're willing to play fast and loose with the terms and conditions of existing customers trying to make it happen

those pockets must be drying up really fast

827a2mo ago

Alternatively, the elephant in the room I'm surprised no one wants to talk about: the vibe coding is catching up with them.

xmprt2mo ago

But as it stands, the more likely reason is capacity crunch caused by a chips shortage and demand heavily outpacing supply. You vibe coding reason is based on as much vibes as their code probably is.

1 more reply

muyuu2mo ago

eatsyourtacos2mo ago

5 more replies

throwaway274482mo ago

It should catch up faster. It's absolutely useless for the bulk of the tedium—notably, soldering together random repos to satisfy executives—that makes up my job now.

1 more reply

sutib2mo ago

If an AI doesnt generate perfect code, if left to its devices it will at some point create a codebase big and nasty enough that it will not be able to deal with it.

vitosartori2mo ago

I was vacationing! What's up with OpenAI now? Asking with some morbid curiosity tbh.

Izkata2mo ago

I believe the first part is referring to these:

https://news.ycombinator.com/item?id=47186677

https://news.ycombinator.com/item?id=47199948

feature202602132mo ago

Nothing, Effective Altruist dweebs realizing that the world isn't their psychology experiment.

dyauspitr2mo ago

As a person that hasn’t used Claude code before, I’ve been using OpenAI’s Codex and it is pretty amazing. I wonder how much more amazing Claude is.

1 more reply

binarymax2mo ago

Codex switched to paid API tokens only. Not to mention their alignment with the department of war.

3 more replies

Someone12342mo ago

Codex just changed the way they calculate usage with a massive negative impact.

Before a Subscription was the cheapest way to gain Codex usage, but now they've essentially having API and Subscription pricing match (e.g. $200 sub = $200 in API Codex usage).

The only value of a subscription now is that you get the web version of ChatGPT "free." In terms of raw Codex usage, you could just as easily buy API usage.

edit: This is currently rolled out for Enterprise, but is coming to Pro/Plus soon. The people below saying "I haven't had this issue" haven't yet*.

2 more replies

jimkleiber2mo ago

Day 1: 2

Day 2: 3

Day 3: 1

Not sure how I can hit such limits so quickly with such low scores on its own chart.

skywhopper2mo ago

The limits are smaller now, is how.

2 more replies

brenoRibeiro7062mo ago

I agree; I think that's what happened. But it's a shame—I'm having a lot of trouble with poor-quality results from Claude-Code, and the session limit is being used up quickly.

garganzol2mo ago

Makes sense, even plan name seems to agree: "Claude Max".

bradgessler2mo ago

Reminds me of an “all you can eat” buffet I was at once where the owner told me, “that’s it, that’s all you can eat” and cut it off.

2 more replies

guelo2mo ago

The responsible thing would be to not sell way more subscriptions than their capacity. But they have to show the exponential revenue curves to their investors. I cancelled my subscription yesterday.

politelemon2mo ago

What is openai's involvement here, as I am out of the loop.

ezfe2mo ago

Claude: Autonomous weapons and domestic surveillance are our red line

Pentagon: No

OpenAI: We are okay if the line is merely a suggestion and we encourage you not to cross it!

Pentagon: Yes we pick that option

masklinn2mo ago

I assume it's anthropic rejecting the US Government's use of their software for domestic mass surveillance or fully autonomous weapons, and openai happily agreeing to it.

That has led to a significant number of people switching over from openai, or at least stating they were going to do so.

Analemma_2mo ago

BlueRock-Jake2mo ago

On the nose. Dealt with this last week. Ran maybe 5 queries (not even in code) and was maxed out for the day. What a great way to spend my money

ramon1562mo ago

Distillation is how they're planning to make money? What a poor strategy. This is next level FOMO (Fear Of Not Being The #1 LLM Provider).

I have cancelled my subscription last week, I'll see them when they fix this nonesense

xantronix2mo ago· 25 in thread

rurp2mo ago

echelon2mo ago

> I get it. LLMs are cool technology.

I don't think many of you have legitimately tried Claude Code, or maybe you're holding it wrong.

I'm getting 10x the work done. I'm operating at all layers of the stack with a speed and rapidity I've never had before.

This isn't just a "cool technology". We've exited the punch card phase. And that is hard or impossible to come back from.

If you're not seeing these same successes, I legitimately think you're using it wrong.

It's as if you got access to fiber or broadband and were asked to go back to ISDN/dial up.

16 more replies

jlokier2mo ago

> the open arms embrace of subscription development tools (services, really) which seek to offload the very act itself makes me wonder how and why so many people are eager to dive right in

Here's a reason not in your list.

Multiple people I know who are employers have recently, without prompting, told me they no longer hire developers who don't use AI in their workflow.

woctordho2mo ago

[0] https://github.com/QuantumNous/new-api

[1] https://she-llac.com/claude-limits

gruez2mo ago

>An API provider aggregated thousands of accounts has much better stability than a single account

Isn't this almost certainly against ToS, at least if you're using "plans" (as opposed to paying per-token)?

3 more replies

michael_j_x2mo ago

throw48472852mo ago

"Addicted" "Superpowers" "manifest with my mind" "it feels amazing"

Why does it sound like you're on drugs? I know that sounds extremely rude, but I can't think of any other reasonable comparison for that language.

4 more replies

skydhash2mo ago

It’s like forcing everyone to learn a symphony with the record instead of the sheets. And often a badly recorded version.

2 more replies

withinboredom2mo ago

What I don’t understand, are the people who let it go over night or with whole “agent teams” working on software. I have no idea how they trust any of it.

snarfy2mo ago

Yep, I want to make stuff. Writing the code by hand was just a means to an end.

supriyo-biswas2mo ago

raxxorraxor2mo ago

I think some workflows are just that much faster with AI. And if not I can spare the time for a prompt to get work done in parallel to the stuff I work on.

I cannot even hope to reach that speed. It isn't a magic tool, but it really accelerates some task.

benterix2mo ago

I still use more traditional approach for finding bugs and other issues in my code, but the agentic workflow doesn't give me any net value.

stefan_2mo ago

I would love nothing more than ditching Claude for a local solution tomorrow. But it doesn't exist today, so it is what it is - you gotta keep up with the joneses.

Maybe in 5 years we'll have an open weights model that is in the "good enough" category that I can run on a RTX 9000 for 15k dollars or whatever.

xantronix2mo ago

torben-friis2mo ago

People will always go along with a removal of friction even against their benefit. It's natural bias, we have a preference for not spending energy.

LLMs are the ultimate friction removal. They can remove gaps or mechanical work that regular programming can, but more importantly they can think for you.

xantronix2mo ago

> stupid amounts for takeout

(this is serious and not sarcasm, by the way)

2 more replies

dyauspitr2mo ago

stephbook2mo ago

We could run it locally, but the problems that matter simply don't change.

Suddenly that $500/month/user seems like a steal.

xantronix2mo ago

Is this pace necessary? I feel like this is causing people to consider code to be disposable, and I think that both are a massive mistake.

cookiengineer2mo ago

The great part is that you can always build your own selfhosted tools. There is nothing that can't be done at home, it's just a calculation of how much you're willing to spend.

Lately though the RAM crisis is continuing and making things like this more unfeasible. But you can still use a lot of smaller models for coding and testing tasks.

Planning tasks I'd use a cloud hosted one, for now, because gemma4 isn't there yet and because the GPU prices are still quite insane.

Source: I am building my own offline agentic environment for Golang [1] which is pretty experimental but sometimes it's also working.

[1] https://github.com/cookiengineer/exocomp

xantronix2mo ago

I'm definitely all in on self-hosting, though I rent my compute and pay for bandwidth with Linode and storage with rsync.net.

The LLM bit though, personally, is just not for me.

DeathArrow2mo ago

>I get it. LLMs are cool technology.

It would be cool to run SOTA models on my own hardware but I can't. Hence, the subscription.

jimmaswell2mo ago

1 more reply

therealpygon2mo ago

kristjansson2mo ago· 12 in thread

No one is going to like this answer, but there’s a simple solution: pay for API tokens and adjust your use of CC so that the actions you have it take are worth the cost of the tokens.

It’s great to buy dollars for a penny, but the guy selling em is going to want to charge a dollar eventually…

Goronmon2mo ago

...pay for API tokens and adjust your use of CC so that the actions you have it take are worth the cost of the tokens

Do you feel there is enough visibility and stability around the "Prompt -> API token usage" connection to make a reliable estimate as to what using the API may end up costing?

kristjansson2mo ago

Time is a relatively good proxy for spend. There are also more ex post diagnostics like count and cost it can write to the status line.

I agree that ex ante it’s tough, and they could benefit from some mode of estimation.

Perhaps we can give tasks sizes, like T shirts? Or a group of claudes can spend the first 1M tokens assigning point values to the prospective tasks?

1 more reply

0xffff22mo ago

Nifty39292mo ago

Now we’re going to find out what these tools are really worth.

gonzalohm2mo ago

it's not a subsidy. It's predatory pricing and it should be illegal. I offer you a service at a loss to remove competition and then increase prices once you are stuck with it.

ronsor2mo ago

Actually, that is illegal.

2 more replies

bschwarz2mo ago

That's the VC playbook.

varispeed2mo ago

The problem with tokens is that they have wrong incentive. The quicker model arrives at the solution the less tokens you have to buy.

jimkleiber2mo ago

_flux2mo ago

It works out even if some customers are able to eat a lot, because people on average have a certain limit. The limits of computers are much higher.

1 more reply

criddell2mo ago

When you hire a person, you don't know what you are going to get out of them today.

If an hour of an excellent developer's time is worth $X, isn't that the upper bound of what the AI companies can charge? If hiring a person is better value than paying for an AI, then do that.

1 more reply

kristjansson2mo ago

If you need the tokens for real work, that’s what the API and the other providers like Bedrock are for. The subscription product is merely to whet your appetite.

2 more replies

ajb922mo ago· 11 in thread

The trend on the status page[1] does not inspire confidence. Beginning to wonder if this might be a daily thing.

[1] https://status.claude.com/

aurareturn2mo ago

They went from $9b ARR at the end of 2025 to $30b ARR today. That's more than 3x the size in 3 months. I expect growing pains.

For some context, they added 2x Palantir or .75x Shopify or .68x Adobe annual revenue in March alone.

twelvechairs2mo ago

Yeah its huge demand upswing from the growth of openclaw and similar pushing resources. Very clear from recent changes and announcement around this [0]

Fwiw there are worse delays from second tier providers like moonshot's kimik2.5 that are also popular for agentic use.

[0] https://news.ycombinator.com/item?id=47633396

rpozarickij2mo ago

samlinnfer2mo ago

And they are early adopters of the vibe coding paradigm, having a 100% Claude generated codebase.

1 more reply

nonameiguess2mo ago

sh1mmerOP2mo ago

They might need to do some vibe refactoring.

ryandrake2mo ago

2026 may be the year that many companies relearn: there is no problem that can’t be made worse by adding even more code.

giwook2mo ago

And then some vibe code reviewing.

fb032mo ago

Outages are already happening, besides that, we need vibe warrooming

cube002mo ago

They've also stopped reporting on the causes too, just "it's resolved" and they move on.

skippyboxedhero2mo ago

It has been a daily thing for 2-3 months.

CapmCrackaWaka2mo ago· 5 in thread

smt882mo ago

If Microsoft is your example of "brand death," Anthropic is dreaming of that kind of wild success and shouldn't care about its brand at all

fleischhauf2mo ago

I'm quite impressed on how far they got while the claude code code looks like it does.

muyuu2mo ago

VC money magic

love2read2mo ago

> they risk brand death like Microsoft

Is Microsoft (one of the largest companies in the world) really a victim of brand death?

mplewis2mo ago

have you ever met a person who likes outlook?

2 more replies

HoldOnAMinute2mo ago· 4 in thread

I solved this by upgrading Claude Code, closing down all instances, closing my browser, starting claude again, and doing a /login

stronglikedan2mo ago

I solved this by upgrading Claude Code, closing down all instances, closing my browser, and starting Codex

csomar2mo ago

Yes, an upgraded Claude Code instance telepathically improve Claude back-end servers.

giwook2mo ago

LOL telepathy!

It's actually via quantum entanglement.

reluctant_dev2mo ago

This resolved it for me as well but not sure if this was just a timing thing.

baq2mo ago· 4 in thread

Quothling2mo ago

The rest of the organisation, which is not software development or IT related, mainly uses GPT models. I just wish I hadn't taught risk management about claude code so they weren't wasting MY tokens.

fakwandi_priv2mo ago

I've been an avid fan of codex for the last few month's but finally hit the weekly limit so I've wanted to try out claude code before biting the bullet and going for the 200 dollar codex sub.

I will also clearly state that the value both these tools provide at these price points are absolutely worth it, it's just that codex's value/money ratio is much better.

m-schuetz2mo ago

Checking different models once every quarter is exactly what made me move to claude code.

skippyboxedhero2mo ago

They were far ahead for a brief period in November/December which is driving the hype cycle that now appears to be collapsing the company.

You have to test at least every month, things are moving quickly. Stepfun is releasing soon and seems to have an Opus-level model with more efficient architecture.

3 more replies

honeycrispy2mo ago· 4 in thread

The solution is clearly more vibe coding at anthropic.

I doubt even the core engineers know how to begin debugging that spaghetti code.

Lionga2mo ago

correct proompt is:"you are a senior engineer. fix issues. NO hallucinations this time. PRETTY PLEASE"

mring336212mo ago

You forgot the "No Mistakes!" clause

cube002mo ago

Needs more bold CRITICAL and some ultra-think

gedy2mo ago

You missed: "Simon says:"

tomasphan2mo ago· 3 in thread

98% uptime is not great. Our eng department is thinking about going half half with Codex but of course there’s a switching cost.

tornikeo2mo ago

p_stuart822mo ago

subscription leftovers are noise. the real switching cost is the harness glue.

prompts. tool calling quirks. evals. auth. retries. all the weird failure modes your team already paid to learn.

prabal972mo ago

FYI I use my Codex models with Claude code and they work pretty great. It can even pick up on existing conversations w/ Opus and then resume w/ OAI models.

butz2mo ago· 1 in thread

Run LLMs locally. Otherwise suffer service disruptions and very likely price hikes in the future.

jimkleiber2mo ago

Luckily, ISPs tend to be quite reliable and don't have outrageous price hikes, but maybe that's because of regulation or focused competition, I'm not sure.

ivanjermakov2mo ago· 1 in thread

Wonder what the next AI winter trigger would be. Coding agent client collapsing under its own tech debt?

bachmeier2mo ago

We'll see AI chat replace Google, we'll see companies adopting AI in high-value areas, and we'll see local models like Gemma 4 get used heavily.

websap2mo ago· 1 in thread

Isn't it a little weird that we trust this app to help us build some of the most important parts of our business and the company that vends this app keep breaking it in unique ways.

At my workplace we have been sticking with older versions, and now stick to the stable release channel.

scottyah2mo ago

I like dogfooding. You can use Azure if you want infra that is clearly not being used, tested, and pushed to the limits by its own creators.

laacz2mo ago· 1 in thread

nickvec2mo ago

Just trying to win over market share I assume. OpenAI is willing to subsidize to try to get people to switch from CC to Codex, and the best incentive is to offer more tokens at a lower price.

whicks2mo ago· 1 in thread

IME this isn't just a 'Claude Code' problem, I'm seeing extremely degraded / unresponsive performance using Opus 4.6 in Cursor.

smt882mo ago

The status page indicates issues on almost all services

postalcoder2mo ago· 1 in thread

I stopped using Claude Code several months ago and I can't say I've missed it.

The harness does cool things, don't get me wrong. But it comes with a ton of papercuts that don't belong in a professional product.

djmips2mo ago

Back to artisan all natural intelligence coding?

nathell2mo ago

HN’s guidelines say ‘Don’t editorialize’. The original title here is ‘[BUG] Claude Code login fails with OAuth timeout on Windows’, which is more specific and less clickbaity.

giancarlostoro2mo ago

Looks to be sourced from an outage:

https://status.claude.com/

SkyPuncher2mo ago

1. Me not wanting that for context management reasons

2. It burning tokens on an expensive model.

Literally a conversation that I just had:

* ME: "Have sonnet background agent do X"

* Opus: "Agent failed, I'll do it myself"

* Me: "No, have a background agent do it"

* Opus: Proceeds to do it in the foreground

* Flips keyboard

This has completely broken my workflows. I'm stuck waiting for Opus to monitor a basic task and destroy my context.

DiffTheEnder2mo ago

I'm finding queries are taking about 3x as long as they used to regardless of whether I use Sonnet or Opus (Claude Code on Max)

guzfip2mo ago

Anyone played much with Jetbrain’s LLM agent?

I’ve been toying around at home with it and I’ve been fine with its output mostly (in a Java project ofc), but I’ve run into a few consistent problems

- Pricing makes no sense to me, but Jetbrains offering seems to have its own layer of abstraction in “credits” that just seem so opaque.

dude2507112mo ago

How is coding "solved" then?

Unless they meant "all code that needs to be written has already been written" so their mission is to prevent any new code from being written via a kind of a bait and switch?

varispeed2mo ago

I found that telling Claude that it is trying to defraud you and making spend money often gets it back on track and return to pervious performance briefly until it agains starts doing nonsense.

mikkupikku2mo ago

JohnMakin2mo ago

The commenters here don't seem to realize this was posted during the outage yesterday that affected login for most claude code users.

alasano2mo ago

If you prepare yourself a token with "claude setup-token" (presuming you're not already locked out and had one) you can run "CLAUDE_CODE_OAUTH_TOKEN=sk.. claude" to use your account.

nprateem2mo ago

Antigravity has become near unusable too for the last week with Opus. Continual capacity alerts meaning tasks stop running.

Not worth the money now, will be canceling unless fixed soon.

fabbbbb2mo ago

Is this really relevant news? Please share more bug reports from popular services and tools. Feels a tiny bit biased. My CC is just fine since at least three weeks.

jostmey2mo ago

15000 milliseconds! Makes me laugh. I've had the same issue! Usually happens in the morning

arduanika2mo ago

The eternal return of https://xkcd.com/303/

world2vec2mo ago

I'm getting "Prompt is too long" a lot today

m3kw92mo ago

How are they making billions with reliability like that?

jollymonATX2mo ago

Simply put, Anthropic does not have enough compute.

mring336212mo ago

For a lot of my work, I'm pretty happy with OpenCode + GLM-4.7-Flash-REAP-23B-A3B-Q4_K_M.gguf running in llama.cpp.