undefined | Better HN

0 pointscs7022mo ago0 comments

Thank you for coming on HN and offering to answer questions.[a]

This is a fantastic piece, very timely, evidently well-researched, and also well-written. Judging by the little that I know, it's accurate. Thank you for doing the work and sharing it with the world.

OpenAI may be in a more tenuous competitive position than many people realize. Recent anecdotal evidence suggests the company has lost its lead in the AI race to Anthropic.[b]

Many people here, on HN, who develop software prefer Claude, because they think it's a better product.[c]

Is your understanding of OpenAI's current competitive position similar?

---

[a] You may want to provide proof online that you are who you say you are: https://en.wikipedia.org/wiki/On_the_Internet%2C_nobody_know...

[b] https://www.latimes.com/business/story/2026-04-01/openais-sh...

[c] For example, there are 2x more stories mentioning Claude than ChatGPT on HN over the past year. Compare https://hn.algolia.com/?dateRange=pastYear&page=0&prefix=tru... to https://hn.algolia.com/?dateRange=pastYear&page=0&prefix=tru...

0 comments

42 comments · 8 top-level

unsupp0rted2mo ago· 20 in thread

Many of us prefer OpenAI's Codex, because we think it's a better product.

No comment on the CEO: I just find the product superior in everything but UI/UX and conversation. It's better at quality code.

mliker2mo ago

Who is “us”? It does seem that some scientists prefer Codex for its math capabilities but when it comes to general frontend and backend construction, Claude Code is just as good and possibly made better with its extensive Skills library.

Both codex and Claude code fail when it comes to extremely sophisticated programming for distributed systems

keldaris2mo ago

As a scientist (computational physicist, so plenty of math, but also plenty of code, from Python PoCs to explicit SIMD and GPU code, mostly various subsets of C/C++), I can confirm - Codex is qualitatively better for my usecases than Claude. I keep retesting them (not on benchmarks, I simply use both in parallel for my work and see what happens) after every version update and ever since 5.2 Codex seems further and further ahead. The token limits are also far more generous (and it matters, I found it fairly easy to hit the 5h limit on max tier Claude), but mostly it's about quality - the probability that the model will give me something useful I can iterate on as opposed to discard immediately is much higher with Codex.

For the few times I've used both models side by side on more typical tasks (not so much web stuff, which I don't do much of, but more conventional Python scripts, CLI utilities in C, some OpenGL), they seem much more evenly matched. I haven't found a case where Claude would be markedly superior since Codex 5.2 came out, but I'm sure there are plenty. In my view, benchmarks are completely irrelevant at this point, just use models side by side on representative bits of your real work and stick with what works best for you. My software engineer friends often react with disbelief when I say I much prefer Codex, but in my experience it is not a close comparison.

3 more replies

zeroxfe2mo ago

I'm in that camp -- I have the max-tier subscription to pretty much all the services, and for now Codex seems to win. Primarily because 1) long horizon development tasks are much more reliable with codex, and 2) OpenAI is far more generous with the token limits.

Gemini seems to be the worst of the three, and some open-weight models are not too bad (like Kimi k2.5). Cursor is still pretty good, and copilot just really really sucks.

the__alchemist2mo ago

Claude Code, Codex, and Cursor are old news. If you're having problems, it's because you're not using the latest hotness: Cludge. Everyone is using it now - don't get left behind.

1 more reply

unsupp0rted2mo ago

Us = me and say /r/codex or wherever Codex users are. I've tried both, liked both, but in my projects one clearly produces better results, more maintainable code and does a better job of debugging and refactoring.

2 more replies

zem2mo ago

I've found claude startlingly good at debugging race conditions and other multithreading issues though.

1 more reply

DeathArrow2mo ago

Many paying customers say that Anthropic degraded the capability of Opus and Claude Code in the last months and the outcomes are worse. There are even discussions on HN about this.

Last one is from yesterday: https://news.ycombinator.com/item?id=47660925

lhl2mo ago

As some other people mentioned, using both/multiple is the way to go if it's within your means.

I've been working on a wide range of relatively projects and I find that the latest GPT-5.2+ models seem to be generally better coders than Opus 4.6, however the latter tends to be better at big picture thinking, structuring, and communicating so I tend to iterate through Opus 4.6 max -> GPT-5.2 xhigh -> GPT-5.3-Codex xhigh -> GPT-5.4 xhigh. I've found GPT-5.3-Codex is the most detail oriented, but not necessarily the best coder. One interesting thing is for my high-stakes project, I have one coder lane but use all the models do independent review and they tend to catch different subsets of implementation bugs. I also notice huge behavioral changes based on changing AGENTS.md.

In terms of the apps, while Claude Code was ahead for a long while, I'd say Codex has largely caught up in terms of ergonomics, and in some things, like the way it let's you inline or append steering, I like it better now (or where it's far, far, ahead - the compaction is night and day better in Codex).

(These observations are based on about 10-20B/mo combined cached tokens, human-in-the-loop, so heavy usage and most code I no longer eyeball, but not dark factory/slop cannon levels. I haven't found (or built) a multi-agent control plane I really like yet.)

2 more replies

Razengan2mo ago

I have been using Codex AND Claude side by side for the same project*, with the same prompts.

Codex has been consistently better on almost every level.

* (an open source framework for 2D games in Godot 4.6 GDScript, mostly using AI to review existing code)

7thpower2mo ago

Not a scientist and use codex for anything complex.

I enjoy using CC more and use it for non coding tasks primarily, but for anything complex (honestly most of what I do is not that complex), I feel like I am trading future toil for a dopamine hit.

baq2mo ago

I’m one of those ‘us’, Claude’s outputs require significant review and iteration effort (to put it bluntly they get destroyed by gpt and Gemini). I’m basically using sonnet to do code search and write up since it is a better (more human-like) writer than gpt and faster and more reliable than gemini, but that’s about it.

bko2mo ago

I also find Codex much more generous in terms of what you get with a Pro ($20/mo) subscription. I use it pretty much non-stop and I have yet to hit a limit. Weekly reset is much better as well.

DeathArrow2mo ago

I prefer GLM 5.1 and MiniMax 2.7. With a better harness like Forge Code, I have better results for way less money than by using GPT and Opus.

jbergqvist2mo ago

Usage limits are more generous and GPT 5.4 is a good model, but yes, UI/UX lags behind Claude Code. Currently I'm especially missing /rewind with code restoration and proper support for plugin marketplaces

KaiserPro2mo ago

GPT/claude/gemini is pretty interchangeable at this point.

baq2mo ago

Absolutely not the case. They're complementary.

1 more reply

shevy-java2mo ago

Does this work for people? To me having a "better product" would be completely irrelevant if the use cases are evil.

thaoanh4042mo ago

i find myself being more productive with codex/copilot on coding tasks, but claude does seem to be better at planning

MrSkelter2mo ago

Here’s a reality check.

There are two types of vaccine be coders. Those who review the code generated and those who don’t.

Either because they don’t understand code at all, or because they don’t have time and don’t care.

Code quality is only one factor. Naive vibe coders, who don’t code otherwise, rate performance based on output alone.

aaa_aaa2mo ago

Shill talk

ronanfarrow2mo ago· 7 in thread

Thank you for this, very much appreciate the thoughtful response.

The piece captures some of the anxieties within OpenAI right now about their competitive position. This obviously ebbs and flows but of late there has been much focus on Anthropic's relative position. We of course mention the allegations of "circular deals" and concerns about partners taking on debt.

cs702OP2mo ago

Thank you. Yes, I saw that. The company's always been surrounded by endless talk about insane hype, speculative bubbles, and financial engineering. I wasn't asking so much about that.

I was asking more about your informed view on how OpenAI's technology, products, and roadmap are perceived, particularly by customers and partners, in comparison to those of competitors.

If you have an opinion about that, everyone here would love to hear about it.

cs702OP2mo ago

UPDATE: Well-regarded people on HN are saying OpenAI's most recent GPT-5x codex model is better than Claude 5x for certain coding tasks:

https://news.ycombinator.com/item?id=47707494

globalnode2mo ago

at this point even googles ai search results are better than gpt - obv. this is not for full programs but if you know what youre doing and just want a snippet, thats all you need.

1 more reply

irishcoffee2mo ago

My guess is that the answer to your question, fantastic question, is that nobody knows. I remember having the same thoughts when Covid was first “arriving” if you will: we wanted people in the know to throw us a nugget of information, and they just didn’t know.

As it turns out, and what I’m kind of going with for this LLM shit, is that it’ll play out exactly how you think it will. The companies are all too big to fail, with billionaire backers who would rather commit fraud than lose money.

1 more reply

Ericson23142mo ago

Ronan Farrow's expertise is investigations into elite amorality, not evaluating technical products. Why are you asking this question?

2 more replies

keepamovin2mo ago

If you were in charge of the deciding what should be done with Sam Altman, what would you choose?

giancarlostoro2mo ago

I mean, its a fair question, though it does make some wonder how extreme the answers could be, so I could see why you're being downvoted.

The problem is sometimes on paper everything people like Sam Altman do is legal, despite it harming so many. We've literally had a major RAM producer pull off the consumer RAM market. I feel like Sam Altman should be investigated and heavily scrutinized. He kind of is the biggest bubble in the AI bubble, we're letting him fester too far into it too, and these circular deals have seemingly somewhat stopped for now, but it might only get worse.

1 more reply

brightbeige2mo ago· 3 in thread

He’s replying on this twitter thread - perhaps someone with an account can ask there and link his comment here?

https://xcancel.com/RonanFarrow/status/2041127882429206532#m

jamiequint2mo ago

Here is the actual link, not a link to some weird third-party site that can't be trusted.

https://x.com/RonanFarrow/status/2041127882429206532

rounce2mo ago

FYI xcancel is just a mirror that allows reading replies without needing an account.

SwellJoe2mo ago

Whereas X can be trusted?

1 more reply

georgemcbay2mo ago· 3 in thread

> You may want to provide proof online that you are who you say you are

Unfortunately it probably doesn't even matter here on HN considering how brigaded down this story is predictably getting.

But yeah, it was a fantastic piece.

dang2mo ago

It wasn't getting "brigaded down" - it set off a software penalty called the flamewar detector. I turned that off as soon as I saw it.

cs702OP2mo ago

Thank you for keeping HN sane :-)

ronanfarrow2mo ago

Fair request, here you go: https://x.com/RonanFarrow/status/2041203911697068112

ed2mo ago· 1 in thread

It's worth noting Codex has 2x more stories than Claude https://hn.algolia.com/?query=codex

cloverich2mo ago

But by page 5, those stories have around 50-60 karma, while claude page five is still 500+

(i found your comment surprising based on my daily hn reading recollection - i mostly read top N daily and feel i only occassionally see codex stories).

ATMLOTTOBEER2mo ago

Yeah we moved to Claude a few months ago, mostly because the devs kept using it anyway. Altman stuff is interesting but at the end of the day you just go with whatever tool works

cableshaft2mo ago

Personally, I prefer Claude for coding, but I still prefer ChatGPT for hashing out ideas for my projects (which tend to be game designs). So I use both.

lasky2mo ago

I’m assuming this is all sarcasm.

j / k navigate · click thread line to collapse

0 comments

42 comments · 8 top-level

unsupp0rted2mo ago· 20 in thread

Many of us prefer OpenAI's Codex, because we think it's a better product.

No comment on the CEO: I just find the product superior in everything but UI/UX and conversation. It's better at quality code.

mliker2mo ago

Both codex and Claude code fail when it comes to extremely sophisticated programming for distributed systems

keldaris2mo ago

3 more replies

zeroxfe2mo ago

Gemini seems to be the worst of the three, and some open-weight models are not too bad (like Kimi k2.5). Cursor is still pretty good, and copilot just really really sucks.

the__alchemist2mo ago

Claude Code, Codex, and Cursor are old news. If you're having problems, it's because you're not using the latest hotness: Cludge. Everyone is using it now - don't get left behind.

1 more reply

unsupp0rted2mo ago

2 more replies

zem2mo ago

I've found claude startlingly good at debugging race conditions and other multithreading issues though.

1 more reply

DeathArrow2mo ago

Many paying customers say that Anthropic degraded the capability of Opus and Claude Code in the last months and the outcomes are worse. There are even discussions on HN about this.

Last one is from yesterday: https://news.ycombinator.com/item?id=47660925

lhl2mo ago

As some other people mentioned, using both/multiple is the way to go if it's within your means.

2 more replies

Razengan2mo ago

I have been using Codex AND Claude side by side for the same project*, with the same prompts.

Codex has been consistently better on almost every level.

* (an open source framework for 2D games in Godot 4.6 GDScript, mostly using AI to review existing code)

7thpower2mo ago

Not a scientist and use codex for anything complex.

I enjoy using CC more and use it for non coding tasks primarily, but for anything complex (honestly most of what I do is not that complex), I feel like I am trading future toil for a dopamine hit.

baq2mo ago

bko2mo ago

I also find Codex much more generous in terms of what you get with a Pro ($20/mo) subscription. I use it pretty much non-stop and I have yet to hit a limit. Weekly reset is much better as well.

DeathArrow2mo ago

I prefer GLM 5.1 and MiniMax 2.7. With a better harness like Forge Code, I have better results for way less money than by using GPT and Opus.

jbergqvist2mo ago

KaiserPro2mo ago

GPT/claude/gemini is pretty interchangeable at this point.

baq2mo ago

Absolutely not the case. They're complementary.

1 more reply

shevy-java2mo ago

Does this work for people? To me having a "better product" would be completely irrelevant if the use cases are evil.

thaoanh4042mo ago

i find myself being more productive with codex/copilot on coding tasks, but claude does seem to be better at planning

MrSkelter2mo ago

Here’s a reality check.

There are two types of vaccine be coders. Those who review the code generated and those who don’t.

Either because they don’t understand code at all, or because they don’t have time and don’t care.

Code quality is only one factor. Naive vibe coders, who don’t code otherwise, rate performance based on output alone.

aaa_aaa2mo ago

Shill talk

ronanfarrow2mo ago· 7 in thread

Thank you for this, very much appreciate the thoughtful response.

cs702OP2mo ago

Thank you. Yes, I saw that. The company's always been surrounded by endless talk about insane hype, speculative bubbles, and financial engineering. I wasn't asking so much about that.

I was asking more about your informed view on how OpenAI's technology, products, and roadmap are perceived, particularly by customers and partners, in comparison to those of competitors.

If you have an opinion about that, everyone here would love to hear about it.

cs702OP2mo ago

UPDATE: Well-regarded people on HN are saying OpenAI's most recent GPT-5x codex model is better than Claude 5x for certain coding tasks:

https://news.ycombinator.com/item?id=47707494

globalnode2mo ago

at this point even googles ai search results are better than gpt - obv. this is not for full programs but if you know what youre doing and just want a snippet, thats all you need.

1 more reply

irishcoffee2mo ago

1 more reply

Ericson23142mo ago

Ronan Farrow's expertise is investigations into elite amorality, not evaluating technical products. Why are you asking this question?

2 more replies

keepamovin2mo ago

If you were in charge of the deciding what should be done with Sam Altman, what would you choose?

giancarlostoro2mo ago

I mean, its a fair question, though it does make some wonder how extreme the answers could be, so I could see why you're being downvoted.

1 more reply

brightbeige2mo ago· 3 in thread

He’s replying on this twitter thread - perhaps someone with an account can ask there and link his comment here?

https://xcancel.com/RonanFarrow/status/2041127882429206532#m

jamiequint2mo ago

Here is the actual link, not a link to some weird third-party site that can't be trusted.

https://x.com/RonanFarrow/status/2041127882429206532

rounce2mo ago

FYI xcancel is just a mirror that allows reading replies without needing an account.

SwellJoe2mo ago

Whereas X can be trusted?

1 more reply

georgemcbay2mo ago· 3 in thread

> You may want to provide proof online that you are who you say you are

Unfortunately it probably doesn't even matter here on HN considering how brigaded down this story is predictably getting.

But yeah, it was a fantastic piece.

dang2mo ago

It wasn't getting "brigaded down" - it set off a software penalty called the flamewar detector. I turned that off as soon as I saw it.

cs702OP2mo ago

Thank you for keeping HN sane :-)

ronanfarrow2mo ago

Fair request, here you go: https://x.com/RonanFarrow/status/2041203911697068112

ed2mo ago· 1 in thread

It's worth noting Codex has 2x more stories than Claude https://hn.algolia.com/?query=codex

cloverich2mo ago

But by page 5, those stories have around 50-60 karma, while claude page five is still 500+

(i found your comment surprising based on my daily hn reading recollection - i mostly read top N daily and feel i only occassionally see codex stories).

ATMLOTTOBEER2mo ago

Yeah we moved to Claude a few months ago, mostly because the devs kept using it anyway. Altman stuff is interesting but at the end of the day you just go with whatever tool works

cableshaft2mo ago

Personally, I prefer Claude for coding, but I still prefer ChatGPT for hashing out ideas for my projects (which tend to be game designs). So I use both.

lasky2mo ago

I’m assuming this is all sarcasm.

j / k navigate · click thread line to collapse