Then, I only switched from gpt-4 to gpt-5 because the price was cheaper lolz
So?
Modern software is designed with a defense in depth model, so it often requires chaining multiple vulnerabilities to get a successful exploit. But individual vulnerabilities still need finding and fixing because people might find vulnerabilities in the other isolation layers later.
I swear every time an LLM does something useful, the usual band of skeptics bends over backwards trying to invent reasons to dismiss it.
Also, I don't believe it is fair to dismiss skeptics as inventing reasons. If anything, "believers" are bending over backwards to praise Anthropic even though they didn't actually release anything.
Quite possible.
Opus is also not the worst at hacking things either. Sometimes it hacks things 'by accident' you see. If Mythos is better at it, then at some point, yeah, I can see how that might start to become a problem. Especially running unsupervised.
No reason to expect capabilities of models are going to stop.
They are saying "trust me, bro, I have a superhacker model" and proceed to show 0 evidence that it is what they are hyping.
E.g. even though LLMs can generate code and we have agents - the profession of software engineers is not being destroyed. The demand for software engineers in the labour market is still strong.
Also a thing that wasnt spoken about loudly (and for good reason) is that code is not perfect - this means bugs/vulnerabilities are there. And the reality is, it is optimal to have done this - for it were not done, the release of software and the moving of resources towards other projects would slow. Aka slowing down economic activity.
https://news.ycombinator.com/item?id=47717587
And here we learn that Mythos is not a big deal. Are there people who believe both?
One day I hear it is all a marketing pitch, another day I hear it can literally end earth so it should be regulated.
How do I reconcile this?
In my view the naysayers always simply been moving the goalposts, and never admit when they were wrong. "AI just produces slop" -> "AI can't write useful code" -> "AI can't take SWE jobs" -> [we are here]
You, just like 'them' are no better. It would be better if all those on the extreme ends could be muted - the truth is closer to the middle.
Didn't Oracle just lay off 30k people, Meta plans on laying off thousands more this year, Microsoft and Amazon have already done layoffs in the name of AI?
So at what point should naysayers update their priors? How many times must people be proven wrong to think "maybe I have the wrong perspective here after all?"
Full disclosure, I used to be a skeptic myself in the early days, but I think being a skeptic today is pure stubbornness, not rational.
I do not think the recent reliability and quality of major services would be considered acceptable if it weren't for a domestic tech oligopoly systematically lowering standards.
The U.S. auto industry also shows that oligopolies can shut out competition for quite some time and steer their captive market into accepting mediocrity, but innovation continues anyway. It's quite the contrast seeing domestic automakers turn their backs on EVs at the same time the Chinese are advertising 10 minute flash charging.
What matters is the aggregate activity in the labour market - for which - the activity associated with demand for software engineers is healthy.