undefined | Better HN

0 pointsnorthern-lights29d ago0 comments

> Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on developing these safeguards and expect to be able to bring Mythos-class models to all our customers in the coming weeks.

Probably more interesting than the 4.8 release.

0 comments

17 comments · 6 top-level

TIPSIO29d ago· 5 in thread

Seems like they might be hinting that if you are not a billionaire or multi-billion dollar company you will just get a limited and nerfed Claude Code slash command /mythos-security-audit or something.

Hope this isn’t the case and that normal average Joe’s of the world don’t get policed out of access.

gs1729d ago

> you will just get a limited and nerfed Claude Code slash command /mythos-security-audit or something.

Unless it's so expensive that we can't realistically use it for anything, I wouldn't complain about getting at least that. I would also rather have the actual model, but that's a useful application of it (and I'm probably not going to afford using it for much more).

3 more replies

dbbk29d ago

What does an average Joe need a Mythos level model for that Opus can't do for them?

2 more replies

Tepix29d ago

It does sound like an even higher API price tier for sure.

hedora29d ago

Isn't OpenAI's public flagship already beating Mythos on penetration testing? I get the impression Mythos is just valuation-juicing for IPO more than anything else.

The fact that they haven't released it yet suggests a cost/margins issue to me more than anything else. Short term, I'll probably keep using Antrhopic, but my long-term bet is that locally-served models win, if only because the quest for profitability will probably lead to intentionally-nerfed / enshittified frontier models.

At other vendors, ad placement within LLM responses is either coming or already here. Anthropic's handling of OpenClaw shows they're willing to engage in anti-competitive behavior, and the courts are not in a hurry to stop them. Why would I pay them $200 a month for such treatment when a $2K box does what I need locally?

3 more replies

kdmtctl29d ago

This command would be not so bad for not a billionaire me.

zamalek29d ago· 3 in thread

> Probably more interesting

It is widely suspected that self-inflicted "bad news" ("Mythos is so dangerous we just can't give the public access to it") is nothing more than Dario's typical style of marketing - keep in mind that they have an IPO coming up, because he certainly factors that into everything he says in public (as is his responsibility, to be fair).

An alternative reason for delaying the model might not be "we are trying to make it safe." It could be "we don't know how to host this thing at scale, or cost-effectively".

GPT 5.5 has already been shown to be as adept as Mythos at finding vulnerabilities.

Finally, laymen massively underestimate the importance of the harness for model performance. OpenHands existed long before Claude Code, Claude Code changed everything because of the clever hand-holding it does. Mythos is definitely more than just a model.

clbrmbr28d ago

One capability that I see is missing from opus is this ability to understand an entire system. My hope is that a mythos class model will be able to comprehend even something as complicated as an IOT system with a hardware and firmware layer multiple API’s backend and different kinds of API and web clients.

The main limitation we’ve had to agentic coding is an understanding of this system that spans processes running on different machines and architectures.

1 more reply

LPisGood28d ago

What sort of clever handholding does Claude code do?

1 more reply

KerryJones28d ago

"GPT 5.5 has already been shown to be as adept as Mythos at finding vulnerabilities."

Do you have any data on this (other than benchmarks)?

1 more reply

scuderiaseb29d ago· 3 in thread

So this is how they’ll remove access from Claude Pro to the biggest models. You would need at least a Claude Max subscription for the bigger than Opus models I bet.

F7F7F729d ago

Anthropic's wants to sell us Claude Code with no model selection at all.

Opus seems to be overly eager of late to 'vibe' out entire solutions and build out things that you didn't ask for.

/goals is helping set the narrative that does it really matter if Sonnet and 3 Haiku agents got you to that end state...eventually...if its what you asked for?

For better or worse Opus is already handing off 80% of its work to background agents of Sonnet, Haiku, and likely a quantized Opus.

Want model selection? Pay for the API.

1 more reply

swalsh29d ago

Its amazing how quickly ive just become accustomed to being a max subscriber. I dont think I could go back to pro.

1 more reply

selcuka28d ago

They have already been experimenting with such ideas [1]:

> Claude Code Removed from $20-a-Month "Pro" Subscription for New Users

[1] https://news.ycombinator.com/item?id=47855832

andai29d ago

In the Opus 4.7 release notes they mentioned intentionally making it worse at cybersecurity. [0]

This suggests that they're doing the same thing with Mythos now and the Mythos we get will be nerfed in that department?

Or more precisely, I think they'll have two versions of Mythos, and the scary one will probably continue to require a lot of paperwork.

https://www.anthropic.com/news/claude-opus-4-7

ac2929d ago

More interesting than that to me is "we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost"

Sonnet and Haiku look real outclassed for the price with current Chinese competition.

_heimdall28d ago

I'm still not sure what safeguards they can be adding here. Unless they've suddenly solved alignment, at best isn't it a collection of system prompts saying what not to do and potentially some screening algorithms that try to catch key phrases in inputs/outputs?

j / k navigate · click thread line to collapse

0 comments

17 comments · 6 top-level

TIPSIO29d ago· 5 in thread

Hope this isn’t the case and that normal average Joe’s of the world don’t get policed out of access.

gs1729d ago

> you will just get a limited and nerfed Claude Code slash command /mythos-security-audit or something.

3 more replies

dbbk29d ago

What does an average Joe need a Mythos level model for that Opus can't do for them?

2 more replies

Tepix29d ago

It does sound like an even higher API price tier for sure.

hedora29d ago

Isn't OpenAI's public flagship already beating Mythos on penetration testing? I get the impression Mythos is just valuation-juicing for IPO more than anything else.

3 more replies

kdmtctl29d ago

This command would be not so bad for not a billionaire me.

zamalek29d ago· 3 in thread

> Probably more interesting

An alternative reason for delaying the model might not be "we are trying to make it safe." It could be "we don't know how to host this thing at scale, or cost-effectively".

GPT 5.5 has already been shown to be as adept as Mythos at finding vulnerabilities.

clbrmbr28d ago

The main limitation we’ve had to agentic coding is an understanding of this system that spans processes running on different machines and architectures.

1 more reply

LPisGood28d ago

What sort of clever handholding does Claude code do?

1 more reply

KerryJones28d ago

"GPT 5.5 has already been shown to be as adept as Mythos at finding vulnerabilities."

Do you have any data on this (other than benchmarks)?

1 more reply

scuderiaseb29d ago· 3 in thread

So this is how they’ll remove access from Claude Pro to the biggest models. You would need at least a Claude Max subscription for the bigger than Opus models I bet.

F7F7F729d ago

Anthropic's wants to sell us Claude Code with no model selection at all.

Opus seems to be overly eager of late to 'vibe' out entire solutions and build out things that you didn't ask for.

/goals is helping set the narrative that does it really matter if Sonnet and 3 Haiku agents got you to that end state...eventually...if its what you asked for?

For better or worse Opus is already handing off 80% of its work to background agents of Sonnet, Haiku, and likely a quantized Opus.

Want model selection? Pay for the API.

1 more reply

swalsh29d ago

Its amazing how quickly ive just become accustomed to being a max subscriber. I dont think I could go back to pro.

1 more reply

selcuka28d ago

They have already been experimenting with such ideas [1]:

> Claude Code Removed from $20-a-Month "Pro" Subscription for New Users

[1] https://news.ycombinator.com/item?id=47855832

andai29d ago

In the Opus 4.7 release notes they mentioned intentionally making it worse at cybersecurity. [0]

This suggests that they're doing the same thing with Mythos now and the Mythos we get will be nerfed in that department?

Or more precisely, I think they'll have two versions of Mythos, and the scary one will probably continue to require a lot of paperwork.

https://www.anthropic.com/news/claude-opus-4-7

ac2929d ago

More interesting than that to me is "we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost"

Sonnet and Haiku look real outclassed for the price with current Chinese competition.

_heimdall28d ago

j / k navigate · click thread line to collapse