NSA director: 'Mythos "broke into almost all of our classified systems in hours" (opens in new tab)

(economist.com)

15 pointsricksunny6d ago13 comments

13 comments

34 comments · 7 top-level

MaxPock6d ago· 8 in thread

What happens when open source models achieve Mythos level capabilities in six months' time?

I dont seem to understand why no one is talking about this obvious fact? I mean suddenyl everyone is banning .. ok .. well how many months behind are the open source models?

hdgvhicv5d ago

If you assume that open models catch up in 6-12-36 months, then you either assume exponential growth destroying the global economy and probably the world in a few years, or you assume it plateaus and commoditises.

Even if your country prevents access to compute to protect the trillion dollar companies, it’s not going to apply for every country, and as models get better it becomes easier to compete. There’s no way an AI non proliferation treaty will be passed or even enforceable.

impossiblefork5d ago

I think it's probably more like a year or a year and a half. I don't want to say two years, but it's what I'm actually thinking.

mariebks5d ago

GLM 5.2 is already between 4.6 Opus 4.7 Opus level based on Artificial Analysis aggregation. 4.6 Opus is about 4 months old at this point, so seems like open source is maybe 4-6 months behind. It could still take a year but seems closer to 6 months.

1 more reply

handoflixue5d ago

We'll see Mythos 2.0 patching all the Mythos 1.0 vulnerabilities before we see an open-source Mythos 1.0.

What matters isn't the power of the tool, but whether defenders have had time to secure against. Today's cyberweapon is tomorrow's laughably obsolete.

Stuxnet used to be a national security threat, now I'm not sure it would be useful for anything.

anakaine5d ago

They were not saying f Open Spurce Mythos 1.0. They were talking about performance / capability parity in other open source models.

1 more reply

latentsea5d ago

Open source models get banned.

DaSHacka5d ago

in America, maybe, but all of our adversaries will still have access to them, and continue working on them.

mirekrusin6d ago· 7 in thread

If mythos can break into almost all of their classified systems in hours then other models including opus, gpt, gemini and large open weight models can do so as well, maybe you'll have to double hours or it may become days, but they also will, there is no "maybe" in here.

State sponsored, non-public penetration fine tunes (of possibly public ones) likely can do it even faster.

Unsupervised penetration RL loop is ideal setup similar to optimization one – it's relatively easy to gain function on it.

dualvariable5d ago

Also, this is just security through obscurity. The holes that mythos exploited still exist after you've tried to limit mythos accessibility.

And the fact that all our systems are riddled with security holes shouldn't be too much of a surprise given the way that we all know that software is developed and how tech debt / chores are constantly underbudgeted (plus I think this underscores that any one human's knowledge and attention are inherently limited, and even the best PR review is going to leak all kinds of security holes).

mirekrusin5d ago

Yes, exactly, quite shocking, if something like this is true, as NSA (!!) director you keep it quiet, right?

2 more replies

johndough6d ago

I don't think that is necessarily true.

- With a weaker model, the time to break into the system might grow so larger that it becomes infeasible, similar to how password hashes can be bruteforced, but if the password is long enough, that is not going to happen in our lifetime.

- There might be problems which are inherently unsolvable with a lower level of intelligence. For example, your dog won't derive calculus from scratch, even if it lived forever.

- LLMs might be biased in such a way that they never explore the entire solution space, no matter how many attempts are made. Some models are notorious for getting stuck in a loop, trying small variations of the same approach every time, even though it is doomed to fail. This can be counteracted somewhat with higher sampling temperature, but that hurts reasoning capabilities.

BikDk6d ago

The concept of infinity claims that the dog eventually becomes Shakespeare. The same way we handled encryption, even before Alan Turing codes were broken and evolved. Last, it is a huge advantage to have the machine/mind and to evolve from there. P.S. Even if you go back to lemon juice on paper there may be a thief around that knows the trick.

2 more replies

awesomeusername5d ago

Dogs deriving calculus:

https://www.csun.edu/~dgray/BE528/Pennigs2003Dogs_Calculus.p...

mirekrusin6d ago

Mythos and other models are not brute-forcing passwords (and with this analogy passwords, ie. systems are the same).

We're not talking about dogs, but LLM systems.

Mythos is not exploring entire solution space either.

Usually looping is solved by repetition/frequency/presence/n-gram penalties/DRY/min-p sampling, not temperature but we're not talking about small models that have those classes of issues here.

1 more reply

Reubend5d ago

I think you're missing the point. Everything you said is theoretically correct, but the parent comment was talking about the concrete circumstance of pentesting with the top models today.

Let's just take GPT 5.5 and Opus 4.8 as an example. Both are worse than Mythos 5, but they're capable of quite a bit when the guardrails are lifted and they're paired with a skilled human operator. They more than "good enough" to reach the same result with the addition of some human effort.

vsgherzi6d ago· 5 in thread

This is really making me raise an eyebrow. I’m sure mythos is an improvement for sure. I don’t think the framing of it hacked the entire NSA is fully truthful. I’d like a more in depth understanding of what actually happened. Excited to be proved wrong tho!

Epa0956d ago

Yeah, this article cites someone saying that someone else said something. Maybe it was said, maybe not. Maybe it was a exaggeration, maybe not.

SirFatty6d ago

Very insightful.

1 more reply

stithpragya5d ago

From the outset, Mythos’s PR has been rather dodgy.

Gee1014d ago

Marketing has been brilliant thou.

DANmode5d ago

They said “almost”, for starters.

ionwake6d ago· 3 in thread

Not being funny but does most of HN subscribe to the economist? I dont think ive ever paid for an online newspaper ( and Im not trying to be edgy )

amanaplanacanal5d ago

If I was going to pay for a news subscription, it would probably be the economist. Or maybe the financial times. They both seem to still have solid journalism.

kingleopold5d ago

they have solid exor acting for sure

arvid-lind6d ago

more likely, most of HN who care about reading this article use something like archive.is

ggm6d ago· 2 in thread

https://archive.is/aA1dB

pelario6d ago

The link does not seem to be working

johndough6d ago

Works for me though, even when using a proxy that is usually blocked everywhere.

bel86d ago· 2 in thread

HN post title does not match link title

> NSA director: 'Mythos "broke into almost all of our classified systems in hours"

> Donald Trump’s blocking of Anthropic is capricious and chaotic

ricksunnyOP6d ago

I’ve found that high community-upvoted posts don’t bury the lede by parroting the headline. I used to be a headline title scribe until the HN community showed me the light.

bel86d ago

The article has nothing about mythos breaking into classified systems in hours.

So you either posted the wrong link or are just spreading FUD.

1 more reply

ggm6d ago

I made a point about this in relation to anthropic last week: nobody inside the strategic information spaces is worried about AGI they're worried about core strategic information leaking out. Either it's in the model, or the model exposes pathways to finding it in the core strategic systems.

Those "tapes" DOGE took away? Nothing on them can be considered private any more. That's how brute force risk happens. Mythos' risks are showing doorways to exfiltration surely? Why bother when you can walk out the door with a data dump?

The NSA is just a highly specific subclass of the problem. Their traditional publicly stated approach to security is "nothing electronic which enters our domain leaves" and yet somehow they have assessed these systems as capable of breaching their walls? That's super bad.

I suspect they ran an analogue/instance inside their protection rings. I doubt they ran a test outside in the global internet. If they have actually lost control of their boundary, that's a bigger story (which I doubt) and contextually he could have been referring to information systems in NSAs duty of care, not things inside Ft Meade.

j / k navigate · click thread line to collapse

13 comments

34 comments · 7 top-level

MaxPock6d ago· 8 in thread

What happens when open source models achieve Mythos level capabilities in six months' time?

ionwake6d ago

I dont seem to understand why no one is talking about this obvious fact? I mean suddenyl everyone is banning .. ok .. well how many months behind are the open source models?

hdgvhicv5d ago

impossiblefork5d ago

I think it's probably more like a year or a year and a half. I don't want to say two years, but it's what I'm actually thinking.

mariebks5d ago

1 more reply

handoflixue5d ago

We'll see Mythos 2.0 patching all the Mythos 1.0 vulnerabilities before we see an open-source Mythos 1.0.

What matters isn't the power of the tool, but whether defenders have had time to secure against. Today's cyberweapon is tomorrow's laughably obsolete.

Stuxnet used to be a national security threat, now I'm not sure it would be useful for anything.

anakaine5d ago

They were not saying f Open Spurce Mythos 1.0. They were talking about performance / capability parity in other open source models.

1 more reply

latentsea5d ago

Open source models get banned.

DaSHacka5d ago

in America, maybe, but all of our adversaries will still have access to them, and continue working on them.

mirekrusin6d ago· 7 in thread

State sponsored, non-public penetration fine tunes (of possibly public ones) likely can do it even faster.

Unsupervised penetration RL loop is ideal setup similar to optimization one – it's relatively easy to gain function on it.

dualvariable5d ago

Also, this is just security through obscurity. The holes that mythos exploited still exist after you've tried to limit mythos accessibility.

mirekrusin5d ago

Yes, exactly, quite shocking, if something like this is true, as NSA (!!) director you keep it quiet, right?

2 more replies

johndough6d ago

I don't think that is necessarily true.

- There might be problems which are inherently unsolvable with a lower level of intelligence. For example, your dog won't derive calculus from scratch, even if it lived forever.

BikDk6d ago

2 more replies

awesomeusername5d ago

Dogs deriving calculus:

https://www.csun.edu/~dgray/BE528/Pennigs2003Dogs_Calculus.p...

mirekrusin6d ago

Mythos and other models are not brute-forcing passwords (and with this analogy passwords, ie. systems are the same).

We're not talking about dogs, but LLM systems.

Mythos is not exploring entire solution space either.

Usually looping is solved by repetition/frequency/presence/n-gram penalties/DRY/min-p sampling, not temperature but we're not talking about small models that have those classes of issues here.

1 more reply

Reubend5d ago

I think you're missing the point. Everything you said is theoretically correct, but the parent comment was talking about the concrete circumstance of pentesting with the top models today.

vsgherzi6d ago· 5 in thread

Epa0956d ago

Yeah, this article cites someone saying that someone else said something. Maybe it was said, maybe not. Maybe it was a exaggeration, maybe not.

SirFatty6d ago

Very insightful.

1 more reply

stithpragya5d ago

From the outset, Mythos’s PR has been rather dodgy.

Gee1014d ago

Marketing has been brilliant thou.

DANmode5d ago

They said “almost”, for starters.

ionwake6d ago· 3 in thread

Not being funny but does most of HN subscribe to the economist? I dont think ive ever paid for an online newspaper ( and Im not trying to be edgy )

amanaplanacanal5d ago

If I was going to pay for a news subscription, it would probably be the economist. Or maybe the financial times. They both seem to still have solid journalism.

kingleopold5d ago

they have solid exor acting for sure

arvid-lind6d ago

more likely, most of HN who care about reading this article use something like archive.is

ggm6d ago· 2 in thread

https://archive.is/aA1dB

pelario6d ago

The link does not seem to be working

johndough6d ago

Works for me though, even when using a proxy that is usually blocked everywhere.

bel86d ago· 2 in thread

HN post title does not match link title

> NSA director: 'Mythos "broke into almost all of our classified systems in hours"

> Donald Trump’s blocking of Anthropic is capricious and chaotic

ricksunnyOP6d ago

I’ve found that high community-upvoted posts don’t bury the lede by parroting the headline. I used to be a headline title scribe until the HN community showed me the light.

bel86d ago

The article has nothing about mythos breaking into classified systems in hours.

So you either posted the wrong link or are just spreading FUD.

1 more reply

ggm6d ago

j / k navigate · click thread line to collapse