Hardening Firefox with Claude Mythos Preview (opens in new tab)

(hacks.mozilla.org)

365 pointsHieronymusBosch2d ago160 comments

https://arstechnica.com/information-technology/2026/05/mozil...

160 comments

Again, and this is important:

A bug is a bug. A “potential vulnerability” is a bug. A vulnerability is verifiable as having security implications with a proof of concept or other substantial evidence.

Words matter. Bugs matter. It’s important to fix large amounts of bugs, just as it always has been, and has been done. Let that be impressive on its own, because it IS impressive.

Mythos didn’t write 271 PoC for vulnerabilities and demonstrate code path reachability with security implications. Mythos found 271 valid bugs. Let that be enough.

6 more replies

kajman2d ago

I dismissed the earlier non-technical blog post as shameless product boosterism for Anthropic. The linked hacks blog (which is a better source than this article) is a welcome release. It's hard to deny there's something real to this now, I think. Mozilla's internal definition of a "vulnerability" is also probably more widely applied than what many would intuit, but it is good that these issues are being taken seriously and fixed.

2 more replies

input_sh2d ago

Original source: https://news.ycombinator.com/item?id=48051079

It's better because it actually lists a sample of Bugzilla reports that were made public. This topic was discussed previously (36 comments two weeks ago: https://news.ycombinator.com/item?id=47885042), but the part about bug reports being made public is brand new.

Diti2d ago

I hope to see the day when (or if) the LLMs get so good at spotting and fixing bugs that all that’s left for the Firefox engineers to do is to focus on adding new features.

This isn’t sarcasm. Firefox deserves to be used more. Most people I know don’t use it because “Chrome does almost everything better”, and Firefox can’t compete with the other browsers’ roadmaps.

6 more replies

tialaramex2d ago

They've only linked a few tickets, so of course maybe when we see all 271 actual distinct things the insight won't apply but all those I examined ended up as some C++ code with a nasty bug in it.

Firefox is written in several languages, only about 25% of it is in C++ but every single one of these issues seems to touch the C++.

3 more replies

benced2d ago

Reading this article in the context of the Zig folks refusing to even consider LLM-generated bugs certainly shapes my perspective on what technologies will be in my toolchain.

2 more replies

OhMeadhbh2d ago

When I was at PalmSource, I tried to get budget for CoVerity or Fortify (static code analysis tools.). "Too expensive," my management chain said. I spent another year putting together a deal for a lower cost but limited to scanning the network stack. "No, it's based on BSD and BSD is inherently secure," my management chain said (neither is true, btw.)

I eventually left and wound up at Mozilla where there were a number of /* flawfinder ignore */ comments scattered throughout the code.

My guess is that Mythos just ignored the "flawfinder ignore" directives and reported the known vulnerabilities in the code.

2 more replies

fg1372d ago

What are people's thoughts on how this could affect static analysis tools? I know they are very different beats but often they achieve the same goal. Static analysis tools can be slow, and they report lots of false positives.

I wonder if these models will get good + cheap enough so that people rarely reach for static analysis.

4 more replies

delichon2d ago

In the latest Mission Impossible, saving the world depends on recovering the original software of an escaped superhuman AGI from a sunken Russian submarine. Luther writes a "poison pill" that given the original source will instantly one-shot the AI. We were left to wonder how this magical code could have been written, but now we know. Luthor just wrote a Mythos prompt that handed it the source code and asked for an immutable critical exploit.

jwr2d ago

> Anyone building software can start using a harness with a modern model to find bugs and harden their code today. We recommend getting started now.

From what I understand, that is a recipe for getting quickly banned by commercial LLM providers?

danieltanfh952d ago

Really it was not the issue that Opus could not do all these, there was just no incentive to fix bugs. Mythos represented a real marketing use case, so yes thanks for spending money to fix this, but this is not sustainable.

crummy2d ago

Curious if people think LLMs will lead to more secure or less secure software in five years.

11 more replies

lschueller2d ago

Let's see, how this will improve the daily soc work. I still don't see, what's the big difference between Mythos and Opus, security wise. I'm confident, that this kind of vul detection is a long-term improvement. But does specifically Mythos makes such a big difference to "normal" models? I would love to see, what's the actual difference.

5 more replies

MetaverseClub2d ago

I'm curious about how did Mozilla do bug finding before Mythos? Did they use any non-AI bug finding tools?

2 more replies

qsera1d ago

The flipside of this is that with AI and prompt injection attacks, you don't need a browser vulnerability to be pwned!

gnabgib2d ago

16 day old story

Wired: Mozilla Used Anthropic's Mythos to Find and Fix 271 Bugs in Firefox (41 points, 18 comments) https://news.ycombinator.com/item?id=47853649

Ars: Mozilla: Anthropic's Mythos found 271 security vulnerabilities in Firefox 150 (33 points, 8 comments)https://news.ycombinator.com/item?id=47855384

1 more reply

languagehacker1d ago

I'm having more problems with Firefox 150 than I have had with any other browser update in years. I think I might be the only one though?

1 more reply

deferredgrant2d ago

A vuln finder is useful only if it respects the humans on the other end. Every bogus report taxes the same scarce attention needed for the real bugs.

xacky2d ago

I just hope they don't start ignoring human created bug reports, as there are still many that haven't been fixed for years.

nnm2d ago

I still don't know the exploit count for Mythos. Is it zero, one, or more?

1 more reply

kittikitti1d ago

This is great, and it reflects some of the changes I've seen in the changelogs of Firefox and many others that have utilize Mythos. I'm closely watching a supposed data wall for AI models and this is a clear indicator that AI capabilities can still become much more advanced even at this point in time. It makes me enthusiastic about future releases and optimizations. Thanks for sharing.

mmooss2d ago

> “That’s the key thing that has unlocked our ability to operate at the scale we’ve been operating at now,” he said. “It gives the engineer a crank they can pull that says: ‘Yep, this has the problem,’ and then you can iterate on the code and know clearly when you’ve fixed it and eventually land the test case in the tree such that you don’t regress it.”

I don't understand much of this paragraph:

* "a crank they can pull that says: ‘Yep, this has the problem,’": as in, ring an alarm? Does the LLM ring th alarm?

* "you can iterate on the code and know clearly when you’ve fixed it": Isn't that true of most bugs, assuming you do the normal thing and generate a test case? And I thought the LLM output test cases itself: "It will craft test cases. We have our existing fuzzing systems and tools to be able to run those tests" And are they claiming the LLM facilitates iterating?

* "and eventually land the test case in the tree": Don't you create the test case before the fix? And just a few words earlier they seemed to be working on the fix, not the test case. And see the prior point about test cases.

* "such that you don’t regress it.”: How is the LLM helping here?

Maybe I'm missing some fundamental unwritten assumption?

1 more reply

ChrisArchitect2d ago

The zero-days are numbered

https://news.ycombinator.com/item?id=47853277

rem10992d ago

I don't find that number very high. In a project of the size of Firefox, a new version of a compiler with stricter warnings or a draconian interpretation of the C standard can easily find 200 new bugs.

New tools find new bugs, but the oligarchy newspapers report on Mythos and not on clang-22.0.

1 more reply

legacynl1d ago

Mozilla is always looking for new revenue, how likely is it that Anthropic payed for this article?

Worf1d ago

Maybe if Mozilla focused less on new useless features and redesigns, they would be able to focus more on writing secure and bug-free code.

I'm not only talking about big things like

* Pocket,

* several major UI redesigns and

* the offline translations,

but even tiny useless things like

* browser.urlbar.trimURLs,

* putting the search query in the URL bar instead of the URL after searching from the URL bar,

* messing with the Edit and Resend feature for no reason (the good one that updates the content length is still available at devtools.netmonitor.features.newEditAndResend) and

* probably thousands of little shit like this that took a bunch of developer hours to implement.

All of the above should've been add-ons.

And of course, we know Mozilla spends a lot of money on things unrelated to Firefox at all. It's amazing Firefox is somewhat secure and stable compared to Chrome, which is backed by Google with their infinitely deep pockets.

This is a web browser, after all. Something most people use all the time. Something that accepts untrusted input from thousands of sources every day. People use it pretty much every aspect of their lives - banking, personal communication, porn, expressing political opinions. It's used for viewing PDFs, playing media files, for interacting with a whole bunch of APIs (that IMO shouldn't be part of the web, but they are). Security should be top priority.

1 more reply

j / k navigate · click thread line to collapse

160 comments

jerrythegerbil2d ago

Again, and this is important:

A bug is a bug. A “potential vulnerability” is a bug. A vulnerability is verifiable as having security implications with a proof of concept or other substantial evidence.

Words matter. Bugs matter. It’s important to fix large amounts of bugs, just as it always has been, and has been done. Let that be impressive on its own, because it IS impressive.

Mythos didn’t write 271 PoC for vulnerabilities and demonstrate code path reachability with security implications. Mythos found 271 valid bugs. Let that be enough.

6 more replies

kajman2d ago

2 more replies

input_sh2d ago

Original source: https://news.ycombinator.com/item?id=48051079

Diti2d ago

I hope to see the day when (or if) the LLMs get so good at spotting and fixing bugs that all that’s left for the Firefox engineers to do is to focus on adding new features.

6 more replies

tialaramex2d ago

They've only linked a few tickets, so of course maybe when we see all 271 actual distinct things the insight won't apply but all those I examined ended up as some C++ code with a nasty bug in it.

Firefox is written in several languages, only about 25% of it is in C++ but every single one of these issues seems to touch the C++.

3 more replies

benced2d ago

Reading this article in the context of the Zig folks refusing to even consider LLM-generated bugs certainly shapes my perspective on what technologies will be in my toolchain.

2 more replies

OhMeadhbh2d ago

I eventually left and wound up at Mozilla where there were a number of /* flawfinder ignore */ comments scattered throughout the code.

My guess is that Mythos just ignored the "flawfinder ignore" directives and reported the known vulnerabilities in the code.

2 more replies

fg1372d ago

I wonder if these models will get good + cheap enough so that people rarely reach for static analysis.

4 more replies

delichon2d ago

jwr2d ago

> Anyone building software can start using a harness with a modern model to find bugs and harden their code today. We recommend getting started now.

From what I understand, that is a recipe for getting quickly banned by commercial LLM providers?

danieltanfh952d ago

crummy2d ago

Curious if people think LLMs will lead to more secure or less secure software in five years.

11 more replies

lschueller2d ago

5 more replies

MetaverseClub2d ago

I'm curious about how did Mozilla do bug finding before Mythos? Did they use any non-AI bug finding tools?

2 more replies

qsera1d ago

The flipside of this is that with AI and prompt injection attacks, you don't need a browser vulnerability to be pwned!

gnabgib2d ago

16 day old story

Wired: Mozilla Used Anthropic's Mythos to Find and Fix 271 Bugs in Firefox (41 points, 18 comments) https://news.ycombinator.com/item?id=47853649

Ars: Mozilla: Anthropic's Mythos found 271 security vulnerabilities in Firefox 150 (33 points, 8 comments)https://news.ycombinator.com/item?id=47855384

1 more reply

languagehacker1d ago

I'm having more problems with Firefox 150 than I have had with any other browser update in years. I think I might be the only one though?

1 more reply

deferredgrant2d ago

A vuln finder is useful only if it respects the humans on the other end. Every bogus report taxes the same scarce attention needed for the real bugs.

xacky2d ago

I just hope they don't start ignoring human created bug reports, as there are still many that haven't been fixed for years.

nnm2d ago

I still don't know the exploit count for Mythos. Is it zero, one, or more?

1 more reply

kittikitti1d ago

mmooss2d ago

I don't understand much of this paragraph:

* "a crank they can pull that says: ‘Yep, this has the problem,’": as in, ring an alarm? Does the LLM ring th alarm?

* "such that you don’t regress it.”: How is the LLM helping here?

Maybe I'm missing some fundamental unwritten assumption?

1 more reply

ChrisArchitect2d ago

The zero-days are numbered

https://news.ycombinator.com/item?id=47853277

rem10992d ago

New tools find new bugs, but the oligarchy newspapers report on Mythos and not on clang-22.0.

1 more reply

legacynl1d ago

Mozilla is always looking for new revenue, how likely is it that Anthropic payed for this article?

Worf1d ago

Maybe if Mozilla focused less on new useless features and redesigns, they would be able to focus more on writing secure and bug-free code.

I'm not only talking about big things like

* Pocket,

* several major UI redesigns and

* the offline translations,

but even tiny useless things like

* browser.urlbar.trimURLs,

* putting the search query in the URL bar instead of the URL after searching from the URL bar,

* messing with the Edit and Resend feature for no reason (the good one that updates the content length is still available at devtools.netmonitor.features.newEditAndResend) and

* probably thousands of little shit like this that took a bunch of developer hours to implement.

All of the above should've been add-ons.

1 more reply

j / k navigate · click thread line to collapse