macOS 14.4 causes JVM crashes (opens in new tab)

(blogs.oracle.com)

235 pointskingds2y ago156 comments

156 comments

> macOS on Apple silicon processors (M1, M2, and M3) includes a feature which controls how and when dynamically generated code can be either produced (written) or executed on a per-thread basis. […] With macOS 14.4, when a thread is operating in the write mode, if a memory access to a protected memory region is attempted, macOS will send the signal SIGKILL instead.

This isn’t just any old thread triggering SIGKILL, it’s the JIT thread privileged to write to executable pages that is performing illegal memory accesses. That’s typically a sign of a bug, and allowing a thread with write access to executable pages to continue executing after that is a security risk.

But I know of other language runtimes that take advantage of installing signal handlers for SIGBUS/SIGSEGV to detect when they overflow a page so they can allocate more memory, etc. This saves from having to do an explicit overflow check on every allocation. Those threads aren’t given privilege to write to executable memory, so they’re not seeing this issue…

So this sounds like a narrow design problem the JVM is facing with their JIT thread. This blog doesn’t explain why their JIT thread needs to make illegal memory accesses instead of an explicit check.

Reason0772y ago

> "This blog doesn’t explain why their JIT thread needs to make illegal memory accesses instead of an explicit check."

Because explicit checks on every memory access (pointer dereference) makes Java significantly slower, even with compiler optimisations to remove redundant checks[1]. Memory protection is a fundamental, very useful, hardware feature and it's perfectly reasonable for user space language runtimes to take advantage of it.

Or, to put it another way, SIGSEGV has been a part of Unix-family OSes for decades. It works perfectly fine on Linux and Windows and there's no reason it shouldn't work on macOS.

[1] (Many years ago I worked on a cross-platform implementation of the Java runtime and wrote much of the threads and signal handling code. We had an option to enable explicit memory checks, which got us up and running faster on new platforms where the SIGSEGV handlers hadn't been written yet. From memory this made everything something like 30-50% slower, so it was definitely worthwhile to implement SIGSEGV handling. In our case SIGSEGV handlers were used both as part of the garbage collector/memory management and to implement Java's NullPointerException)

destring2y ago

As Linus famously said: Shut. Up. Don’t break userspace and then blame the user.

https://lkml.org/lkml/2012/12/23/75

extraduder_ire2y ago

Linux did break adobe flash when it used memmove like memcopy after fixing a kernel bug. Can't think of any other examples though.

3 more replies

stephenr2y ago

I feel like preventing illegal writes to protected memory is less "breaking user space" and more "protecting all space".

This is like arguing to allow the guy who can't drive and just pin-balls his way down the freeway bouncing of other cars, because to prevent him from driving would be to take away his personal freedoms.

jvmboi2y ago

There was no conceivable version of a road system where that behavior would ever be okay. However, it's not only conceivable but, apparently standard practice in systems programming, to "Try and Fail" instead of "Only Proceed if allowed".

So, if we want a tortured metaphor what JVM is doing is like trying to pass a turnstile to see if the pass is still valid so that on the happy path it saves the extra check. Now Apple decided that instead of just showing a red X and letting you buy a new pass, in the future you get shot in the back of the head if you try with an invalid pass.

1 more reply

amelius2y ago

So MacOS is trying to be smart, changes their API, and now we're blaming the JVM for doing something we don't understand?

At least they could have provided a path back to the old behavior.

chaostheory2y ago

That’s how MS works which leads to compatibility, but less stability. Historically with Apple, it’s their way or the highway. Less compatibility, but the OS is more stable.

funcDropShadow2y ago

If the OS changes its fundamental behaviors it is less stable not more.. It might be more secure, though I am not convinced in this case.

TheLoafOfBread2y ago

Does not seems stable, when all Java based applications are crashing.

2 more replies

hbbio2y ago

Sorry, that's not how security works.

sunshinerag2y ago

macOS is trying to keep its systems safe. Can’t leave the back door open for few who were used to it.

redeeman2y ago

this reads like a description dumbed down meant to be read by someone that does not know the technical details, and on top of that, with an agenda to mislead(in this circumstance) :)

amelius2y ago

The problem is that Apple closed the front door too.

kaba02y ago

What is it a backdoor to?

LadyCailin2y ago

It said it affected back to Java 8, so seems like this design has been there for a while, and since older versions are EOL, any Java level fix would not be patched back.

flohofwoe2y ago

I wonder what that means for the Android SDK, which AFAIK requires an ancient Java8 runtime for the command line SDK tools on macOS.

pjmlp2y ago

Android team has been forced to accept Kotlin without the Java ecosystem is an oxymoron, thus not only is ART updatable since Android 12, Java 17 LTS is now the latest supported version.

And on the SDK side, they need to use whatever InteliJ requires.

vbezhenar2y ago

Java 8 is not EOL.

1 more reply

saurik2y ago

They now seem to work with Java 11.

beeboobaa2y ago

If something works on OS release version 1 then it should still work on OS release version 2.

Or in apple vernacular, it should just work.

fwlr2y ago

“The Java Virtual Machine […] leverages the protected memory access signal mechanism both for correctness (e.g., to handle the truncation of memory mapped files) and for performance.”

Where by “protected memory access signal mechanism”, they mean SIGBUS/SIGSEGV, i.e., a segfault.

This is probably because the JVM is doing “zero cost access checks”, which is where you do the moral equivalent of:

    try {
      writeToFile()
    } catch(err) {
      if (err == SYSTEM_CRASH_IMMINENT) {
        changeFilePermissions()
        retry
      }
    }

…because it’s faster than checking file permissions before every write. (It’s a common pattern in systems programming, so it’s not quite as crazy as it sounds.)

I guess my opinion on this is that if you write your program to intentionally trigger and ignore kill(10) / kill(11) from the host OS, for the sake of a speed boost, you can’t really get too mad when the host OS gets fed up and starts sending kill(9) instead.

I also wonder what happens in the (extremely rare) case where the signal the JVM is trapping is a real segfault, and not an operating system signal.

dzaima2y ago

This isn't about files, this is about plain pages of RAM[0]. It is a basic CPU operation to trap on unmapped pages, and OSes rightfully expose this useful feature (in addition to using it themselves), allowing processes to do many things, from lazily-computed memory regions to removing significant amounts of overhead doing a thing the CPU will inevitably do itself anyway.

I believe the "the truncation of memory mapped files" section is for when the Java process memory-maps a file (as Java provides memory-mapping operations in its standard library, and probably also uses them itself), and afterwards some other unrelated process truncates the file, resulting in the OS quietly making (parts of) the mappings inaccessible. Here the process couldn't even check the permissions before reading (never mind how utterly hilariously inefficient that would be, defeating the purpose of memory-mapping) as the mappings could change between the check and subsequent read anyway.

[0]: https://bugs.java.com/bugdatabase/view_bug?bug_id=8327860, "I've managed to narrow this down to this small reproducer:" section

Jtsummers2y ago

And it's worth noting that while man mmap on macOS doesn't indicate what happens when the protections are violated (that is, if you try to read, write, or execute in violation of the set protections) the related function mprotect has this to say in macOS 14.3 (what I have available):

> When a program violates the protections of a page, it gets a SIGBUS or SIGSEGV signal.

(The Linux man pages for mmap and mprotect indicates SIGSEGV would be signaled.)

So the past use and assumption (SIGSEGV or SIGBUS) are consistent with the expectations of mmap and mprotect given the documentation provided.

fwlr2y ago

You are of course completely correct.

However, I still stand by my pseudocode - I claim that it will give a fairly accurate impression of the basic concept of zero-cost access checks to a reader who isn’t familiar with low-level systems programming. (That said, I have updated my comment to make it clear it’s more of a metaphor than a literal description.)

mrlsph2y ago

A talk at FOSDEM this year [0] describes how the OpenJDK JVM relies on triggering SIGSEGVs in order to efficiently implement thread-local safepoint checks - I wonder if that would also be affected?

[0]: https://mostlynerdless.de/blog/2023/07/31/the-inner-workings...

kaba02y ago

> I also wonder what happens in the (extremely rare) case where the signal the JVM is trapping is a real segfault, and not an operating system signal.

Just an educated guess, but the JVM knows if a thread may expect a segfault at a given point or not. If no thread expects one, then I assume the segfault handler just writes out that a segfault happened with some useful info, and terminates the program. I mean, I’m sure about the effect as I have caused a JVM to segfault a couple of times with native memory, so it handles it as expected.

w10-12y ago

"The issue was not present in the early access releases for macOS 14.4, so it was discovered only after Apple released the update."

I wonder if Oracle really didn't know beforehand.

Apple has long been telling people (writing JITs) that to write to executable memory, they need the correct entitlements (com.apple.security.cs.allow-jit, allow-unsigned--executable-memory, and or/ .disable-executable-page-protection). I wonder if Oracle has been ignoring them, satisfied with the signal-handler workaround, and Apple finally enforced their policy.

Apple also expects that developers deploying apps on MacOS that use Java have these entitlements configured on a per-app basis. Oracle likely objects that this is not really for the application developer to certify, since it's pretty much out of their control.

In any case, I'm doubting Oracle's release is the whole truth.

kaba02y ago

> Apple has long been telling people (writing JITs) that to write to executable memory, they need the correct entitlements (com.apple.security.cs.allow-jit, allow-unsigned--executable-memory, and or/ .disable-executable-page-protection). I wonder if Oracle has been ignoring them, satisfied with the signal-handler workaround, and Apple finally enforced their policy.

As far as I understand, that’s not the issue, the JIT itself works just fine. The JVM just uses the (quite common) trick that it doesn’t actually bound check everything, but let’s the hardware trigger an interrupt, expecting that to “bubble up” to the program at hand, so it can handle certain cases “for free”. This behavior was changed by apple, which causes issues.

exabrial2y ago

Why not just let it bubble up from the hardware? Seems like a redundant thing to build into the kernel

vips7L2y ago

This is honestly a wild and out there claim. The OpenJdk team would never want to see this happen to their user base. They’re some of the most professional programmers I’ve ever seen.

The whole truth is that the Apple kernel team broke user space.

zx80802y ago

The main question now is why hasn't it been exposed in pre-release 14.4. This could mean some very urgent and risky change got its way to the 14.4 release, or that the whole macos release process is broken and unstable.

pier252y ago

Amazing that Apple introduced a breaking change in a .4 release. Probably a mistake?

Also amazing it wasn't caught during the beta period.

empthought2y ago

Apple has never been a follower of semantic versioning.

lloeki2y ago

nitpick: Apple doesn't follow SemVer 2.0, but they do have a semantic versioning scheme, that is, the version components carry a certain semantic, it's just so that this semantic is different than the semantic defined by the SemVer 2.0 specification.

One can have any sort of semantic versioning that is not SemVer 2.0 compliant and still be useful, see e.g Rails or Ruby.

Even .Net assemblies are not SemVer 2.0 compliant: their pattern is maj.min.patch.build but SemVer 2.0 specifies that there can only be three conponents and build info must be behind a plus, like maj.min.patch+build

goosedragons2y ago

It wasn't in the public beta according to Oracle.

mvdtnz2y ago

This kind of behaviour is very common from Apple.

CharlesW2y ago

> "As a normal part of the just-in-time compile and execute cycle, processes running on macOS may access memory in protected memory regions."

I'm just a lowly JavaScript/TypeScript/PHP programmer, but what is the Very Good Reason that Java trying to access other processes' memory?

mayoff2y ago

I don’t think the article claims that a Java process tries to access some other process’s memory.

In a typical modern operating system, a memory page can be non-writable and non-executable, writable and non-executable, or non-writable and executable, but not simultaneously writable AND executable.

If you generate executable code at runtime, then you need write access to a page to write the executable code into that page. Then you need to tell the operating system to change the page from writable to executable.

If you then try to write to the page, you’ll get a signal (SIGSEGV or SIGBUS, according to the article).

Oracle’s JVM apparently relies on this behavior: a Java process sometimes tries to write to a page (in its own memory space) that is not marked writable. The JVM then catches the SIGSEGV and recovers (perhaps by asking the operating system to change the page back from executable to writable, or by arranging to write to a different page, or to abort the write operation altogether).

Traubenfuchs2y ago

Thank you, that explained it way better than the original link.

scialex2y ago

It's not. It's trying to access unmapped or protected memory in its own process.

Basically what its used for is to implement an 'if' that's super fast on the most likely path but super slow on the less likely path.

It's not super clear what its being used for (this is often used for the GC but the fact that graal isn't affected means that likely still works). Possibly they are using this to detect attempts to use inline-cache entries that have been deleted.

moonchild2y ago

object.field is implemented as a direct load from the object; if the object turned out to be null, then the resultant signal is caught and turned into a NullPointerException

moonchild2y ago

sorry, I didn't read the linked post closely enough—from my reading, this case is not one of the ones that was broken

toast02y ago

In a virtual memory operating system, every program has its own address space. Accessing an unmapped address is not the same as trying to access another process's memory.

It's also pretty common to use memory protection to autoextend stacks... Allocate the stack size you need, ask the OS to mark the page(s) after the stack as protected, catch the signal when you hit the protection, allocate some more stack and a new protected page unless the stack is too big. Works for heaps too.

Let the MMU hardware check accesses, so you don't have to check everything in software all the time.

olliej2y ago

It depends on exactly what is being done.

A fairly common idiom is to use memory protection to provide zero cost access checks, as you can generally catch the signals produced by most memory faults, and then work out where things went wrong and convert the memory access error into a catchable exception, or to lazily construct data structures or code.

So you want the trap, but the trap itself can be handled. It sounds like there’s been a semantic change when the trap occurs for execution of an address or an access to an executable page.

There are also a bunch of poorly documented Mac APIs to inform the memory manager and linker about JIT regions and I wonder if it’s related to those. It really depends on exactly what oracle’s jvm is trying to do, and what the subsequent cause of the fault is.

Certainly it’s a less than optimal failure though :-/

royjacobs2y ago

The reasons are literally spelled out in the following paragraphs.

CharlesW2y ago

I’m asking because the reasons seem dumb to me, which is why I’m asking people smarter than I am about low-level memory management if they’re legitimate.

2 more replies

samus2y ago

Accessing such areas is sometimes done deliberately since programmers could rely on the OS telling them what just happened using signals instead of nuking the process wholesale. Doing it without signals is usually slow and/or clunky (null-pointer checks, read/write permissions, existence of pages), or straight out impossible.

Accessing other processes' memory is not the concern since virtual memory provides each process the illusion of having the entire address space for itself.

8crazyideas2y ago

I just bought a MacBook Pro with the M3 Max chip and installed MATLAB R2023b. Sonoma 14.3 is in place. As a requirement, I had to also install Corretto 8. MathWorks only supports the Java 8 JRE included with Amazon Corretto 8. I am already having several problems in MATLAB with his new setup. Can I assume that updating to Sonoma 14.4 might very well cause even more problems? I really don't understand any of this.

xcv1232y ago

".. is affecting all Java versions from Java 8 to the early access builds of JDK 22. There is no workaround available .."

Do not update until Apple fixes the issue.

e402y ago

Isn’t this something Oracle will be fixing? Seems like it from other comments here.

viraptor2y ago

No. Oracle will not fix jre8 because it's too old. Oracle will not fix corretto jre, because it's not theirs.

1 more reply

ls6122y ago

Is it Apple or Oracle who should rightly be fixing this issue?

xcv1232y ago

It's a bug in macOS and it breaks POSIX compliance. Oracle can develop a workaround but Apple should fix it in their next update.

1 more reply

dimask2y ago

I would not update for the time being. If it works it will probably break, if it is broken it may break more.

Btw what sort of problems are you facing? I have had problems with closing figures, but figured it out eventually with a workaround [0].

[0] https://se.mathworks.com/matlabcentral/answers/2027964-matla...

tebruno992y ago

It is always funny to Me when Apple zealots come into threads blaming everyone but Apple that software broke. Complaining Java doesn’t follow Apple standards or some crap. Then 9 days later Apple issues a fix because they did indeed break it.

w10-12y ago

Yes, you mean: https://support.apple.com/en-us/109035

Can you tell from this or any other Oracle bug whether Apple is bending its rules for Java? I can't tell either way.

javajosh2y ago

It seems highly unlikely that the macos people don't test anything on the jvm during acceptance. It's even more suspicious that this change didn't happen during the public beta. Is it possible that Apple is firing a warning shot at Java? Even as a huge fan of Hanlon's razor, this seems like such an enormous oversight its hard for me to ascribe it to incompetence.

flohofwoe2y ago

> it seems highly unlikely that the macos people don't test anything on the jvm during acceptance.

I would be surprised if they do to be honest (Apple doesn't even catch obvious bugs in the new macOS settings panel, which really makes me wonder if there is a software QA process at all). For 3rd party apps they seem to rely on the software vendors to holler if a macOS update breaks their app. That's why the macOS prerelease versions exist. But since the bug wasn't present in the prerelease, affected vendors couldn't catch it. It's still a fuckup in Apple release process of course (which tbh also isn't surprising).

wyclif2y ago

What is the bug in the new System Settings panel?

flohofwoe2y ago

I'm stumbling over a couple of annoying problems when opening the DNS server subpanel via search (because without search it's pretty much impossible to find that panel, but that's a separate issue).

One is that occasionally there's an error popup "Extension process Network(4433) exited." just when clicking on the 'DNS servers' search result.

The other is that when accidentally hitting "Enter" after entering a new DNS server address the entire DNS Server subpanel will close even though I want to enter a second address (which sucks from a user perspective, but might even be consistent with the UX guidelines, but OTH I would expect pressing Enter on a text input box would not close the UI panel which contains the text input box, but maybe that's just me). But then clicking on the previous search result 'DNS servers' to open the DNS servers panel again, the click does nothing this time.

One has to clear the search box, enter the search term again, perform a new search, and then click the search result 'DNS servers' again to get the subpanel for entering DNS server addresses.

I guess the search is also broken like this for other subpanels, but changing the DNS servers is about the only situation where I'm using the search box.

In the old settings panel all that worked as expected (and apart from that, everything also was a lot snappier, somehow Apple engineers managed to create simple Settings window that suffers from performance problems, but again, different issue).

overstay89302y ago

No idea what OP if referring to but I could pretty consistently cause Settings to soft lock for a few months by loading a configuration profile while the settings window was open, just small things like that are basically everywhere in macOS.

Don't even get me started on Screen Time bugs...

bzzzt2y ago

It's not a problem that breaks all JVM based software instantly. So maybe Apple tests but not long enough to trigger this issue.

I really don't know what Apple would be 'warning' against. Don't use Java? There are tens of thousands of business and development tools depending on the JVM. Blocking Java would diminish the value of macOS tremendously and doing so without warning would open Apple up to lots of lawsuits.

javajosh2y ago

>It's not a problem that breaks all JVM based software instantly.

Do you know how long it takes to reproduce? The OP was light on details here. I assume that a memory access issue with the JIT would pop up pretty quickly, though.

bzzzt2y ago

I'm running an Eclipse development environment that's regularly compiling a huge codebase. Had 2 crashes this week after updating so less than once a day. That's assuming it isn't an Eclipse bug ;)

1 more reply

Anamon2y ago

Another example for how preventing users from doing rollbacks is a terrible practice. Even if it's not your application's fault, users may have very good reasons to revert an update, if only temporarily.

This also bothers me on Android. Sometimes, an app update may break something and prevent me from using it. But Google doesn't allow me to reinstall a previously published version from the Play Store. If I don't have to (or can't easily) do without that application until a fix might be released, my only option is to find an older release on some shady mirror site.

sunshinerag2y ago

considering macOS is doing the right thing, shouldn't the title read JVM crashes on macOS 14.4 ?

metanonsense2y ago

Even if this was the right thing, they could / should have changed this behavior in a pre-release because that's exactly the kind of API change in the OS that will catch people off-guard. As another commenter wrote, I'd consider this either a serious flaw in Apples release process or they learned about some very dangerous vulnerability where the old behavior was abused and they decided that they rather annoy all users and vendors of Java software out there than tolerate the vulnerability in MacOS. But in this case I'd surmise that at least now Oracle would have been informed about this.

olliej2y ago

A gross and low performance option for now might be to run Java under Rosetta, but I’m saying that based on them saying that this is apple silicon specifically and processes under rosetta have a bunch of quirks to support intel semantics. This would allow you to work around this for now.

That said I’m curious what the exact scenario that leads to this is, I’m assuming it’s not common as you would expect it to have come up during betas and pre -release seeds.

grodriguez1002y ago

> I’m assuming it’s not common as you would expect it to have come up during betas and pre -release seeds.

The article specifically says that the issue was not present in early access releases, so it was not possible to discover it before the actual release.

millzlane2y ago

14.4 also killed automated device enrollment in VMware's workspaceone. We're having to downgrade brand new MacBooks to 14.3 using configurator.

ivan_gammel2y ago

I wonder if it's the same reason as why Civilization 6 stopped working on iPadOS 17.4. Did they change something deep in the kernel for DMA compliance?

MaxBarraclough2y ago

Is the signals change in macOS likely to affect JIT-based systems other than the OpenJDK JVM?

sebazzz2y ago

Isn't this just W^X?

not_me_ever2y ago

Wait, they write to protected memory, and get killed.

:tripplefacepalm:

Somebody hire some engineers at Oracle.

kaba02y ago

Sarcasm only works when you are actually smart and know what you are talking about.

erik_seaberg2y ago

To finish validating a request and then start executing it creates a race condition. That's why execution always needed to fail in a recoverable way.

tiffanyh2y ago

macOS dark ages.

I wonder if we’re about to enter 4-5 years of macOS “dark ages”, due to Apple grappling with EU/DMA.

Much like Microsoft in early 2000s, between IE/lawsuit and grappling with internet security/viruses. Windows XP, launched in 2001, was considered by most a great OS, didn’t have another good OS successor until 8-years later (Windows 7).

mdhb2y ago

It’s not at all like they didn’t have the time or the resources to deal with this.

I think we already saw some of this in particular with the recent bullshit they tried to pull with PWAs in iOS 17.4 that they were hoping to just let things break and were hoping that they could shift the blame and anger towards the EU instead.

xyst2y ago

Apple and macOS is slowly becoming another Windows in terms of stability.

There was a HN post about a hashicorp founder using Linux within a vm on their mbp. Might adopt that same approach, if I can find the og post.

nullwarp2y ago

This is what I do when my job forced me to use a mac. I think the only thing I installed on the mac outside of it was Firefox.

Worked great for years before I changed jobs that let me bring my own hardware finally.

neeleshs2y ago

What is your preferred hardware and flavor of Linux for this? I'm trying to do the same

rzzzt2y ago

Rancher Desktop used Lima + QEMU behind the scenes: https://lima-vm.io/

mktk10012y ago

Tbh no one matches the hardware quality of MacBooks. But I refuse to use them on principle. Thinkpad t14 been serving me well (using fedora).

Kipters2y ago

To be fair, this is the kind of breakage I'd expect from macOS, but never from Windows

open5922y ago

Here’s the YouTube link from Mitchell. I was thinking about doing something similar lately too.

https://youtu.be/ubDMLoWz76U?si=ipmho73-r9FzZpBp

stevefan19992y ago

Windows, as a kernel itself and by extension as a server, is very resilient stable to a point that there is a Windows NT 4 machine of a certain railroad control system still running continuously for 14 years without any restarts. It even still reboots back without problem in disastrous cases such as power loss due to hurricane or earthquake. Trust me, it is made by Dave Culter, it, just, works.

It is really the client facing side of Windows that really sucks, (warning: explicitly strong language) such as having really shitty software known as Office, like god why Word and not Latex, and why spreadsheet when we have database that we can query efficiently? Or not being able to have multi-user RDP session due to Microsoft having licensing dispute with Citrix about 20-ish years ago (fuck you Citrix, you asshole!). Or why do I have to do a lot of hoops and install a lot of "C++ redistributable" for running some antique software? Or why do I have to jump through a lot of group policy simply to enable WinRM and get remote powershell management?

Either way, I'm typing this on a Windows 11 desktop with WSL2 on. The hybrid experience is incredible, unless you need some performance critical app (WSL2 is in general slower than bare metal Windows and bare metal Linux itself, of course, except in machine learning).

Things like 9P to cross the Window file system access also introduced a lot of pain such as permission control because Windows does not have a POSIX-like permission system, like instead of having a simple 2 bytes that split into 3 octal number (there is a reason it is maxed out at 777), you have an incredibly sophisticated, capability and token-based access control system dated almost 30 years ago that Linux doesn't even have back in the day! But that pile of shit is now full of bugs and exploits such as token/handle duplication. (oh yes I'm talking about black hat territory as I also do some red team CTF regarding these stuff)

secondcoming2y ago

When's the last time you had a BSOD on Windows? I honestly can't recall.

1over1372y ago

A long time in fact. But macOS 14.3 kernel paniced on me last week.

threeseed2y ago

Depends on what you're doing.

It happens fairly often for me with more exotic hardware e.g. Infiniband or when I push the hardware too hard i.e. parallel Rust builds.

npalli2y ago

  An issue introduced by macOS 14.4, which causes Java process to terminate unexpectedly, is affecting all Java versions from Java 8 to the early access builds of JDK 22

If this affects so many versions of Java and nobody notices, is anyone even using Java on macOS?

semiquaver2y ago

Plenty of people develop for java on macs. The issue is that per the article this behavior was not present in the early access macOS builds, which means something changed between beta and release.

latchkey2y ago

> is anyone even using Java on macOS?

IntelliJ IDEA, the product itself, is JVM based.

threeseed2y ago

Actually everything from Jetbrains does.

So Pycharm, Rustrover etc.

latchkey2y ago

All of those are effectively language specific plugins of the base IDE platform, which is why I didn't list them individually.

bombcar2y ago

Minecraft runs on various Javas.

And there's a known issue with an interaction between minecraft, Java, and the video drivers that crashes out and it can be traced back all the way to here: https://github.com/glfw/glfw/issues/1997

It's not fixed.

bzzzt2y ago

It's not terminating directly. I've seen a few IDE crashes this week, less than one per day, but since there's no log there's no easy way to determine it's related to a macOS change.

Mustachio2y ago

https://youtrack.jetbrains.com/issue/JBR-6802

The JetBrains team has already figured it out as well.

vips7L2y ago

Reading the comments from David Wartell in that thread is enough internet for me today. This guy is CTO of some company and is just harassing the thread for a fix ETA without understanding the problem at all.

1 more reply

LgWoodenBadger2y ago

IntelliJ did this twice to me on Thursday and there was a crash log both times. I only reported one to Apple.

Did you check the Console app for crash reports?

lanna2y ago

Maybe not a lot of macOS devs use Java, but a lot of Java devs use macOS

seanalltogether2y ago

Also, if you're a mobile developer you likely have a Mac, and if you're a mobile dev that doesn't target iOS exclusively, then you run java.

re-thc2y ago

If you use a Jetbrains IDE you already use Java e.g. Webstorm and Pycharm.

comonoid2y ago

It broke since recent MacOS 14.4, even at 14.4 betas it worked.

karmakaze2y ago

I'm running RubyMine on 14.3.1 all the time and it's fine. Should I hold off updating to 14.4 until the dust has settled?

merb2y ago

You should I had some Rider and IntelliJ crashes. The crash does not happen often tough, but if your in the middle of writing code it can get you out of the flow.

CharlesW2y ago

For one, it doesn't affect all versions of Java. Java 20 (an LTS release) and 21, for example, don't have this problem.

pritambarhate2y ago

JDK 21 is LTS not JDK 20.

https://www.oracle.com/in/java/technologies/downloads/

grodriguez1002y ago

The article says that the issue “is affecting all Java versions from Java 8 to the early access builds of JDK 22” now.

bremac2y ago

Per the bug report, all versions since Java 8 are affected.

CharlesW2y ago

> Affected Version: 8,11,17,21,22

This has changed (they added 21) since I posted the comment above, so it looks like they’re still getting a handle on it.

nurettin2y ago

Sonoma has been out for only one week!

stalfosknight2y ago

Sonoma became generally available September 26, 2023.

1 more reply

DuskHorizon2y ago

Well, that’s why Apple forbids use of private APIs in the App Store apps. If you built all your tech stack on the foundation of some peculiar nondocumented platform’s behavior, don’t be surprised when this stack breaks.

bhawks2y ago

This is not an API. It's the handling of writes to memory the process has protected. In the past this would generate a signal the process could handle and recover from. Now it generates a sigkill which is uncatchable / unrecoverable from.

These behaviours have been historically well documented.

DuskHorizon2y ago

All system idiosyncrasies are APIs in the long run ;)

fifteen15062y ago

The change of a SIGSEGV to a SIGKILL, seriously?

2 more replies

j / k navigate · click thread line to collapse

156 comments

riscy2y ago

Reason0772y ago

> "This blog doesn’t explain why their JIT thread needs to make illegal memory accesses instead of an explicit check."

Or, to put it another way, SIGSEGV has been a part of Unix-family OSes for decades. It works perfectly fine on Linux and Windows and there's no reason it shouldn't work on macOS.

destring2y ago

As Linus famously said: Shut. Up. Don’t break userspace and then blame the user.

https://lkml.org/lkml/2012/12/23/75

extraduder_ire2y ago

Linux did break adobe flash when it used memmove like memcopy after fixing a kernel bug. Can't think of any other examples though.

3 more replies

stephenr2y ago

I feel like preventing illegal writes to protected memory is less "breaking user space" and more "protecting all space".

jvmboi2y ago

1 more reply

amelius2y ago

So MacOS is trying to be smart, changes their API, and now we're blaming the JVM for doing something we don't understand?

At least they could have provided a path back to the old behavior.

chaostheory2y ago

That’s how MS works which leads to compatibility, but less stability. Historically with Apple, it’s their way or the highway. Less compatibility, but the OS is more stable.

funcDropShadow2y ago

If the OS changes its fundamental behaviors it is less stable not more.. It might be more secure, though I am not convinced in this case.

TheLoafOfBread2y ago

Does not seems stable, when all Java based applications are crashing.

2 more replies

hbbio2y ago

Sorry, that's not how security works.

sunshinerag2y ago

macOS is trying to keep its systems safe. Can’t leave the back door open for few who were used to it.

redeeman2y ago

this reads like a description dumbed down meant to be read by someone that does not know the technical details, and on top of that, with an agenda to mislead(in this circumstance) :)

amelius2y ago

The problem is that Apple closed the front door too.

kaba02y ago

What is it a backdoor to?

LadyCailin2y ago

It said it affected back to Java 8, so seems like this design has been there for a while, and since older versions are EOL, any Java level fix would not be patched back.

flohofwoe2y ago

I wonder what that means for the Android SDK, which AFAIK requires an ancient Java8 runtime for the command line SDK tools on macOS.

pjmlp2y ago

Android team has been forced to accept Kotlin without the Java ecosystem is an oxymoron, thus not only is ART updatable since Android 12, Java 17 LTS is now the latest supported version.

And on the SDK side, they need to use whatever InteliJ requires.

vbezhenar2y ago

Java 8 is not EOL.

1 more reply

saurik2y ago

They now seem to work with Java 11.

beeboobaa2y ago

If something works on OS release version 1 then it should still work on OS release version 2.

Or in apple vernacular, it should just work.

fwlr2y ago

“The Java Virtual Machine […] leverages the protected memory access signal mechanism both for correctness (e.g., to handle the truncation of memory mapped files) and for performance.”

Where by “protected memory access signal mechanism”, they mean SIGBUS/SIGSEGV, i.e., a segfault.

This is probably because the JVM is doing “zero cost access checks”, which is where you do the moral equivalent of:

    try {
      writeToFile()
    } catch(err) {
      if (err == SYSTEM_CRASH_IMMINENT) {
        changeFilePermissions()
        retry
      }
    }

…because it’s faster than checking file permissions before every write. (It’s a common pattern in systems programming, so it’s not quite as crazy as it sounds.)

I also wonder what happens in the (extremely rare) case where the signal the JVM is trapping is a real segfault, and not an operating system signal.

dzaima2y ago

[0]: https://bugs.java.com/bugdatabase/view_bug?bug_id=8327860, "I've managed to narrow this down to this small reproducer:" section

Jtsummers2y ago

> When a program violates the protections of a page, it gets a SIGBUS or SIGSEGV signal.

(The Linux man pages for mmap and mprotect indicates SIGSEGV would be signaled.)

So the past use and assumption (SIGSEGV or SIGBUS) are consistent with the expectations of mmap and mprotect given the documentation provided.

fwlr2y ago

You are of course completely correct.

mrlsph2y ago

A talk at FOSDEM this year [0] describes how the OpenJDK JVM relies on triggering SIGSEGVs in order to efficiently implement thread-local safepoint checks - I wonder if that would also be affected?

[0]: https://mostlynerdless.de/blog/2023/07/31/the-inner-workings...

kaba02y ago

> I also wonder what happens in the (extremely rare) case where the signal the JVM is trapping is a real segfault, and not an operating system signal.

w10-12y ago

"The issue was not present in the early access releases for macOS 14.4, so it was discovered only after Apple released the update."

I wonder if Oracle really didn't know beforehand.

In any case, I'm doubting Oracle's release is the whole truth.

kaba02y ago

exabrial2y ago

Why not just let it bubble up from the hardware? Seems like a redundant thing to build into the kernel

vips7L2y ago

This is honestly a wild and out there claim. The OpenJdk team would never want to see this happen to their user base. They’re some of the most professional programmers I’ve ever seen.

The whole truth is that the Apple kernel team broke user space.

zx80802y ago

pier252y ago

Amazing that Apple introduced a breaking change in a .4 release. Probably a mistake?

Also amazing it wasn't caught during the beta period.

empthought2y ago

Apple has never been a follower of semantic versioning.

lloeki2y ago

One can have any sort of semantic versioning that is not SemVer 2.0 compliant and still be useful, see e.g Rails or Ruby.

goosedragons2y ago

It wasn't in the public beta according to Oracle.

mvdtnz2y ago

This kind of behaviour is very common from Apple.

CharlesW2y ago

> "As a normal part of the just-in-time compile and execute cycle, processes running on macOS may access memory in protected memory regions."

I'm just a lowly JavaScript/TypeScript/PHP programmer, but what is the Very Good Reason that Java trying to access other processes' memory?

mayoff2y ago

I don’t think the article claims that a Java process tries to access some other process’s memory.

If you then try to write to the page, you’ll get a signal (SIGSEGV or SIGBUS, according to the article).

Traubenfuchs2y ago

Thank you, that explained it way better than the original link.

scialex2y ago

It's not. It's trying to access unmapped or protected memory in its own process.

Basically what its used for is to implement an 'if' that's super fast on the most likely path but super slow on the less likely path.

moonchild2y ago

object.field is implemented as a direct load from the object; if the object turned out to be null, then the resultant signal is caught and turned into a NullPointerException

moonchild2y ago

sorry, I didn't read the linked post closely enough—from my reading, this case is not one of the ones that was broken

toast02y ago

In a virtual memory operating system, every program has its own address space. Accessing an unmapped address is not the same as trying to access another process's memory.

Let the MMU hardware check accesses, so you don't have to check everything in software all the time.

olliej2y ago

It depends on exactly what is being done.

So you want the trap, but the trap itself can be handled. It sounds like there’s been a semantic change when the trap occurs for execution of an address or an access to an executable page.

Certainly it’s a less than optimal failure though :-/

royjacobs2y ago

The reasons are literally spelled out in the following paragraphs.

CharlesW2y ago

I’m asking because the reasons seem dumb to me, which is why I’m asking people smarter than I am about low-level memory management if they’re legitimate.

2 more replies

samus2y ago

Accessing other processes' memory is not the concern since virtual memory provides each process the illusion of having the entire address space for itself.

8crazyideas2y ago

xcv1232y ago

".. is affecting all Java versions from Java 8 to the early access builds of JDK 22. There is no workaround available .."

Do not update until Apple fixes the issue.

e402y ago

Isn’t this something Oracle will be fixing? Seems like it from other comments here.

viraptor2y ago

No. Oracle will not fix jre8 because it's too old. Oracle will not fix corretto jre, because it's not theirs.

1 more reply

ls6122y ago

Is it Apple or Oracle who should rightly be fixing this issue?

xcv1232y ago

It's a bug in macOS and it breaks POSIX compliance. Oracle can develop a workaround but Apple should fix it in their next update.

1 more reply

dimask2y ago

I would not update for the time being. If it works it will probably break, if it is broken it may break more.

Btw what sort of problems are you facing? I have had problems with closing figures, but figured it out eventually with a workaround [0].

[0] https://se.mathworks.com/matlabcentral/answers/2027964-matla...

tebruno992y ago

w10-12y ago

Yes, you mean: https://support.apple.com/en-us/109035

Can you tell from this or any other Oracle bug whether Apple is bending its rules for Java? I can't tell either way.

javajosh2y ago

flohofwoe2y ago

> it seems highly unlikely that the macos people don't test anything on the jvm during acceptance.

wyclif2y ago

What is the bug in the new System Settings panel?

flohofwoe2y ago

I'm stumbling over a couple of annoying problems when opening the DNS server subpanel via search (because without search it's pretty much impossible to find that panel, but that's a separate issue).

One is that occasionally there's an error popup "Extension process Network(4433) exited." just when clicking on the 'DNS servers' search result.

One has to clear the search box, enter the search term again, perform a new search, and then click the search result 'DNS servers' again to get the subpanel for entering DNS server addresses.

I guess the search is also broken like this for other subpanels, but changing the DNS servers is about the only situation where I'm using the search box.

overstay89302y ago

Don't even get me started on Screen Time bugs...

bzzzt2y ago

It's not a problem that breaks all JVM based software instantly. So maybe Apple tests but not long enough to trigger this issue.

javajosh2y ago

>It's not a problem that breaks all JVM based software instantly.

Do you know how long it takes to reproduce? The OP was light on details here. I assume that a memory access issue with the JIT would pop up pretty quickly, though.

bzzzt2y ago

I'm running an Eclipse development environment that's regularly compiling a huge codebase. Had 2 crashes this week after updating so less than once a day. That's assuming it isn't an Eclipse bug ;)

1 more reply

Anamon2y ago

sunshinerag2y ago

considering macOS is doing the right thing, shouldn't the title read JVM crashes on macOS 14.4 ?

metanonsense2y ago

olliej2y ago

That said I’m curious what the exact scenario that leads to this is, I’m assuming it’s not common as you would expect it to have come up during betas and pre -release seeds.

grodriguez1002y ago

> I’m assuming it’s not common as you would expect it to have come up during betas and pre -release seeds.

The article specifically says that the issue was not present in early access releases, so it was not possible to discover it before the actual release.

millzlane2y ago

14.4 also killed automated device enrollment in VMware's workspaceone. We're having to downgrade brand new MacBooks to 14.3 using configurator.

ivan_gammel2y ago

I wonder if it's the same reason as why Civilization 6 stopped working on iPadOS 17.4. Did they change something deep in the kernel for DMA compliance?

MaxBarraclough2y ago

Is the signals change in macOS likely to affect JIT-based systems other than the OpenJDK JVM?

sebazzz2y ago

Isn't this just W^X?

not_me_ever2y ago

Wait, they write to protected memory, and get killed.

:tripplefacepalm:

Somebody hire some engineers at Oracle.

kaba02y ago

Sarcasm only works when you are actually smart and know what you are talking about.

erik_seaberg2y ago

To finish validating a request and then start executing it creates a race condition. That's why execution always needed to fail in a recoverable way.

tiffanyh2y ago

macOS dark ages.

I wonder if we’re about to enter 4-5 years of macOS “dark ages”, due to Apple grappling with EU/DMA.

mdhb2y ago

It’s not at all like they didn’t have the time or the resources to deal with this.

xyst2y ago

Apple and macOS is slowly becoming another Windows in terms of stability.

There was a HN post about a hashicorp founder using Linux within a vm on their mbp. Might adopt that same approach, if I can find the og post.

nullwarp2y ago

This is what I do when my job forced me to use a mac. I think the only thing I installed on the mac outside of it was Firefox.

Worked great for years before I changed jobs that let me bring my own hardware finally.

neeleshs2y ago

What is your preferred hardware and flavor of Linux for this? I'm trying to do the same

rzzzt2y ago

Rancher Desktop used Lima + QEMU behind the scenes: https://lima-vm.io/

mktk10012y ago

Tbh no one matches the hardware quality of MacBooks. But I refuse to use them on principle. Thinkpad t14 been serving me well (using fedora).

Kipters2y ago

To be fair, this is the kind of breakage I'd expect from macOS, but never from Windows

open5922y ago

Here’s the YouTube link from Mitchell. I was thinking about doing something similar lately too.

https://youtu.be/ubDMLoWz76U?si=ipmho73-r9FzZpBp

stevefan19992y ago

secondcoming2y ago

When's the last time you had a BSOD on Windows? I honestly can't recall.

1over1372y ago

A long time in fact. But macOS 14.3 kernel paniced on me last week.

threeseed2y ago

Depends on what you're doing.

It happens fairly often for me with more exotic hardware e.g. Infiniband or when I push the hardware too hard i.e. parallel Rust builds.

npalli2y ago

  An issue introduced by macOS 14.4, which causes Java process to terminate unexpectedly, is affecting all Java versions from Java 8 to the early access builds of JDK 22

If this affects so many versions of Java and nobody notices, is anyone even using Java on macOS?

semiquaver2y ago

Plenty of people develop for java on macs. The issue is that per the article this behavior was not present in the early access macOS builds, which means something changed between beta and release.

latchkey2y ago

> is anyone even using Java on macOS?

IntelliJ IDEA, the product itself, is JVM based.

threeseed2y ago

Actually everything from Jetbrains does.

So Pycharm, Rustrover etc.

latchkey2y ago

All of those are effectively language specific plugins of the base IDE platform, which is why I didn't list them individually.

bombcar2y ago

Minecraft runs on various Javas.

And there's a known issue with an interaction between minecraft, Java, and the video drivers that crashes out and it can be traced back all the way to here: https://github.com/glfw/glfw/issues/1997

It's not fixed.

bzzzt2y ago

It's not terminating directly. I've seen a few IDE crashes this week, less than one per day, but since there's no log there's no easy way to determine it's related to a macOS change.

Mustachio2y ago

https://youtrack.jetbrains.com/issue/JBR-6802

The JetBrains team has already figured it out as well.

vips7L2y ago

1 more reply

LgWoodenBadger2y ago

IntelliJ did this twice to me on Thursday and there was a crash log both times. I only reported one to Apple.

Did you check the Console app for crash reports?

lanna2y ago

Maybe not a lot of macOS devs use Java, but a lot of Java devs use macOS

seanalltogether2y ago

Also, if you're a mobile developer you likely have a Mac, and if you're a mobile dev that doesn't target iOS exclusively, then you run java.

re-thc2y ago

If you use a Jetbrains IDE you already use Java e.g. Webstorm and Pycharm.

comonoid2y ago

It broke since recent MacOS 14.4, even at 14.4 betas it worked.

karmakaze2y ago

I'm running RubyMine on 14.3.1 all the time and it's fine. Should I hold off updating to 14.4 until the dust has settled?

merb2y ago

You should I had some Rider and IntelliJ crashes. The crash does not happen often tough, but if your in the middle of writing code it can get you out of the flow.

CharlesW2y ago

For one, it doesn't affect all versions of Java. Java 20 (an LTS release) and 21, for example, don't have this problem.

pritambarhate2y ago

JDK 21 is LTS not JDK 20.

https://www.oracle.com/in/java/technologies/downloads/

grodriguez1002y ago

The article says that the issue “is affecting all Java versions from Java 8 to the early access builds of JDK 22” now.

bremac2y ago

Per the bug report, all versions since Java 8 are affected.

CharlesW2y ago

> Affected Version: 8,11,17,21,22

This has changed (they added 21) since I posted the comment above, so it looks like they’re still getting a handle on it.