LLVM: The bad parts (opens in new tab)

(npopov.com)

389 pointsvitaut5mo ago77 comments

77 comments

58 comments · 19 top-level

pizlonator5mo ago· 7 in thread

This is a good write up and I agree with pretty much all of it.

Two comments:

- LLVM IR is actually remarkably stable these days. I was able to rebase Fil-C from llvm 17 to 20 in a single day of work. In other projects I’ve maintained a LLVM pass that worked across multiple llvm versions and it was straightforward to do.

- LICM register pressure is a big issue especially when the source isn’t C or C++. I don’t think the problem here is necessarily licm. It might be that regalloc needs to be taught to rematerialize

theresistor5mo ago

> It might be that regalloc needs to be taught to rematerialize

It knows how to rematerialize, and has for a long time, but the backend is generally more local/has less visibility than the optimizer. This causes it to struggle to consistently undo bad decisions LICM may have made.

pizlonator5mo ago

> It knows how to rematerialize

That's very cool, I didn't realize that.

> but the backend is generally more local/has less visibility than the optimizer

I don't really buy that. It's operating on SSA, so it has exactly the same view as LICM in practice (to my knowledge LICM doesn't cross function boundary).

LICM can't possibly know the cost of hoisting. Regalloc does have decent visibility into cost. Hence why this feels like a regalloc remat problem to me

1 more reply

weinzierl5mo ago

"LLVM IR is actually remarkably stable these days."

I'm by no means an LLVM expert but my take away from when I played with it a couple of years ago was that it is more like the union of different languages. Every tool and component in the LLVM universe had its own set of rules and requirements for the LLVM IR that it understands. The IR is more like a common vocabulary than a common language.

My bewilderment about LLVM IR not being stable between versions had given way to understanding that this freedom was necessary.

Do you think I misunderstood?

pizlonator5mo ago

> like the union of different languages

No. Here are two good ways to think about it:

1. It's the C programming language represented as SSA form and with some of the UB in the C spec given a strict definition.

2. It's a low level representation. It's suitable for lowering other languages to. Theoretically, you could lower anything to it since it's Turing-complete. Practically, it's only suitable for lowering sufficiently statically-typed languages to it.

> Every tool and component in the LLVM universe had its own set of rules and requirements for the LLVM IR that it understands.

Definitely not. All of those tools have a shared understanding of what happens when LLVM executes on a particular target and data layout.

The only flexibility is that you're allowed to alter some of the semantics on a per-target and per-datalayout basis. Targets have limited power to change semantics (for example, they cannot change what "add" means). Data layout is its own IR, and that IR has its own semantics - and everything that deals with LLVM IR has to deal with the data layout "IR" and has to understand it the same way.

> My bewilderment about LLVM IR not being stable between versions had given way to understanding that this freedom was necessary.

Not parsing this statement very well, but bottom line: LLVM IR is remarkably stable because of Hyrum's law within the LLVM project's repository. There's a TON of code in LLVM that deals with LLVM IR. So, it's super hard to change even the smallest things about how LLVM IR works or what it means, because any such change would surely break at least one of the many things in the LLVM project's repo.

2 more replies

enos_feedler5mo ago

This take makes sense in the context of MLIR creation which introduces dialects which are namespaces within the IR. Given it was created by Chris Lattner I would guess he saw these problems with LLVM as well.

fooker5mo ago

There is a rematerialize pass, there is no real reason to couple it with register allocation. LLVM regalloc is already somewhat subpar.

What would be neat is to expose all right knobs and levers so that frontend writers can benchmark a number of possibilities and choose the right values.

I can understand this is easier said than done of course.

pizlonator5mo ago

> There is a rematerialize pass, there is no real reason to couple it with register allocation

The reason to couple it to regalloc is that you only want to remat if it saves you a spill

1 more reply

hoyhoy5mo ago· 5 in thread

I asked the guy working on compiler-rt to change one boolean so the LLVM 18 build would work on macOS, and he locked the whole issue down as "heated" and it's still not fixed four years later.

I love LLVM though. clang-tidy, ASAN, UBSAN, LSAN, MSAN, and TSAN are AMAZING. If you are coding C and C++ and NOT using clang-tidy, you are doing it wrong.

My biggest problem with LLVM rn is that -fbounds-safety is only available on Xcode/AppleClang and not LLVM Clang. MSAN and LSAN are only available on LLVM and not Xcode/AppleClang. Also Xcode doesn't ship clang-tidy, clang-format, or llvm-symbolizer. It's kind of a mess on macOS rn. I basically rolled my own darwin LLVM for LSAN and clang-tidy support.

The situation on Linux is even weirder. RHEL doesn't ship libcxx, but Fedora does ship it. No distro has libcxx instrumented for MSAN at the moment which means rolling your own.

What would be amazing is if some distro would just ship native LLVM with all the things working out of the box. Fedora is really close right now, but I still have to build compiler-rt manually for MSAN support..

mort965mo ago

Wait is THAT why I've been unable to build compiler-rt on macOS? Do you have a link?

hoyhoy5mo ago

There are a cluster of issues. macOS is linked against Xcode's libcxx. LLVM 21 changed the ABI compatibility because they upgraded the hashing algorithm from murmur to something even fancier. Homebrew has it fixed allegedly, but I keep getting gh notifications from people having issues. My solution is not for the weak of spirit... https://gist.github.com/hoyhoy/492cf3077239baececb6b838bf620...

cogman105mo ago

Gentoo is calling you ;)

https://wiki.gentoo.org/wiki/LLVM

erichocean5mo ago

> What would be amazing is if some distro would just ship native LLVM with all the things working out of the box.

Omarchy could/should do this, nice low-hanging fruit.

@dhh, if you're listening, the other good thing Omarchy could do is support the VFX Reference Platform specs maintained by the ASWF. That would bring in all of the Linux-based VFX software to Omarchy in a clean way.

LeFantome5mo ago

Do Chimera Linux or Mandriva not have LLVM working out of the box in your view?

I mean, Chimera Linux is pretty LLVM native.

apitman5mo ago· 5 in thread

My main concern with LLVM is that it adds 30+ million lines of code dependency to any language that relies on it.

Part of the reason I'm not ready to go all in on Rust is that I'm not willing to externalize that much complexity in the programs I make.

cardanome5mo ago

QBE might be more to your liking: https://c9x.me/compile/

It is used in the Hare language

MeetingsBrowser5mo ago

What language do you typically use?

melodyogonna5mo ago

Does it? One of LLVM's main selling points is modularisation. You can mix and match components; you don't have to depend on the entire monorepo

lenkite5mo ago

The Rust compiler also uses LLVM right ?

dubi_steinkek5mo ago

mainly, but it also supports alternative codegen backends (cranelift, rustc_codegen_gcc)

neuroelectron5mo ago· 5 in thread

It's amazing to me that this is trusted to build so much of software. It's basically impossible to audit yet Rust is supposed to be safe. It's a pipe dream that it will ever be complete or Rust will deprecate it. I think infinite churn is the point.

pornel5mo ago

Rust does its own testing, and regularly helps fix issues in LLVM (which usually also benefits clang users and other LLVM languages).

Optimizing compilers are basically impossible to audit, but there are tools like alive2 for checking them.

bigstrat20035mo ago

> I think infinite churn is the point.

That would require the LLVM devs to be stupid and/or evil. As that is not the case, your supposition is not true either. They might be willing to accept churn in the service of other goals, but they don't have churn as a goal unto itself.

neuroelectron5mo ago

How can you make that assumption?

hu35mo ago

Go is sometimes criticised for not using LLVM but I think they made the right choice.

For starters the tooling would be much slower if it required LLVM.

phplovesong5mo ago

Also OCaml. Having a own compiler is THE way for language development. IMHO.

1 more reply

ksec5mo ago· 4 in thread

>Compilation time

I remember part of the selling point of LLVM during its early stage was compilation time being so much faster than GCC.

LLVM started about 15 years after GCC. Considering LLVM is 23 years old already. I wonder if something new again will pop up.

circuit105mo ago

A few months ago someone wrote a much faster -O0 backend for LLVM, though it seems it didn't get much attention upstream: https://discourse.llvm.org/t/tpde-llvm-10-20x-faster-llvm-o0...

Discussion: https://news.ycombinator.com/item?id=45072481

There are also codegen projects that don't use LLVM IR that are faster like Cranelift: https://github.com/bytecodealliance/wasmtime/tree/main/crane...

pjmlp5mo ago

If it wasn't for Apple wanting to get rid of GCC due to licensing, and Google as well on Android, LLVM would have remained like Andrew Compiler Toolkit, MSR Phoenix, and similar endevours, another compiler development research project at Illinois university.

Thus what would be the commercial reason to support LLVM's sucessor, especially since the companies that were responsible for LLVM going mainstream, are happy with current C and C++ support, mostly using LLVM for other programming language frontends?

flamedoge5mo ago

non-C/C++ centric, performant compiler maybe. Aliasing support in C is pretty limited and a performant langauge like fortran and more modern equivalents may seek more efficient, concise IR for compiler with less comparable overhead from LLVM.

1 more reply

mungaihaha5mo ago

> Considering LLVM is 23 years old already. I wonder if something new again will pop up

LLVM is actually really really good at what it does (compiling c/c++ code). Not perfect, but good enough that it would take tens of thousands of competent man hours to match it

pja5mo ago· 3 in thread

Six years ago I was building LLVM pretty regularly on an 8GB Dell 9360 laptop whilst on a compiler related contract. (Still have it actually - that thing is weirdly indestructible for a cheap ultrabook.)

Build time wasn’t great, but it was tolerable, so long as you reduced link parallelism to squeeze inside the memory constraints.

Is it still possible to compile LLVM on such a machine, or is 8Gb no longer workable at all?

adgjlsfhk15mo ago

If you don't build with parallelism and have a couple gigs of swap available, it should work (although you might need to set some command line flags to use the right linker settings).

mungaihaha5mo ago

> or is 8Gb no longer workable

llvm compiles in less than an hour on my old m1 mac in all the build configurations I have tried so far

pja5mo ago

Pretty much the same then - good news!

anarazel5mo ago· 2 in thread

FWIW, the article says "Frontends are somewhat insulated from this because they can use the largely stable C API." but that's not been my/our experience. There are parts of the API that are somewhat stable, but other parts (e.g. Orc) that change wildly.

nikic5mo ago

Yes, the Orc C API follows different rules from the rest of the C API (https://github.com/llvm/llvm-project/blob/501416a755d1b85ca1...).

anarazel5mo ago

I know, but even if it's not breaking promises, the constant stream of changes still makes it still rather painful to utilize LLVM. Not helped by the fact that unless you embed LLVM you have to deal with a lot of different LLVM versions out there...

1 more reply

pklausler5mo ago· 2 in thread

> There are thousands of contributors and the distribution is relatively flat (that is, it’s not the case that a small handful of people is responsible for the majority of contributions.)

This certainly varies across different parts of llvm-project. In flang, there's very much a "long tail". 80% of its 654K lines are attributed to the 17 contributors responsible for 1% or more of them, according to "git blame", out of 355 total.

nikic5mo ago

That was ambiguously phrased. The point I was trying to make here is that we don't have the situation that is very common for open-source projects, where a project might nominally have a 100 contributors, but in reality it's one person doing 95% of the changes.

LLVM of course has plenty of contributors that only ever landed one change, but the thing that matters for project health is that that the group of "top contributors" is fairly large.

(And yes, this does differ by subproject, e.g. lld is an example of a subproject where one contributor is more active than everyone else combined.)

pklausler5mo ago

There may be a difference of degree here, but not a difference of kind.

Fiveplus5mo ago· 2 in thread

[dead]

muizelaar5mo ago

What section is that?

Fiveplus5mo ago

Sorry, wrong post.

ggggffggggg5mo ago· 1 in thread

> This is somewhat unsurprising, as code review … may not provide immediate value to the person reviewing (or their employer).

If you get “credit” for contributing when you review, maybe people (and even employers, though that is perhaps less likely) would find doing reviews to be more valuable.

Not sure what that looks like; maybe whatever shows up in GitHub is already enough.

loeg5mo ago

Honestly, the same phenomenon is a problem inside companies as well. My employer credits review quality and quantity relatively well (i.e., in annual performance review), but it still isn't a strong enough motivator to really get the rate up to a satisfactory level.

phplovesong5mo ago· 1 in thread

Comptimes aee an issue, not only for LLVM itself, but also for users, as a prime example: Rust. Rust has horrible comptimes for anything larger, what makes its a real PITA to use.

zbentley5mo ago

I think that’s primarily a Rust issue, not an LLVM issue. LLVM is at least competitive performance-wise in every case I’ve used it, and is usually the fastest option (given a specific linker behavior) outright. That’s especially true on larger code bases (e.g. chromium, or ZFS).

Rust is also substantially faster to compile than it was a few years ago, so I have some hope for improvements in that area as well.

anonymousDan5mo ago· 1 in thread

Is there any implicit understanding in the community that byte types will inevitably be added to LLVM? I see that there has been a recent GSOC effort (https://blog.llvm.org/posts/2025-08-29-gsoc-byte-type/ ) but it's unclear whether this has resolved most of the issues or is still an open research problem.

nikic5mo ago

> Is there any implicit understanding in the community that byte types will inevitably be added to LLVM?

Among the people who are familiar with such things, yes. An RFC on the topic will be posted in the near future.

sixthDot5mo ago· 1 in thread

Also the C API is a bit the poor child. Plenty of useful options (or even opt passes !) are not available.

r2vcap5mo ago

I think a large part of this comes from the fact that the expressiveness of LLVM’s C++ APIs does not translate well into a “plain old C” style interface. Many of the abstractions and extension points are simply awkward or impractical to expose in C.

On top of that, there is little incentive for contributors to invest in the C API: most LLVM users and developers interact with the C++ API directly, so new features and options tend to be added there first, and often exclusively. As a result, the C API inevitably lags behind and remains a second-class citizen.

jcranmer5mo ago

Given some of the discussions I've been stuck in over the past couple of weeks, one of the things I especially want to see built out for LLVM is a comprehensive executable test suite that starts not from C but from LLVM IR. If you've ever tried working on your own backend, one of the things you notice is there's not a lot of documentation about all of the SelectionDAG stuff (or GlobalISel), and there is also a lot of semi-generic "support X operation on top of Y operation if X isn't supported." And the precise semantics of X or Y aren't clearly documented, so it's quite easy to build the wrong thing.

tvali5mo ago

https://tatsu.readthedocs.io/en/stable/ - this was my result to find lightweight syntax parsers. LLVM: in my experience, to play with little languages or ideas, such as additional tag, is so heavy-weight that it's as hard to learn as Isabelle Proof Assistant; large systems are interesting, but it's worth mentioning that 99% of the functionality could be often 1% of the API.

sylware5mo ago

I remember the time I did dive in LLVM, object orientation was so much aggressive that you have to embrace the whole object oriented model to start to have a chance to understand what the code is actually doing.

Panzerschrek5mo ago

ABI / calling convention handling - that's exactly my pain. As compiler developer I need to manage arguments passing in my compiler frontend code myself, which sometimes even requires register counting.

hu35mo ago

Hey Nikita, if you're reading this, Thank You! for your contributions to PHP!

We miss you!

Panzerschrek5mo ago

LLVM also has (in my opinion) no capacity to review issues. None of the issues I have created were addressed, including a couple of really painful bugs.

j / k navigate · click thread line to collapse

77 comments

58 comments · 19 top-level

pizlonator5mo ago· 7 in thread

This is a good write up and I agree with pretty much all of it.

Two comments:

theresistor5mo ago

> It might be that regalloc needs to be taught to rematerialize

pizlonator5mo ago

> It knows how to rematerialize

That's very cool, I didn't realize that.

> but the backend is generally more local/has less visibility than the optimizer

I don't really buy that. It's operating on SSA, so it has exactly the same view as LICM in practice (to my knowledge LICM doesn't cross function boundary).

LICM can't possibly know the cost of hoisting. Regalloc does have decent visibility into cost. Hence why this feels like a regalloc remat problem to me

1 more reply

weinzierl5mo ago

"LLVM IR is actually remarkably stable these days."

My bewilderment about LLVM IR not being stable between versions had given way to understanding that this freedom was necessary.

Do you think I misunderstood?

pizlonator5mo ago

> like the union of different languages

No. Here are two good ways to think about it:

1. It's the C programming language represented as SSA form and with some of the UB in the C spec given a strict definition.

> Every tool and component in the LLVM universe had its own set of rules and requirements for the LLVM IR that it understands.

Definitely not. All of those tools have a shared understanding of what happens when LLVM executes on a particular target and data layout.

> My bewilderment about LLVM IR not being stable between versions had given way to understanding that this freedom was necessary.

2 more replies

enos_feedler5mo ago

fooker5mo ago

There is a rematerialize pass, there is no real reason to couple it with register allocation. LLVM regalloc is already somewhat subpar.

What would be neat is to expose all right knobs and levers so that frontend writers can benchmark a number of possibilities and choose the right values.

I can understand this is easier said than done of course.

pizlonator5mo ago

> There is a rematerialize pass, there is no real reason to couple it with register allocation

The reason to couple it to regalloc is that you only want to remat if it saves you a spill

1 more reply

hoyhoy5mo ago· 5 in thread

I asked the guy working on compiler-rt to change one boolean so the LLVM 18 build would work on macOS, and he locked the whole issue down as "heated" and it's still not fixed four years later.

I love LLVM though. clang-tidy, ASAN, UBSAN, LSAN, MSAN, and TSAN are AMAZING. If you are coding C and C++ and NOT using clang-tidy, you are doing it wrong.

The situation on Linux is even weirder. RHEL doesn't ship libcxx, but Fedora does ship it. No distro has libcxx instrumented for MSAN at the moment which means rolling your own.

mort965mo ago

Wait is THAT why I've been unable to build compiler-rt on macOS? Do you have a link?

hoyhoy5mo ago

cogman105mo ago

Gentoo is calling you ;)

https://wiki.gentoo.org/wiki/LLVM

erichocean5mo ago

> What would be amazing is if some distro would just ship native LLVM with all the things working out of the box.

Omarchy could/should do this, nice low-hanging fruit.

LeFantome5mo ago

Do Chimera Linux or Mandriva not have LLVM working out of the box in your view?

I mean, Chimera Linux is pretty LLVM native.

apitman5mo ago· 5 in thread

My main concern with LLVM is that it adds 30+ million lines of code dependency to any language that relies on it.

Part of the reason I'm not ready to go all in on Rust is that I'm not willing to externalize that much complexity in the programs I make.

cardanome5mo ago

QBE might be more to your liking: https://c9x.me/compile/

It is used in the Hare language

MeetingsBrowser5mo ago

What language do you typically use?

melodyogonna5mo ago

Does it? One of LLVM's main selling points is modularisation. You can mix and match components; you don't have to depend on the entire monorepo

lenkite5mo ago

The Rust compiler also uses LLVM right ?

dubi_steinkek5mo ago

mainly, but it also supports alternative codegen backends (cranelift, rustc_codegen_gcc)

neuroelectron5mo ago· 5 in thread

pornel5mo ago

Rust does its own testing, and regularly helps fix issues in LLVM (which usually also benefits clang users and other LLVM languages).

Optimizing compilers are basically impossible to audit, but there are tools like alive2 for checking them.

bigstrat20035mo ago

> I think infinite churn is the point.

neuroelectron5mo ago

How can you make that assumption?

hu35mo ago

Go is sometimes criticised for not using LLVM but I think they made the right choice.

For starters the tooling would be much slower if it required LLVM.

phplovesong5mo ago

Also OCaml. Having a own compiler is THE way for language development. IMHO.

1 more reply

ksec5mo ago· 4 in thread

>Compilation time

I remember part of the selling point of LLVM during its early stage was compilation time being so much faster than GCC.

LLVM started about 15 years after GCC. Considering LLVM is 23 years old already. I wonder if something new again will pop up.

circuit105mo ago

A few months ago someone wrote a much faster -O0 backend for LLVM, though it seems it didn't get much attention upstream: https://discourse.llvm.org/t/tpde-llvm-10-20x-faster-llvm-o0...

Discussion: https://news.ycombinator.com/item?id=45072481

There are also codegen projects that don't use LLVM IR that are faster like Cranelift: https://github.com/bytecodealliance/wasmtime/tree/main/crane...

pjmlp5mo ago

flamedoge5mo ago

1 more reply

mungaihaha5mo ago

> Considering LLVM is 23 years old already. I wonder if something new again will pop up

LLVM is actually really really good at what it does (compiling c/c++ code). Not perfect, but good enough that it would take tens of thousands of competent man hours to match it

pja5mo ago· 3 in thread

Build time wasn’t great, but it was tolerable, so long as you reduced link parallelism to squeeze inside the memory constraints.

Is it still possible to compile LLVM on such a machine, or is 8Gb no longer workable at all?

adgjlsfhk15mo ago

If you don't build with parallelism and have a couple gigs of swap available, it should work (although you might need to set some command line flags to use the right linker settings).

mungaihaha5mo ago

> or is 8Gb no longer workable

llvm compiles in less than an hour on my old m1 mac in all the build configurations I have tried so far

pja5mo ago

Pretty much the same then - good news!

anarazel5mo ago· 2 in thread

nikic5mo ago

Yes, the Orc C API follows different rules from the rest of the C API (https://github.com/llvm/llvm-project/blob/501416a755d1b85ca1...).

anarazel5mo ago

1 more reply

pklausler5mo ago· 2 in thread

> There are thousands of contributors and the distribution is relatively flat (that is, it’s not the case that a small handful of people is responsible for the majority of contributions.)

nikic5mo ago

LLVM of course has plenty of contributors that only ever landed one change, but the thing that matters for project health is that that the group of "top contributors" is fairly large.

(And yes, this does differ by subproject, e.g. lld is an example of a subproject where one contributor is more active than everyone else combined.)

pklausler5mo ago

There may be a difference of degree here, but not a difference of kind.

Fiveplus5mo ago· 2 in thread

[dead]

muizelaar5mo ago

What section is that?

Fiveplus5mo ago

Sorry, wrong post.

ggggffggggg5mo ago· 1 in thread

> This is somewhat unsurprising, as code review … may not provide immediate value to the person reviewing (or their employer).

If you get “credit” for contributing when you review, maybe people (and even employers, though that is perhaps less likely) would find doing reviews to be more valuable.

Not sure what that looks like; maybe whatever shows up in GitHub is already enough.

loeg5mo ago

phplovesong5mo ago· 1 in thread

Comptimes aee an issue, not only for LLVM itself, but also for users, as a prime example: Rust. Rust has horrible comptimes for anything larger, what makes its a real PITA to use.

zbentley5mo ago

Rust is also substantially faster to compile than it was a few years ago, so I have some hope for improvements in that area as well.

anonymousDan5mo ago· 1 in thread

nikic5mo ago

> Is there any implicit understanding in the community that byte types will inevitably be added to LLVM?

Among the people who are familiar with such things, yes. An RFC on the topic will be posted in the near future.

sixthDot5mo ago· 1 in thread

Also the C API is a bit the poor child. Plenty of useful options (or even opt passes !) are not available.

r2vcap5mo ago

jcranmer5mo ago

tvali5mo ago

sylware5mo ago

Panzerschrek5mo ago

hu35mo ago

Hey Nikita, if you're reading this, Thank You! for your contributions to PHP!

We miss you!

Panzerschrek5mo ago

LLVM also has (in my opinion) no capacity to review issues. None of the issues I have created were addressed, including a couple of really painful bugs.

j / k navigate · click thread line to collapse