C Is Not a Low-level Language (2018) (opens in new tab)

(queue.acm.org)

293 pointsbmer2y ago396 comments

396 comments

249 comments · 50 top-level

GuB-422y ago· 42 in thread

C is low level for at least one reason: manual memory management. Especially with modern hardware, memory management is at the center of programming. For example, Rust prides itself in being memory safe without a garbage collector, memory management is more or less the entire reason for Rust to exist. Why is C fast? Memory. Why is C unsafe? Mostly memory. One of the big reason parallel computing is hard? Concurrent memory access. Functional programming is often surrounded by plenty of mathematical concepts, but a good part of it is to pretend that objects are immutable when behind the scenes, the compiler works with mutable memory.

In C, every call to the allocator is explicit, that is, if you are using an allocator at all. Compare to oldschool C++, with new/delete and raw pointers, where you may call the allocator explicitly, but still, a lot happen in destructors, automatically. In modern C++, with smart pointers, it is essentially like a garbage collected language in the sense that allocation and deallocation all happen automatically.

grotorea2y ago

> C is low level for at least one reason: manual memory management. Especially with modern hardware, memory management is at the center of programming.

Ok, but even with C we can't actually low level manage memory how the processor does it. You can't tell the processor what to keep or not on what level of cache, what to send to virtual memory, etc. It's lower level than say Python but I don't think it is low level memory management in the way PDP-11 C was.

Chabsff2y ago

A lot of that is just a property of modern OSs, with good reason, intentionally not exposing these features to userspace processes. It's not really a function of the language itself.

grotorea2y ago

Hmm, true for virtual memory, didn't think of that, but CPU caches are inside the processor, can even the kernel control it at all?

1 more reply

theamk2y ago

The reference point for when we talk about low memory languages is not transistor but rather machine code. In this aspect we say that C is low level because one pointer dereference in C translates directly into memory load of machine code.

The fact that modern memory load operation involves cache, protection, memory mapping, etc.. is not a property of language, but rather of the environment (CPU + OS).

grotorea2y ago

That's fair enough, but it doesn't seem to be the most useful that it can be then. Low level to me could mean that we control all the details. But modern processors have microcode between transistors and the x86 machine code. And we can't control all that memory stuff.

But those aren't abstractions that we can treat as black boxes, we need to know them and how to code taking them into account without actually having control inside the black box.

wang_li2y ago

You can control what goes into cache if you want to. The effort to make an open source bios do this in order to have working memory before the DRAM controllers are initialized.[0]

0. https://www.coreboot.org/data/yhlu/cache_as_ram_lb_09142006....

ilyt2y ago

> Ok, but even with C we can't actually low level manage memory how the processor does it. You can't tell the processor what to keep or not on what level of cache, what to send to virtual memory, etc.

neither can assembler so it is useless distinction

CJefferson2y ago

There are CPU instructions to pull memory into cache, send cache back to main memory, and Mark things in cache as not worth writing out to memory. All hard to use from C, the last type basically impossible.

2 more replies

quatrevingts2y ago

As the article notes, that's because CPUs are designed to run existing C code fast. You could create an instruction set that provided this control, but it might be a tough sell in a world full of C code.

cmrdporcupine2y ago

"Memory" itself is an abstraction around a much more complicated model (virtual memory / pages) that most programmers remain ignorant of. (Unless you're working on a microcontroller class system, or other system without an MMU but that's a whole other kettle of fish(.

Even Rust developers like myself labour within the fantasy that a pointer is, y'know, like an address to memory, a real "physical" thing. Rust (and to some extent C++) introduces some management abstractions in front of this in the form of references and borrowing, but the main concept is still there.

In reality the kernel of your operating system has put a giant layer between you and the physical memory, and the "address" and "pointer" are really just handles behind which the OS and MMU do all sorts of shenanigans.

"Raw pointers" really aren't raw. They're handles to offsets within pages, which can be all over the place. It would be entirely possible to walk away from the libc & C model entirely and work in a world of pure references interacting directly with VM subsystem pages as some kind of "object handles" and be much closer to the actual operation of the underlying system.

squeaky-clean2y ago

> Unless you're working on a microcontroller class system, or other system without an MMU but that's a whole other kettle of fish

So C can do actual memory management, your OS or hardware just won't let you. I've done programming for audio effects gear where memory is directly accessible by real address. Often with different memory chips with different performance characteristics (for cost reasons) corresponding to different pointer value ranges. Just because your machine won't let you do it doesn't mean C isn't capable of it.

jstimpfle2y ago

Raw pointers are how you communicate with your CPU. They are "raw" in the sense that they're just an integer number (not really on the C language level, but they have an integer representation on any actual target like x86) and that you have to synchronize these pointers with the lifetimes of "actual" objects, which are only an abstract concept that your computer doesn't understand.

Meanwhile, virtual memory is as close as you can get down to the physical hardware in terms of normal CPU instructions (i.e., not VM management code). VM as a concept is orthogonal to raw pointers, which can be either virtual or physical.

Raw pointers are nothing like handles. They need to be manually "synchronized" properly with VM management (which happens completely behind the scenes for 99,99% of userspace code) to make sense but it's not like there is bookkeeping overhead in copying or offseting a pointer, like there would be for a "handle".

The point of a handle is that it's use to hold objects, to keep them alive. Raw pointers don't do that.

saagarjha2y ago

Would such a model be generally useful, though?

actionfromafar2y ago

I have thought so for a long time. It could open up execution of functional languages on a truly distributed runtime. Something like the fabled Tao operating system I guess.

cmrdporcupine2y ago

Definitely useful in some systems context, especially e.g. database page buffer management.

anonymous_sorry2y ago

> It would be entirely possible to walk away from the libc & C model entirely and work in a world of pure references interacting directly with VM subsystem pages

Is this possible in Ring 3? Or would everyone be running in kernel mode at that point.

Even if you do away with that layer, then there may still be a hypervisor lying to the kernel about memory.

pornel2y ago

C's memory management is its own abstraction. malloc and free are library functions. They're an abstraction not just over hardware (that doesn't have anything bytewise allocated like that), they even abstract away the way operating systems allocate memory.

You don't get direct access to the stack in C either. Stack frames are abstracted away, and you only get longjmp.

If you pay attention to Undefined Behavior and strict aliasing, you don't even get that much access to poking around memory.

1 more reply

pjmlp2y ago

BASIC also can do manual memory management, not only that, it had a whole computer generation for itself, in computers not able to have a full ISO C implementation.

theamk2y ago

So does Python (via ctypes) and pretty much every language we consider "high level". But in BASIC your default approach is "DIM names$(count)" which "magically" manages your memory for you.. which is why we consider it higher level than C.

pjmlp2y ago

That is hardly different from malloc(count * size), REDIM exists (aka realloc()) and many BASICs do offer the free variant as well.

In fact, there is hardly any difference between VMS BASIC and VMS C in terms of what is possible, if we want to take the discussion outside of 8 bit versions.

marcosdumay2y ago

> Why is C unsafe? Mostly memory.

C can't even do all of integral arithmetic safely. It's a language that goes really out of its way to add unsafety.

bensecure2y ago

I'm pretty sure that this is one of the unsafeties that rust borrows from c, even as it attempts to eliminate all the others. Checking every addition adds a massive slowdown, without giving much useful protection against vulns or corruption.

Hirrolot2y ago

> I'm pretty sure that this is one of the unsafeties that rust borrows from c

But integer arithmetic is safe in terms of Rust.

> Checking every addition adds a massive slowdown

It only does so for debug mode. In release mode, it uses modular arithmetic.

5 more replies

shrimp_emoji2y ago

But 53 years later, it's added the `<stdckdint.h>` header, offering `ckd_add()` and friends. :D Better late than never!

diogenes42y ago

Integers are an abstraction on top of words; words are perfectly safe.

rewmie2y ago

> C can't even do all of integral arithmetic safely.

Your comment reads like nonsense. Are you able to provide what you feel is the best example that substantiates your claim?

DashAnimal2y ago

When comparing a signed and unsigned integer, the signed integer is promoted to unsigned integer.

So if you have a = -1, b = 1000 and compare the two, a > b is actually true.

1 more reply

aidenn02y ago

Signed integer overflow is UB in C.

1 more reply

theamk2y ago

C is a low level language, so when programmer writes "+" they get "ADD" opcode. That's what "low level" means. If one wants "+" to add then do range checks, they can use a higher level programming language (or a special function, perhaps an intrinsic, in C)

monocasa2y ago

There's plenty of cases in C where the use of a + operator doesn't result in any form of add instruction being emitted.

MrBuddyCasino2y ago

> C is low level for at least one reason: manual memory management.

Manual memory management isn't that much faster than a modern GC, sometimes even slower. I'd argue that C programs are typically fast because there is just less rope to hang yourself with, leaving aside memory safety.

The anemic abstractions provided in the language and the tiny stdlib means it takes a lot of work to achieve something, so developers simply do less. There isn't even a Hashmap (or a proper String), while in Kotlin, you can perform a deep copy of the object graph and convert it to json in parallel in a single line if you so wish.

johnnyjeans2y ago

This is a common misconception that stems from a misunderstanding of why manual memory management is "fast". It has nothing to do with the actual process by which you request and release memory. Manual memory management being faster than GC is a function of controlling memory layout, which is one of the (very) few low hanging optimization fruits that mortals like you or I can do on contemporary CPUs. It also has to deal with the fact that dynamic allocation is slow, and being able to get it out of the way with a single allocation in the first moments of a process's life is an immensely important tool in the optimizer's toolbelt.

> The anemic abstractions provided in the language and the tiny stdlib means it takes a lot of work to achieve something

Which has the additional effect of forcing you to be a bit smarter about how you do things, to be less wasteful. It forces you to contend with everything you want to do, to consider it and the cost associated with it. Built-in, general-case abstractions are nice when under time constraint and hacking something together, but it doesn't make for good software. Not only is it almost guaranteed to be slower than a properly constructed purpose-built solution, but it also removes your view from thinking about the cost of every single thing you're doing. It makes it easier and attractive to overuse abstractions, to over-engineer solutions, and to approach problems from a standpoint where you simply throw the kitchen sink at the problem because that's the only thing you can think of.

imtringued2y ago

>It has nothing to do with the actual process by which you request and release memory. Manual memory management being faster than GC is a function of controlling memory layout, which is one of the (very) few low hanging optimization fruits that mortals like you or I can do on contemporary CPUs.

That is kind of misleading. The difference is that C and Rust support stack allocation, which is essentially an arena style allocator integrated into the language. What the fancy pointer bumping GC runtimes do, the stack does by default. The problem is that escape analysis is difficult and it is difficult to prove that an access to memory on the stack is safe without fundamentally changing the language like Rust does. It gets worse on the heap, where you can have runtime determined ownership.

C programmers like their doubly linked lists, but when you think about it, it is actually kind of a difficult problem to formalize and analyze in its full generality.

1 more reply

MrBuddyCasino2y ago

> Manual memory management being faster than GC is a function of controlling memory layout

Control over memory layout and manually allocating and freeing memory are orthogonal issues.

I can optimize memory layout in Java too by using primitive data types instead of pointer-chasing objects, or structs of arrays vs array of structs type of things in order to improve access patterns. I can't control alignment and padding, except indirectly, thats true, but that is not what people mean when they say "manual memory management". Rust gives you control over memory layout, but has "automatic" memory management.

> forcing you to be a bit smarter about how you do things, to be less wasteful

Yes this is what I meant.

jstimpfle2y ago

Love your words. 1000 upvotes if I could.

For balance, the faster machines get, the more problems are most effectively solved by throwing the kitchen sink at them.

1 more reply

trealira2y ago

There are some performance advantages a garbage collector can have over manual memory management. If you're just calling malloc/free or, in C++, calling new/delete in constructors/destructors (or using a class that does so, like std::vector), and nothing special, the garbage collector is probably allocating memory faster.

> controlling memory layout

Garbage collectors can compact active memory into one contiguous location and adjust the active pointers to point there instead. You can't do this in a language like C, because you can have arbitrary pointers to anything, and there's no runtime indication of what's a pointer or just an integer. You simply have to prevent memory fragmentation in the first place, which also complicates the logic of the program.

For faster allocation in C, arena allocation based on object lifetimes can be used [1]; in generational garbage collectors, you get similar benefits, but it's just done automatically. In fact, in that linked paper, they found that lifetime-based arena allocation improved the speed of their program (a C compiler) at the cost of increased memory allocation compared to naïve malloc() and free(), which is exactly what garbage collection does.

As a result of compaction, memory allocation with garbage collection is just a pointer bump in the best case, whereas allocation with just malloc usually requires searching a free list or a tree.

[1]: https://www.cs.princeton.edu/techreports/1988/191.pdf

kuchenbecker2y ago

Cache invalidation is hard :) almost as hard as semantically naming things in a way that is clear now, and in the future.

staunton2y ago

... the famous Two Hard Things, together with off-by-one errors.

tmtvl2y ago

concurrency With 3 it's Things Hard.

1 more reply

2-718-281-8282y ago

very interesting observation. never thought about how memory is central to all those concepts and technologies.

systemBuilder2y ago

In my lifetime the memory hierarchy has grown from 3 levels (register, main memory, disk) to at least 8 levels (register, L1 Cache, L2 Cache, L3 Cache, VM, main memory, disk, web). A lot of people like the guy who wrote this paper have no patience for a language that can still run most 1971 C programs!

Every year people who do not understand why C is so successful try make a name for themselves by breaking what makes the language great (such as the complete agnostic set of control structures). C has successfully remained portable and performant for 50Y+ because of its flat memory model (with a few tweaks such as "volatile".)

crabbone2y ago

C is neither fast nor low-level... none of these descriptors have any meaning.

It's a pointless discussion when you don't care to explain how you use the words that obviously have many related but different ways to interpret them.

pizlonator2y ago· 37 in thread

Friendly local C programmer and compiler writer here to remind you that C definitely is a low level language for those who understand it and use it professionally. If you’re looking for a low level language, then C (and its relatives) are your best bet.

If you’re new to the language and want to understand how to use it like a pro then ignore this post - it will only confuse you and reduce your ability to use C effectively.

jerf2y ago

C is a low level language... but it's the wrong low-level language. It gives you low-level access to a machine that your real machine actually has to somewhat laboriously emulate. Such dangly bits and bodges that have been added over the years to give access to the real machine are relatively foreign bodies to C.

I would agree the title is a bit rhetorically rough, though, because being the wrong low-level language doesn't make it a high-level language. WASM would similarly be "wrong" if I claimed it was a direct mapping to modern hardware, but that doesn't make it "high level".

(Although what really frustrates me about C isn't that it's a bad mapping per se. It's from the 1970s, what do you expect? And it is obviously still quite useful for many cases. What frustrates me is that it continues to a large degree to dictate language design and heavily color how language designers see hardware, so too much modern language design is still just reshuffling bits of C around, rather than building languages that work with the hardware well.)

jstimpfle2y ago

What is C really? A concise syntax to define structs and functions, with a usable expression syntax. There isn't all that much to it, I've always found it ridiculous for people to claim it's holding hardware back.

I don't think I've ever really seen a good argument what developments were prevented by the existence of C as an important compiled language. The one claim I can remember I find ridiculous: that today's CPUs execute instructions in parallel, not serially. Well, for one, C's semantics aren't that serial, there is a large degree of freedom for compilers and CPUs how to schedule the execution of C expressions and statements. Then, there are SIMD instructions exploiting those capabilities explicitly. But also, the rest of the code gets automatically pipelined by the CPU, according to a specific CPUs capabilities. Even though that stuff happens in parallel, any instruction encoding is by necessity serial. Or is anyone proposing we should switch to higher-dimensional code (and address spaces)?

shadowgovt2y ago

To my aesthetic C is the wrong abstraction because while all those things are possible, the language exposes them via a syntax that makes you think you're writing an embarrassingly sequential program and then tries to hide all of the parallelization that improves performance in the undefined behavior.

I liken it to doing imperative UI development on top of the DOM abstraction in a browser. Yes, under the hood, the browser is choosing when to re-evaluate and repaint interface elements, but you can't touch any of that; you're instead rearranging things in the DOM and memorizing heuristics the browsers use to try and trick the browsers into matching changes to the DOM to visual changes in the browser UI efficiently.

It may very well be time for a low level languages to encourage us to think about programming as "arranging independent blocks of code that can be executed in parallel, with only a handful of sequencing operations enforcing some kind of dependency order. Apart from honoring those sequencing requirements, order of execution or whether execution happens in parallel is undefined."

1 more reply

crabbone2y ago

> A concise syntax to define structs and functions, with a usable expression syntax. [...] I've always found it ridiculous for people to claim it's holding hardware back.

You just looked in your fish tank and declared what the weather is going to be like in the Atlantic ocean... Like... these things have nothing to do with each other. The fact that C has functions or structs has nothing to do with it being awful influence on designing hardware.

Here are some reasons why C is awful.

* It believes that volatile storage is uniform in terms of latency and throughput. This results in operating systems written with the same stupid idea: they only give you one system call to ask for memory, and you cannot tell what kind of memory you want. This in turn results in hardware being designed in such a way that an operating system can create the worthless "abstraction" of uniform random-access memory. And then you have swap, pmem GPU's memory etc. And none of that has any good interface. And these are the products that despite the archaic and irrelevant concept of how computers are built have succeeded to a degree... Imagine all those which didn't. Imagine those that weren't even conceived of because the authors dismissed the very notion before giving the idea any kind of thinking.

* It has no concept of parallelism. In its newer iterations it added atomics, but this is just a reflection of how hardware was coping with C's lack of any way to deal with parallel code execution. C "imagines" a computer to have a CPU with a single core running a single thread, and that's where program is executed. This notion pushes hardware designers towards pretension that computers are single-threaded. No matter how many components your computer has that can actually compute, whenever you write your program in C, you implicitly understand that it's going to run on this one and only CPU. (And then eg. CUDA struggles with its idea of loading code to be executed elsewhere, which it has to do in some very cumbersome and hard to understand way, which definitely doesn't rely on any of C's own mechanisms).

3 more replies

imtringued2y ago

>Well, for one, C's semantics aren't that serial, there is a large degree of freedom for compilers and CPUs how to schedule the execution of C expressions and statements.

I thought about the implications of a "parallel" statement, where everything is assumed to execute in parallel and oh boy are the implications big. C's semantics are serial but they contain implicit parallelism. The equivalent is that the parallel statement contains implicit sequentialism that the compiler can exploit to reduce the amount of book keeping needed by the CPU to schedule thousands of instructions at the same time. E.g. instead of having an explicit ready signal and blocking on it, the compiler can simply decide to split the parallel statement into two parallel statements, one executed after the other. Implicit sequentialism! A parallel statement implies that no aliasing writes are allowed to be performed. I don't know what the analysis for that would look like, but in many common cases I would expect the parallel statement to be autovectorized quite reliably.

>Even though that stuff happens in parallel, any instruction encoding is by necessity serial. Or is anyone proposing we should switch to higher-dimensional code (and address spaces)?

Uh, you know we can just encode the program as a graph? Graph reduction machines are a thing, you know.

1 more reply

circuit102y ago

“instruction encoding is by necessity serial. Or is anyone proposing we should switch to higher-dimensional code”

That is sort of a thing: https://en.m.wikipedia.org/wiki/Very_long_instruction_word

If you have multiple instructions grouped together like this you could think of it as being a 2D array of instructions

xscott2y ago

I understand your point: Modern hardware tries REALLY hard to pretend it is a simple set of instructions executing one after another. For all the on the fly clever caching, micro-op translation, branch prediction, speculative execution, register renaming, and whatever else, it consistently presents a sane model to single threaded programs. It's difficult to even see the magic under the hood if you tried, and it mostly shows up in unexpected performance discrepancies or race conditions for multi threaded programs. It's all a huge charade...

However, before dismissing this all as a bad mapping to an outdated 1970s model of computation, I'd like to see a good alternative. CUDA has clearly shown that there's an acceptable model for massively parallel data sets, but that doesn't handle branch heavy code very well at all. And FPGAs have a different approach for a completely different kind of problem, but I don't know how you would expose what Apple, AMD, or Intel chips are doing under the hood and have it be at all manageable to the programmer. How is someone supposed to indicate what's next when a pipeline stalls waiting on the previous operation or a cache miss? Is the programmer going to toss micro ops into separate execution units and wait for the results to come out the other side in arbitrary order? Is this an async/await model for every addition or memory fetch? I think it would be complete spaghetti to even try, but I'd love to be shown I'm wrong.

People get all excited trash talking Itanium, but I think it's a lesson that if you try to expose any alternative to the 1970s model they'll just bitch about how there are no sufficiently smart compilers. And of course it got scooped by AMD64 pretending to execute one instruction after another.

And if there isn't a good alternative, I think C (or Rust, or WASM) are a pretty good fit for what you've actually got to work with at the low level.

2 more replies

hawk_2y ago

What language(s) in your opinion have the right low-level where the access to the real machine doesn't feel foreign?

JonChesterfield2y ago

Assembly is the right one. You have direct access to the machine ISA, including the weirder status/control registers and whatever trap/syscall corresponds to. Assemblers are somewhat powerful - can define data layouts somewhat like structs, abstract some things behind macros, add pseudo-instructions to put friendlier names on some things. Maybe the ISA expects you to build constant integers out of arithmetic, the assembler can give you a 'const' instruction which expands to said arithmetic.

I have a pet theory that lisp macros over an assembler is the right high level language for systems programming but that hasn't made it off the whiteboard yet.

3 more replies

pjmlp2y ago

Assembly, or what ESPOL was already doing in 1961 a decade before C was even an idea, compiler intrisics.

So taking out Assembly, any language can have hardware capabilities exposed as compiler intrisics, that is nothing special about C in that regard, only the one many people are commonly aware of because they don't to be educated in compilers.

giancarlostoro2y ago

The only one I can think of would-be Assembly, but I don't do much low-level work, I code in much higher-level languages. Genuinely curious what the answer is.

2 more replies

jerf2y ago

Per my last paragraph, I am not convinced about any of them.

One of these days I really need to post my "ideas for languages" that I've got banging around on my hard drive, but one of them is "a language that deals with the increasingly heterogeneous nature of the computer". You've got the CPU, the GPU, efficiency cores, whoknows what else in the future (NN cores), and it's only a small hop from there to consider other computers as resources too.

Full disclosure: I have no idea whatsoever what this looks like. Especially in light of the fact that you need to build not just for the exact machine you're developing on but for machines in the future as well. Some sort of model of what is being computed and some guestimate at the costs? (Something like an SQL query builder where you declare your goal and it does the computation about what resources to compute it with?) It's also possible that the huge gulfs in performance between all these parts are just too large to bridge and manual scheduling of all these resources is just the only choice.

Even just within a CPU it's rather annoyingly difficult to use vector-based code in modern languages. Perhaps something like an array-based language, but one that discards that field's bizarre love affair with single-character (if not outright Unicode) operators and can be read by a normal human, and just affords writing code in a style that SIMD becomes a sensible default rather than something the optimizer laboriously reverse engineers from your conventional imperative code. (Array based programming could really use a "for humans" version of those languages in general.)

To some extent, just sitting down for a year to learn modern assembler and starting from the very, very bottom once again to build a high level language, rather than starting with C and building "C, but ..." which is pretty much every modern language being developed, would be an interesting exercise if nothing else.

Another little example is I think Jai was supporting structures-of-arrays instead of arrays-of-structures, though I don't know if they kept it. I'd like to see a language where the language-level data structures are explicitly viewed through the lens of "how I serialize these into memory", rather than the data structure implicitly creating such a specification by how it is defined, so for instance you could swap out a SoA to an AoS by swapping only the way the compiler serializes to RAM and not any of the rest of the code. Obviously you provide defaults that look like modern languages, but with this you could directly implement things like tagged unions with custom bit layouts, or theoretically, directly accessing gzip'd data by specifying that this data structure can only be accessed sequentially but as long as that's what you do you don't need to directly unzip it, etc. This doesn't directly answer "how do you utilize modern hardware correctly" but gives you tools to potentially create a better match than what compilers give by default.

Again, to be clear, this is crazy pie-in-the-sky far out ideas that I do not have an implementation in mind for, but it's the sort of thing I'd like to see more experimentation with on the fringes of language dev. (And I only wish I had time to do it myself. Unfortunately, I simply do not.)

(And, as the sibling comments point out, yeah, assembler technically, but that's kind of a cop out.)

3 more replies

grotorea2y ago

> It gives you low-level access to a machine that your real machine actually has to somewhat laboriously emulate

Isn't C the language (x86_64) processors are designed to be fast for? Sure they added a large amount of abstractions but since they were made for C is there any language where the processor doesn't have to laboriously emulate?

pizlonator2y ago

> Isn't C the language (x86_64) processors are designed to be fast for?

Yup

I mean they also optimize for Java and JS and .NET and probably Swift and Rust.

But C still takes precedence, I bet

kllrnohj2y ago

> Isn't C the language (x86_64) processors are designed to be fast for?

Nope. They compete on performance in C++ (games mostly), Java (enterprise SKUs, but same core architecture), and JavaScript (browser benchmarks even though raw JS performance is a very small part of browser responsiveness...)

pizlonator2y ago

Nothing added to machines since the invention of C is foreign to C. In fact, C is hardwares most favored customer. Chip designers tend to favor tuning for traces of instructions generated by C compilers. Some architectures, like RISCV, are so overtuned for C and nothing but C that they forgot to add some instructions (like add with overflow check).

snvzz2y ago

>they forgot to add some instructions (like add with overflow check).

If you actually read the spec, you would have found that they didn't "forget" these.

They carefully studied them and judged the encoding space is better used elsewhere.

1 more reply

fanf22y ago

Multiprocessing. Atomics. Vectors. GPGPUs. All foreign to C when they were introduced.

1 more reply

quelsolaar2y ago

Your friendly wg14 member here. It is a low-level language, but it is not a portable assembler. If you think you what you will write will have a one-to-one relationship to assembler you will run in to trouble. If you want a deeper dive in to how these things can trip you up, watch: https://youtu.be/w3_e9vZj7D8

gavinhoward2y ago

C programmer and fan of yours.

I agree with you, but if you could convince WG14 to remove a lot of the stupid UB, that would be closer to the case.

(I know you're trying from your "One Word Broke C" article. Which, by the way, is putting up a server error right now.)

pif2y ago

> it is not a portable assembler

And it never was!

Just keeping this point in mind would reduce the plethora of discussions about undefined behaviour to the essential, i.e. the useful discussions, i.e. the 0.1%.

JonChesterfield2y ago

Opinion is divided on this. My best guess is that ISO C was never a portable assembler, but the C programming language before standardisation broadly was, and that's how people hold both positions as self evidently true. Different definition of "C".

1 more reply

titzer2y ago

C would have been great as a portable assembler. E.g. if a syntactic + mapped to the hardware `add` instruction, that's pretty predictable! But it doesn't; it maps to the hardware `add` modulo compiler optimizations (like folding and strength reduction, which are done assuming overflow and other tricky parts are UB). Basically everywhere UB is permitted by the spec is so compilers don't have to handle the tricky cases, don't have to give semantics for buggy programs, or even help in debugging, and can make what would be unsound optimizations if the operations truly represented the target CPU's "weird" add semantics.

pizlonator2y ago

Just toss enough compiler flags at clang and make sure to occasionally use inline asm snippets to throw off the compiler's optimizations.

Then you're GTG

pizlonator2y ago

Depends on what you mean by "portable assembler". It is exactly that in a lot of ways, but exactly not that in others.

I think it's more useful to say that C is a portable assembler, than it is to say that it isn't, considering how it's used in practice and the sort of nasty things C compilers do in order to make that possible.

cmsonger2y ago

The author is playing a semantic game.

I don't think the author's point is that "C is not a good language for systems programming." You are not going to have an equivalent to volatile int *dma_register = SCATTER_GATHER_BASE; in Haskell.

The author's point is that the drive to make C and other "model the von Neumann machine" languages execute quickly has made the compiler very complicated (the author is implying that "low level requires simple compiler") and that processors built to make such code run quickly are also very complicated. And those complications carry costs.

In many ways this is a "call to programming model action" and cites GPU as illustrating the potential when "new programming model" and "silicon to support it" are done in concert.

bunderbunder2y ago

"Low-level" is a word with multiple meanings.

The original one is the one the article uses: low-level languages are non-portable and tied to the hardware on which they run, and high-level languages can target multiple platforms. Under this definition, C is absolutely a high-level language.

My complaint would not exactly be that the author is playing semantic games; it would be that they are clinging to archaic terminology in a way that does more to confuse than enlighten. The "generations" taxonomy is generally more descriptive.

  1st: Machine
  2nd: Assembly
  3rd: General-purpose
  4th: Application-specific

The 3rd/4th distinction gets a bit muddied sometimes, and back in the 80s and 90s people talked about a 5th generation that never really took off. But a couple (I think) clear examples of 4GLs are SQL, HyperCard, and Mathematica.

What I like about that approach is that it mostly breaks languages up according to fairly clear distinctions about when you would use them. And then we can use "high/low-level" as a relative term, where higher-level languages tend to do more to abstract away the details of what the computer is actually doing. That does mean that higher-generation languages tend to be higher-level; all we lose in doing it that way is the ability to have silly arguments about where to place a completely arbitrary (and, frankly, useless) dividing line.

I also like that this way we can recognize .NET IL, WebAssembly, and Java bytecode as very high-level 2nd generation languages, which, at the very least, is fun.

Oh, and Forth is a 3rd generation language. Fight me, Chuck.

fanf22y ago

5th generation was the label under which the Japanese government threw a lot of money at Prolog and expert systems. It wasn’t a technically-driven distinction from the 4th generation, but rather a wish about what would happen if the project succeeded. 5GLs came about from language designers bidding for research money, saying, try our language, it’s better than Prolog!

hardware2win2y ago

>use it professionally

I think this post goes way way way above boringness of day2day jobs.

Yea, this post is not about how to use hammer, but more like curious consideration whether using hammers everywhere is not limiting us (C design)

lelanthran2y ago

> Yea, this post is not about how to use hammer, but more like curious consideration whether using hammers everywhere is not limiting us (C design)

Maybe it [EDIT: the post] is, but the title is obviously nowhere near accurate - if C is not a portable low-level language, what on earth is?

[1] It gets reposted everywhere so often I have read it multiple times, and the one thing in common I see is how every know-it-all crawls out of the woodwork to comment on the title, as if the title was something new, deep, profound or even correct.

bayindirh2y ago

C is only portable between systems which emulate PDP-11 at hardware level and if and only if you don't use any compiler-specific extensions.

If you use sys calls, work between different breeds of operating systems (UNIX, POSIX and Windows are not compatible with each other), you need to rewrite or wrap relevant parts, or write the relevant part beforehand inside ifdefs to be able "port" it between systems.

The gist of the piece is, hardware is evolving to please C's programming model, hiding all the complexities C is not aware of, and behave like a PDP-11 on steroids. This is why we have truckload of side-channel attacks in X86 to begin with. To "emulate" PDP-11s faster and faster.

2 more replies

scythe2y ago

>if C is not a portable low-level language, what on earth is?

This question doesn't have to have an answer. The author of TFA apparently believes that a low-level language is one that effectively and clearly exposes the execution model of the hardware to the programmer. Under this definition, no widespread language (except assembly) is truly low-level, and possibly none are.

Which, for what it's worth, is also what I was taught in school. C was consistently described as a high-level language by my professors, even if it is "lower-level" than almost everything else.

1 more reply

rfoo2y ago

The post argues that there is no portable low-level languages, including C.

i.e. truly low-level languages can't be portable and is bound to the architecture.

1 more reply

pjmlp2y ago

Only when taking into account language extensions that are compiler specific and not part of ISO C.

Also a reminder that any language can have toolchains with extensions exposing low level features.

1 more reply

hcks2y ago

Funny how the top comment on "hacker" news is an *unsubstantial* comment about how, actually, TFA is wrong.

Even worse, adding a comment on how actually you shouldn’t be curious and understand how things really work.

1 more reply

titzer2y ago

There's a lot of moaning and crowing here, but no real substance. If one were to design a CPU and its ISA from scratch, what would you do? Instructions, control flow, memory, out-of-order execution, caches, hierarchies, branch prediction, you'd probably end up with all of it down there anyway. I don't get the point about GPUs. Real applications aren't matrix multiplies and embarrassingly parallel numeric algorithms, they run general purpose PLs.

Which basically then boils down to ISA design. If you could design an ISA from scratch for the hardware you design from scratch, what would you do? Well, there aren't that many options. Stack machine, dataflow machine, VLIW machine. All of those have been tried and the modern superscalar CPUs kick their butts on every metric except power.

The whole article kind of misses the point anyway. We should probably be running higher level languages for most things anyway, which shouldn't be overly constrained by hardware design. For everything else, 100% serious, there is WebAssembly, and hardware ISAs will fade below this level of abstraction in the fullness of time.

spion2y ago

Can you elaborate?

wolframhempel2y ago· 20 in thread

I feel, low to high level is a spectrum, not a binary. C is arguably in the lowest third of languages, exposing you to a lot of machine primitives like memory and thread management. It may not be as low level as assembly, but it is arguably lower level than Java or Go, and definitely nowhere near the Pythons and JS of this world.

Joker_vD2y ago

> exposing you to a lot of machine primitives like memory and thread management

Except it doesn't really, the standard leaves most of the really machine-dependent parts undefined; only very few things are left implementation-defined.

Plus, of course, C is quite unsuitable for any platform that uses segmented memory/non-flat addresses (which are things that are trying to come back in vogue but C's wide spread really, really hinders that).

mytailorisrich2y ago

> Except it doesn't really, the standard leaves most of the really machine-dependent parts undefined

Well that's because it is low level and, especially, simple, and doesn't try to abstract things.

TheOtherHobbes2y ago

It's a certain kind of low level - specifically a PDP-11 kind of low level.

If your hardware is significantly different, it only looks low level. In reality plenty of mapping and conversion goes on behind the scenes - sometimes with hilarious consequences.

pornel2y ago

> and doesn't try to abstract things.

The C standard is a description of an abstract machine. You get UB and unexpected miscompilations, because the optimizer is not evaluating how your code runs on the machine you're compiling for, but simulates running your code on the weirdly abstract C machine, one that can't overflow signed integers.

And C abstracts away almost everything about stack, stack frames, and all the complexities of memory and cache hierarchies. They are abstracted to be uniform linear address space.

mrpopo2y ago

Can you or someone expand further on that? Which platforms are trying to use segmented addressing, and what benefits does it have?

Joker_vD2y ago

CHERI project [0]. Look at figure 2.1: it's an improvement and further development of the segments of yore but the origins are quite visible.

[0] https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-941.pdf

1 more reply

agentultra2y ago

We did ok with the 8086 processors, no?

Joker_vD2y ago

Yes, juggling near and far pointers was somewhat annoying but then Intel, as a part of the 32-bit transition, modified their ISA to be a more pleasant target for C implementations.

Incidentally, C never really became popular on 6502 because, arguably, that ISA is somewhat hostile towards efficient implementations of higher-level languages.

2 more replies

robmccoll2y ago

By that do you mean exposing a non-uniform memory hierarchy as separate addressable spaces (but with coherent views from each hardware thread) or something like thread-local scratch pads?

Either way, C is equipped ok for that - at least as well as most systems languages C++, Rust, etc. - simply because dealing with allocation and raw addressing (at least raw within the process memory space) is a fundamental part of the experience. Throw in a few compiler extensions (because you'll need to change the compiler to make use of this anyway) for things like where to locate static allocations and use library functions that add dynamic allocation in specific spaces. It will get hairy, but it's at least possible with some very careful programming.

danielvaughn2y ago

Honest question - is there any language at all between C and Assembly? Because if there is, I haven't heard of it. For that reason alone, my mental model has always been "C is the lowest you can go before hitting direct instructions to the processor."

lebuffon2y ago

I would say Forth is lower level than C. The mental model is a two stack machine plus memory, rather than a PDP-11.

And it is very reasonable if you are under 50 years of age, that you haven't heard of it.

1 more reply

fiedzia2y ago

Conceptually, you can consider LLVM IR to be such language, and there are people who use it that way.

grotorea2y ago

There are some cleverer assemblers that allow to program in assembly while still being able to do stuff like loops without too much effort https://en.wikipedia.org/wiki/Assembly_language#Support_for_...

trealira2y ago

While staying portable across architectures, probably not. But you can make a little language that's nicer to jse than assembly for a particular CPU.

COMFY-65 is a compiler for a small Lisp language that provides all non-branching operations of the 6502 processor as primitives (e.g. tests for carries, overflows, zero, and negative; set decimal arithmetic mode; etc.). However, programs still consist of subroutines, loops, and tests, with no "go to label" construct provided. It's surprisingly simple and, I would say, elegant.

Here's the PDF that outlines it: https://dl.acm.org/doi/pdf/10.1145/270941.270947

CJefferson2y ago

C makes lots of things undefined behavior which are perfectly fine in assembler — read a stack without first writing to it, doing overflow signed internet arithmetic, treating the same memory location as different types.

Also there is quite a lot in modern assembler that you can’t really get to from C, like prefetch and cache flushing instructions.

danielvaughn2y ago

That last bit is really interesting. Do newer languages make use of those features? Sorry I’m fairly ignorant of this level of the stack.

1 more reply

xigoi2y ago

> is there any language at all between C and Assembly?

LLVM or QBE, for example.

titzer2y ago

Yes, WebAssembly is higher-level than machine code but lower than C.

eesmith2y ago

1990 is calling:

"C does not behave as a typical ‘high-level’ language, because it offers a number of features which are more normally associated with ‘low-level’ languages such as assembly language. These include the ability to write data to and from particular memory addresses, facilities for operations on the contents of memory locations, and instructions for incrementing and decrementing integer variables ... Thus C allows the programmer the flexibility and efficiency of working at low level with the advantages of working at high-level, for example the more advanced data structures and program flow controls typical of today’s computer languages. For this reason, C is sometimes described as a ‘high-level low-level language’ or as a ‘low-level high-level language’." - https://archive.org/details/computerprogramm0000ford/page/13...

clnq2y ago

You have high expectations for accuracy in an article titled "C Is Not a Low-level Language".

ndiddy2y ago· 15 in thread

I disagree with the author's point that CPU instruction sets should expose more of the CPU's implementation. This has been tried in the past and failed to work long-term. One example of this is branch delay slots from some RISC processors (such as MIPS and SuperH) designed in the late 80s and early 90s. For those unfamiliar with the concept, it basically means that the instruction after a branch instruction will get run regardless of if the branch was taken or not. This was a short-term benefit, as it meant the job of avoiding pipeline stalls after a branch was left to the programmer, so the processor could be simpler and cheaper than designs without them. However, as time went on, the processor designs evolved with more complex pipelines, so the single instruction wasn't enough to cover the branch delay. Instead, it became a legacy issue that future processors had to deal with for compatibility reasons and made their branch prediction and pipeline logic more complex.

danielmarkbruce2y ago

I don't think he's saying "expose random implementation details". Exposing the wrong details would obviously be bad. He's just saying c's model has significant shortcomings in the world of modern CPUs.

rewmie2y ago

> he's saying c's model just doesn't work well anymore.

The author argues that C's model does not fit the model he defined himself and claims to be the same model used by everyone.

After going through the article, I'm left with the impression that the author's thesis is flawed and relies on a series of strawmen arguments. Among the strawmen we find:

* arguing that speculative execution "were added to let C programmers continue to believe they were programming in a low-level language".

* claiming that "modern processors are trying to emulate "the same abstract machine as a PDP-11"

* "Creating a new thread is a library operation known to be expensive, so processors wishing to keep their execution units busy running C code rely on ILP (instruction-level parallelism)."

* etc etc etc.

I don't think this opinion piece is grounded on reality, let alone is an objective take.

danielmarkbruce2y ago

He doesn't define a model. He just discusses the gap between c's model and a few details of a modern CPU and talks about a few other models.

In your opinion, why was speculative execution added? It doesn't seem off base to suggest it was to enable programmers to continue writing single threaded applications while increasing execution speed.

In your opinion, what is wrong with the statement that modern processors are trying to emulate an abstract machine like PDP-11? To me it seems largely right.

bayindirh2y ago

Counterpoint: Allowing explicit control over prefetching or providing an additional "engine" which brings in data to caches in a pattern you like is esp. beneficial in real-time and latency sensitive applications.

I have listened a talk where developers used such subsystem in the given processor. If unused, they would spend 95% of their time window just to copy the data, however by requesting the data ahead of time via that engine, they only used 10% of their time window to get the data, and accomplished what they wanted in ~50% of the time window they have, leaving tons of time for further features and improvements.

If x86 had a such feature, I'd use that in my Ph.D. to request the matrix data I'm accessing ahead of time, because the pattern I use is not linear but well defined. Now, if I want to accelerate that code further, I need to reorder my matrices to make the prefetcher happy, and refactor the whole codebase from top to bottom.

monocasa2y ago

PREFETCHh: Prefetch Data Into Caches

https://c9x.me/x86/html/file_module_x86_id_252.html

theamk2y ago

generic x86 does not, but SSE extension (present on all modern CPUs) does have it! and you naturally can use C to call it via intrinsics, because C is low level after all...

https://stackoverflow.com/questions/48994494/how-to-properly...

pjmlp2y ago

C# also has the same intrisics, I guess C# is low level after all...

monocasa2y ago

And it's mandated to be available on all x86-64.

bee_rider2y ago

Is the pattern linear for some stretches or something like that? I’m wondering if reordering your matrices is the right strategy anyway—you want to load a whole cache line at a time I guess, being able to program the prefetcher wouldn’t get you that, right?

fanf22y ago

x86 does have a prefetch instruction, and it’s available as a compiler intrinsic in gcc and clang. But it’s really difficult to get a performance improvement from explicit prefetching on modern big CPUs because their dynamic prefetchers are very clever. Your data access pattern needs to be something the prefetcher does not know how to match, and you need to be able to prefetch addresses before the speculator would get to them by itself. Unlikely for matrices unless they have a very weird shape.

JonChesterfield2y ago

You may like the x86 instruction called 'prefetch'.

frankreyes2y ago

Wasn't this also similar for itanium? Where the branch burden would be on the compiler?

dataflow2y ago

> One example of this is branch delay slots

That's the only example I'm aware of. Are there others? (I'm sure you could do it poorly if you wanted to, but how much history is there to extrapolate from?)

JonChesterfield2y ago

I'm unimpressed that x64 assumes function calls write the return address to stack memory instead of passing it in a register. It means you have to use the stack memory in order to benefit from the call/ret prediction hardware, even in functions which don't otherwise need to allocate any stack memory.

In general leaking microarch weirdness matters less if you don't have backwards compatibility.

fanf22y ago

Vector instruction sets expose too much about the size of the CPU. They have got bigger over the years, 128 - 256 - 512 bits, 8 - 16 - 32 registers, and now Intel is struggling to fit them comfortably into their small efficiency cores and retain binary compatibility with their big performance cores.

usrnm2y ago· 15 in thread

Of course it isn't, but what's the alternative?

lproven2y ago

My favourite article about C in years.

To answer your question off the top of my head, answering different bits of the issue, from the perspective of the era of active programming language R&D not themes on themes on themes as we have now...

Limbo, Occam (Occam-pi, etc.), APL (I/J, Aplus, etc.), Oberon (Oberon 2, Oberon 07, Active Oberon, Zennon)...

usrnm2y ago

I'm not familiar with these languages, but which of them is closer to the actual modern hardware than C, while still being abstract enough to be portable?

lproven2y ago

In what way did I imply that any of them were in any way closer?

That was not my intention at all.

You asked what alternatives there were. C is a systems implementation language, designed to be compiled to object code that will run on the bare metal.

I offered some examples of alternatives to that role, as I thought you asked. I did say that they explored different aspects of the problem.

As I said to someone else upthread:

It does not need to be a relative statement in order to be correct.

The statement "C is not close to the instruction set of a modern CPU" does not need to be validated by specifying examples of languages that are closer.

1 more reply

agumonkey2y ago

While I'm reading about limbo and occam, what do you think apl and oberon can express that C cannot ? talking about low level electronics benefits (apl array idioms are superb for sure)

lproven2y ago

I recommend Sophie Wilson's talk on CPU architectures for some interesting insight into this.

https://www.youtube.com/watch?v=6lOnpQgn-9s

It's worth the time, IMHO, and I dislike video presentations. This one is different.

She designed the ARM processor (and BBC BASIC before that).

pjmlp2y ago

Bounds checking by default.

Actors, more precisely active objects in Active Oberon, the only one still actively being developed at ETHZ from Oberon linage.

2 more replies

olafura2y ago

One example given in the article is Erlang VM which maps a lot better to modern processors.

We currently have a problem where we can't have thousands of cores because, even today, so much code is designed to be fast on one core.

We really have to move the asynchronous programming because synchronizing async hardware is both complex and inefficient.

RISC V is probably going to help since it allows for a lot of experimentation.

creshal2y ago

The article does mention a few areas of interest:

- Languages with "better" (=more modern hardware friendly) loop constraints are easier to parallelize (Fortran, Erlang, …)

- CPU architectures with better programmable vectorization (ARM SVE, Risc-V VE) are much easier to work with, if the language primitives allow it (see above)

Porting software over to fortran/erlang on aarch64 is something you can already do today, if you want to. Rust/Zig/etc. and RISC-V could have a good opportunity here to figure out better ergonomics for vectorization and more hardware friendly cache coherency policies, too, but no clue if anyone in the relevant standard gremiums cares.

In terms out "but what can I easily use as drop-in replacement?" Yeah, we're kinda stuck with C and languages that inherit its problems (current Rust/Zig/etc. included).

xet72y ago

Rust/Zig does not have enough portability, there is errors trying to compile to s390x:

https://github.com/wekan/wekan-node20#trying-to-compile-llvm...

C89 compiles to 30+ CPU/OS:

https://github.com/xet7/darkesthour

ahoka2y ago

GNU assembler, nasm, if you really need to go low level, usually you don’t.

papruapap2y ago

isnt assembler a high level language nowadays?

d_tr2y ago

Why would you think so? Assembler is what it has always been, i.e., mnemonics for machine instructions. Unless you are thinking about microcode, which is nothing new and I am not sure it should count as a "level" from the perspective of a programmer anyway.

1 more reply

illys2y ago

Your question seems provocative... but that's a very good question. I've always liked assembly programming and I got very puzzled when I discovered the processor metal have gone very far from the x86 instruction set I was writing. Il felt like the magic was gone.

Indeed there is no direct match anymore between instructions and gate combinations on the processor die. There is a microcode translating x86 instruction into whatever electronics are below. Change this microcode, and you could have your processor speak a different binary code (matched to a assembler language).

rfoo2y ago

Rewrite the world in Rust /s

The real answer is: none. There are two problems, the first is you have to rewrite the world with the new language and hardware.

The second is, unfortunately, language enthusiasts who are willing to rewrite the world AND can get job done want a language to target a sequential abstract machine (i.e. look like C).

pjmlp2y ago

C++ is serving me well staying away from C as much as possible, since 1993.

Gazoo1012y ago· 13 in thread

I think this statement at the end of the article - 'There is a common myth in software development that parallel programming is hard.' - is misleading. Granted the author denotes explicit situations where it is not hard, but if it's applicable in general, then it is hard. Not a common myth.

Is parallel programming hard? Without any further details or specifics, yes it is. It is far harder to conceptualize code instructions executing simultaneously, than one-at-a-time in a sequential order.

unblough2y ago

> Is parallel programming hard? Without any further details or specifics, yes it is. It is far harder to conceptualize code instructions executing simultaneously, than one-at-a-time in a sequential order.

If I program (map inc [0 1 2 3]) is it really any more difficult to conceptualize the (inc ) function performing on each element sequentially than in parallel?

I think the difficulty of parallel programming is less innate and more two fold:

1) languages often default to sequential so to do async requires introducing additional primitives to the programmer

2) knowing when to effectively use parallel programming

When I have a list or stream that I know has independent elements that require wholly independent calculations then parallel programming is straightforward

Where people get hung up is trying to shoe horn async where it is either unnecessary (performance is equal or worse than sequential) or introduces breaking behavior (the computations are in fact interdependent).

mgaunard2y ago

Most problems are not embarrassingly parallel.

(Fun fact: I once had someone call HR on me because they didn't know embarrassingly parallel was a technical term, and they thought I was belittling them)

rfoo2y ago

Prefix scan is not embarrassingly parallel. Yet OP's statement still works when you change it to scanl (+) 0 [0 1 2 3]

1 more reply

Gazoo1012y ago

I agree that if we define the individual instructions to always be wholly independent, then sure, it is more straightforward.

While I'd probably argue that it is still more difficult to conceptualize, the statement we're discussing is presented as broad and general. I'd call it far less misleading if it said something like:

There is a common myth in software development that parallel programming *has* to be hard.

kortex2y ago

The whole reason async is even a thing is due to slow, side-effect producing operations. Of course pure functions are easy to parallelize.

I don't think folks so much "shoe horn async where it is unnecessary" as the red/blue problem causes async code in most languages to spread.

Or by "async" do you just mean concurrent code? I'm reading "async" to mean lightweight coroutines or similar.

1 more reply

wrsh072y ago

I don't think this is right. Thinking about operations on matrices is not complex. Defining how a single agent should act on its environment is not complex

When you say "without further details or specifics" you're saying "using my default framework of a c/ c descendent world"

The author's point is that sequential programming is one type of simple programming, but it's not the only type, and it doesn't map easily to modern hardware

Gazoo1012y ago

The author's article generally focuses on C (and possibly descendant languages), but the phrase I am critical of, does not. Furthermore, I explicitly consider a very broad selection of programming languages (many not C-derived) in my opinion. The author's phrasing, I'd argue, paints the entire concept of parallel programming as not hard.

There's some irony to the fact that you re-interpret my opinion as being very specific to C and (indirectly) posit that - in that specific case - parallel programming is hard, and then yourself go on to select a very specific case where parallel programming is not hard, because some matrix operations are independent.

I agree that there are languages that are explicitly built to make parallel programming easy. But in general, and not just related to c or c descendant languages, parallel programming is hard.

wrsh072y ago

My point (and I think the points of others responding to you) is that parallel programming is not always hard. That's also what the author is saying.

The common myth - you're doing parallel programming? That sounds hard

It's not always hard. It really isn't! You don't need to be a genius or an expert to write parallel code.

Maybe where we're getting caught up is Cassie K's comment on ml engineering. You don't need to know how to build a microwave to use a microwave. In the same way, you don't need to be a genius or some deep expert in distributed systems to use abstractions that parallelize your programs

To write a parallel program does not require that you know what a mutex is. It just needs you to understand some simple algebraic (6-8th grade) properties about your functions (and, in fact, for library functions, they can be annotated as associative)

There is a broad spectrum of parallel programs. Somebody using a web server implementation? They've made a parallel application

Somebody running tensorflow or pytorch? Also parallel! Even for simple stuff!

You could be a beginner programmer and be taught to make parallel programs without understanding distributed systems. It's not always hard. It's not generally hard. The complex bits are hard. The simple bits use 8th grade math.

1 more reply

ethbr12y ago

Agreed. The potential state space in parallel processing is a lot larger, which makes it more complex, which makes it harder.

That Erlang exists and people use it successfully does not mean that harder things aren't.

fanf22y ago

Concurrent programming - doing lots of different things at once - is hard. It is hard to use concurrent programming infrastructure (processes, threads) to implement parallel algorithms. Parallel programming - using lots of processing elements to work on the same thing at once - is much easier if you have the right abstractions.

danmaz742y ago

I wouldn't say that it's hard to conceptualize instructions executing in parallel, but it's hard to coordinate those parallel subtasks in an efficient and correct way - except in some use cases, like eg matrix multiplication.

grotorea2y ago

Isn't it distant from how humans work? We can't really do parallel, can we? And programming is translating human instructions to computer instructions, and translation is harder between more distant languages.

wrsh072y ago

A factory is parallel

Or do you mean an individual can't do things in parallel?

Like.... Pushing all of those grocery carts in a long line is moving them in parallel

Or do you mean processing? Like thinking?

throwaway875432y ago· 12 in thread

Other than assembly, which barely qualifies as a language, what programming language is lower than C?

lproven2y ago

It does not need to be a relative statement in order to be correct.

The statement "C is not close to the instruction set of a modern CPU" does not need to be validated by specifying examples of languages that are closer.

danielvaughn2y ago

But if you're going to say that "C is not a low-level language", then yeah you kinda do need other languages beneath it.

lproven2y ago

Well, firstly, I'm not saying it.

But no. That is what I meant when I said this is not a relative statement.

If the title said "C is not the lowest-level language" then your objection would be valid... but it doesn't and it's not saying that.

But before I go into some lengthy explanation: have you read the article, or are you responding to the title alone?

dist1ll2y ago

In general terms, any language aiming to be lower-level than C should

- have an "abstract" machine that is more concrete than C (and by extension less portable)

- be easier to lower into optimal assembly (especially loop ops)

- give you strong and precise compile-time guarantees about memory layout (padding, bitfields), variable sizes, register spilling, stack usage, etc.

pjmlp2y ago

Plenty to chose from since 1958's introduction of JOVIAL, when one cares to research what has happened in the world of systems programming outside Bell Labs, and UNIX/C taking over the server room.

casparvitch2y ago

Forth/joy maybe?

humanrebar2y ago

LLVM IR

giaour2y ago

Fortran, maybe?

drsopp2y ago

Joker_vD2y ago

Or T3X9, if you're prefer Algol-style syntax.

Nzen2y ago

I think there is an argument that Brainfuck [0], et al, is lower than C, given that it eschews variables and functions.

[0] https://esolangs.org/wiki/Brainfuck

AlecSchueler2y ago

Low level means close to the processor, not small in scope.

You could argue brainfuck is machine language for a theroetical infinite tape machine, but such a machine can only exist when implemented in high-level software.

somat2y ago· 12 in thread

I don't think it was ever claimed that C was a low level language. In fact I have always heard it as the canonical reference for an example of a high level language. I will admit that in this day and age C feels like a low level language.

Lower level is something that maps more directly to machine operation (assembly, maybe forth).

Higher level is something that has it's own semantics of operation and need to be converted to into the machine operation, the more conversion the higher the level.

woodruffw2y ago

It is somewhat common to describe C as “low level” in introductory programming or CS classes (before the student would know what an abstract machine is). Lots of people carry misunderstandings from that early simplification forwards in their careers, especially if their only interaction with C is academic and not professional.

philipov2y ago

When C was declared to be "high level", there did not exist a level higher than C. Now that there are more levels above C, it is not the highest level language out there, which makes it low level compared to them. It is not the lowest level, but it's not a misunderstanding to call it low level. The playing field is not the same as it was 50 years ago, so relative terms like "low" and "high" have naturally changed referends.

coldtea2y ago

Huh? Tons of languages at levels "higher than C" existed at that time C was created, and they were popular too.

LISP (1960), Smalltalk (1972), BASIC (1963), FORTRAN (1957), COBOL (1959) and countless others. Heck, ALGOL (1958, 1968) was much higher level than C too.

pjmlp2y ago

Yes it did, outside Bell Labs.

noirscape2y ago

It's also that the definition of high vs low level has shifted in the past decade.

Nowadays a "high level language" is one where the person using it doesn't necessarily have to think about memory usage and allocation, since that's the task of a garbage collector - you accept a small amount of inefficiency in order to get a program that works "good enough" in 99.9% of all cases (since we're not on ancient devices anymore and most programmers don't write code that upsets the garbage collector in novel ways). By this criteria, Java, C#, Python, JavaScript, Ruby and so on are "high level languages" in that the programmer rarely has to think about this sort of thing; the underlying GC takes care of memory concerns. There's a reason you see these languages used more for end-user tools like webdev, scripting and desktop applications - the penalty is considered worth it (since it often ends up only shaving off milliseconds at most).

By contrast a low level language basically makes the programmer an active participant in memory management, with all the footguns that come with it. C and Rust are both two extremes of this - C just lets you do whatever, any form of memory control is up to you, segfaults included. Meanwhile Rust deliberately prevents you from doing anything that could possibly cause segfaults through its borrow checker. In some ways C can give you a lot more freedom to be efficient in how you allocate/deallocate your memory (or in the case of Rust - write code that is always memory safe), but you do trade things for it (in C you basically have to be really meticulous about free()-ing memory while in Rust you have to eat a lot of complexity upfront to not upset the borrow checker).

Also contrasting to high level languages, the modern domain of lower level languages tend to be things like drivers, kernels, RDBMSes and the like, rather than conventional user-facing applications (which it also was used for in the past since most of the previously mentioned languages are either pretty young or took quite some time to mature). Still useful, just a different set of expectations, since those are the components that have to be fast so the rest doesn't have to be as hyperefficient.

adamrezich2y ago

> in C you basically have to be really meticulous about free()-ing memory

only if you malloc()/free() for every allocation/deallocation. if you use any other allocation strategy then this is never an issue.

for example: see the "Rewriting the memory management" section in this article: https://phoboslab.org/log/2023/08/rewriting-wipeout

> I'm not sure what the original PSX version did, but the PC version had a lot of malloc() and little fewer free() calls scattered around. Now I can assure you that the game doesn't leak any memory, because it never calls malloc().

> Instead, there's a fixed size statically allocated uint8_t hunk[MEM_HUNK_BYTES]; of 4mb that is used from both sides:

> A bump allocator takes bytes from the front of the hunk. This is used for everything that persists for many frames. When the game starts, it loads a bunch of assets that are needed everywhere (UI graphics, ship models and textures etc.) into this bump allocater and then remembers the high water mark of it. When you load a race track, it loads all assets needed on top. After finishing a race, the bump allocator is reset to the previous high water mark.

> On the other side, a temp allocator takes bytes from the end of the hunk. Temporary allocated objects need to be explicitly released again. This is used when loading a file into memory. The file is read at once and unpacked onto the bump allocated side. When done, the temp memory for the file is released again.

> Temporary objects are not allowed to persist over multiple frame. So each frame ends with a check to ensure that the temp allocator is empty.

> Somewhat related, the OpenGL renderer does the same with the textures: It bumps up texture memory (more precisely space in the texture atlas) and resets it to the previous level when a race ends.

if you use a system like this—either malloc() just once (or a few times) at the start of your program and then never manually free(), or just use statically-allocated arrays—then you never have to worry about "meticulous free()ing". I'm not sure why this never seems to be taught in early CS courses that teach C—it seems that basically everyone comes away thinking malloc()/free() OCD is the only way to manage memory with C, and is thus undesirable compared to the ease of use of garbage collection.

Alifatisk2y ago

> I don't think it was ever claimed that C was a low level language.

When I was introduced to C during high school, my teacher presented C as a low-level language compared to what we previously studied (which was Ruby).

And I just ate that up because C looked less readable than Ruby, today (10 years later) I have to disagree with my teacher. C is not a low-level language, it has access to the lower level parts, sure. But it is an high level language!

denton-scratch2y ago

> I don't think it was ever claimed that C was a low level language.

It was introduced to me as "glorified PDP11 Assembly Language". So the claim has been made at least once.

Granted, there are people here commenting that maybe assembly language is not "low-level". I'm lost for words.

Normal_gaussian2y ago

Im curious - what is it about forth that makes you consider it to map more closely to the machine?

I've done a handful of forth projects as part of a code-dojo years ago. I wouldn't have considered it low-level.

somat2y ago

Fort is strange, it requires a virtual forth machine to run(I have heard of hardware that runs forth directly but it is exotic), this should automatically exclude it from the low level camp. however this machine ends up almost trivial to write and is very simple. so once you start writing forth it feels low level, like there is very little between you and the cpu. As a consequence, just like assembly, forth people tend to reinvent the world.

Note that I am not far in the forth rabbit hole at all, any interest I may show is incidental, a side effect of my interest in postscript, which is very much a high level language.

creshal2y ago

Forth is easy to make custom hardware for, even if it's a poor fit for the commonly available hardware architectures. RPN + Stack lends itself to a very simple implementation (no registers needed, easy layouting, etc.).

domust2y ago

It's fortran, not forth.

1 more reply

abainbridge2y ago· 10 in thread

This article is correct that your computer is not a fast PDP-11 but wrong that this has anything to do with C. Eg, "another core part of the C abstract machine's memory model: flat memory. This hasn't been true for more than two decades."

This has nothing to do with C. The hardware insists on this abstraction. And its a good job too, otherwise your programs would stop working when moved to a machine with different cache.

semiquaver2y ago

The article argues that the hardware’s insistence on this abstraction is in large part _because_ of C’s dominance.

mgaunard2y ago

If only that were true. Lots of languages that have nothing to do with C also did it. It's just much easier to program with a unified memory model, that's all there is to it.

Joker_vD2y ago

If by unified memory model you mean "flat address space" then no, it's not. The moment you need two (or more) dynamically-sized arrays, you need to implement realloc with memmov in the unfortunate case. In a world where each array could have it's own segment, this problem doesn't arise because they simply cannot intersect and realloc boils down into increasing/decreasing a segment's extent.

2 more replies

creshal2y ago

Many of those languages indirectly have lots to do with C – even if you ignore the obvious problems like "C is the only ABI supported by most OSes, C FFI is the only cross-language interface supported by most languages and thus most libraries", there's more subtle influenced: Copying e.g. the (very expensive to implement in hardware) cache semantics of C usually "costs" languages nothing, because the hardware is already there, due to C. Not copying them happens, if both language and hardware get developed at the same time, but it's much rarer.

You see similar problems with things like vectorization – Rust was in a good position to define semantics more amenable to ARM SVE / Risc-V VE, but all existing SIMD libraries are written for C and x86 semantics, so that's what Rust is currently stuck with, as are most other languages.

1 more reply

abainbridge2y ago

But we have/had architectures that expose parallelism at the instruction set. Eg itanium and graphcore. And the PS2 made cache management the programmer's problem. I don't think any of these experiments proved successful in the long run.

gpderetta2y ago

Hence the observation that every architecture eventually converges to NUMAcc if physically possible.

nitwit0052y ago

People have written C code that dealt with more complex memory schemes.

The language matters less than the fact that there's a lot of existing code around. That code needs to keep working.

fanf22y ago

Yeah. A lot of the things that make C not low level in the terms of this article happened on IBM mainframes decades before x86:

* tiered memory hierarchy pretending to be flat RAM

* CPUs that are much bigger than the ISA suggests, and which have out-of-order and speculative execution so code can make good use of their resources

* optimizing compilers that further decouple the program as written from its execution

IBM was working on this stuff in the 1970s, well before the rise of C. It’s fair to criticize the model and seek out alternatives, but it isn’t fair to blame C.

JonChesterfield2y ago

Flat memory is a bad thing for performance. Especially cache-coherent flat memory. It is convenient for programmers.

abainbridge2y ago

Yeah, agreed. My comment should really have been, "I'm glad modern ISAs are high level because low level ones would be a massive burden". And, "It isn't C's fault that low level ISAs are a massive burden".

thatjoeoverthr2y ago· 8 in thread

Reminds me of VLIW. As per Wikipedia, from the Itanium page:

> One VLIW instruction word can contain several independent instructions, which can be executed in parallel without having to evaluate them for independence. A compiler must attempt to find valid combinations of instructions that can be executed at the same time, effectively performing the instruction scheduling that conventional superscalar processors must do in hardware at runtime.

If your CPU exposed the single-stream parallelism at the interface, you can do it at compile-time or even decide it with in-line assembler.

I wonder if it hasn't caught due strictly to the business dynamics of the industry, or are there technical reasons this isn't really a good strategy?

Joker_vD2y ago

Well, IIRC it didn't caught on mostly because of a) compilers weren't really that good at that kind of instruction scheduling (and when they improved, Itanium has sunk already), b) conventional ISAs (that is, x86) got quite good at doing this in hardware, at runtime, and actually deliver slightly better results than static scheduling precisely because they do it at runtime, when profiling data is available.

I believe Linus has a good even if tangentially related to this exact topic rant at [0]. "While the RISC people were off trying to optimize their compilers to generate loops that used all 32 registers efficiently, the x86 implementors instead made the chip run fast on varied loads and used tons of register renaming hardware (and looking at _memory_ renaming too)."

[0] https://yarchive.net/comp/linux/x86.html

fanf22y ago

Static scheduling, even with profiling, can never be as good as dynamic scheduling for general-purpose workloads. VLIW/EPIC can do well for HPC-style number crunching, but that isn't everything. https://news.ycombinator.com/context?id=37900987

JonChesterfield2y ago

One can move complexity back and forth between compiler, runtime and processor implementation to some extent. VLIW works really well in some niches. It's harder to program than single instructions that execute in sequence, either by hand or by compiler, but it simplifies the scheduling for the hardware. Works better if the bundled instructions have similar latency.

The key design puzzle at present seems to be that memory access takes many more cycles than arithmetic. Bundling a few cycles of arithmetic with a few hundred cycles of memory load is kind of pointless. So VLIW works well if you know memory access is going to be fast, which roughly means knowing it'll hit in L1 cache or equivalent. I think that's part of why it suits DSP style systems.

Exposed pipelines are an interesting quirk of some of these systems. One instruction in a VLIW bundle writes to a register and subsequent instructions that read from that same register will see the previous value for N subsequent cycles, after which the write becomes visible. They're really confusing to program by hand but compilers can deal with that sort of scheduling.

gpderetta2y ago

Because static scheduling is terrible for non-DSP and non-HPC loads like the typical server or desktop application where the control and data flow is very input dependent. Until recently DSP and HPC were a tiny fraction of the market so architectures capable of dynamic scheduling dominated even those markets as they had more investment.

With GPUs of course things have changed and in fact GPUs relied more on static scheduling, but even there as they expand to more varied loads, they are acquiring more dynamism.

fanf22y ago

See my other comment for why VLIW was technically flawed

https://news.ycombinator.com/context?id=37900987

thatjoeoverthr2y ago

Im reading that TeraScale (AMD) works this way. Itanium is a major attempt to ship it in a CPU. I guess AMD64 and ARM rule the day but maybe in the future we'll see it again.

JonChesterfield2y ago

Terascale was a vliw, worked well as far as I know. The current amdgpu architectures aren't - those are multiple execution port systems, reminiscent of the x64 setup.

Qualcomms' Hexagon is a vliw, I think that's contemporary. Graphcore's IPU is two instructions per word.

lizknope2y ago

Are you asking why VLIW hasn't caught on? There are DSPs that use VLIW concepts. But for general purpose computing look at Itanium and it's failure.

Waterluvian2y ago· 3 in thread

My sense is that this is really a communication issue (when is it not?)

On a relative scale, C is very low level compared to how we program today if you think about levels of abstraction.

If “low level” means “runs on the CPU almost literally as written.” then no it’s not.

fweimer2y ago

It's more than that. C-the-language just doesn't have low-level concepts such as machine addresses, and its facilities for dealing with the types that the abstract machine ascribes to all objects are quite limited.

Ada has System.Address to model machine addresses:

http://ada-auth.org/standards/rm12_w_tc1/html/RM-13-7.html#p...

C++ has std::less specializations for pointer types which provide a strict total order (one aspect of machine addresses):

https://en.cppreference.com/w/cpp/utility/functional/less

There is also placement new and std::launder for more explicit control of typed memory:

https://en.cppreference.com/w/cpp/language/new https://en.cppreference.com/w/cpp/utility/launder

These days, even Java tries to model machine addresses:

https://docs.oracle.com/en/java/javase/21/core/foreign-funct...

kortex2y ago

Yep, this is a linguistic problem, not a technical one. "C is not a low level language" implies that the hi/lo boundary lies below C. What's below C? IR, Asm, and opcodes.

IRs like LLVMIR and various bytecodes. Well, those don't map to the hardware 1:1, not even close. So IR must be HLL.

Sure Asm has to be architecture specific, but even then we are getting pretty good at transpilation. And those codes get translated to opcodes anyways on most modern chips.

Basically, unless you are assembling on an ancient system or embedded processor, you aren't writing in a "low level language". Very few folks nowadays do this, so the term "LLL" doesn't occupy much mindshare in semantic space. That leads folks to populate it with what they perceive as low level - the lowest language on the abstraction tree they are likely to encounter - C.

This divide is only going to expand so I say we just accept the definition of low level language has shifted, and call anything where it does closely match... something else, I don't have a good term. Maybe "hardware level language".

falcrist2y ago

> If “low level” means “runs on the CPU almost literally as written.” then no it’s not.

But doesn't this still depend on what CPU you're talking about? Your C code will map much more closely to the instructions of the machine code of an 8051 or even an M4 than it will to an x86.

Thus any general-purpose language is more or less "low level" depending on the CPU it's running on. This seems like a poor definition.

PhilipRoman2y ago· 2 in thread

>take a property described by a multidimensional value

>project it into a single dimension

>split it in the middle, thus inventing two useless artificial categories ("low level", "high level")

>get a bunch of highly functioning hackernews 0.1xers to argue endlessly about said useless categories

>submit weekly articles "thing X is NOT in my imaginary category Y!!!"

>profit

Arguing whether or not C is a low level language is about as useful as arguing whether dog-headed men have souls

Next up: IO is not a Monad, x86 machine code is not a low level language, RISC-V is not actually RISC, GPL is not actually open source and so on

ad404b8a372f2b92y ago

The taxonomy is not the point of the article. The point of the article is about language and hardware development interactions and whether we are locked in a paradigm in which our reliance on C prevents us from taking advantage of hardware innovations, and in turn create languages which properly makes use of such hardware.

PhilipRoman2y ago

I see that point, but I still think the article is completely wrong.

My disagreement with the article (aside from the flamebait title) is that many of the things the author calls C problems are actually general computing issues. The reason highly threaded processors are not the norm is not that C can't take advantage of them (it does it just as well as 90% of other languages). The problem is that most problems aside from specialized domains are either highly sequential or require too much synchronization.

Regarding the immutable memory model example - C does not place any limitations at all. Just declare that modifying such an immutable object is undefined behaviour and let programmers figure it out. Memory already has its complexities with NUMA and such, C programmers have no issue taking advantage of these features.

Or maybe take TSX as an example - I'm fairly sure the PDP-11 did not have anything remotely close to Intel TSX and yet it is easy to use in C. Include <magic.h>, write __magicXYZ() and it just works.

Sure, existing C programs will run slowly on the author's imagined new processor architecture, but so will programs written in any language except maybe some highly restrictive very high level language (like GLSL on GPUs, etc.). But new programs that are written with such hardware in mind will not in any way be limited by C semantics and if they are (like with mistakes in standard such as errno for math functions...), it will be one compilation switch away from being fixed.

HarHarVeryFunny2y ago· 1 in thread

If the sophistication of modern CPUs makes C no longer a "low level" language, then the same applies to assembly language .. things like out of order execution and register naming applies there too.

I guess the sophistication of compilers in recent decades adds to the argument since even the assembler (object code) the C compiler generates isn't going to be as expected due to hoisting things out of loops, common subexpression elimination, etc, etc.

Still, I think the notion of C being a "low level" language is still a useful label ... if not we need to retire this designation altogether.

marcosdumay2y ago

Assembly is just a bit lazy, macro-expanded, and the computer's memory address a made up concept.

That's indeed an abstraction over the real computer, but it's a lot less things piled up on your virtual computer's model than C. Current assembly is about on the same level as C was when it was created. Current C is so high-level that it doesn't provide any functionality you can't get with a better, more modern language.

But yeah, I do agree that "low" and "high" level aren't useful names nowadays.

openasocket2y ago· 1 in thread

I feel like the article advances on two different lines of argument that are difficult to reconcile. The first is that C is not a low-level language, and gives examples like struct padding and signed overflow being undefined behavior. That part makes sense to me, and the argument seems constructive: it seems to propose language features for a hypothetical "real low-level" language.

The second argument is that, because of the dominance of C, CPU designers have had to bend over backwards to create something that runs C naturally. Here there are examples like register renaming, flat memory, caching, etc. This argument also makes sense to me, but in the context of the first argument, and the title of the article, I'm not sure how it relates. Taken at face value, this seems to imply that it isn't even possible to create a low-level language on modern hardware, and even machine code is "high-level". This seems to argue that we would have to create a new generation of hardware that exposes much more complexity to the instruction set architecture, and only then could we design a low-level language to take advantage of that.

I think both of these arguments have merit, but it's a little disconcerting to put both of them in the same article, and to make the title "C is not a Low-Level Language". I suppose the first argument could go here, and the second argument could have been done in a follow-up article entitled "Machine code is not a Low-Level Language Either".

Phrodo_002y ago

Intel's IA-64 supposedly exposed lower levels of the processor to machine code, but I hear it took ages to compile, and compilers never really got to the optimization levels they were expecting (and not being compatible with x86 also didn't help adoption)

bee_rider2y ago· 1 in thread

I think the title that the authors decided to give this article was unnecessarily provocative in a distracting manner. I’m pretty sure there is a technical definition of low level language they are referencing that excludes C, and pretty much only includes assembly as a low level language. Ok, fine, whatever.

Their bigger point seems to be that C is no longer very mechanically sympathetic to huge modern cores, because the abstraction pretends there’s only one instruction in flight at a time. Is anyone aware of a language that fits the hardware better? Maybe Intel needs to release a “CUDA of CPUs” type language.

ori_b2y ago

It doesn't even include assembly.

bazoom422y ago· 1 in thread

Apparently there is plenty disagreement about what “low level” means. Historically assembler was considered low level, and languages like C with named variables and functions and nested expressions were considered high level. I have also seen C described as mid-level to indicate it is higher level than assembler but “closer to the metal” than say Java. And apparently it is now called low-level by some - wonder what assembler is then?

In any case, at this point, low level and high level are only meaningful relative to other languages.

The article is questioning how “close to the metal” C actually is, but some of the arguments also applies to assembler, which is not that close to the metal either these days.

chefandy2y ago

Yeah-- my intro C Programming class 20+ years ago began with the professor saying "This class is about C, a high-level programming language used for most UNIX system tools because trying to write all of those things in assembly would make you want to burn your keyboard."

It seems like the distinction between C and Assembly these days is less important than the distinction between C and say... Javascript. Which is fine by me- English is descriptive and the people who work in Assembly aren't going to get confused by it.

bluGill2y ago· 1 in thread

Assembly is not the lowest level language you can work in. I've programmed in raw binary opcodes before, that is the lowest level. (though there is a valid argument that microcode is even lower level - I disagree but still acknowledge the argument is valid) Often a single assembly language instruction can be one of more than 30 different opcodes as registers are often encoded in the opcode. Of course at this level you have to have your CPU instruction manual as they are all different.

toinewx2y ago

agreed

cmrdporcupine2y ago· 1 in thread

Yes, C is a set of abstractions like any other language (even assembly.) which attempt to mimic a machine of far less complexity.

Unfortunately it's also the wrong set of abstractions for the contemporary era.

That said, if you're working in low-level embedded microcontroller world, C's memory model and program structure does in fact look a lot more like those systems.

AnimalMuppet2y ago

What would you say is the right set of abstractions for the contemporary era? Especially for writing things like OSes and device drivers?

jansan2y ago· 1 in thread

If you have never heard of the PDP-11 before like I did until yesterday (I should probably be banned for this from HN), this is really something worth learning about. There is an awesome project for a PDP-11 front panel replica running an emulator on a Rhaspberyy PI (the whole thing is called PiDP-11, haha). Here is more information:

https://obsolescence.wixsite.com/obsolescence/pidp-11

fsniper2y ago

Mandatory xkcd: https://xkcd.com/1053/

vivekv2y ago· 1 in thread

I am neither a compiler writer or an OS guru. Just an old c programmer. In this article it looks like the entire CPU instruction set was designed to emulate pdp11 to ensure c compatibility. So my naive question. What is stopping a Microprocessor manufacturer to have two instruction sets one that is compatible and one that allows us to fully utilise the modern CPU but with a different programming paradigm? Is that too expensive or hard to do? I genuinely don't know.

GTP2y ago

I think that there's just isn't any commercial incentive, as we have a ton of legacy C code and the CPU will have to run anyway in the "legacy" mode to run your OS, be it Linux, Windows or MacOS.

ori_b2y ago· 1 in thread

If you accept the premise of the article, you also need to accept that assembly is not a low level language, and that it is impossible to program any CPU currently for sale in a low level language.

The abstraction CPUs give you is more or less a fast pdp11 with some vector registers bolted on.

The implementation internally is not.

variadix2y ago

This is pretty much my issue with this article. By its criteria assembly isn’t low level, which makes the claim pretty uninteresting. You could argue there are/were processors that had low level ISAs (VLIW, no OoO exec, no memory reordering, no caches, no branch prediction, etc.) but they are all niche (usually low power embedded DSPs) or failed to capture the market (Itanium) because they failed to deliver performance comparable to the highly abstracted CPU supposedly “designed” to run C (also a questionable claim, I think the reality is it has more to do with sequential execution being the way humans think, a point Jim Keller has made in several interviews).

jokoon2y ago· 1 in thread

I once read that C is the new assembly, because all CPU have a C compiler.

I then decided to make a language that compiles to C, it's just about adding strings, list and tuple. I almost finished the parser and the "translator" will take more time (I encourage anybody to try lexy as a parser combinator). Basically it will use a lot of the C semantics and even give C compiler errors, so it will save me a lot of work.

Of course I am very scared that I will run into awful problems, but that will be fun anyways.

lebuffon2y ago

Everyone knows that CPU stands for "C" processing Unit...

:-)

wzdd2y ago

This is now five years old, and while obviously the premise is more correct than ever (computers don't look much like a PDP-11 architecturally), the conclusion ("imagining a non-C processor") seems less strong. We are seeing (and were seeing, even in 2018) a strong separation between linear and highly-parallel code, most obviously in the rise of Python for machine learning and scientific computing. It is still very convenient, when performance isn't paramount, to write in a single-threaded style and to a flat memory model. When performance is important, it's then appropriate to switch to a language better suited to parallel programming -- one of the computational-graph languages in something like Pytorch, some other set of primitives on top of CUDA, or even something more experimental like Futhark. Performance-critical code has always had its domain-specific languages, and they seem to be becoming more common, not less, and the hardware is being built to match -- as the CPU+GPU combination common to desktop PCs, as vector extensions to x86 (which have their own primitives making, essentially, a DSL of their own), or things like the M1, which bolt a GPU to a CPU to give both high-speed access to the same system memory.

In other words, perhaps what's really out of date is not C, but the concept of a general-purpose language which is equally well-suited to any type of task.

wrsh072y ago

This is one of the most interesting programming articles I've read in a while. And it's well written and easy to read! Don't stop at the (inflammatory?) title.

* We all agree that c gives you a lot of control to write efficient sequential code

* Modern processors aren't merely sequential processors

* Optimizing c code for a modern processor is hard because c is over-specified - in order to allow humans to manually optimize their programs (given the c memory model etc), it's hard for compilers to make assumptions about what optimizations they can make

It doesn't seem like this is a fundamental problem, though, and c could provide symbols that denote "use a less strict model here" (or even a compiler flag, although I bet incremental is the way to go)

karmakaze2y ago

This is a great article (worth reading if interested in performance/parallel computing) but the complications it gets into are mostly in the CPU architecture/hardware to which compilers add additional complexity. Even without the compiler optimizations there's still branch prediction and associated parallel execution of serial machine code.

To anyone debating whether C is low/not-low level language note that this discussion is at a much lower level so 'low' has a lower than common meaning.

titzer2y ago

> On a modern high-end core, the register rename engine is one of the largest consumers of die area and power.

Another red herring. Register rename isn't the result of some PDP fetishizing. It is a direct result of using more hardware resources than are exposed in the architectural model. Even if it were a stack machine or a dataflow graph architecture, register renaming is what you do when you have more dynamic names for storage than static names in the ISA.

layer82y ago

> Consider another core part of the C abstract machine's memory model: flat memory.

The C abstract machine only has a flat memory model within a given malloc allocation (and within each local or static object). Relational pointer comparison between different allocations is UB (see e.g. https://stackoverflow.com/a/34973704).

So C is perfectly fine with a non-flat memory model as long as each object is confined within a flat memory region (by virtue of being allowed to alias it as a char array). You can imagine a C runtime library that provides functions to obtain pointers to different types of memory that don’t share a flat address space.

The only restriction is that pointers must carry enough information to compare unequal if they point to different objects. Of course, you might be able to construct a virtual flat memory model from the bit representation of void* or char*, but that’s not quite the same as imposing an actual flat memory model.

hooby2y ago

"Low-level" is not a perfectly well-defined technical term, and does mean (slightly) different things to different people.

I feel that the article does explain well enough, how the author defines "low-level" for the sake of this article - and the definition being used seems just as fine as any other. And sticking with this specific definition, the conclusions of the article do seem to check out. (But I'm no expert on the subject matter, so I might be wrong about that).

I feel that the "value" of the article lies in challenging certain conceptions about C.

To me, it doesn't really matter if the article is (completely) right or not - the somewhat indignant response I see happening to the title of the article, and the discussion I see about what "low-level" actually means, seems to prove that some dogmatic beliefs about C are pretty deep-seated.

I feel it's always worthwhile to question such dogmatic beliefs.

titzer2y ago

> The root cause of the Spectre and Meltdown vulnerabilities was that processor architects were trying to build not just fast processors, but fast processors that expose the same abstract machine as a PDP-11.

No, Spectre is the direct result of processors speculatively executing code without respecting the conditions that guard the code. Hands down, processors hallucinate conditions in code. It has nothing to do with the particular computational model, but would happen in any system that speculates conditions.

And not just one branch, but a whole series of them. In fact, the processor is usually running with a whole buffer full of instructions that are executing in parallel, having been loaded into the reorder engine using nothing more than (normally highly accurate) statistical predictions.

falcrist2y ago

To make an article about how C maps to the processor and fail to make any distinction between application programming and embedded programming seems strange to me. After all, C is by far the most common language for programs running on micro-controllers, and it actually does map well to many micro-controller architectures in use today.

I'm clearly not the target audience for this article, but I still feel like the author would be well advised to put a little note at the top that says "we're talking about CISC and high-end microprocessors rather than microcontrollers."

I'm also not seeing suggestions for languages that do map well to modern microprocessors.

HumblyTossed2y ago

I've been programming since the mid 80s, started with the C=64. People have been having the argument that C is low-level vs c is not since at least then.

Why do so many smart people waste their friggin' time on such nonsense?

cat_plus_plus2y ago

Computation is only a small part of computing, addressed by languages such as OpenCL and by no means simple, observe constant GameReady driver releases from Nvidia to support each new major game. C is still pretty good at many other parts of low level computing, such as managing state of hardware or allocation of system memory to different tasks. Such tasks are not well suited to parallelism, as they must maintain a globally consistent state.

It is perhaps true that CPUs and compilers should execute C code mostly as it is, with only local optimizations to spare programmer of having to decide whether x + x, or x * 2 or x << 1 is faster for example. This would improve system security and reliability while freeing up time to work on great compute languages for vectorizable computations.

But, at the end of the day, CPU makers and compiler writers are humans motivated by both career success and less tangible bragging rights. So OF COURSE they will chase benchmarks at the expense of everything else, even when benchmarks have little to do with real life performance in an average case. I have a 13 year old 17 inch MacBook pro I use for some favorite old games. When I fire it up, I don't see any differences in my computing experience vs a 2023 laptop. So whatever advances in CPU/compiler design were made since do not seem to help with tasks I am actually interested in.

vonwoodson2y ago

This article begins with victim blaming the software engineers in full-throated support of hardware engineers. If, and I do mean if, anyone should be exalted it is the fact that software engineers have been coping with C as a stable-but-difficult programming language specifically for the benefit of the hardware engineers’ desire to have a stable target. The fact that the specification is ambiguous at all is so that hardware manufacturers can port a reasonably small, expressive, and powerful language to their hardware. And, no, making a new language that targets the platform for the ease of hardware development and exploitation of system-specific benefits is not the answer. In fact, it’s the literal reason why C is still as popular as it is.

Nobody wants to learn your programming language, write thousands-to-millions of dollars worth of software, just to have it become obsolete two days after the new-hotness processor comes out. Been there, done that.

Alternatively, perhaps, we can place the blame on hardware manufacturers who were looking to cut corners for improved performance and produced insecure machines because they lied to us non-expert hardware users about how fast their systems could go and what we were getting for our money.

adamrezich2y ago

I've been working on making games for the Playdate (https://play.date) over the past few weeks, using their C SDK. it's my first time using C in a decade, since I first learned it in college, and I'm having a surprisingly great time with it. sure, there's tons of weird quirks that take some getting used to—but there's a lot that I've been surprised to find that I missed about it! it's fun to write code that does what you tell it to do, without having to worry about object ownership or any higher-level concerns like that—you just manage the memory yourself, so you know where everything is, and write functions that operate on it straightforwardly. if it's been awhile since you've touched C, I highly recommend giving it a try for a small game project.

intalentive2y ago

“The abstract machine C assumes no longer resembles modern architectures” implies that it might be nice to have a language that maps more directly to what is really happening under the hood. I agree. It would be nice to take the guesswork out of, “How should I write this so that the compiled code has fewer cache misses?”

Maybe there is a sweet-spot level of abstraction that allows for more fine-grained control of the modern machine, in the sense that compiled code more or less reflects written code, but not so fine-grained as to be unwieldy or non-portable.

Vectorized code that is native to the language could be done with either map functions or Python / NumPy / PyTorch style slicing, which is fairly intuitive. Multithreaded OTOH I’m not sure there is an easy answer.

OnlyMortal2y ago

When compared to assembler, I’d agree.

I grew up with 6502 and 68k. To me, back in the early 90s, C (Mac MPW C to be precise) was an abstract assembler. The code-gen was perfectly readable.

Compared to the likes of Python, it most certainly is low-level. These types of language allow developers to rapidly get something going and not just because of the libraries.

I’d find it very hard to justify a business position where C has any other role than binding and breaking out into something more abstract. Be that Go or C++, for an example.

An argument I used to hear was “performance” from C. I’m not entirely convinced as in a higher language your algorithm may well be better as you can deal with the abstraction.

But… people make money coding C.

eigenform2y ago

Even before that, this is ultimately about that fact that an ISA for a general-purpose computer can be seen as a way to abstract away parallelism. Even in your favorite assembly language, the effects are largely supposed to happen one after another.

That abstraction is leaky, but the alternative is VLIW machines - even in that case, you probably end up using a compiler so that you don't have to worry about parallelism. Reasoning about parallel things is hard, that's why we spend so much time trying to avoid it ¯\_(ツ)_/¯

phendrenad22y ago

[delayed]

hluska2y ago

If we ever wonder why more people don’t get into low level programming, this article and the responses are an excellent case study. We’re allowed to make what we know accessible to newcomers and many of us should tone down our arrogance when we have really deep technical conversations.

assimpleaspossi2y ago

The paper talks about how C is designed for a PDP architecture and that's the problem. Is there any language that is not that way and can handle parallelism and all the things mentioned in the paper?

Yes, I do see Erlang mentioned but I don't think it was considered a solution.

charles_f2y ago

Interesting take, but I think it goes out of its way to prove the definition of low-level to be wrong, while missing that the definition it gives and claim is wrong, in itself is very flexible.

What is irrelevant? To a data-scientist, typescript is low-level. You're required to think about structure and compile stuff!

To a web developer, C# and Java are low-level because you need to think about the execution platform

To an IT developer, C and C++ are low level because you need to think about memory.

To a game developer assembly is low level because you need to think about everything.

To electronocians everything is high level. To accountants VBA in Excel is low level. To a product manager a word document with any sort of technical words is too low level.

If you need to optimize your software to the point where some CPU specific instructions are required, C is too high level because its hiding stuff that is not irrelevant.

mgaunard2y ago

With the same argument you could even argue that the x86 ISA is a high-level language, since under the hood it's decomposed to micro-ops which are scheduled on a superscalar infrastructure and run out of order.

mpweiher2y ago

I am really surprised that such a bad take has gotten so much airtime, almost as much as that such a gifted developer came up with it.

The only way that the title is true is one that is not mentioned in the article: when C became popular, anything that was not assembly was a "high level language". Heck, even some Macro assemblers were considered high level, IIRC.

The factors that are mentioned in the article fall roughly into two categories:

1. The machine now works differently.

This may be true, but it does so almost entirely invisibly, and the exact same arguments given in the article apply in the same way not just to assembly language, but even to raw machine language.

I have a hard time seeing how machine level is not low level. But I guess opinions can differ. What seems inarguable is that machine language is the lowest level available. And if the lowest available level does not qualify as "low" in your taxonomy, then maybe you need to rethink your taxonomy.

2. C compilers do crazy shit now

This is also true, but it is true exactly because C is a low level language. As a low-level language, it ties execution semantics to the hardware, resulting in lots of undefined (and implementation defined) behavior that makes a lot of optimisations that some people really, really want to do (but which are far less useful than they claim) really really hard.

So C compiler engineers have defined a new language C' which has semantics that are much more amenable to optimisation. Nowadays they try to infer that language C' from the C source code and then optimize that program. And manhandle the C standard, which is intentionally somewhat loose, in order to make the C'' language that looks like C but maps to C' the official C language.

Since they were moderately successful, it can now be argued that C has morphed or been turned into a language that is no longer low level. However, the shenanigans that were and continue to be necessary to accomplish this make it pretty obvious that it is not the case that this "is" C.

Because, once again, those shenanigans were only necessary because C is a low level language that isn't really suited to these kinds of optimisations. Oh, and of course the rationale document(s) for the original ANSI C standard, which explicitly state that C should be suitable as a "portable assembly language".

But then again we already established that assembly is no longer a low level language...so whatever.

ngrilly2y ago

Wasn't it the idea of RISC to have a simpler CPU and push the optimization responsibility towards the programmer and the compiler?

ultra_nick2y ago

I just write English these days and have my LLM compile it to Python, so...

danielmarkbruce2y ago

On the flip side, maybe CPUs are trying to be too general purpose.

BeefyMcGhee2y ago

(2018)

mbfg2y ago

PDP-11 is a fast machine?

dboreham2y ago

2017

acqq2y ago

(2018)

j / k navigate · click thread line to collapse

396 comments

249 comments · 50 top-level

GuB-422y ago· 42 in thread

grotorea2y ago

> C is low level for at least one reason: manual memory management. Especially with modern hardware, memory management is at the center of programming.

Chabsff2y ago

A lot of that is just a property of modern OSs, with good reason, intentionally not exposing these features to userspace processes. It's not really a function of the language itself.

grotorea2y ago

Hmm, true for virtual memory, didn't think of that, but CPU caches are inside the processor, can even the kernel control it at all?

1 more reply

theamk2y ago

The fact that modern memory load operation involves cache, protection, memory mapping, etc.. is not a property of language, but rather of the environment (CPU + OS).

grotorea2y ago

But those aren't abstractions that we can treat as black boxes, we need to know them and how to code taking them into account without actually having control inside the black box.

wang_li2y ago

You can control what goes into cache if you want to. The effort to make an open source bios do this in order to have working memory before the DRAM controllers are initialized.[0]

0. https://www.coreboot.org/data/yhlu/cache_as_ram_lb_09142006....

ilyt2y ago

neither can assembler so it is useless distinction

CJefferson2y ago

2 more replies

quatrevingts2y ago

cmrdporcupine2y ago

squeaky-clean2y ago

> Unless you're working on a microcontroller class system, or other system without an MMU but that's a whole other kettle of fish

jstimpfle2y ago

The point of a handle is that it's use to hold objects, to keep them alive. Raw pointers don't do that.

saagarjha2y ago

Would such a model be generally useful, though?

actionfromafar2y ago

I have thought so for a long time. It could open up execution of functional languages on a truly distributed runtime. Something like the fabled Tao operating system I guess.

cmrdporcupine2y ago

Definitely useful in some systems context, especially e.g. database page buffer management.

anonymous_sorry2y ago

> It would be entirely possible to walk away from the libc & C model entirely and work in a world of pure references interacting directly with VM subsystem pages

Is this possible in Ring 3? Or would everyone be running in kernel mode at that point.

Even if you do away with that layer, then there may still be a hypervisor lying to the kernel about memory.

pornel2y ago

You don't get direct access to the stack in C either. Stack frames are abstracted away, and you only get longjmp.

If you pay attention to Undefined Behavior and strict aliasing, you don't even get that much access to poking around memory.

1 more reply

pjmlp2y ago

BASIC also can do manual memory management, not only that, it had a whole computer generation for itself, in computers not able to have a full ISO C implementation.

theamk2y ago

pjmlp2y ago

That is hardly different from malloc(count * size), REDIM exists (aka realloc()) and many BASICs do offer the free variant as well.

In fact, there is hardly any difference between VMS BASIC and VMS C in terms of what is possible, if we want to take the discussion outside of 8 bit versions.

marcosdumay2y ago

> Why is C unsafe? Mostly memory.

C can't even do all of integral arithmetic safely. It's a language that goes really out of its way to add unsafety.

bensecure2y ago

Hirrolot2y ago

> I'm pretty sure that this is one of the unsafeties that rust borrows from c

But integer arithmetic is safe in terms of Rust.

> Checking every addition adds a massive slowdown

It only does so for debug mode. In release mode, it uses modular arithmetic.

5 more replies

shrimp_emoji2y ago

But 53 years later, it's added the `<stdckdint.h>` header, offering `ckd_add()` and friends. :D Better late than never!

diogenes42y ago

Integers are an abstraction on top of words; words are perfectly safe.

rewmie2y ago

> C can't even do all of integral arithmetic safely.

Your comment reads like nonsense. Are you able to provide what you feel is the best example that substantiates your claim?

DashAnimal2y ago

When comparing a signed and unsigned integer, the signed integer is promoted to unsigned integer.

So if you have a = -1, b = 1000 and compare the two, a > b is actually true.

1 more reply

aidenn02y ago

Signed integer overflow is UB in C.

1 more reply

theamk2y ago

monocasa2y ago

There's plenty of cases in C where the use of a + operator doesn't result in any form of add instruction being emitted.

MrBuddyCasino2y ago

> C is low level for at least one reason: manual memory management.

johnnyjeans2y ago

> The anemic abstractions provided in the language and the tiny stdlib means it takes a lot of work to achieve something

imtringued2y ago

C programmers like their doubly linked lists, but when you think about it, it is actually kind of a difficult problem to formalize and analyze in its full generality.

1 more reply

MrBuddyCasino2y ago

> Manual memory management being faster than GC is a function of controlling memory layout

Control over memory layout and manually allocating and freeing memory are orthogonal issues.

> forcing you to be a bit smarter about how you do things, to be less wasteful

Yes this is what I meant.

jstimpfle2y ago

Love your words. 1000 upvotes if I could.

For balance, the faster machines get, the more problems are most effectively solved by throwing the kitchen sink at them.

1 more reply

trealira2y ago

> controlling memory layout

As a result of compaction, memory allocation with garbage collection is just a pointer bump in the best case, whereas allocation with just malloc usually requires searching a free list or a tree.

[1]: https://www.cs.princeton.edu/techreports/1988/191.pdf

kuchenbecker2y ago

Cache invalidation is hard :) almost as hard as semantically naming things in a way that is clear now, and in the future.

staunton2y ago

... the famous Two Hard Things, together with off-by-one errors.

tmtvl2y ago

concurrency With 3 it's Things Hard.

1 more reply

2-718-281-8282y ago

very interesting observation. never thought about how memory is central to all those concepts and technologies.

systemBuilder2y ago

crabbone2y ago

C is neither fast nor low-level... none of these descriptors have any meaning.

It's a pointless discussion when you don't care to explain how you use the words that obviously have many related but different ways to interpret them.

pizlonator2y ago· 37 in thread

If you’re new to the language and want to understand how to use it like a pro then ignore this post - it will only confuse you and reduce your ability to use C effectively.

jerf2y ago

jstimpfle2y ago

shadowgovt2y ago

1 more reply

crabbone2y ago

> A concise syntax to define structs and functions, with a usable expression syntax. [...] I've always found it ridiculous for people to claim it's holding hardware back.

Here are some reasons why C is awful.

3 more replies

imtringued2y ago

>Well, for one, C's semantics aren't that serial, there is a large degree of freedom for compilers and CPUs how to schedule the execution of C expressions and statements.

>Even though that stuff happens in parallel, any instruction encoding is by necessity serial. Or is anyone proposing we should switch to higher-dimensional code (and address spaces)?

Uh, you know we can just encode the program as a graph? Graph reduction machines are a thing, you know.

1 more reply

circuit102y ago

“instruction encoding is by necessity serial. Or is anyone proposing we should switch to higher-dimensional code”

That is sort of a thing: https://en.m.wikipedia.org/wiki/Very_long_instruction_word

If you have multiple instructions grouped together like this you could think of it as being a 2D array of instructions

xscott2y ago

And if there isn't a good alternative, I think C (or Rust, or WASM) are a pretty good fit for what you've actually got to work with at the low level.

2 more replies

hawk_2y ago

What language(s) in your opinion have the right low-level where the access to the real machine doesn't feel foreign?

JonChesterfield2y ago

I have a pet theory that lisp macros over an assembler is the right high level language for systems programming but that hasn't made it off the whiteboard yet.

3 more replies

pjmlp2y ago

Assembly, or what ESPOL was already doing in 1961 a decade before C was even an idea, compiler intrisics.

giancarlostoro2y ago

The only one I can think of would-be Assembly, but I don't do much low-level work, I code in much higher-level languages. Genuinely curious what the answer is.

2 more replies

jerf2y ago

Per my last paragraph, I am not convinced about any of them.

(And, as the sibling comments point out, yeah, assembler technically, but that's kind of a cop out.)

3 more replies

grotorea2y ago

> It gives you low-level access to a machine that your real machine actually has to somewhat laboriously emulate

pizlonator2y ago

> Isn't C the language (x86_64) processors are designed to be fast for?

Yup

I mean they also optimize for Java and JS and .NET and probably Swift and Rust.

But C still takes precedence, I bet

kllrnohj2y ago

> Isn't C the language (x86_64) processors are designed to be fast for?

pizlonator2y ago

snvzz2y ago

>they forgot to add some instructions (like add with overflow check).

If you actually read the spec, you would have found that they didn't "forget" these.

They carefully studied them and judged the encoding space is better used elsewhere.

1 more reply

fanf22y ago

Multiprocessing. Atomics. Vectors. GPGPUs. All foreign to C when they were introduced.

1 more reply

quelsolaar2y ago

gavinhoward2y ago

C programmer and fan of yours.

I agree with you, but if you could convince WG14 to remove a lot of the stupid UB, that would be closer to the case.

(I know you're trying from your "One Word Broke C" article. Which, by the way, is putting up a server error right now.)

pif2y ago

> it is not a portable assembler

And it never was!

Just keeping this point in mind would reduce the plethora of discussions about undefined behaviour to the essential, i.e. the useful discussions, i.e. the 0.1%.

JonChesterfield2y ago

1 more reply

titzer2y ago

pizlonator2y ago

Just toss enough compiler flags at clang and make sure to occasionally use inline asm snippets to throw off the compiler's optimizations.

Then you're GTG

pizlonator2y ago

Depends on what you mean by "portable assembler". It is exactly that in a lot of ways, but exactly not that in others.

cmsonger2y ago

The author is playing a semantic game.

I don't think the author's point is that "C is not a good language for systems programming." You are not going to have an equivalent to volatile int *dma_register = SCATTER_GATHER_BASE; in Haskell.

In many ways this is a "call to programming model action" and cites GPU as illustrating the potential when "new programming model" and "silicon to support it" are done in concert.

bunderbunder2y ago

"Low-level" is a word with multiple meanings.

  1st: Machine
  2nd: Assembly
  3rd: General-purpose
  4th: Application-specific

I also like that this way we can recognize .NET IL, WebAssembly, and Java bytecode as very high-level 2nd generation languages, which, at the very least, is fun.

Oh, and Forth is a 3rd generation language. Fight me, Chuck.

fanf22y ago

hardware2win2y ago

>use it professionally

I think this post goes way way way above boringness of day2day jobs.

Yea, this post is not about how to use hammer, but more like curious consideration whether using hammers everywhere is not limiting us (C design)

lelanthran2y ago

> Yea, this post is not about how to use hammer, but more like curious consideration whether using hammers everywhere is not limiting us (C design)

Maybe it [EDIT: the post] is, but the title is obviously nowhere near accurate - if C is not a portable low-level language, what on earth is?

bayindirh2y ago

C is only portable between systems which emulate PDP-11 at hardware level and if and only if you don't use any compiler-specific extensions.

2 more replies

scythe2y ago

>if C is not a portable low-level language, what on earth is?

Which, for what it's worth, is also what I was taught in school. C was consistently described as a high-level language by my professors, even if it is "lower-level" than almost everything else.

1 more reply

rfoo2y ago

The post argues that there is no portable low-level languages, including C.

i.e. truly low-level languages can't be portable and is bound to the architecture.

1 more reply

pjmlp2y ago

Only when taking into account language extensions that are compiler specific and not part of ISO C.

Also a reminder that any language can have toolchains with extensions exposing low level features.

1 more reply

hcks2y ago

Funny how the top comment on "hacker" news is an *unsubstantial* comment about how, actually, TFA is wrong.

Even worse, adding a comment on how actually you shouldn’t be curious and understand how things really work.

1 more reply

titzer2y ago

spion2y ago

Can you elaborate?

wolframhempel2y ago· 20 in thread

Joker_vD2y ago

> exposing you to a lot of machine primitives like memory and thread management

Except it doesn't really, the standard leaves most of the really machine-dependent parts undefined; only very few things are left implementation-defined.

mytailorisrich2y ago

> Except it doesn't really, the standard leaves most of the really machine-dependent parts undefined

Well that's because it is low level and, especially, simple, and doesn't try to abstract things.

TheOtherHobbes2y ago

It's a certain kind of low level - specifically a PDP-11 kind of low level.

If your hardware is significantly different, it only looks low level. In reality plenty of mapping and conversion goes on behind the scenes - sometimes with hilarious consequences.

pornel2y ago

> and doesn't try to abstract things.

And C abstracts away almost everything about stack, stack frames, and all the complexities of memory and cache hierarchies. They are abstracted to be uniform linear address space.

mrpopo2y ago

Can you or someone expand further on that? Which platforms are trying to use segmented addressing, and what benefits does it have?

Joker_vD2y ago

CHERI project [0]. Look at figure 2.1: it's an improvement and further development of the segments of yore but the origins are quite visible.

[0] https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-941.pdf

1 more reply

agentultra2y ago

We did ok with the 8086 processors, no?

Joker_vD2y ago

Yes, juggling near and far pointers was somewhat annoying but then Intel, as a part of the 32-bit transition, modified their ISA to be a more pleasant target for C implementations.

Incidentally, C never really became popular on 6502 because, arguably, that ISA is somewhat hostile towards efficient implementations of higher-level languages.

2 more replies

robmccoll2y ago

By that do you mean exposing a non-uniform memory hierarchy as separate addressable spaces (but with coherent views from each hardware thread) or something like thread-local scratch pads?

danielvaughn2y ago

lebuffon2y ago

I would say Forth is lower level than C. The mental model is a two stack machine plus memory, rather than a PDP-11.

And it is very reasonable if you are under 50 years of age, that you haven't heard of it.

1 more reply

fiedzia2y ago

Conceptually, you can consider LLVM IR to be such language, and there are people who use it that way.

grotorea2y ago

trealira2y ago

While staying portable across architectures, probably not. But you can make a little language that's nicer to jse than assembly for a particular CPU.

Here's the PDF that outlines it: https://dl.acm.org/doi/pdf/10.1145/270941.270947

CJefferson2y ago

Also there is quite a lot in modern assembler that you can’t really get to from C, like prefetch and cache flushing instructions.

danielvaughn2y ago

That last bit is really interesting. Do newer languages make use of those features? Sorry I’m fairly ignorant of this level of the stack.

1 more reply

xigoi2y ago

> is there any language at all between C and Assembly?

LLVM or QBE, for example.

titzer2y ago

Yes, WebAssembly is higher-level than machine code but lower than C.

eesmith2y ago

1990 is calling:

clnq2y ago

You have high expectations for accuracy in an article titled "C Is Not a Low-level Language".

ndiddy2y ago· 15 in thread

danielmarkbruce2y ago

rewmie2y ago

> he's saying c's model just doesn't work well anymore.

The author argues that C's model does not fit the model he defined himself and claims to be the same model used by everyone.

After going through the article, I'm left with the impression that the author's thesis is flawed and relies on a series of strawmen arguments. Among the strawmen we find:

* arguing that speculative execution "were added to let C programmers continue to believe they were programming in a low-level language".

* claiming that "modern processors are trying to emulate "the same abstract machine as a PDP-11"

* "Creating a new thread is a library operation known to be expensive, so processors wishing to keep their execution units busy running C code rely on ILP (instruction-level parallelism)."

* etc etc etc.

I don't think this opinion piece is grounded on reality, let alone is an objective take.

danielmarkbruce2y ago

He doesn't define a model. He just discusses the gap between c's model and a few details of a modern CPU and talks about a few other models.

In your opinion, what is wrong with the statement that modern processors are trying to emulate an abstract machine like PDP-11? To me it seems largely right.

bayindirh2y ago

monocasa2y ago

PREFETCHh: Prefetch Data Into Caches

https://c9x.me/x86/html/file_module_x86_id_252.html

theamk2y ago

generic x86 does not, but SSE extension (present on all modern CPUs) does have it! and you naturally can use C to call it via intrinsics, because C is low level after all...

https://stackoverflow.com/questions/48994494/how-to-properly...

pjmlp2y ago

C# also has the same intrisics, I guess C# is low level after all...

monocasa2y ago

And it's mandated to be available on all x86-64.

bee_rider2y ago

fanf22y ago

JonChesterfield2y ago

You may like the x86 instruction called 'prefetch'.

frankreyes2y ago

Wasn't this also similar for itanium? Where the branch burden would be on the compiler?

dataflow2y ago

> One example of this is branch delay slots

That's the only example I'm aware of. Are there others? (I'm sure you could do it poorly if you wanted to, but how much history is there to extrapolate from?)

JonChesterfield2y ago

In general leaking microarch weirdness matters less if you don't have backwards compatibility.

fanf22y ago

usrnm2y ago· 15 in thread

Of course it isn't, but what's the alternative?

lproven2y ago

My favourite article about C in years.

Limbo, Occam (Occam-pi, etc.), APL (I/J, Aplus, etc.), Oberon (Oberon 2, Oberon 07, Active Oberon, Zennon)...

usrnm2y ago

I'm not familiar with these languages, but which of them is closer to the actual modern hardware than C, while still being abstract enough to be portable?

lproven2y ago

In what way did I imply that any of them were in any way closer?

That was not my intention at all.

You asked what alternatives there were. C is a systems implementation language, designed to be compiled to object code that will run on the bare metal.

I offered some examples of alternatives to that role, as I thought you asked. I did say that they explored different aspects of the problem.

As I said to someone else upthread:

It does not need to be a relative statement in order to be correct.

The statement "C is not close to the instruction set of a modern CPU" does not need to be validated by specifying examples of languages that are closer.

1 more reply

agumonkey2y ago

While I'm reading about limbo and occam, what do you think apl and oberon can express that C cannot ? talking about low level electronics benefits (apl array idioms are superb for sure)

lproven2y ago

I recommend Sophie Wilson's talk on CPU architectures for some interesting insight into this.

https://www.youtube.com/watch?v=6lOnpQgn-9s

It's worth the time, IMHO, and I dislike video presentations. This one is different.

She designed the ARM processor (and BBC BASIC before that).

pjmlp2y ago

Bounds checking by default.

Actors, more precisely active objects in Active Oberon, the only one still actively being developed at ETHZ from Oberon linage.

2 more replies

olafura2y ago

One example given in the article is Erlang VM which maps a lot better to modern processors.

We currently have a problem where we can't have thousands of cores because, even today, so much code is designed to be fast on one core.

We really have to move the asynchronous programming because synchronizing async hardware is both complex and inefficient.

RISC V is probably going to help since it allows for a lot of experimentation.

creshal2y ago

The article does mention a few areas of interest:

- Languages with "better" (=more modern hardware friendly) loop constraints are easier to parallelize (Fortran, Erlang, …)

- CPU architectures with better programmable vectorization (ARM SVE, Risc-V VE) are much easier to work with, if the language primitives allow it (see above)

In terms out "but what can I easily use as drop-in replacement?" Yeah, we're kinda stuck with C and languages that inherit its problems (current Rust/Zig/etc. included).

xet72y ago

Rust/Zig does not have enough portability, there is errors trying to compile to s390x:

https://github.com/wekan/wekan-node20#trying-to-compile-llvm...

C89 compiles to 30+ CPU/OS:

https://github.com/xet7/darkesthour

ahoka2y ago

GNU assembler, nasm, if you really need to go low level, usually you don’t.

papruapap2y ago

isnt assembler a high level language nowadays?

d_tr2y ago

1 more reply

illys2y ago

rfoo2y ago

Rewrite the world in Rust /s

The real answer is: none. There are two problems, the first is you have to rewrite the world with the new language and hardware.

The second is, unfortunately, language enthusiasts who are willing to rewrite the world AND can get job done want a language to target a sequential abstract machine (i.e. look like C).

pjmlp2y ago

C++ is serving me well staying away from C as much as possible, since 1993.

Gazoo1012y ago· 13 in thread

unblough2y ago

If I program (map inc [0 1 2 3]) is it really any more difficult to conceptualize the (inc ) function performing on each element sequentially than in parallel?

I think the difficulty of parallel programming is less innate and more two fold:

1) languages often default to sequential so to do async requires introducing additional primitives to the programmer

2) knowing when to effectively use parallel programming

When I have a list or stream that I know has independent elements that require wholly independent calculations then parallel programming is straightforward

mgaunard2y ago

Most problems are not embarrassingly parallel.

(Fun fact: I once had someone call HR on me because they didn't know embarrassingly parallel was a technical term, and they thought I was belittling them)

rfoo2y ago

Prefix scan is not embarrassingly parallel. Yet OP's statement still works when you change it to scanl (+) 0 [0 1 2 3]

1 more reply

Gazoo1012y ago

I agree that if we define the individual instructions to always be wholly independent, then sure, it is more straightforward.

While I'd probably argue that it is still more difficult to conceptualize, the statement we're discussing is presented as broad and general. I'd call it far less misleading if it said something like:

There is a common myth in software development that parallel programming *has* to be hard.

kortex2y ago

The whole reason async is even a thing is due to slow, side-effect producing operations. Of course pure functions are easy to parallelize.

I don't think folks so much "shoe horn async where it is unnecessary" as the red/blue problem causes async code in most languages to spread.

Or by "async" do you just mean concurrent code? I'm reading "async" to mean lightweight coroutines or similar.

1 more reply

wrsh072y ago

I don't think this is right. Thinking about operations on matrices is not complex. Defining how a single agent should act on its environment is not complex

When you say "without further details or specifics" you're saying "using my default framework of a c/ c descendent world"

The author's point is that sequential programming is one type of simple programming, but it's not the only type, and it doesn't map easily to modern hardware

Gazoo1012y ago

I agree that there are languages that are explicitly built to make parallel programming easy. But in general, and not just related to c or c descendant languages, parallel programming is hard.

wrsh072y ago

My point (and I think the points of others responding to you) is that parallel programming is not always hard. That's also what the author is saying.

The common myth - you're doing parallel programming? That sounds hard

It's not always hard. It really isn't! You don't need to be a genius or an expert to write parallel code.

There is a broad spectrum of parallel programs. Somebody using a web server implementation? They've made a parallel application

Somebody running tensorflow or pytorch? Also parallel! Even for simple stuff!

1 more reply

ethbr12y ago

Agreed. The potential state space in parallel processing is a lot larger, which makes it more complex, which makes it harder.

That Erlang exists and people use it successfully does not mean that harder things aren't.

fanf22y ago

danmaz742y ago

grotorea2y ago

wrsh072y ago

A factory is parallel

Or do you mean an individual can't do things in parallel?

Like.... Pushing all of those grocery carts in a long line is moving them in parallel

Or do you mean processing? Like thinking?

throwaway875432y ago· 12 in thread

Other than assembly, which barely qualifies as a language, what programming language is lower than C?

lproven2y ago

It does not need to be a relative statement in order to be correct.

The statement "C is not close to the instruction set of a modern CPU" does not need to be validated by specifying examples of languages that are closer.

danielvaughn2y ago

But if you're going to say that "C is not a low-level language", then yeah you kinda do need other languages beneath it.

lproven2y ago

Well, firstly, I'm not saying it.

But no. That is what I meant when I said this is not a relative statement.

If the title said "C is not the lowest-level language" then your objection would be valid... but it doesn't and it's not saying that.

But before I go into some lengthy explanation: have you read the article, or are you responding to the title alone?

dist1ll2y ago

In general terms, any language aiming to be lower-level than C should

- have an "abstract" machine that is more concrete than C (and by extension less portable)

- be easier to lower into optimal assembly (especially loop ops)

- give you strong and precise compile-time guarantees about memory layout (padding, bitfields), variable sizes, register spilling, stack usage, etc.

pjmlp2y ago

Plenty to chose from since 1958's introduction of JOVIAL, when one cares to research what has happened in the world of systems programming outside Bell Labs, and UNIX/C taking over the server room.

casparvitch2y ago

Forth/joy maybe?

humanrebar2y ago

LLVM IR

giaour2y ago

Fortran, maybe?

drsopp2y ago

Joker_vD2y ago

Or T3X9, if you're prefer Algol-style syntax.

Nzen2y ago

I think there is an argument that Brainfuck [0], et al, is lower than C, given that it eschews variables and functions.

[0] https://esolangs.org/wiki/Brainfuck

AlecSchueler2y ago

Low level means close to the processor, not small in scope.

You could argue brainfuck is machine language for a theroetical infinite tape machine, but such a machine can only exist when implemented in high-level software.

somat2y ago· 12 in thread

Lower level is something that maps more directly to machine operation (assembly, maybe forth).

Higher level is something that has it's own semantics of operation and need to be converted to into the machine operation, the more conversion the higher the level.

woodruffw2y ago

philipov2y ago

coldtea2y ago

Huh? Tons of languages at levels "higher than C" existed at that time C was created, and they were popular too.

LISP (1960), Smalltalk (1972), BASIC (1963), FORTRAN (1957), COBOL (1959) and countless others. Heck, ALGOL (1958, 1968) was much higher level than C too.

pjmlp2y ago

Yes it did, outside Bell Labs.

noirscape2y ago

It's also that the definition of high vs low level has shifted in the past decade.

adamrezich2y ago

> in C you basically have to be really meticulous about free()-ing memory

only if you malloc()/free() for every allocation/deallocation. if you use any other allocation strategy then this is never an issue.

for example: see the "Rewriting the memory management" section in this article: https://phoboslab.org/log/2023/08/rewriting-wipeout

> Instead, there's a fixed size statically allocated uint8_t hunk[MEM_HUNK_BYTES]; of 4mb that is used from both sides:

> Temporary objects are not allowed to persist over multiple frame. So each frame ends with a check to ensure that the temp allocator is empty.

> Somewhat related, the OpenGL renderer does the same with the textures: It bumps up texture memory (more precisely space in the texture atlas) and resets it to the previous level when a race ends.

Alifatisk2y ago

> I don't think it was ever claimed that C was a low level language.

When I was introduced to C during high school, my teacher presented C as a low-level language compared to what we previously studied (which was Ruby).

denton-scratch2y ago

> I don't think it was ever claimed that C was a low level language.

It was introduced to me as "glorified PDP11 Assembly Language". So the claim has been made at least once.

Granted, there are people here commenting that maybe assembly language is not "low-level". I'm lost for words.

Normal_gaussian2y ago

Im curious - what is it about forth that makes you consider it to map more closely to the machine?

I've done a handful of forth projects as part of a code-dojo years ago. I wouldn't have considered it low-level.

somat2y ago

Note that I am not far in the forth rabbit hole at all, any interest I may show is incidental, a side effect of my interest in postscript, which is very much a high level language.

creshal2y ago

domust2y ago

It's fortran, not forth.

1 more reply

abainbridge2y ago· 10 in thread

This has nothing to do with C. The hardware insists on this abstraction. And its a good job too, otherwise your programs would stop working when moved to a machine with different cache.

semiquaver2y ago

The article argues that the hardware’s insistence on this abstraction is in large part _because_ of C’s dominance.

mgaunard2y ago

If only that were true. Lots of languages that have nothing to do with C also did it. It's just much easier to program with a unified memory model, that's all there is to it.

Joker_vD2y ago

2 more replies

creshal2y ago

1 more reply

abainbridge2y ago

gpderetta2y ago

Hence the observation that every architecture eventually converges to NUMAcc if physically possible.

nitwit0052y ago

People have written C code that dealt with more complex memory schemes.

The language matters less than the fact that there's a lot of existing code around. That code needs to keep working.

fanf22y ago

Yeah. A lot of the things that make C not low level in the terms of this article happened on IBM mainframes decades before x86:

* tiered memory hierarchy pretending to be flat RAM

* CPUs that are much bigger than the ISA suggests, and which have out-of-order and speculative execution so code can make good use of their resources

* optimizing compilers that further decouple the program as written from its execution

IBM was working on this stuff in the 1970s, well before the rise of C. It’s fair to criticize the model and seek out alternatives, but it isn’t fair to blame C.

JonChesterfield2y ago

Flat memory is a bad thing for performance. Especially cache-coherent flat memory. It is convenient for programmers.

abainbridge2y ago

thatjoeoverthr2y ago· 8 in thread

Reminds me of VLIW. As per Wikipedia, from the Itanium page:

If your CPU exposed the single-stream parallelism at the interface, you can do it at compile-time or even decide it with in-line assembler.

I wonder if it hasn't caught due strictly to the business dynamics of the industry, or are there technical reasons this isn't really a good strategy?

Joker_vD2y ago

[0] https://yarchive.net/comp/linux/x86.html

fanf22y ago

JonChesterfield2y ago

gpderetta2y ago

With GPUs of course things have changed and in fact GPUs relied more on static scheduling, but even there as they expand to more varied loads, they are acquiring more dynamism.

fanf22y ago

See my other comment for why VLIW was technically flawed

https://news.ycombinator.com/context?id=37900987

thatjoeoverthr2y ago

Im reading that TeraScale (AMD) works this way. Itanium is a major attempt to ship it in a CPU. I guess AMD64 and ARM rule the day but maybe in the future we'll see it again.

JonChesterfield2y ago

Terascale was a vliw, worked well as far as I know. The current amdgpu architectures aren't - those are multiple execution port systems, reminiscent of the x64 setup.

Qualcomms' Hexagon is a vliw, I think that's contemporary. Graphcore's IPU is two instructions per word.

lizknope2y ago

Are you asking why VLIW hasn't caught on? There are DSPs that use VLIW concepts. But for general purpose computing look at Itanium and it's failure.

Waterluvian2y ago· 3 in thread

My sense is that this is really a communication issue (when is it not?)

On a relative scale, C is very low level compared to how we program today if you think about levels of abstraction.

If “low level” means “runs on the CPU almost literally as written.” then no it’s not.

fweimer2y ago

Ada has System.Address to model machine addresses:

http://ada-auth.org/standards/rm12_w_tc1/html/RM-13-7.html#p...

C++ has std::less specializations for pointer types which provide a strict total order (one aspect of machine addresses):

https://en.cppreference.com/w/cpp/utility/functional/less

There is also placement new and std::launder for more explicit control of typed memory:

https://en.cppreference.com/w/cpp/language/new https://en.cppreference.com/w/cpp/utility/launder

These days, even Java tries to model machine addresses:

https://docs.oracle.com/en/java/javase/21/core/foreign-funct...

kortex2y ago

Yep, this is a linguistic problem, not a technical one. "C is not a low level language" implies that the hi/lo boundary lies below C. What's below C? IR, Asm, and opcodes.

IRs like LLVMIR and various bytecodes. Well, those don't map to the hardware 1:1, not even close. So IR must be HLL.

Sure Asm has to be architecture specific, but even then we are getting pretty good at transpilation. And those codes get translated to opcodes anyways on most modern chips.

falcrist2y ago

> If “low level” means “runs on the CPU almost literally as written.” then no it’s not.

But doesn't this still depend on what CPU you're talking about? Your C code will map much more closely to the instructions of the machine code of an 8051 or even an M4 than it will to an x86.

Thus any general-purpose language is more or less "low level" depending on the CPU it's running on. This seems like a poor definition.

PhilipRoman2y ago· 2 in thread

>take a property described by a multidimensional value

>project it into a single dimension

>split it in the middle, thus inventing two useless artificial categories ("low level", "high level")

>get a bunch of highly functioning hackernews 0.1xers to argue endlessly about said useless categories

>submit weekly articles "thing X is NOT in my imaginary category Y!!!"

>profit

Arguing whether or not C is a low level language is about as useful as arguing whether dog-headed men have souls

Next up: IO is not a Monad, x86 machine code is not a low level language, RISC-V is not actually RISC, GPL is not actually open source and so on

ad404b8a372f2b92y ago

PhilipRoman2y ago

I see that point, but I still think the article is completely wrong.

Or maybe take TSX as an example - I'm fairly sure the PDP-11 did not have anything remotely close to Intel TSX and yet it is easy to use in C. Include <magic.h>, write __magicXYZ() and it just works.

HarHarVeryFunny2y ago· 1 in thread

If the sophistication of modern CPUs makes C no longer a "low level" language, then the same applies to assembly language .. things like out of order execution and register naming applies there too.

Still, I think the notion of C being a "low level" language is still a useful label ... if not we need to retire this designation altogether.

marcosdumay2y ago

Assembly is just a bit lazy, macro-expanded, and the computer's memory address a made up concept.

But yeah, I do agree that "low" and "high" level aren't useful names nowadays.

openasocket2y ago· 1 in thread

Phrodo_002y ago

bee_rider2y ago· 1 in thread

ori_b2y ago

It doesn't even include assembly.

bazoom422y ago· 1 in thread

In any case, at this point, low level and high level are only meaningful relative to other languages.

The article is questioning how “close to the metal” C actually is, but some of the arguments also applies to assembler, which is not that close to the metal either these days.

chefandy2y ago

bluGill2y ago· 1 in thread

toinewx2y ago

agreed

cmrdporcupine2y ago· 1 in thread

Yes, C is a set of abstractions like any other language (even assembly.) which attempt to mimic a machine of far less complexity.

Unfortunately it's also the wrong set of abstractions for the contemporary era.

That said, if you're working in low-level embedded microcontroller world, C's memory model and program structure does in fact look a lot more like those systems.

AnimalMuppet2y ago

What would you say is the right set of abstractions for the contemporary era? Especially for writing things like OSes and device drivers?

jansan2y ago· 1 in thread

https://obsolescence.wixsite.com/obsolescence/pidp-11

fsniper2y ago

Mandatory xkcd: https://xkcd.com/1053/

vivekv2y ago· 1 in thread

GTP2y ago

I think that there's just isn't any commercial incentive, as we have a ton of legacy C code and the CPU will have to run anyway in the "legacy" mode to run your OS, be it Linux, Windows or MacOS.

ori_b2y ago· 1 in thread

If you accept the premise of the article, you also need to accept that assembly is not a low level language, and that it is impossible to program any CPU currently for sale in a low level language.

The abstraction CPUs give you is more or less a fast pdp11 with some vector registers bolted on.

The implementation internally is not.

variadix2y ago

jokoon2y ago· 1 in thread

I once read that C is the new assembly, because all CPU have a C compiler.

Of course I am very scared that I will run into awful problems, but that will be fun anyways.

lebuffon2y ago

Everyone knows that CPU stands for "C" processing Unit...

:-)

wzdd2y ago

In other words, perhaps what's really out of date is not C, but the concept of a general-purpose language which is equally well-suited to any type of task.

wrsh072y ago

This is one of the most interesting programming articles I've read in a while. And it's well written and easy to read! Don't stop at the (inflammatory?) title.

* We all agree that c gives you a lot of control to write efficient sequential code

* Modern processors aren't merely sequential processors

karmakaze2y ago

To anyone debating whether C is low/not-low level language note that this discussion is at a much lower level so 'low' has a lower than common meaning.

titzer2y ago

> On a modern high-end core, the register rename engine is one of the largest consumers of die area and power.

layer82y ago

> Consider another core part of the C abstract machine's memory model: flat memory.

hooby2y ago

"Low-level" is not a perfectly well-defined technical term, and does mean (slightly) different things to different people.

I feel that the "value" of the article lies in challenging certain conceptions about C.

I feel it's always worthwhile to question such dogmatic beliefs.

titzer2y ago

falcrist2y ago

I'm also not seeing suggestions for languages that do map well to modern microprocessors.

HumblyTossed2y ago

I've been programming since the mid 80s, started with the C=64. People have been having the argument that C is low-level vs c is not since at least then.

Why do so many smart people waste their friggin' time on such nonsense?

cat_plus_plus2y ago

vonwoodson2y ago

adamrezich2y ago

intalentive2y ago

OnlyMortal2y ago

When compared to assembler, I’d agree.

I grew up with 6502 and 68k. To me, back in the early 90s, C (Mac MPW C to be precise) was an abstract assembler. The code-gen was perfectly readable.

Compared to the likes of Python, it most certainly is low-level. These types of language allow developers to rapidly get something going and not just because of the libraries.

I’d find it very hard to justify a business position where C has any other role than binding and breaking out into something more abstract. Be that Go or C++, for an example.

An argument I used to hear was “performance” from C. I’m not entirely convinced as in a higher language your algorithm may well be better as you can deal with the abstraction.

But… people make money coding C.

eigenform2y ago

phendrenad22y ago

[delayed]

hluska2y ago

assimpleaspossi2y ago

The paper talks about how C is designed for a PDP architecture and that's the problem. Is there any language that is not that way and can handle parallelism and all the things mentioned in the paper?

Yes, I do see Erlang mentioned but I don't think it was considered a solution.

charles_f2y ago

Interesting take, but I think it goes out of its way to prove the definition of low-level to be wrong, while missing that the definition it gives and claim is wrong, in itself is very flexible.

What is irrelevant? To a data-scientist, typescript is low-level. You're required to think about structure and compile stuff!

To a web developer, C# and Java are low-level because you need to think about the execution platform

To an IT developer, C and C++ are low level because you need to think about memory.

To a game developer assembly is low level because you need to think about everything.

To electronocians everything is high level. To accountants VBA in Excel is low level. To a product manager a word document with any sort of technical words is too low level.

If you need to optimize your software to the point where some CPU specific instructions are required, C is too high level because its hiding stuff that is not irrelevant.

mgaunard2y ago

mpweiher2y ago

I am really surprised that such a bad take has gotten so much airtime, almost as much as that such a gifted developer came up with it.

The factors that are mentioned in the article fall roughly into two categories:

1. The machine now works differently.

This may be true, but it does so almost entirely invisibly, and the exact same arguments given in the article apply in the same way not just to assembly language, but even to raw machine language.

2. C compilers do crazy shit now

But then again we already established that assembly is no longer a low level language...so whatever.

ngrilly2y ago

Wasn't it the idea of RISC to have a simpler CPU and push the optimization responsibility towards the programmer and the compiler?

ultra_nick2y ago

I just write English these days and have my LLM compile it to Python, so...

danielmarkbruce2y ago

On the flip side, maybe CPUs are trying to be too general purpose.

BeefyMcGhee2y ago

(2018)

mbfg2y ago

PDP-11 is a fast machine?