Threadripper 3990X: The Quest To Compile 1B Lines Of C++ On 64 Cores (opens in new tab)

(blogs.embarcadero.com)

230 pointsfmxexpress5y ago178 comments

178 comments

92 comments · 25 top-level

PragmaticPulp5y ago· 28 in thread

Fun experiment.

The more pedestrian 5950X or the now bargain 3950X are great for anyone doing a lot of compiling. With the right motherboard they even have ECC RAM support. Game changer for workstations in the $1000–$2000 range.

The more expensive Threadripper parts really shine when memory bandwidth becomes a bottleneck. In my experience, compiling code hasn’t been very memory bandwidth limited. However, some of my simulation tools don’t benefit much going from 8 to 16 cores with regular Ryzen CPUs because they’re memory constrained. Threadripper has much higher memory bandwidth.

ska5y ago

I suspect the biggest (build time) benefit to most c++ workflows and toolchains was the move to ubiquitous SSD. Prior to that in my experience excepting expensive RAID array dedicated build machines, it was really easy to build a system that would always be IO bound on builds. There of course were tricks to improve things but you still tended to hit that wall unless your CPUs were really under spec.

edit: to be clearer, I'm not thinking of dedicated build machines here (hence RAID comment) but over all impact on dev time by getting local builds a lot faster.

PragmaticPulp5y ago

SSDs help, but nothing beats core count X clock speed when compiling.

Source code files are relatively small and modern OSes are very good at caching. I ran out of SSD on my build server a while ago and had to use a mechanical HDD. To my surprise, it didn’t impact build times as much as I thought it would.

mrmuagi5y ago

I did a test a while back where I had a workstation compiling linux with SSD and one with a HDD -- it turns out all the files were cached in the memory (measely 8gb). But for general usage and user experience I would reccomend SSD without any question.

ska5y ago

Hmm. Maybe the tradeoff has changed since I last tested this (to be fair, a few years ago). But I'm also not focused on build servers especially, it's always been possible to make those reasonably fast. Unless you have a very specific sort of workflow anyway, your devs are doing way more local builds than on the server and that sped up a ton moving to SSD, in my experience anyway. YMMV of course.

2 more replies

lumost5y ago

You know I wonder how much of an impact this has had on the recent move back to statically typed and compiled languages vs. interpreted languages. I had assumed most of the compilation speedups were due to enhancements to the compiler toolchain - but my local laptop moving from 100 IOPS to > 100k IOPS and 3GB/s throughput may have more to do with it.

rbanffy5y ago

CPUs got faster too. My MacBook pro is a lot faster than the 6yo top of the line Mac Mini.

In fact, if compile times are being limited by storage there should be some quick wins in configuration terms - building intermediates to RAM, cache warming, etc - that can enable better performance than faster storage.

trhway5y ago

i (and some teamates) actually put HDDs on some workstations as SSD just die after 2-3 years of active build on them and with modern HDDs you have practically unlimited storage while you can have only limited number of 400G builds on SSD (the org has psychological barriers to having more than 1-2Tb SSD in a machine) and the SSD start to have perf issues when at 70-80% capacity . With HDD the build time didn't change much - the machines have enough memory for the system to cache a lot (256-512G RAM).

outworlder5y ago

> i actually put HDDs on some workstations as SSD just die after 2-3 years of active build on them

That sounds very low for modern SSDs, even consumer-grade. Have you tried different vendors?

1 more reply

EricE5y ago

A way to have your cake and eat it too - check out Primocache. It's pretty inexpensive disk caching software (especially for Windows Server which is where I really leverage it!).

Pair it with an Optane for L2 cache and it will speed up normal SSD use too ;)

1 more reply

ska5y ago

Those are pretty beefy workstations, does every developer have one or are these really build servers? As I noted you could always throw money at this and end up somewhere reasonable, but it introduces workflow considerations for your devs.

1 more reply

fctorial5y ago

What are the signs of an SSD that's about to die?

2 more replies

fctorial5y ago

Don't SSDs have a finite TBW? 50GB of writes everyday (possible on large projects) will consume that in a couple of months.

nickjj5y ago

I've had a Crucial 256GB SSD (MX100) since early 2015 and I use it with Windows 10. WSL 2's file system is on there along with Docker, which I've been using full time since then. That means all of my source code, installing dependencies, building Docker images, etc. is done on the SSD.

The SMART stats of the drive says it's at 88% health out of 100%, AKA it'll be dead when it reaches 0%. This is the wear and tear on the drive after ~6 years of full time usage on my primary all around dev / video creating / gaming workstation. It's been powered on 112 times for a grand total of 53,152 running hours and I've written 31TB total to it. 53,152 hours is 2,214 days or a little over 6 years. I keep my workstation on all the time short of power outages that drain my UPS or if I leave my place for days.

Here's a screenshot of all of the SMART stats: https://twitter.com/nickjanetakis/status/1357127351772012544

I go out of my way to save large files (videos) and other media (games, etc.) on a HDD but generally most apps are installed on the SSD and I don't really think about the writes.

1 more reply

bluedino5y ago

Something like the Samsung EVO 960 (typical mid-range SSD) will take 400TB of writes in it's lifetime. So that's 8,000 days of 50GB writes.

2 more replies

magicalhippo5y ago

The larger your SSD the more flash cells you have, so the more data you can write to it before it fails.

You can see this from the warranty for example, which for the Samsung 970 EVO[1] goes linearly from 150TBW for the 250GB model up to 1200TBW for the 2000GB model.

So if you take the 1000GB model with its 600TBW warranty, you can write 50BG of data per day for over 32 years before you're exhausted the drive write warranty.

[1]: https://www.samsung.com/semiconductor/minisite/ssd/product/c... (under "MORE SPECS")

frizkie5y ago

They do but it's really large. The Tech Report did an endurance test on SSDs 5-6 years ago [0]. The tests took 18 months before all 6 SSDs were dead.

Generally you're looking at hundreds of terabytes, if not more than a petabyte in total write capacity before the drive is unusable.

This is for older drives (~6 years old as I said), and I don't know enough about storage technology and where it's come since then to say, but I imagine things probably have not gotten worse.

[0]: https://techreport.com/review/27909/the-ssd-endurance-experi...

1 more reply

dekhn5y ago

I've swapped hundreds of terabytes to a terabyte SSD (off the shelf cheapie) with no recognizable problems (the gigapixel panoramas look fine).

1 more reply

KingOfCoders5y ago

We had ultra fast HDDs as developers with sound proof housings because they were so loud. Glad for SSDs.

masklinn5y ago

> The more expensive Threadripper parts really shine when memory bandwidth becomes a bottleneck.

Threadripper can be useful for IO, especially for C++ (which is famously quite IO intensive) owing to its 128 PCIe lanes, you can RAID0 a bunch of drives and have absolutely ridiculous IO.

bmurphy19765y ago

Where can you get decent ECC ram for a reasonable price? I was on the hunt recently for ECC RAM for my new desktop and I gave up and pulled the trigger on low latency non-ECC RAM. Availability seems to be pretty terrible at the moment.

558734452161115y ago

You can get ECC UDIMMs from Supermicro. They are rebranded Micron DIMMs. ECC memory is not going to go as high of frequencies as you might be looking for. They will only go up to the officially validated speed of the CPUs. https://store.supermicro.com/16gb-ddr4-mem-dr416l-cv02-eu26....

amarshall5y ago

The rated speeds are not as high, but ECC memory can be overclocked just as non-ECC; memory overclock support is mostly up to the motherboard. I have some DDR4-2666 ECC overclocked to 3200 MHz with slightly tighter timings on TRX40.

1 more reply

account425y ago

> They will only go up to the officially validated speed of the CPUs.

You won't be able to buy ECC ram marketed for speeds beyond JEDEC standards but that does not mean that you cannot clock them higher.

bmurphy19765y ago

Oh thanks, I didn't even think to check Supermicro! The prices are reasonable as well.

mng25y ago

Kingston has some, I got a couple of KSM32ED8/16HD recently. They are 3200 CL22, though they probably have some room to tighten up timings.

m4635y ago

same for everything. 5950? where would you even get one?

wait - seems you can get one, just pay 2x list price.

2 more replies

PartiallyTyped5y ago

IIRC AMD has EEC support in all x70/x50 motherboards and cpu combinations. If I may, what kind of simulations are you running?

I am trying to build a system for Reinforcement Learning research and seeing many things depend on python, I am not certain how to best optimise the system.

peferron5y ago

Yep, with permanent WFH due to the pandemic I started working on a desktop with 5950X + 64 GB memory and it's been a huge upgrade over my work laptop (and probably any laptop available at the moment).

It's much quieter under load as well.

ahepp5y ago· 9 in thread

>C++Builder with TwineCompile is a powerful productivity solution for multi-core machines compiling 1 million lines of code very quickly and can work better than the MAKE/GCC parallel compilation Jobs feature due to it’s deep IDE integration

You're claiming this plugin has deeper IDE integration than `make`? I find that really, really difficult to believe. And if it's true, it seems like the solution is to either use a better IDE, or improve IDE support for the de facto standard tools that already exist, as opposed to writing a plugin for the -j flag.

fmxexpressOP5y ago

Yes, it could be as simple as having Dev-C++ run a build every time a file is saved. Currently it does not do this. Remember, Dev-C++ didn't have -j support at all until I added it. TwineCompile does do this (background compile). Therefore the IDE is providing this functionality and has nothing really do to with make or the compiler.

TwineCompile is not a plugin wrapping the -j flag. It is a separate thing entirely unique to C++Builder. It does offer integration with MSBuild though.

The second part of that was the fall off. With the 1 million size files it only ever used half of the cores and each successive round of core compiles it would use even less cores. TwineCompile didn't seem to have that problem but this post was not about TwineCompile vs. MAKE -j so I did not investigate this farther.

I was expecting MAKE/GCC to blow me away and use all 64 cores full bore until complete and it did not do this.

klodolph5y ago

Make forces you to choose between being able to do full parallel builds or using recursive make, you can’t do both.

StillBored5y ago

I might be misunderstanding something, but the common gnumake does no such thing.

https://www.gnu.org/software/make/manual/html_node/Job-Slots...

klodolph5y ago

I didn’t go into great detail of the reasons, but the jobserver doesn’t address the problem except for the most trivial cases—the core problem is that you can’t cross to a recursive make invocation via multiple edges.

This is fairly common in larger projects, so you end up having to do some hackery to manually sequence make invocations if you want to use recursive make (which is pretty awful).

Honestly, for large projects, Make is an insane choice, notwithstanding the fact that people who are sufficiently savvy at Make can sometimes make it work. (If your tools are bad, you can make up for it with extra staff time and expertise.)

3 more replies

ChuckMcM5y ago

It may be a Windows thing. I too started reading the post and was thinking, why not -j128 or -j64 depending on if HT was on and then realized that the author's system wasn't one that had been tuned for decades to build software as quickly as possible :-).

It would be an interesting academic exercise to create a setup on Linux that did the same thing but using the extant build tools, and then to do a deep dive into how much of the computer was actually compiling code and how much of it was running the machinery of the OS/IDE/Linkages. A LONG time ago we used to do that at Sun to profile the "efficiency" of a machine in order to maximize the work it did for the user per unit time, vs other things. That sort of activity however became less of a priority as the CPUs literally became 1000x faster.

1 more reply

zajio1am5y ago

Why would you do recursive make? That is setup discouraged for decades ...

ahepp5y ago

(for reference: "Recursive Make Considered Harmful", https://accu.org/journals/overload/14/71/miller_2004/)

Too5y ago

Because you included a third-party project into your build which is not compatible with your build system and you don't want to rewrite all their ninjafiles into makefiles to match.

Someone should really formalize a standard for declaring dependencies which build systems can share between each other.

1 more reply

klodolph5y ago

Because it can be hard to maintain non-recursive make systems for large projects. Just because recursive make is discouraged does not mean that the alternative is without its own drawbacks.

renewiltord5y ago· 8 in thread

Hahaha, fuck me, CPUs are fast. That's wicked. 15 mins. A billion lines of C. Insane. Wonder if there's some IO speed to be gained from ramdisking the inputs.

titzer5y ago

17,300 lines/sec per core. That's embarrassingly slow IMHO.

bjoli5y ago

That depends completely on what optimizations are being done.

But alas, I have said for some time that a fast compiler should be able to compile about 1MLOC/S with some basic optimization work.

Macha5y ago

It is pointed out that the threadripper does worse per core when under full load than even high core count consumer CPUs like the 3950x/5950x. That's the tradeoff you make for huge core count CPUs. 4x 3950x might do better, but then you need to build 3 other PCs, and for actual processing tasks, co-ordinate stuff to run across multiple systems.

bserge5y ago

What can perform better?

titzer5y ago

Lots of single-pass compilers can achieve 1MLOC/s. But the main problem is that C++ has an O(n^2) compilation model due to header explosion. Also, expanding C++ templates is very computationally intensive.

viktorcode5y ago

Other languages do perform better

3 more replies

mhh__5y ago

I can build the D compiler (500k lines?) Warts and all in a second in my machine - and that's code that's not particularly complicated, but not at all optimized for compile times realistically.

1 more reply

josephg5y ago

Go, Jai, V8 (weirdly), some hobbyist C compilers.

1 more reply

formerly_proven5y ago· 7 in thread

[Not "real" C++ code, benchmark is for compiling 14492754 copies of a fairly simple C function]

blt5y ago

Yeah, I would say this title is a little misleading. The example doesn't use any of the C++ features that cause long compile times, like templates and the STL.

jandrese5y ago

Seems like he tried more complex examples too, but ran into roadblocks like a 2GB limit on executables and running into a commandline length limit restriction that dates back to early DOS days which made it impossible to link.

Both of those problems seemed solvable if he was willing to chunk up his application into libraries, maybe 1024 files per library then linked to the main application.

simcop23875y ago

I believe this is one of the reasons for object libraries (or archives, the foo.a files on linux/unix), you can then link in all of the object files from one of those at link time without having to list them all at once. That won't get past the 2GB limit on executables but it will get past the command line length.

com2kid5y ago

This is correct, a .lib file on Windows has a bunch of .obj files in it that you can then link together.

You can also use command files[1] to pass options in instead of using the command line.

[1] https://docs.microsoft.com/en-us/cpp/build/reference/linking... to pass

2 more replies

jandrese5y ago

I was going to suggest that, but he's running on Windows and I don't know if they are supported there. I guess they probably are since they're a compiler feature.

account425y ago

> a commandline length limit restriction that dates back to early DOS days which made it impossible to link.

MinGW's linker supports passing the list of objects as a file for this reason and CMake will use that by default.

jpaul235y ago

Does there exist some kind of random C code generator?

bullen5y ago· 5 in thread

In my experience multi-core compilation does not work.

make -j>3 just locks the process and fails.

jcelerier5y ago

You just need more ram. I 'ever compile at less than -j$(ncpu). Hard with less than 32 GB tho - a single clang instance can easily eat upwards of 1gb of ram

bullen5y ago

Aha, I only compile on ARM so I got no room to increase RAM...

Is it the same with g++? I have 4GB so I should be able to compile with 4 cores, but the processes only fill 2-3 cores even when I try make -j8 on a 8 core machine and then locks the entire OS until it craps out?!

Something is fishy...

ahepp5y ago

Why are you compiling on ARM with only 4GB RAM? Wouldn't it make more sense to cross compile from a machine with more resources, if you cared about build speed? (maybe there's a good reason not to do that, idk)

If it's crapping out when you give it -j8, that seems to strongly suggest you're running into limited resources somewhere.

I'm no expert in the intricacies of parallel builds, but as far as I know you can still have dependencies between targets that will limit parallelism.

pirocks5y ago

This is just a low end/uncommon hardware problem. I typically do make -j16 on a 4 core x86 system and it just works. You are probably running out of ram and the swapping resulting in that instability.

1 more reply

drmpeg5y ago

A swap file is your friend.

1 more reply

coliveira5y ago· 4 in thread

It is a good thing that Embarcadero is keeping alive this technology to create desktop apps from the early 2000s that was abandoned by MS and other large companies in favor of complex Web-based apps.

dvfjsdhgfv5y ago

If only they had made Delphi Community Edition available a decade earlier...

cosmotic5y ago

Someone has to build the native electron wrapper

dvfjsdhgfv5y ago

If you mean wrapping native widgets, this wouldn't solve much - you would still need some language to take care of the logic, like a JavaScript engine. At this point just using Electron is simply easier for devs, and as much as we hate it, realistically speaking it's still better than nothing.

phendrenad25y ago

Qt is still around and doing well. Some people still need desktop apps.

trhway5y ago· 2 in thread

Lucky sons of gun. We are stuck with Xeons. Have to wait 3 hours for our 20M C/C++ on the 2x14cores Xeon machine after a pull/rebase. Ryzen/TR would probably be faster 2-3x times for the same money, yet it is a BigCo, so no such luck (and our product is certified only for Xeons, so our customers can't run AMD too - thus we're de-facto part of the Great Enterprise Wall blocking AMD from on-premise datacenter).

maccard5y ago

I upgraded from 2x 12 core xeons to a 64 core thread ripper - compile times dropped from 45m to 12m

AshamedCaptain5y ago

Industrial software will always grow in size to use all available compilation time ... I have seen large Xeon distcc farms and the total build walltime was still measured in hours...

gm5y ago· 1 in thread

That article mentioned Delphi and Object Pascal, and it brought back many fond memories. I absolutely LOVED Delphi and Object Pascal back in the day. So clean and so fun to program in. If Borland hadn't f-ed it up and had stayed around until now, I'd be the biggest Delphi fanboy.

Alas, that was not to be. Modern languages are fun and all, but not Delphi-back-in-the-day level fun :-).

nick__m5y ago

2 actively maintained version of Delphi still exist, the original one maintained by Embarcadero, and an open-source one available at https://www.lazarus-ide.org/ .

barkingcat5y ago· 1 in thread

there's something much easier to bring 64 cores to its knees - chromium takes a loooong time to compile.

mrlonglong5y ago

Two and half hours on my trusty Threadripper 2920x. Firefox only takes 20 mins.

andy_ppp5y ago· 1 in thread

Does anyone have reviews of this on their JS test suite. The quicker the tests run the better my life, I have around 2000 quite slow tests... 76s MacBook 15” 2016, 30s M1 Apple Silicon Mac Mini, what should I expect with loads more cores like this?

nevi-me5y ago

How parallel do the tests run? The Threadrippers have massive number of cores, but their per-core performance is lower than say a Ryzen 9.

Yuioup5y ago· 1 in thread

Embarcadero? Are they still around?

colejohnson665y ago

They’re still selling Delphi, for what it’s worth

dboat5y ago

After liking this article, I wanted to check out others on the site, and am shocked at the terrible usability of their front page. I can't finish reading the titles of their articles before the page just keeps moving things around on me. It is so frustrating, which is unfortunate because I would otherwise have been interested to see more of their content. Experience completely ruined by awful design judgment.

peter_d_sherman5y ago

This seems to be a little bit related to this quest for fast compilation:

The "mold" linker:

https://github.com/rui314/mold

>"Concretely speaking, I wanted to use the linker to link a Chromium executable with full debug info (~2 GiB in size) just in 1 second. LLVM's lld, the fastest open-source linker which I originally created a few years ago, takes about 12 seconds to link Chromium on my machine. So the goal is 12x performance bump over lld. Compared to GNU gold, it's more than 50x."

robinei5y ago

This shows that if you are making a not-very-fast compiler (most compilers these days), then the much maligned C compilation model has some serious advantages on modern and future hardware, due to its embarrassingly parallell nature.

ianhanschen5y ago

Great read. I wonder if the make -j modification wasn’t scaling things across all cores because it was using the physical core count (number of cores) versus the logical core count (number of core threads).

Or perhaps the code wasn’t modified to spread the work across all processor core groups (a Windows thing to support more than 64 logical cores).

https://bitsum.com/general/the-64-core-threshold-processor-g...

dboreham5y ago

They finally got around to reusing mainframe model numbers.

Tade05y ago

The images remind me of "Bad Apple!" as displayed on a CPU load graph of a 896 core machine:

https://youtu.be/RY5_gutA_Vw

tester7565y ago

Just try to compile LLVM - maybe not 1b of LoC, but that's definitely going to be challenging

Daho0n5y ago

Great article but for the love of god don't use Passmark. They are extremely bad on AMD scores. Now this is luckily two CPU's from AMD so it isn't bad but it is a bad comparison site as they heavily favour Intel.

zelly5y ago

On Linux I would just use Bazel. It can burn through 1B lines of code on all cores.

muststopmyths5y ago

Interesting. It would be cool to compare this against Visual Studio + Incredibuild, in my experience the most solid distributed C++ compilation tool.

solinent5y ago

> 1B Lines of C++

Seems like our code is inflating quite rapidly. I remember when 1M was the biggest project. /snark

throwaway815235y ago

How many times are they going to repeat the search phrases like "one billion lines"? It's reached the point where SEO obstructs human readability. It was cool that Object Pascal (maybe a descendant of Turbo Pascal) compiled 1e9 lines of Pascal in 5 minutes on the 64 core box. Scrolling way through the article, it looks like they had enough trouble setting up their parallel Windows C++ build environment on 64 cores that they ended up running 4 instances on 16 cores each, and splitting the source files among the instances. The build then took about 15 minutes on 64 cores, which is faster than I'd have expected.

This all seems kind of pointless since distributed C++ compilation has been a thing for decades, so they could have used a cluster of Ryzens instead of "zowie look at our huge expensive single box".

czbond5y ago

1B Lines? And this is just from a "rails new" command. Had to for some levity.

einpoklum5y ago

A Billion lines, eh?

  int
  main
  ()
  {
    /* 
     _______ _     _       _               _                                                               
    |__   __| |   (_)     (_)             | |                                                              
       | |  | |__  _ ___   _ ___    __ _  | | ___  _ __   __ _   _ __  _ __ ___   __ _ _ __ __ _ _ __ ___  
       | |  | '_ \| / __| | / __|  / _` | | |/ _ \| '_ \ / _` | | '_ \| '__/ _ \ / _` | '__/ _` | '_ ` _ \ 
       | |  | | | | \__ \ | \__ \ | (_| | | | (_) | | | | (_| | | |_) | | | (_) | (_| | | | (_| | | | | | |
       |_|  |_| |_|_|___/ |_|___/  \__,_| |_|\___/|_| |_|\__, | | .__/|_|  \___/ \__, |_|  \__,_|_| |_| |_|
                                                          __/ | | |               __/ |                    
                                                         |___/  |_|              |___/                   
    */
    return 0;
  }

j / k navigate · click thread line to collapse

178 comments

92 comments · 25 top-level

PragmaticPulp5y ago· 28 in thread

Fun experiment.

ska5y ago

edit: to be clearer, I'm not thinking of dedicated build machines here (hence RAID comment) but over all impact on dev time by getting local builds a lot faster.

PragmaticPulp5y ago

SSDs help, but nothing beats core count X clock speed when compiling.

mrmuagi5y ago

ska5y ago

2 more replies

lumost5y ago

rbanffy5y ago

CPUs got faster too. My MacBook pro is a lot faster than the 6yo top of the line Mac Mini.

trhway5y ago

outworlder5y ago

> i actually put HDDs on some workstations as SSD just die after 2-3 years of active build on them

That sounds very low for modern SSDs, even consumer-grade. Have you tried different vendors?

1 more reply

EricE5y ago

A way to have your cake and eat it too - check out Primocache. It's pretty inexpensive disk caching software (especially for Windows Server which is where I really leverage it!).

Pair it with an Optane for L2 cache and it will speed up normal SSD use too ;)

1 more reply

ska5y ago

1 more reply

fctorial5y ago

What are the signs of an SSD that's about to die?

2 more replies

fctorial5y ago

Don't SSDs have a finite TBW? 50GB of writes everyday (possible on large projects) will consume that in a couple of months.

nickjj5y ago

Here's a screenshot of all of the SMART stats: https://twitter.com/nickjanetakis/status/1357127351772012544

I go out of my way to save large files (videos) and other media (games, etc.) on a HDD but generally most apps are installed on the SSD and I don't really think about the writes.

1 more reply

bluedino5y ago

Something like the Samsung EVO 960 (typical mid-range SSD) will take 400TB of writes in it's lifetime. So that's 8,000 days of 50GB writes.

2 more replies

magicalhippo5y ago

The larger your SSD the more flash cells you have, so the more data you can write to it before it fails.

You can see this from the warranty for example, which for the Samsung 970 EVO[1] goes linearly from 150TBW for the 250GB model up to 1200TBW for the 2000GB model.

So if you take the 1000GB model with its 600TBW warranty, you can write 50BG of data per day for over 32 years before you're exhausted the drive write warranty.

[1]: https://www.samsung.com/semiconductor/minisite/ssd/product/c... (under "MORE SPECS")

frizkie5y ago

They do but it's really large. The Tech Report did an endurance test on SSDs 5-6 years ago [0]. The tests took 18 months before all 6 SSDs were dead.

Generally you're looking at hundreds of terabytes, if not more than a petabyte in total write capacity before the drive is unusable.

This is for older drives (~6 years old as I said), and I don't know enough about storage technology and where it's come since then to say, but I imagine things probably have not gotten worse.

[0]: https://techreport.com/review/27909/the-ssd-endurance-experi...

1 more reply

dekhn5y ago

I've swapped hundreds of terabytes to a terabyte SSD (off the shelf cheapie) with no recognizable problems (the gigapixel panoramas look fine).

1 more reply

KingOfCoders5y ago

We had ultra fast HDDs as developers with sound proof housings because they were so loud. Glad for SSDs.

masklinn5y ago

> The more expensive Threadripper parts really shine when memory bandwidth becomes a bottleneck.

Threadripper can be useful for IO, especially for C++ (which is famously quite IO intensive) owing to its 128 PCIe lanes, you can RAID0 a bunch of drives and have absolutely ridiculous IO.

bmurphy19765y ago

558734452161115y ago

amarshall5y ago

1 more reply

account425y ago

> They will only go up to the officially validated speed of the CPUs.

You won't be able to buy ECC ram marketed for speeds beyond JEDEC standards but that does not mean that you cannot clock them higher.

bmurphy19765y ago

Oh thanks, I didn't even think to check Supermicro! The prices are reasonable as well.

mng25y ago

Kingston has some, I got a couple of KSM32ED8/16HD recently. They are 3200 CL22, though they probably have some room to tighten up timings.

m4635y ago

same for everything. 5950? where would you even get one?

wait - seems you can get one, just pay 2x list price.

2 more replies

PartiallyTyped5y ago

IIRC AMD has EEC support in all x70/x50 motherboards and cpu combinations. If I may, what kind of simulations are you running?

I am trying to build a system for Reinforcement Learning research and seeing many things depend on python, I am not certain how to best optimise the system.

peferron5y ago

It's much quieter under load as well.

ahepp5y ago· 9 in thread

fmxexpressOP5y ago

TwineCompile is not a plugin wrapping the -j flag. It is a separate thing entirely unique to C++Builder. It does offer integration with MSBuild though.

I was expecting MAKE/GCC to blow me away and use all 64 cores full bore until complete and it did not do this.

klodolph5y ago

Make forces you to choose between being able to do full parallel builds or using recursive make, you can’t do both.

StillBored5y ago

I might be misunderstanding something, but the common gnumake does no such thing.

https://www.gnu.org/software/make/manual/html_node/Job-Slots...

klodolph5y ago

This is fairly common in larger projects, so you end up having to do some hackery to manually sequence make invocations if you want to use recursive make (which is pretty awful).

3 more replies

ChuckMcM5y ago

1 more reply

zajio1am5y ago

Why would you do recursive make? That is setup discouraged for decades ...

ahepp5y ago

(for reference: "Recursive Make Considered Harmful", https://accu.org/journals/overload/14/71/miller_2004/)

Too5y ago

Because you included a third-party project into your build which is not compatible with your build system and you don't want to rewrite all their ninjafiles into makefiles to match.

Someone should really formalize a standard for declaring dependencies which build systems can share between each other.

1 more reply

klodolph5y ago

Because it can be hard to maintain non-recursive make systems for large projects. Just because recursive make is discouraged does not mean that the alternative is without its own drawbacks.

renewiltord5y ago· 8 in thread

Hahaha, fuck me, CPUs are fast. That's wicked. 15 mins. A billion lines of C. Insane. Wonder if there's some IO speed to be gained from ramdisking the inputs.

titzer5y ago

17,300 lines/sec per core. That's embarrassingly slow IMHO.

bjoli5y ago

That depends completely on what optimizations are being done.

But alas, I have said for some time that a fast compiler should be able to compile about 1MLOC/S with some basic optimization work.

Macha5y ago

bserge5y ago

What can perform better?

titzer5y ago

viktorcode5y ago

Other languages do perform better

3 more replies

mhh__5y ago

I can build the D compiler (500k lines?) Warts and all in a second in my machine - and that's code that's not particularly complicated, but not at all optimized for compile times realistically.

1 more reply

josephg5y ago

Go, Jai, V8 (weirdly), some hobbyist C compilers.

1 more reply

formerly_proven5y ago· 7 in thread

[Not "real" C++ code, benchmark is for compiling 14492754 copies of a fairly simple C function]

blt5y ago

Yeah, I would say this title is a little misleading. The example doesn't use any of the C++ features that cause long compile times, like templates and the STL.

jandrese5y ago

Both of those problems seemed solvable if he was willing to chunk up his application into libraries, maybe 1024 files per library then linked to the main application.

simcop23875y ago

com2kid5y ago

This is correct, a .lib file on Windows has a bunch of .obj files in it that you can then link together.

You can also use command files[1] to pass options in instead of using the command line.

[1] https://docs.microsoft.com/en-us/cpp/build/reference/linking... to pass

2 more replies

jandrese5y ago

I was going to suggest that, but he's running on Windows and I don't know if they are supported there. I guess they probably are since they're a compiler feature.

account425y ago

> a commandline length limit restriction that dates back to early DOS days which made it impossible to link.

MinGW's linker supports passing the list of objects as a file for this reason and CMake will use that by default.

jpaul235y ago

Does there exist some kind of random C code generator?

bullen5y ago· 5 in thread

In my experience multi-core compilation does not work.

make -j>3 just locks the process and fails.

jcelerier5y ago

You just need more ram. I 'ever compile at less than -j$(ncpu). Hard with less than 32 GB tho - a single clang instance can easily eat upwards of 1gb of ram

bullen5y ago

Aha, I only compile on ARM so I got no room to increase RAM...

Something is fishy...

ahepp5y ago

If it's crapping out when you give it -j8, that seems to strongly suggest you're running into limited resources somewhere.

I'm no expert in the intricacies of parallel builds, but as far as I know you can still have dependencies between targets that will limit parallelism.

pirocks5y ago

1 more reply

drmpeg5y ago

A swap file is your friend.

1 more reply

coliveira5y ago· 4 in thread

It is a good thing that Embarcadero is keeping alive this technology to create desktop apps from the early 2000s that was abandoned by MS and other large companies in favor of complex Web-based apps.

dvfjsdhgfv5y ago

If only they had made Delphi Community Edition available a decade earlier...

cosmotic5y ago

Someone has to build the native electron wrapper

dvfjsdhgfv5y ago

phendrenad25y ago

Qt is still around and doing well. Some people still need desktop apps.

trhway5y ago· 2 in thread

maccard5y ago

I upgraded from 2x 12 core xeons to a 64 core thread ripper - compile times dropped from 45m to 12m

AshamedCaptain5y ago

Industrial software will always grow in size to use all available compilation time ... I have seen large Xeon distcc farms and the total build walltime was still measured in hours...

gm5y ago· 1 in thread

Alas, that was not to be. Modern languages are fun and all, but not Delphi-back-in-the-day level fun :-).

nick__m5y ago

2 actively maintained version of Delphi still exist, the original one maintained by Embarcadero, and an open-source one available at https://www.lazarus-ide.org/ .

barkingcat5y ago· 1 in thread

there's something much easier to bring 64 cores to its knees - chromium takes a loooong time to compile.

mrlonglong5y ago

Two and half hours on my trusty Threadripper 2920x. Firefox only takes 20 mins.

andy_ppp5y ago· 1 in thread

nevi-me5y ago

How parallel do the tests run? The Threadrippers have massive number of cores, but their per-core performance is lower than say a Ryzen 9.

Yuioup5y ago· 1 in thread

Embarcadero? Are they still around?

colejohnson665y ago

They’re still selling Delphi, for what it’s worth

dboat5y ago

peter_d_sherman5y ago

This seems to be a little bit related to this quest for fast compilation:

The "mold" linker:

https://github.com/rui314/mold

robinei5y ago

ianhanschen5y ago

Or perhaps the code wasn’t modified to spread the work across all processor core groups (a Windows thing to support more than 64 logical cores).

https://bitsum.com/general/the-64-core-threshold-processor-g...

dboreham5y ago

They finally got around to reusing mainframe model numbers.

Tade05y ago

The images remind me of "Bad Apple!" as displayed on a CPU load graph of a 896 core machine:

https://youtu.be/RY5_gutA_Vw

tester7565y ago

Just try to compile LLVM - maybe not 1b of LoC, but that's definitely going to be challenging

Daho0n5y ago

zelly5y ago

On Linux I would just use Bazel. It can burn through 1B lines of code on all cores.

muststopmyths5y ago

Interesting. It would be cool to compare this against Visual Studio + Incredibuild, in my experience the most solid distributed C++ compilation tool.

solinent5y ago

> 1B Lines of C++

Seems like our code is inflating quite rapidly. I remember when 1M was the biggest project. /snark

throwaway815235y ago

This all seems kind of pointless since distributed C++ compilation has been a thing for decades, so they could have used a cluster of Ryzens instead of "zowie look at our huge expensive single box".

czbond5y ago

1B Lines? And this is just from a "rails new" command. Had to for some levity.

einpoklum5y ago

A Billion lines, eh?

  int
  main
  ()
  {
    /* 
     _______ _     _       _               _                                                               
    |__   __| |   (_)     (_)             | |                                                              
       | |  | |__  _ ___   _ ___    __ _  | | ___  _ __   __ _   _ __  _ __ ___   __ _ _ __ __ _ _ __ ___  
       | |  | '_ \| / __| | / __|  / _` | | |/ _ \| '_ \ / _` | | '_ \| '__/ _ \ / _` | '__/ _` | '_ ` _ \ 
       | |  | | | | \__ \ | \__ \ | (_| | | | (_) | | | | (_| | | |_) | | | (_) | (_| | | | (_| | | | | | |
       |_|  |_| |_|_|___/ |_|___/  \__,_| |_|\___/|_| |_|\__, | | .__/|_|  \___/ \__, |_|  \__,_|_| |_| |_|
                                                          __/ | | |               __/ |                    
                                                         |___/  |_|              |___/                   
    */
    return 0;
  }

j / k navigate · click thread line to collapse