The Worst CPUs Ever Made (2021) (opens in new tab)

RachelF3y ago

Yeah, it's good, but the author forgets to mention some other bad chips from before the late 1990's

- The Intel i432 - too far ahead of its time, in Itanium for the 1980's. https://en.wikipedia.org/wiki/Intel_iAPX_432

- The TI CMS320 series of DSPs. So full of silicon bugs it hurt TI badly.

- The Transputer T9000 - very ambitious, but vapourware for so long it killed its parent company. https://en.wikipedia.org/wiki/Transputer#T9000

djmips3y ago

the Cell processor in the PS3 was not terrible in the PS3 and I doubt you ever worked on it. So talk about 'not the best-researched'. You can find many people singing it's praises, including me.

oppositelock3y ago

Haha! I've spent months tuning code to run on the Cell, and I despise that thing.

Sony gave you 6 of the 8 SPE cores to use (I think they reserved two, but it's been ages). They are indeed very fast, however, they have no cache coherent access to main RAM and only 256k of memory for each element. So, you have to meticulously write DMA scheduling code to keep them fed. If you're a simpleton like me, you double buffer your SPE memory, cutting in in half, so 128k to work with, 128k for paging into, and you hope to be done paging before it's needed. Latency to memory is on the order of 2,000 cycles to first byte, but then they arrive fast.

So, what you do is decompose your problem into data streams that can be cruched through, but in such a way that you minimize the need to randomly access much memory. It's often cheaper to recompute things locally than to fetch them from RAM. Random access into your RAM is pointless, so you have to marshal all your input into DMA buffers, do some work, marshal all your output into other DMA buffers, and send back to host CPU.

Anyhow, I got this working. Meshes were being skinned at very high rate, but it was very frustrating. The PPE was really slow, so you had to offload as much as you could to those SPE's. But hey, I may be complaining, but it sure beats dealing with the "Emotion Engine" on the PS2. I can tell you which emotion that engine brings up.

hajile3y ago

For every person singing it's praises, there are dozens of game developers who were singing with gladness when it was gone. The PS3 devs I've spoken with (you aside) universally hated the platform and spoke of how much more dev time it took to launch games on the platform to achieve mediocre results.

If the chip were so wonderful to work on, then it would still be in use today as the theoretical performance per area beats everything else by a wide margin.

Roadrunner was built in 2008. It would still be just barely off the top 500 list in 2021, but was decommissioned just FIVE years later in 2013. Its x86 replacement was already underway in 2010 TWO years after its launch.

I'm glad you got to work with the architecture you loved for so many years, but I think the rest of the world disagrees with your assessment.

shadowofneptune3y ago

It probably was spectacular once you knew how to work with it. Like the Atari Jaguar though, getting the performance needed out of such a highly parallel architecture took a lot of time and investment. With cross-platform games really taking off during that time, it was a strategic mistake IMO.

cbozeman3y ago

That's an enthralling tale, but perhaps you could share why you feel it deserved praise-singing to begin with, and also what titles you worked on, considering many developers were complaining about it when it was current console architecture, and you don't even need to do much of a Google Search to find people bitching about it.

jsjohnst3y ago

> You can find many people singing it's praises, including me.

Until today, I’ve never once seen someone “singing it’s praises” that’s actually written code for one. At best, they’d curse it under their breath while saying it had its benefits. Usually however it was a full throated rant about how bad the experience was.

https://www.jwhitham.org//2016/02/risc-instruction-sets-i-ha...

unethical_ban3y ago

Mod -1: Rude

musicale3y ago

Personally I can find something to like in most architectures.

Cell (for example) was an asymmetric/hybrid multicore CPU; Apple Silicon is perhaps a modern example of asymmetric performance vs. efficiency cores, and also features special-purpose accelerator cores such as the neural engine.

The 432 had capability-based addressing. Speed-over-security has had a good run, but with some disastrous consequences. We may be seeing the return of capabilities with CHERI/ARM.

The 960 was an early superscalar design, supported tag bits, and was also a successful product.

chasil3y ago

Obligatory mention:

"RISC instruction sets I have known and disliked."

https://news.ycombinator.com/item?id=11607119

I might also say that Sun's UltraSPARC was constantly beaten by Fujitsu SuperSPARC. It would have been better to outsource.

pinewurst3y ago

I remember evaluating the 960 for an embedded router project and it was quite a nice ISA. Plus the 66 Mhz CA part was fast for the price at the time.

kps3y ago

The i960CA was the one of the first superscalar microprocessors. (I wrote a third-party commercial instruction scheduler for it, that operated on assembly code.) It was pretty nice, certainly in line with the other 32-bit RISCy ISAs of the time. My impression is that its relative lack of success was due to Intel internal politics.

sanguy3y ago

i860 did well in embedded applications and for awhile was the mainstay in most RAID controllers and network communication processors. Not what Intel wanted from it but it did have a long life in such applications. I spent many years working on the i860 and i960 and learned to live with its oddities.

As for the Cell it was overly complex architecture and had remarkable performance under very optimized code. The hope was hand tuned libraries would address this; and compiler optimizations would take care of the rest. Neither happened in a meaningful way. We did two major projects with the Cell using it for real-time HDTV compression/direct broadcast applications.

Another one not on the list was the inmos Transputer. Again similar to the Cell; very complex and fast for its time; but not easy to achieve this performance. That was my first job as an EE - we used it on a GPS receiver ISA card in the early days of GPS. It was a good choice as very fast and could keep up with the signal processing that allowed us to roll code updates to add major features as various changes to GPS signals were rolled out (P-code on L2, SA being turned off, and later CA code on L2 being unencrypted). Our competitors had to redesign ASICS to get these new features which means long product cycles and hardware replacement.

Today I find myself doing a lot on the M1 series, as well as Epyc. Now you can give zero shits about clean optimized code and it still runs amazingly fast. Last time I had to do assembler or intrinsics was many many years ago - and I sort of miss that intimacy with the hardware to get the most out of it.

pinewurst3y ago

I think you mean 960 in RAID and comm controllers. The 860 had incredibly bad, almost unbelievably slow context switches. You’d never ever use it in a controller. A dedicated render pipeline is pretty all it was good for, for some value of ‘good’.

notacoward3y ago

At least the 960 was somewhat usable. Many variants were created, and several were widely used in embedded products for quite a few years. The 860, however, was Just Crap. Full stop. End of story. IIRC it had weird double-instruction modes that compilers just couldn't handle, and if you used them anyway (for very necessary performance) then handling exceptions properly was all but impossible. Definitely gets my vote for worst ever.

kps3y ago

I worked on an unreleased third-party C compiler for the i860. It wasn't that compilers couldn't handle the double-issue float mode, it was more that it was worthless in real-world code due to the entry/exit latency. It had high performance on paper but not in reality, which was exactly the lesson that Intel did not learn for the Itanium.

TheOtherHobbes3y ago

Interesting that Intel has such an impressive record of failed designs. Itanium, 860, and iAPX 432 - all anti-classics of their time.

I remember articles from Byte hyping it(the 860), also adverts for accelerator cards.

It runs rings around workstations!

Throwawayaerlei3y ago

"We now know that core 2 dropped all kinds of safety features resulting in the Meltdown vulnerabilities."

Curiously, every other out-of-order chip designer except for AMD also designed CPUs with Meltdown flaws. That's per their own documentation ARM, IBM both Power and mainframe, SPARC, and I think MIPS but they weren't entirely clear about it.

causality03y ago

Yes, and no mention of the Transmeta Crusoe either.

JJMcJ3y ago

It seems like Intel was in some ways like Microsoft. Their revenues were so high that they could survive spectacular failures and still keep going.

the_only_law3y ago

> The i960 was take 2 and their joint venture called BiiN also shuttered.

I have an old X-11 terminal I believe has a i960 in it. I’m shocked that thing was capable of running CDE desktops when it stutters on FVWM over a network much faster than it ever was intended to see.

voldacar3y ago

What games were able to make full use of the Cell?

scrlk3y ago

A different twist on the Itanium: technically bad but ended up as a strategic win for Intel.

SGI, Compaq and HP mothballed development of their own CPUs (MIPS/Alpha/PA-RISC) as they all settled on Itanium for future products.

After Itanium turned out to be a flop, those companies adopted x86-64 - Intel killed off 3 competing ISAs by shipping a bad product.

sanguy3y ago

Very true, it was the end of the DEC Alpha as Compaq chose the Itanic.

npunt3y ago

Itanium was the OS/2 of chips, Microsoft used OS/2 to get IBM chasing a dead end while they baked Windows NT & 95 until their lead was secured.

Melatonic3y ago

Interesting take!

SeanLuke3y ago

What does "worst CPU" mean? I think that it means, regardless of market success, the CPU that most hindered, indeed retarded, progress in CPU engineering history. In this regard, #1 and #2 are clearly the 8088 and 80286 respectively.

als03y ago

Agreed. I think Itanium gets a lot of unnecessary slack. It really tried some exciting new ideas and clean concepts. Not all of those concepts were much of a win, but with the first chip arriving years late then there’s no wonder it was perceived as underwhelming from the get go (that would happen to any chip that’s late)

cogman103y ago

Funnily, I feel like SIMD instructions are slowly reinventing what the itanium did out of the box.

I think a modern compiler could likely do a good job with itanium now-a-days. However, when it first came out, there simply wasn't the ability to keep those instruction batches full. Compiler tech was too far behind to work well with the hardware.

rondrabkin3y ago

Also was very amusing when we shipped test boxes...that sucker ran really hot, and I got one call about the Itanium box asking for tech support help and the report was that the box was on fire.

drallison3y ago

Of course, what constitutes "worst" is a difficult question.

Signetics made the 2650, a nice processor with a highly regular architecture with a condition code register. After every arithmetic operation including loads and stores the ALU updated the condition code register.

The National 32032 processor was a wonderful part with a clarity of design that made it a great choice for a workhorse processor. Unix running the machine was stable and efficient except that every few weeks there would be disastrous crash. With a tremendous amount of effort the source of the problem was found: a race condition in the interrupt control logic that returned from the wrong stack and scribbled over memory.

The Intel i860 exposed the internal computational pipeline to the programmer. Context switching was complicated by the conflict of real-time operating performance requirements and a deep pipeline with no way to grab the context and drain the pipeline. Eventually a dedicated team got a Unix OS running on the part, but it peformed poorly.

The Maspar MP-1 was a SIMD machine. It was cool to test new library functions by seeing if, say, sqrt(x)*sqrt(x)==x for all floating point numbers. Customers wanted the Maspar machine to be timeshared, but the architecture made it difficult to do since the CPU state was very large and memory was not mapped.

Intel's 8048 (and simplified versions like the 8021 and enhanced versions like the 8051) did not perform as well in terms of speed or code size as many of the competing micro controllers. The competition offered very simple asymmetric complex architectures which could be programmed (possibly with external hardware assists) to accomplish embedded tasks with significant effort and several days or weeks of effort. The Intel part was not quite as efficient in memory use and speed, but could be programmed in an afternoon. And another engineer/programmer could look at the code and understand it without much deep thought.

The Motorola 68000 was a wonderful machine with a clear instruction set. But the original 68000 could not support virtual memory.

There have been all sorts of different architectures tried which seen strange today but came about because the architecture was thought to provide an engineering solution to an immediate problem. There was a time when register machines were thought to be a bad architecture, far inferior to a simple stack architecture.

bstar773y ago

I would vote for the Pentium IV for all the reasons mentioned in the article, but more importantly because it was initially coupled with Rambus memory. Intel pushed that tech so hard to try and squeeze out AMD. Super high frequency, high bandwidth, high expense memory with terrible latency was not the future anyone wanted. Intel's hubris back then was off the charts.

I know intel wanted Itanium to succeed for the same reasons, but the PIV came very close to home since it actually shipped for consumers. Oddly enough, Extreme Tech was a huge shill for Intel back in those days. Funny they don't mention that in this article.

McGlockenshire3y ago

I'm currently building a homebrew system built on the TMS99105A CPU, one of the final descendants of the TMS9900.

It's a nifty little CPU. There's a lot of hidden little features once you dig in. It can actually address multiple separate 64k memory namespaces: data memory, instruction memory, macroinstruction memory, and mapped memory with the assistance of a then-standard chip. Normally these are all the same space and just need external logic to differentiate them. There's also a completely separate serial and parallel hardware interface bus.

The macroinstruction ("Macrostore") feature is pretty fun. There's sets of opcodes that will decode into illegal instructions that, instead of immediately erroring out, will go looking for a PC and workspace pointer (the "registers") in memory and jump there. Their commercial systems like the 990/12 used this feature to add floating point and other features like stack operations.

Yup, there's no stack. Just the 16 "registers," which live in main memory. There are specific branch and return instructions that store the previous PC and register pointer in the top registers of the new "workspace," allowing you direct access to the context of the caller. The assembly language is simple and straightforward with few surprises, but it's also clearly an abstraction over the underlying mechanisms of the CPU. I believe this then classifies this CPU as CISC incarnate.

There are some brilliant and insane people on the Atari Age forums! One of them managed to extract and post the data for a subset of those floating point instructions, and then broke it all down and how it all worked. Some are building new generations of previous TMS9900 systems. One of them is replicating the CPU in an FPGA. A few others are building things like a full-featured text editor and, of course, an operating system.

I've learned a hell of a lot during this project. I've been documenting what I'm doing and am planning to eventually make it into a pretty build log. I think this is a beautiful dead platform that deserved better.

throwawayboise3y ago

I have a soft spot for that CPU. My first computer was a TI99/4a when I was about 14 or 15. I started with BASIC, then learned assembly language on that machine. I give it a lot of credit for starting the trajectory my future took.

ncmncm3y ago

The TI's serial I/O bus takes the prize, for me.

StillBored3y ago

Man, that 6x86 CPU is still getting the short end of the stick nearly three decades later despite being a pretty solid chip.

So, first it generally had a higher IPC than anything else available (ignoring the P6). So, the smart marketing people at cyrix decided they were going to sell it based on a PR rating which was the average performance on a number of benchmarks vs a similar pentium. AKA a Cyrix PR166 (clocked at 133Mhz) was roughly the same perf as a 166Mhz pentium. Now had they actually been selling it for a MSRP similar to a pentium 166 that might have seemed a bit shady, but they were selling it closer to the price of a pentium 75/90.

Then along comes quake which is hand optimized for the pentium's U/V pipeline architecture and happens to use floating point too. And since a number of people had pointed out the Cx86's floating point perf was closer in "PR" ratings to its actual clock speed suddenly you have a chip performing at much less than its PR rating, and certain people then proceeded to bring up the fact that it was more like a 90Mhz pentium in quake than a 166Mhz pentium (something i'm sure made, say intel, really happy) at every chance they get.

So, yah here we are 20 years later putting a chip with what was generally a higher IPC than its competitors on a "shit" list mostly because of one benchmark. While hopefully all being aware that these shenanigins continue to this day, a certain company will be more than happy to cherry pick a benchmark and talk up their product while ignoring all the benchmarks that make it look worse.

Now as far as motherboard compatibility, that was true to a certain extent if you didn't bother to assure your motherboard was certified for the higher bus rates required by the cyrix, and the other being it tended to require more sustained current than the intels the motherboards were initially designed for. So, yah the large print said "compatible with socket7" the fine print later added that they needed to be qualified, and the whole thing paved the way for the super socket7 specs which AMD made use of. And of course lots of people didn't put large enough heatsink/fans on them which they needed to be stable.

So, people are shitting on a product that gets a bad rep because they were mostly ignorant of what we have all come to accept as normal business when your talking about differing micro architectural implementations.

PS: Proud owner of a 6x86 that cost me about the same as a pentium 75, and not once do I think it actually performed worse than that, while for the most part (compiling code, and running everything else including Unreal) it was significantly better than my roommates pentium75.

hulitu3y ago

6x86 PR200 was really fast in Linux of the day. The fact that it had 256 kB cache also helped.

StillBored3y ago

Which brings up another fact, which was that microsoft disabled the cache on cyrix processor in one of the versions of windows NT (3.51 or 4?). And so you had to download a driver from Cyrix to turn it back on. But that didn't keep various people from claiming it's perf sucked in windows NT too.

IIRC the official excuse when this became public was that a MS engineer turned it off because one of their test machines couldn't complete a stress test with it enabled, but later it turned out the root cause was a bad motherboard. The curious part being that it didn't result in MS immediately issuing a hotfix to turn the cache back on.

edit: found one of the articles mentioning this. https://www.tomshardware.com/reviews/bananas,9.html

Apparently it was just writeback mode that got disabled, either way that link mentions a 30% perf hit.

rondrabkin3y ago

OK then, I was very heavily involved in both the item in the Intro (the flaw in the first Pentium, I was the production control guy in the sole source factory) and #1 on the list (Itanium, I was trying to get hardware companies to work with their software suppliers to port to the new architecture using a very significant budget).

The common thread was Intel marketing pushing something that was a dog for marketing reasons

1. It is very amazing not in a good way when you think you have enough inventory but someone from HQ calls up the warehouse and has the older CPUs crushed by a bulldozer (you don't want to throw them out, they are quite usable)

2. Was amazing that sucker ran so hot tech support got a call about test boxes catching on fire

louissan3y ago

Anyone remember Pentium II and their new <del>sockets</del> cartridges?

That didn't last long. Like what, one generation?

Good.

(saying that, but I remember purchasing a dual Pentium II motherboard for 2 400 MHz CPUs to speed up 3DStudio 4 renderings under Windows NT4... xD)

scrlk3y ago

The reason why they went down the slot route was for packaging reasons.

Cache was still external at that point. There would be performance benefits from brining it on die, but larger chips are more expensive to make & using two smaller dies (one for CPU & one for cache like the Pentium Pro) is still quite expensive.

The middle ground was to put the CPU and cache on a single PCB, so you end up with a cartridge form factor. By the time the next generation rolled around it was possible to put the CPU and cache on the same die at a reasonable cost (Moore's law), making the cartridge form factor obsolete.

flyinghamster3y ago

There were Pentium IIIs in slot form as well - I encountered one at work many years ago. AMD also had a "Slot A" version of the original Athlon, which was quickly ditched.

pram3y ago

Pretty sure there was a slot Pentium 3.

I thought it was cool at the time, made me think of a NES cartridge.

user39393823y ago

It was called Slot 1. The first computer I built for myself used it, circa 2001.

Ze Fuji Quicksnap CPUs.

(Single use analog pocket cameras)

mardifoufs3y ago

Anymore details on those? I can't find any info on the CPUs inside

xbar3y ago

I'm so ashamed to have owned a Cyrix, a P4, and an AMD Bulldozer.

They were all awful.

pseudosavant3y ago

I had two Bulldozers. Bulldozer wasn't competitive at the top end, but I always found Athlon chips to be cheaper than their performance equivalent Intel part. So the fastest AMD chip would be cheaper than the third fastest Intel part. Still a good value. Terrible for AMD's bottom-line though.

xbar3y ago

Fair enough. I feel like I got decent value out of my Athlon. At the time, it sure seemed like a gross power hog. I suspect I would be shocked by its modest TDP if I went and looked back at specs.

josteink3y ago

I’ve had a P4 and I didn’t consider it “awful”.

It was without a doubt the fastest CPU I had ever had at the time, but boy did it generate heat and need cooling.

That machine sounded like a always on vacuum-cleaner.

sidewndr463y ago

I owned a Pentium 4 as well (oh boy did I save for a long time as a teenager to afford that). It wasn't really as bad as what this article claims. On the other hand, the dual-core parts probably really are that bad.

zepearl3y ago

Intel was too expensive for me, so I ended up buying a Cyrix => performance (floating point) was terrible in Falcon 3, I was sooo sad - but on the other hand that gave me until today the push to really focus on details before taking a decision => thank you Cyrix for having changed my life hehe.

StillBored3y ago

Nothing to be ashamed of on the cyrix and AMD, both were better price/perf than what you would have bought with the same money from intel. The same can't be said of the P4, which was right in the middle of AMD giving intel a good solid whumping.

hulitu3y ago

For P4 i underestand ( legend says that P3 was faster at the same clock rate and that's why there are no P4 at the same speed as P3). But Cyrix and Buldozer ?

tangental3y ago

I visted this page hoping to see the PowerPC 970 top of the list, but all it gets is a "Dishonorable Mention". After going through three PowerMac G5s, all of which had their processors die within 4 years, I still bear a grudge.

flenserboy3y ago

Surprising; never knew anyone whose G5s died on them (the systems, sure, but not the CPUs). My dual '04 cpus are still chugging along just fine.

selectodude3y ago

The hotter running G5s had liquid cooling that would inevitably leak and corrode everything.

[0] https://www.cnet.com/culture/pcs-plagued-by-bad-capacitors/

sidewndr463y ago

For us non Apple users, how is that possible? I don't think I've ever had a CPU die other than by lightning.

giantrobot3y ago

I don't know what OP was running but the G5 iMacs were some of the machines suffering from the early 2000s capacitor plague[0]. The power supplies and power regulation on the logic boards would die on those all the time. If you were lucky it was just the power supply but the problem usually needed a PSU and logic board swap.

pram3y ago

The processor in a G5 PowerMac came on a card that had the VRMs, capacitors, and a bunch of other stuff on it. It was basically like a tiny motherboard that attached to your motherboard.

kzrdude3y ago

I was imagining lightning struck the cpu specifically, leaving the rest intact? Quite the precision.

protomyth3y ago

If I remember correctly it didn't have the biendian capability of the G4 so Virtual PC wouldn't run.

my1233y ago

Virtual PC for Mac did get an update to run on the G5.

https://obits.dallasnews.com/us/obituaries/dallasmorningnews...

cwilkes3y ago

I wondered what happened to the head of Cyrix, Jerry Rogers. He died 2 years ago:

grp0003y ago

For a bit of time, I ran an over clocked FX 8320 and crossfire 7970's. The heat that machine put out was tremendous. I only had a wall mounted AC unit so I had to practically take my shirt off when I loaded it up.

robotnikman3y ago

Ah yes, AMD/ATI crossfire. I had nearly forgotten that was a thing...

grp0003y ago

This was back when I was a student and had the FX and one GPUs. Getting an internship meant that I had the money for an upgrade, and the cheapest, most straightforward was to get a second GPU, or so I thought. Wasn't even that cheap because I had to put both the GPUs under water to keep them from overheating when both in the same PC.

easytiger3y ago

My first PC had a cyrix 333Mhz CPU. Ran just fine! But I was learning c in Borland turbo c and djgpp so it didn't have to do much. Running java on it... Well that wasn't fun with the 32MB RAM.

Worked on itanium too. It was more amazing Microsoft actually had support for it.

annoyingnoob3y ago

I've owned 4 or 5 of the CPUs on that list over the years. I'm sure there are worse.

nesarkvechnep3y ago

Cyrix wasn't the first company to build SoC, Acorn was.

fanf23y ago

You are referring to the ARM 250 chip in the Acorn A3010, A3020, and A4000 https://en.wikipedia.org/wiki/Acorn_Archimedes

nesarkvechnep3y ago

Yes, thank you!

hilbert423y ago

"Note: Plenty of people will bring up the Pentium FDIV bug here, but the reason we didn’t include it is simple: Despite being an enormous marketing failure for Intel and a huge expense, the actual bug was tiny."

The fact that the fault was tiny and that few people were affected is definatly NOT the point.

The so-called Pentium 'bug' was the result of fundamentally terrible engineering on Intel's part in that the underlying design wasn't fit for purpose - it wasn't just a bug.

It seems to me the authors of this story do not understand the implications of what Intel did was fundamentally wrong in that its math processing was flawed by design from the outset or otherwise they would have included the Pentium in their list.

In order to achieve increased math processing speed, Intel broke mathematics algorithms down into part algorithm and part lookup tables - that is instead of having mathematics algorithms complete the whole task (which is the logical way of doing things). If the mathematics algorithm were wrong then every calculation would also be wrong and thus the problem obvious from the outset. Adding a lookup table makes calculations faster but one would then have had to test every combination in the lookup table - and Intel didn't.

Look at the problem like this - think of a set of log or trig tables, now think of the implications if one of those table entries is incorrect. What Intel did was deliberate cheating and it failed to get away with it. Intel would have known this from the outset and thus the problem was an integral design fault rather than a bug.

Intel knowingly implemented a design that had flawed data integrity at its most fundamental level. What Intel did was so nasty that it's hard to think of how it could have made matters worse than if it had deliberately tried to introduce a fault.

In my opinion, any company that would stoop to such low ethical tactics as Intel did with the Pentium's design would have demonstrated that it cannot be trusted - and I've never trusted Intel from that point onward.

If anyone ever needs a reason for why processors should have open design architectures that are subject to third-party scrutiny then this is the quintessential example.

tadfisher3y ago

This is a bit hyperbolic. Intel implemented a known and popular algorithm (SRT [1]) with a standard LUT for the bit patterns expected in IEEE754 FP numbers. They were not the first, last, or only microprocessor design firm to do so. A fault in a script that copied the LUT values to the machines that program the PLAs as part of the manufacturing process led to 5 missing values in the LUT (set to 0), out of 1066 entries.

There's a great writeup with the results of Intel's internal investigation [2], which outlines the challenge in testing production chips for this sort of bug. A key point:

> The fraction of the total input number space that is prone to failure is 1.14 x 10^-10.

So around 1 in 9 billion possible numerator/denominator pairs exhibit the bug. Testing 9 billion double-precision FDIV divides on a 60MHz Pentium would take almost four days, if my math checks out and the CPU could do 2.5 billion divides per 24 hours.

[1]: https://en.wikipedia.org/wiki/Division_algorithm#SRT_divisio...

[2]: https://users.fmi.uni-jena.de/~nez/rechnerarithmetik_5/fdiv_...

hilbert423y ago

Hyperbolic or otherwise, it happened and I remember it well (it changed my purchasing decisions at the time).

I'm aware of most of those details as I took a keen interest in the matter at the time. I'm also aware of the argument for the use of said algorithm.

Whether one adopts this approach or not is philosophical argument and I just happen to believe it's bad (and ugly) engineering - and in this case witnes the outcome, it cost Intel dearly in both monetary and PR terms.

colejohnson663y ago

> In order to achieve increased math processing speed, Intel broke mathematics algorithms down into part algorithm and part lookup tables - that is instead of having mathematics algorithms complete the whole task (which is the logical way of doing things).

Can you expand on this? I thought all FPUs used lookup tables? Even the 8087 had them.

scheme2713y ago

I think fpus still do for things like trig functions. Doing it using a power series potentially gives bad results and takes a long time to get enough accuracy. I think it was pretty common to use lookup tables in various algorithms back then since it was way faster to do a memory access and then some interpolation or just a memory access than to do a bunch of calculations.

drivebycomment3y ago

This is nonsense. There's no functional difference between "lookup table" and "algorithm" (whatever that means) when it comes to a circuit design. Both are perfectly valid ways, nothing inherently wrong with either.

hilbert423y ago

See links in tadfisher's reply, they provide a summary. (My comment was simplified for the HN post, his links provide a comprehensive description).

alkaloid3y ago

As a system builder for a "custom computer shop" back in 1997/98, I came here just to make sure Cyrix was on the list.

gattilorenz3y ago

No IDT WinChip though, that's mildly surprising

timw4mail3y ago

I don't think the Winchip was that well known. But it never pretended to be a high performing design.

velcrovan3y ago

CTRL+F "transmeta crusoe": Not found

ah well

mrintellectual3y ago

My vote for worst CPU goes to the iAPX 432 (also not on this list).

jandrese3y ago

Wow, a garbage collector implemented inside of the processor. Chip level support for objects. You can't fault Intel for their ambition here, just their common sense.

And the whole thing is built for a world where everybody is writing code in Ada. I bet some compiler makers were salivating at the prospect of collecting all of those huge license fees from developers.

sidewndr463y ago

I was looking for this as well. It should be on there for introducing a completely new architecture, costing more, and underperforming contemporary products from Intel's catalog.

Max-q3y ago

When I saw the headline I expected the iAPX to be number one on the list.

ncmncm3y ago

Isn't the i860 the inheritor of iAPX 432 design details?

UmbertoNoEco3y ago

Oh those were the days were I was young and naive and I thought Linus was going to change the world (again) blurring the lines between 'software' and 'hardware'

hajile3y ago

I think transmeta was MUCH better than Itanium.

Itanium held the idea that we could accurately predict ILP at compile time (when the halting problem clearly states that we cannot).

Transmeta said VLIW has the best theoretical PPA possible, so let's wrap that in a large, programmable JIT to analyze/optimize stuff to take advantage.

Modern CPUs run quite a bit closer to transmeta, but they largely use fixed-function hardware rather than being able to improve performance at a later time.

If we could nail down that ideal VLIW architecture, we could sell a given chip at various process sizes and then offer various paid "software" upgrades or compatibility packs for various ISAs to run legacy code.

At least there's a pipe dream worth looking into.

cogman103y ago

> Itanium held the idea that we could accurately predict ILP at compile time (when the halting problem clearly states that we cannot).

I don't know where these notions are coming from.

Compilers can (and do) reorder instructions to extract as much parallelism as possible. Further, SIMD has forced most compilers down a path of figuring out how to parallelize, at the instruction level, the processing of data.

Further, most CPUs now-a-days are doing instruction reordering to try and extract as much instruction level parallelism out as possible.

Figuring out what instructions can be run in parallel is a data dependency problem, one that compilers have been solving for years.

Side note: the instruction reordering actually poses a problem for parallel code. Language writers and compiler writers have to be extra careful about putting up "fences" to make sure a read or write isn't happening outside a critical section when it shouldn't be.

https://devblogs.microsoft.com/oldnewthing/20170817-00/?p=96...

pengaru3y ago

I had a Transmeta Crusoe based PC104 SBC and for the time it was relatively quick for something low power, does it really deserve to be in the "worst cpus ever" list? why for?

velcrovan3y ago

You’re right. I was kind of being snarky. But it was a huge disappointment compared against the promises from Transmeta and the tech journalism of the time.

masklinn3y ago

The lack of Alpha seems odd, though maybe that should be the worst ISA rather than merely individual CPU?

notacoward3y ago

The ISA wasn't that bad, but the weak memory-ordering model was a huge pain in the ass. I worked for a while with some of the Alpha folks years later, and they did a lot of really great work, but they did bring that weak memory model along with them. It allowed us to find many Linux kernel bugs that had lain dormant since Alpha because nothing since had repeated the mistake. Fun times ... not.

pinewurst3y ago

Why would Alpha be the worst? I’ve owned 2 of them, 21064 and 21264, and they were fast and reliable.

chasil3y ago

The only two architectural questions that I know of were...

The weak memory model:

Inability to address low-power designs:

https://en.m.wikipedia.org/wiki/StrongARM

"According to Allen Baum, the StrongARM traces its history to attempts to make a low-power version of the DEC Alpha, which DEC's engineers quickly concluded was not possible."

The other major problem with the Alpha was the high license costs of DEC operating systems, which greatly helped put it in the grave.

als03y ago

The influence of Alpha on modern instruction sets like ARM64 and RISC-V is tremendous. It’s just sad it had to die for this to happen.

jandrese3y ago

My impression is that Alpha's ISA was mostly fine except for the power draw, DEC just didn't have the R&D budget to keep up with Intel and all of the foundries and had their lunch eaten by x86 just like every other chip designer in the 80s and 90s.

ncmncm3y ago

Alpha was astonishing when it came out. It ran x86 code in emulation faster than any real x86 could go. Its only serious flaw was its chaotic memory bus operation ordering, which came to matter when you had two or more of them. Alpha died because DEC died, not the reverse.

hajile3y ago

x86, SPARC, Cell, EPIC, iAPX, i860, and even contemporary ARM versions are worse. If we reach into lesser-known ISAs or older ISAs, we could add a TON more to that list.

j / k navigate · click thread line to collapse

151 comments

hajile3y ago

This doesn't seem to be the best-researched article out there.

flyinghamster3y ago

tsss3y ago

adrian_b3y ago

So an 8-core Bulldozer could barely match the multi-threaded performance of a 4-core Sandy Bridge, while being much slower on single-thread tasks.

While Intel worked diligently to improve the performance of their cores, what did AMD do ?

paulmd3y ago

price-to-performance is the last resort of a company that has failed at taking the performance crown.

Price chart: https://www.anandtech.com/show/4955/the-bulldozer-review-amd...

RachelF3y ago

Yeah, it's good, but the author forgets to mention some other bad chips from before the late 1990's

- The Intel i432 - too far ahead of its time, in Itanium for the 1980's. https://en.wikipedia.org/wiki/Intel_iAPX_432

- The TI CMS320 series of DSPs. So full of silicon bugs it hurt TI badly.

- The Transputer T9000 - very ambitious, but vapourware for so long it killed its parent company. https://en.wikipedia.org/wiki/Transputer#T9000

djmips3y ago

the Cell processor in the PS3 was not terrible in the PS3 and I doubt you ever worked on it. So talk about 'not the best-researched'. You can find many people singing it's praises, including me.

oppositelock3y ago

Haha! I've spent months tuning code to run on the Cell, and I despise that thing.

hajile3y ago

If the chip were so wonderful to work on, then it would still be in use today as the theoretical performance per area beats everything else by a wide margin.

I'm glad you got to work with the architecture you loved for so many years, but I think the rest of the world disagrees with your assessment.

shadowofneptune3y ago

cbozeman3y ago

jsjohnst3y ago

> You can find many people singing it's praises, including me.

https://www.jwhitham.org//2016/02/risc-instruction-sets-i-ha...

unethical_ban3y ago

Mod -1: Rude

musicale3y ago

Personally I can find something to like in most architectures.

The 432 had capability-based addressing. Speed-over-security has had a good run, but with some disastrous consequences. We may be seeing the return of capabilities with CHERI/ARM.

The 960 was an early superscalar design, supported tag bits, and was also a successful product.

chasil3y ago

Obligatory mention:

"RISC instruction sets I have known and disliked."

https://news.ycombinator.com/item?id=11607119

I might also say that Sun's UltraSPARC was constantly beaten by Fujitsu SuperSPARC. It would have been better to outsource.

pinewurst3y ago

I remember evaluating the 960 for an embedded router project and it was quite a nice ISA. Plus the 66 Mhz CA part was fast for the price at the time.

kps3y ago

sanguy3y ago

pinewurst3y ago

notacoward3y ago

kps3y ago

TheOtherHobbes3y ago

Interesting that Intel has such an impressive record of failed designs. Itanium, 860, and iAPX 432 - all anti-classics of their time.

I remember articles from Byte hyping it(the 860), also adverts for accelerator cards.

It runs rings around workstations!

Throwawayaerlei3y ago

"We now know that core 2 dropped all kinds of safety features resulting in the Meltdown vulnerabilities."

causality03y ago

Yes, and no mention of the Transmeta Crusoe either.

JJMcJ3y ago

It seems like Intel was in some ways like Microsoft. Their revenues were so high that they could survive spectacular failures and still keep going.

the_only_law3y ago

> The i960 was take 2 and their joint venture called BiiN also shuttered.

voldacar3y ago

What games were able to make full use of the Cell?

scrlk3y ago

A different twist on the Itanium: technically bad but ended up as a strategic win for Intel.

SGI, Compaq and HP mothballed development of their own CPUs (MIPS/Alpha/PA-RISC) as they all settled on Itanium for future products.

After Itanium turned out to be a flop, those companies adopted x86-64 - Intel killed off 3 competing ISAs by shipping a bad product.

sanguy3y ago

Very true, it was the end of the DEC Alpha as Compaq chose the Itanic.

npunt3y ago

Itanium was the OS/2 of chips, Microsoft used OS/2 to get IBM chasing a dead end while they baked Windows NT & 95 until their lead was secured.

Melatonic3y ago

Interesting take!

SeanLuke3y ago

als03y ago

cogman103y ago

Funnily, I feel like SIMD instructions are slowly reinventing what the itanium did out of the box.

rondrabkin3y ago

Also was very amusing when we shipped test boxes...that sucker ran really hot, and I got one call about the Itanium box asking for tech support help and the report was that the box was on fire.

drallison3y ago

Of course, what constitutes "worst" is a difficult question.

The Motorola 68000 was a wonderful machine with a clear instruction set. But the original 68000 could not support virtual memory.

bstar773y ago

McGlockenshire3y ago

I'm currently building a homebrew system built on the TMS99105A CPU, one of the final descendants of the TMS9900.

throwawayboise3y ago

ncmncm3y ago

The TI's serial I/O bus takes the prize, for me.

StillBored3y ago

Man, that 6x86 CPU is still getting the short end of the stick nearly three decades later despite being a pretty solid chip.

hulitu3y ago

6x86 PR200 was really fast in Linux of the day. The fact that it had 256 kB cache also helped.

StillBored3y ago

edit: found one of the articles mentioning this. https://www.tomshardware.com/reviews/bananas,9.html

Apparently it was just writeback mode that got disabled, either way that link mentions a 30% perf hit.

rondrabkin3y ago

The common thread was Intel marketing pushing something that was a dog for marketing reasons

2. Was amazing that sucker ran so hot tech support got a call about test boxes catching on fire

louissan3y ago

Anyone remember Pentium II and their new <del>sockets</del> cartridges?

That didn't last long. Like what, one generation?

Good.

(saying that, but I remember purchasing a dual Pentium II motherboard for 2 400 MHz CPUs to speed up 3DStudio 4 renderings under Windows NT4... xD)

scrlk3y ago

The reason why they went down the slot route was for packaging reasons.

flyinghamster3y ago

There were Pentium IIIs in slot form as well - I encountered one at work many years ago. AMD also had a "Slot A" version of the original Athlon, which was quickly ditched.

pram3y ago

Pretty sure there was a slot Pentium 3.

I thought it was cool at the time, made me think of a NES cartridge.

user39393823y ago

It was called Slot 1. The first computer I built for myself used it, circa 2001.

Ze Fuji Quicksnap CPUs.

(Single use analog pocket cameras)

mardifoufs3y ago

Anymore details on those? I can't find any info on the CPUs inside

xbar3y ago

I'm so ashamed to have owned a Cyrix, a P4, and an AMD Bulldozer.

They were all awful.

pseudosavant3y ago

xbar3y ago

Fair enough. I feel like I got decent value out of my Athlon. At the time, it sure seemed like a gross power hog. I suspect I would be shocked by its modest TDP if I went and looked back at specs.

josteink3y ago

I’ve had a P4 and I didn’t consider it “awful”.

It was without a doubt the fastest CPU I had ever had at the time, but boy did it generate heat and need cooling.

That machine sounded like a always on vacuum-cleaner.

sidewndr463y ago

zepearl3y ago

StillBored3y ago

hulitu3y ago

For P4 i underestand ( legend says that P3 was faster at the same clock rate and that's why there are no P4 at the same speed as P3). But Cyrix and Buldozer ?

tangental3y ago

flenserboy3y ago

Surprising; never knew anyone whose G5s died on them (the systems, sure, but not the CPUs). My dual '04 cpus are still chugging along just fine.

selectodude3y ago

The hotter running G5s had liquid cooling that would inevitably leak and corrode everything.

[0] https://www.cnet.com/culture/pcs-plagued-by-bad-capacitors/

sidewndr463y ago

For us non Apple users, how is that possible? I don't think I've ever had a CPU die other than by lightning.

giantrobot3y ago

pram3y ago

The processor in a G5 PowerMac came on a card that had the VRMs, capacitors, and a bunch of other stuff on it. It was basically like a tiny motherboard that attached to your motherboard.

kzrdude3y ago

I was imagining lightning struck the cpu specifically, leaving the rest intact? Quite the precision.

protomyth3y ago

If I remember correctly it didn't have the biendian capability of the G4 so Virtual PC wouldn't run.

my1233y ago

Virtual PC for Mac did get an update to run on the G5.

https://obits.dallasnews.com/us/obituaries/dallasmorningnews...

cwilkes3y ago

I wondered what happened to the head of Cyrix, Jerry Rogers. He died 2 years ago:

grp0003y ago

robotnikman3y ago

Ah yes, AMD/ATI crossfire. I had nearly forgotten that was a thing...

grp0003y ago

easytiger3y ago

My first PC had a cyrix 333Mhz CPU. Ran just fine! But I was learning c in Borland turbo c and djgpp so it didn't have to do much. Running java on it... Well that wasn't fun with the 32MB RAM.

Worked on itanium too. It was more amazing Microsoft actually had support for it.

annoyingnoob3y ago

I've owned 4 or 5 of the CPUs on that list over the years. I'm sure there are worse.

nesarkvechnep3y ago

Cyrix wasn't the first company to build SoC, Acorn was.

fanf23y ago

You are referring to the ARM 250 chip in the Acorn A3010, A3020, and A4000 https://en.wikipedia.org/wiki/Acorn_Archimedes

nesarkvechnep3y ago

Yes, thank you!

hilbert423y ago

The fact that the fault was tiny and that few people were affected is definatly NOT the point.

The so-called Pentium 'bug' was the result of fundamentally terrible engineering on Intel's part in that the underlying design wasn't fit for purpose - it wasn't just a bug.

If anyone ever needs a reason for why processors should have open design architectures that are subject to third-party scrutiny then this is the quintessential example.

tadfisher3y ago

There's a great writeup with the results of Intel's internal investigation [2], which outlines the challenge in testing production chips for this sort of bug. A key point:

> The fraction of the total input number space that is prone to failure is 1.14 x 10^-10.

[1]: https://en.wikipedia.org/wiki/Division_algorithm#SRT_divisio...

[2]: https://users.fmi.uni-jena.de/~nez/rechnerarithmetik_5/fdiv_...

hilbert423y ago

Hyperbolic or otherwise, it happened and I remember it well (it changed my purchasing decisions at the time).

I'm aware of most of those details as I took a keen interest in the matter at the time. I'm also aware of the argument for the use of said algorithm.

colejohnson663y ago

Can you expand on this? I thought all FPUs used lookup tables? Even the 8087 had them.

scheme2713y ago

drivebycomment3y ago

hilbert423y ago

See links in tadfisher's reply, they provide a summary. (My comment was simplified for the HN post, his links provide a comprehensive description).

alkaloid3y ago

As a system builder for a "custom computer shop" back in 1997/98, I came here just to make sure Cyrix was on the list.

gattilorenz3y ago

No IDT WinChip though, that's mildly surprising

timw4mail3y ago

I don't think the Winchip was that well known. But it never pretended to be a high performing design.

velcrovan3y ago

CTRL+F "transmeta crusoe": Not found

ah well

mrintellectual3y ago

My vote for worst CPU goes to the iAPX 432 (also not on this list).

jandrese3y ago

Wow, a garbage collector implemented inside of the processor. Chip level support for objects. You can't fault Intel for their ambition here, just their common sense.

sidewndr463y ago

I was looking for this as well. It should be on there for introducing a completely new architecture, costing more, and underperforming contemporary products from Intel's catalog.

Max-q3y ago

When I saw the headline I expected the iAPX to be number one on the list.

ncmncm3y ago

Isn't the i860 the inheritor of iAPX 432 design details?

UmbertoNoEco3y ago

Oh those were the days were I was young and naive and I thought Linus was going to change the world (again) blurring the lines between 'software' and 'hardware'

hajile3y ago

I think transmeta was MUCH better than Itanium.

Itanium held the idea that we could accurately predict ILP at compile time (when the halting problem clearly states that we cannot).

Transmeta said VLIW has the best theoretical PPA possible, so let's wrap that in a large, programmable JIT to analyze/optimize stuff to take advantage.

Modern CPUs run quite a bit closer to transmeta, but they largely use fixed-function hardware rather than being able to improve performance at a later time.

At least there's a pipe dream worth looking into.

cogman103y ago

> Itanium held the idea that we could accurately predict ILP at compile time (when the halting problem clearly states that we cannot).

I don't know where these notions are coming from.

Further, most CPUs now-a-days are doing instruction reordering to try and extract as much instruction level parallelism out as possible.

Figuring out what instructions can be run in parallel is a data dependency problem, one that compilers have been solving for years.

https://devblogs.microsoft.com/oldnewthing/20170817-00/?p=96...

pengaru3y ago

I had a Transmeta Crusoe based PC104 SBC and for the time it was relatively quick for something low power, does it really deserve to be in the "worst cpus ever" list? why for?

velcrovan3y ago

You’re right. I was kind of being snarky. But it was a huge disappointment compared against the promises from Transmeta and the tech journalism of the time.

masklinn3y ago

The lack of Alpha seems odd, though maybe that should be the worst ISA rather than merely individual CPU?

notacoward3y ago

pinewurst3y ago

Why would Alpha be the worst? I’ve owned 2 of them, 21064 and 21264, and they were fast and reliable.

chasil3y ago

The only two architectural questions that I know of were...

The weak memory model:

Inability to address low-power designs:

https://en.m.wikipedia.org/wiki/StrongARM

"According to Allen Baum, the StrongARM traces its history to attempts to make a low-power version of the DEC Alpha, which DEC's engineers quickly concluded was not possible."

The other major problem with the Alpha was the high license costs of DEC operating systems, which greatly helped put it in the grave.

als03y ago

The influence of Alpha on modern instruction sets like ARM64 and RISC-V is tremendous. It’s just sad it had to die for this to happen.