Nvidia Speeds Key Chipmaking Computation by 40x (opens in new tab)

(spectrum.ieee.org)

123 pointsafrcnc3y ago40 comments

40 comments

28 comments · 6 top-level

ChuckNorris893y ago· 12 in thread

It's just sad how absent AMD is from such innovative uses of GPUs. It wouldn't bother me too much if at least their GPUs were much more affordable than Nvidia's, but they're not. They're at best playing catch-up months or years later with slight price discounts (the Steam HW survey still reflects this market discrepancy)

And I'm not talking about gimmicks like RTX, I'm talking about all these cool use cases around ML and DL like background noise cancelling, video upscaling, camera eye contact real-time deepfakes and now this. And that's if you ignore all the mind-blowing research papers put out by Nvidia which aren't featured in consumer apps yet.

This is Nvidia's biggest moat and AMD isn't even in the race here and for some reason Lisa Su seems to not give enough of a shit to compete.

I hate Nvidia for their price gouging and anti-consumer practices, but at least they haven't gotten complacent and are innovating on all fronts to keep pushing the envelope. Massive respect for the tech leadership at Nvidia.

sorenjan3y ago

The GPGPU situation is absolutely ridiculous. Imagine if you couldn't run code for an Intel CPU on an AMD chip, who would buy them? But somehow we accept that our GPUs, small super computers that often cost 50% of the total computer cost, can only run games and vendor proprietary code. I thought we would have this figured out a decade ago.

I can't even blame Nvidia, of course they're gonna do what's best for them, and it has worked. I blame AMD for completely dropping the ball on the GPU compute segment, and I blame users for preferring Cuda libraries instead of OpenCL.

I hope Intel and maybe AMD can get the GPGPU market to something that resembles something open and most importantly interoperable. But Nvidia has a big head start.

dragonwriter3y ago

> Imagine if you couldn't run code for an Intel CPU on an AMD chip, who would buy them?

Enough people that the Intel standard would lose out and you'd need to ask the reverse question? (See IA64 vs. AMD64.)

But if you mean, “what if there was a major microprocessor line incompatible with AMD64”, well, its called ARM and lots of people buy it.

1 more reply

fnordpiglet3y ago

Sort of like how you can’t run ARM on x86 or PowerPC or risc-v?

1 more reply

kkielhofner3y ago

Completely agree.

Whenever use of AMD GPUs for ML comes up on HN I echo your points with the added personal experiences (PAIN) I've had trying to actually use an AMD GPU for anything other than driving a display.

In terms of the tech leadership at Nvidia, all you need to do is peek at the mind-blowing number of repos they have on Github - literally hundreds of component software pieces across every layer that at this point can do anything from drastic performance increases to completely unique (CUDA only of course) functionality.

On HN especially the whole "proprietary driver on Linux desktop situation" has hurt Nvidia significantly in terms of hearts and minds. As I said, then you look at almost any other software provided by Nvidia and realize they're actually a huge champion and supporter of open source for just about every aspect of the ecosystem other than the driver itself - and they're working on open sourcing the driver as well.

AMD occupies a weird space in GPU compute - there are massive HPC deployments of AMD GPUs. Presumably they only work because AMD is throwing a ton of essentially one-off support at them for deployment. On the other end you have their "support" for low to mid-range GPU compute. I say "support" with quotes because you realize very quickly it's absolutely pathetic to the point of useless and run back screaming to Nvidia/CUDA.

I'm not an Nvidia fanboy but my opinion at this point is the hardware markups for Nvidia GPU essentially subsidize all of the incredible (largely open source) software and ecosystem support they provide. Yes, they engage in anti-competitive practices but please show me a large corporation that doesn't. Fact is Nvidia has invested a massive amount of resources for well over a decade to earn their dominance of GPU compute.

Spending twice as much (somewhat factual but overblown popular opinion on HN) on Nvidia hardware becomes an obvious choice when you realize you're going to burn A TON of time (and still fail) trying to get AMD GPU hardware to actually do anything in ML.

0dayz3y ago

Sure, but I think you can't really blame AMD for it.

Nvidia has dominated the market for a long time, and unlike Intel they up the prices and wisely spent enough of it on R&D, Nvidia is just reaping the rewards for it.

blihp3y ago

Sure you can. The difference between the CPU and GPU markets is that AMD is a competent competitor in one of them. Pre-Ryzen AMD knew they weren't competitive and AMD CPU pricing reflected that. The reason nVidia is getting away with their pricing is that AMD has been producing mediocre hardware (and drivers) and pricing it as if it were premium and competitive for the money. AMD's continually declining GPU market share indicates that customers aren't buying their argument.

iforgotpassword3y ago

But it applies to AMD in every way. I could understand that pre-ryzen when they were really cash restricted and had nothing to compete anyways. So you don't throw money at software devs. but the software side apart from (Windows) gaming is still a shit show with AMD today.

We wanted to use MI50s at work because it was promised they can do SRIOV, but we never got any further with AMD support than "it should work". They took ages to respond, and could not tell what was wrong from the extensive logs and hwinfo we provided them with.

Also the PCI reset bug that plagued multiple generations. There's a guy maintaining a kernel module that works around that issue in a whacky way. According to his research and reverse engineering, AMD could fix that with a firmware update to those cards. Even got in contact with AMD engineers briefly and outlined what the problem was. Then radio silence, and a couple months later AMD added a very similar workaround in their kernel module, the amdgpu driver. It's just that a fix in there doesn't make any sense whatsoever, because you need that fix when you do PCI passthrough, in which case you explicitly do not load the amdgpu module, as you don't use the GPU on the host machine but, well, pass it through to the VM.

1 more reply

amdgpunope3y ago

I applied for a job with AMD's GPU division. I am senior and qualified, and the hiring manager was a petty tyrant. At a company as large as AMD, departments have their own culture. Lisa Su has done an incredible job revitalizing their position in the CPU market, but the GPU side needs some serious attention.

snvzz3y ago

The "compute" market is about to be disrupted[0].

AMD might be right not to put effort into manually optimizing for the old approach.

0. https://www.youtube.com/watch?v=yHrdEcsr9V0

newZWhoDis3y ago

RT is not a gimmick, it’s the rapidly-approaching future

KaoruAoiShiho3y ago

Is it possible that AI rendering will make RT moot.

3 more replies

oblak3y ago

Current uses, with the notable exception of Quake 2 and possible that nvidia sponsored Portal mod, are pure gimmicks. The real thing requires ridiculous resources which are not available for real-time graphics.

bottlepalm3y ago· 6 in thread

That's pretty cool they're using GPUs to design GPUs.

amelius3y ago

This is not fundamentally new. We've been using CPUs to design CPUs for ages.

carlmr3y ago

And compilers to compile the next version of themselves.

LoganDark3y ago

Well, they're using GPUs to perform the light calculations required to create a photomask that will get the designs they've already produced onto real silicon. But sure, they're using GPUs to design GPUs. I'm sure this technology will create more opportunities to use inverse lithography and therefore have huge implications for the actual designs, which may no longer have to hold back quite as much due to compute concerns.

imhoguy3y ago

Add some AI and we can sit back and see it building Dyson sphere soon /s

tysam_and3y ago

Introducing the RTX for T 80!: the worlds' first nearly fully recursive GPU.

hackernewds3y ago

What about using GPUs to design GPUs that will design GPUs

iandanforth3y ago· 2 in thread

Aside: Nvidia named this cuLitho, which I learned from my Spanish speaking mother-in-law basically looks like they named it 'butt'. So if you see a bunch of rear-end related memes about this software you know why!

wslh3y ago

I am a native Spanish speaker, you can check what it means in google images (NSFW): https://www.google.com.ar/search?client=safari&hl=en-ar&sxsr...

majewsky3y ago

Or you can just look at a dictionary:

  culito, noun. Diminuitive of culo.

  culo, noun. Slang for arse.

https://en.wiktionary.org/wiki/culito https://en.wiktionary.org/wiki/culo

1 more reply

amelius3y ago· 2 in thread

How much does that mean in practice, considering:

- the computation output depends only on local features (my guess)

- most transistors look the same, so you can cache these results heavily

- the same holds for the interconnect layers

pjc503y ago

> - the computation output depends only on local features (my guess)

This is not correct: the features on the mask are larger than the features required on the target, which is the whole problem, so you need a diffraction pattern over a large area to produce a target feature. But then you need to overlap with the diffraction pattern of the next feature, and so on. I suspect that every pixel on the output gets determined by almost every pixel on the input, which is why it takes so much computation in the first place.

tomxor3y ago

According to the article, rule based patterns are already used, but inverse lithography is required in more unique or problematic cases. Ideally they would use it in all cases, maybe we end up with slightly wobbly but tolerable features for the more generic or less esoteric features that use the heuristics based approach.

> Even a change to the thickness of a material can lead to the need for a new set of photomasks

You can imagine the light spreading out through the other side of the mask, diffracting and interfering with neighbouring features through some radius. That it's not trivial to compute suggests the number of combinations within this radius is large enough to not be highly cacheable.

jmartrican3y ago

I noticed TSMC and ASML were mentioned but not Intel. I wonder if this will set Intel further behind TSMC.

ftxbro3y ago

The chipmaking computation is inverse lithography, and the Nvidia system that is speeding it up is cuLitho on DGX H100.

I like how inverse lithography and neural network backpropagation were both techniques introduced in the 1980s and now we are finally seeing them both come to life, so to speak, with our sufficiently advanced GPUs.

j / k navigate · click thread line to collapse

40 comments

28 comments · 6 top-level

ChuckNorris893y ago· 12 in thread

This is Nvidia's biggest moat and AMD isn't even in the race here and for some reason Lisa Su seems to not give enough of a shit to compete.

sorenjan3y ago

I hope Intel and maybe AMD can get the GPGPU market to something that resembles something open and most importantly interoperable. But Nvidia has a big head start.

dragonwriter3y ago

> Imagine if you couldn't run code for an Intel CPU on an AMD chip, who would buy them?

Enough people that the Intel standard would lose out and you'd need to ask the reverse question? (See IA64 vs. AMD64.)

But if you mean, “what if there was a major microprocessor line incompatible with AMD64”, well, its called ARM and lots of people buy it.

1 more reply

fnordpiglet3y ago

Sort of like how you can’t run ARM on x86 or PowerPC or risc-v?

1 more reply

kkielhofner3y ago

Completely agree.

Whenever use of AMD GPUs for ML comes up on HN I echo your points with the added personal experiences (PAIN) I've had trying to actually use an AMD GPU for anything other than driving a display.

0dayz3y ago

Sure, but I think you can't really blame AMD for it.

Nvidia has dominated the market for a long time, and unlike Intel they up the prices and wisely spent enough of it on R&D, Nvidia is just reaping the rewards for it.

blihp3y ago

iforgotpassword3y ago

1 more reply

amdgpunope3y ago

snvzz3y ago

The "compute" market is about to be disrupted[0].

AMD might be right not to put effort into manually optimizing for the old approach.

0. https://www.youtube.com/watch?v=yHrdEcsr9V0

newZWhoDis3y ago

RT is not a gimmick, it’s the rapidly-approaching future

KaoruAoiShiho3y ago

Is it possible that AI rendering will make RT moot.

3 more replies

oblak3y ago

bottlepalm3y ago· 6 in thread

That's pretty cool they're using GPUs to design GPUs.

amelius3y ago

This is not fundamentally new. We've been using CPUs to design CPUs for ages.

carlmr3y ago

And compilers to compile the next version of themselves.

LoganDark3y ago

imhoguy3y ago

Add some AI and we can sit back and see it building Dyson sphere soon /s

tysam_and3y ago

Introducing the RTX for T 80!: the worlds' first nearly fully recursive GPU.

hackernewds3y ago

What about using GPUs to design GPUs that will design GPUs

iandanforth3y ago· 2 in thread

wslh3y ago

I am a native Spanish speaker, you can check what it means in google images (NSFW): https://www.google.com.ar/search?client=safari&hl=en-ar&sxsr...

majewsky3y ago

Or you can just look at a dictionary:

  culito, noun. Diminuitive of culo.

  culo, noun. Slang for arse.

https://en.wiktionary.org/wiki/culito https://en.wiktionary.org/wiki/culo

1 more reply

amelius3y ago· 2 in thread

How much does that mean in practice, considering:

- the computation output depends only on local features (my guess)

- most transistors look the same, so you can cache these results heavily

- the same holds for the interconnect layers

pjc503y ago

> - the computation output depends only on local features (my guess)

tomxor3y ago

> Even a change to the thickness of a material can lead to the need for a new set of photomasks

jmartrican3y ago

I noticed TSMC and ASML were mentioned but not Intel. I wonder if this will set Intel further behind TSMC.

ftxbro3y ago

The chipmaking computation is inverse lithography, and the Nvidia system that is speeding it up is cuLitho on DGX H100.

j / k navigate · click thread line to collapse