undefined | Better HN

0 pointswebdevver2mo ago0 comments

i cant believe we're still putting NPUs into new designs.

silicon taken up that couldve been used for a few more compute units on the GPU, which is often faster at inference anyway and way more useful/flexible/programmable/documented.

0 comments

zmb_2mo ago

You can thank Microsoft for that. Intel architects in fact did not want to waste area on an NPU. That caused Microsoft to launch their AI-whatever branded PCs with Qualcomm who were happy to throw in whatever Microsoft wanted to get to be the launch partner. After than Intel had to follow suit to make Microsoft happy.

dangus2mo ago

That doesn’t explain why Apple “wastes” die area on their NPU.

The thing is, when you get an Apple product and you take a picture, those devices are performing ML tasks while sipping battery life.

Microsoft maybe shouldn’t be chasing Apple especially since they don’t actually have any marketshare in tablets or phones, but I see where they’re getting at: they are probably tired of their OS living on devices that get half the battery life of their main competition.

And here’s the thing, Qualcomm’s solution blows Intel out of the water. The only reason not to use it is because Microsoft can’t provide the level of architecture transition that Apple does. Apple can get 100% of their users to switch architecture in about 7 years whenever they want.

cromka2mo ago

Guess they're following Apple here whose NPUs get all the support possible, as far as I can tell.

dangus2mo ago

Bingo. Maybe Microsoft shouldn’t even be chasing them but I think they have a point to try and stay competitive. They can’t just have their OS getting half the battery life of their main competitor.

When you use an Apple device, it’s performing ML tasks while barely using any battery life. That’s the whole point of the NPU. It’s not there to outperform the GPU.

astrange2mo ago

NPUs aren't designed to be "faster", they are designed to have better perf/power ratios.

stockresearcher2mo ago

Every modern chip needs some percentage dedicated to dark silicon. There is no cheating the thermal reality. You could add more compute units in the GPU, but you then have to make up for it somewhere else. It’s a balancing act.

The Core Ultra lineup is supposed to be low-power, low-heat, right? If you want more compute power, pick something from a different product series.

wtallis2mo ago

> Every modern chip needs some percentage dedicated to dark silicon. There is no cheating the thermal reality. You could add more compute units in the GPU, but you then have to make up for it somewhere else. It’s a balancing act.

I think that "dark silicon" mentality is mostly lingering trauma from when the industry first hit a wall with the end of Dennard scaling. These days, it's quite clear that you can have a chip that's more or less fully utilized, certainly with no "dark" blocks that are as large as a NPU. You just need to have the ability to run the chip at lower clock speeds to stay within power and thermal constraints—something that was not well-developed in 2005's processors. For the kind of parallel compute that GPUs and NPUs tackle, adding more cores but running them at lower clock speeds and lower voltages usually does result in better efficiency in practice.

The real answer to the GPU vs NPU question isn't that the GPU couldn't grow, but that the NPU has a drastically different architecture making very different power vs performance tradeoffs that theoretically give it a niche of use cases where the NPU is a better choice than the GPU for some inference tasks.

j / k navigate · click thread line to collapse

0 comments

zmb_2mo ago

dangus2mo ago

That doesn’t explain why Apple “wastes” die area on their NPU.

The thing is, when you get an Apple product and you take a picture, those devices are performing ML tasks while sipping battery life.

cromka2mo ago

Guess they're following Apple here whose NPUs get all the support possible, as far as I can tell.

dangus2mo ago

When you use an Apple device, it’s performing ML tasks while barely using any battery life. That’s the whole point of the NPU. It’s not there to outperform the GPU.

astrange2mo ago

NPUs aren't designed to be "faster", they are designed to have better perf/power ratios.

stockresearcher2mo ago

The Core Ultra lineup is supposed to be low-power, low-heat, right? If you want more compute power, pick something from a different product series.

wtallis2mo ago

j / k navigate · click thread line to collapse