Nvidia R&D chief on how AI is improving chip design (opens in new tab)

(hpcwire.com)

225 pointschuckjchen4y ago78 comments

78 comments

43 comments · 9 top-level

I work on this team! (Specifically: applied deep learning research, chip design).

It's a shame to see so many people dismissing this work as marketing. I see lots of clever people working hard on really novel and interesting stuff, and I really do think that ML has real potential to customize a design much more "deeply" than traditional automation tools.

marginalia_nu4y ago

This is directed at AI marketing in general: "AI" has been used to market so much nonsense it's probably becoming a problem communicating actual interesting uses of AI. I very much get a dot com vibe off it, like nobody on the team knows how it works but we're sure we're gonna be rich somehow! In my head, I've begun substituting AI with "wizards" when I read it.

It's very much the sort of problems crypto is having. So many grifters actual interesting uses of the technology are very hard to identify and take seriously.

maxwells-daemon4y ago

I guess so, but the fact of the matter is that ML/AI is actually, right now, doing useful things that would have been impossible 10 years ago. I don't think I could say the same about crypto (as distinct from cryptography).

q-big4y ago

> "AI" has been used to market so much nonsense it's probably becoming a problem communicating actual interesting uses of AI.

On the other hand: if the people who do serious work in this area don't call out this nonsense, they must accept that their (serious) work becomes devalued.

> It's very much the sort of problems crypto is having. So many grifters actual interesting uses of the technology are very hard to identify and take seriously.

Here, the same holds.

1 more reply

PaulHoule4y ago

I think it's funny how "the old AI" had combinatorical optimization as a major theme, for instance

https://en.wikipedia.org/wiki/Travelling_salesman_problem

which is closely related to the central operation of logic, the canonical NP problem

https://en.wikipedia.org/wiki/Boolean_satisfiability_problem

as well as the playing of games like Chess, Poker, etc.

Modern neural networks also have optimization as a theme even when the output is a classification or something that doesn't look like optimization... That is, the network itself is trained to minimize an error function. People used these kind of algorithms back in the 1980s to layout chips

https://en.wikipedia.org/wiki/A*_search_algorithm

and it's only natural that new techniques of optimization (both direct and through heuristics like the neural network used in AlphaGo) are used today for chips.

maxwells-daemon4y ago

Yes, the way I see it, one of the major benefits of deep learning is that it lets you define functions (in the R^n -> R^m sense) that would be basically impossible to define with traditional programming techniques. I think this comes up a lot in subroutines of combinatorial optimization, like heuristics for guiding search on subsets of NP-complete problems. The fact that you can automatically evaluate the heuristic and train by RL is also very convenient.

xbmcuser4y ago

It is the same with a lot of the machine learning stuff posted here the 2nd or 3rd comment is that how it could be achieved with normal algos etc. But slowly as more people start applying to different problems machine learning is solving many of them.

lvl1024y ago

To be fair, Nvidia does a lot of “selling” when they’re basically making money from crypto and CUDA monopoly.

hoosieree4y ago

When I briefly used Cadence's stuff I always thought about how fixing DRC errors could be crowdsourced as an "idle game" because it's so puzzle-like. The other thing was how it's even slower than Vivado...

Using RL to automate DRC fixes, and modeling standard cells as graph/flow problems are things I'd love to learn more about. What papers would you recommend reading to get started (for a grad student already familiar with machine learning basics)?

kraussvonespy4y ago

A quick tangent if you have time and can discuss it: some really interesting, effective and odd antenna designs came from AI:

https://ti.arc.nasa.gov/m/pub-archive/1244h/1244%20(Hornby)....

Have there been any odd, surprising or wildly efficient chip designs that have come out of the AI designs?

hardolaf4y ago

I saw the in-depth presentation at DAC. Until your company is willing to actually release your work, it's marketing.

CreateAccntAgn4y ago

Any word on how the accuracy/quality of final results compare to traditional flows? Are process variations handled differently (with regards to training or modelling) compared to IR? I assume traditional vendors (CDNS/SNPS/MENT) all have (or working on) AI driven tools as well. How do they compare?

maxwells-daemon4y ago

In general, a function approximation solution like deep learning does worse on cases where exhaustively finding the exact optimum is possible (small combinatorial problems), but can be applied to much larger instances than the exact algorithms can.

selimthegrim4y ago

Ha, this does sound awesome. Are you guys hiring?

bsder4y ago· 8 in thread

What is extremely telling is what is missing ... Design Rule Checking (DRC) and Layout Vs Schematic (LVS).

These require:

1) Longer bit length arithmetic

32-bit float simply isn't enough. 64-bit float is close, but limited. You really want 128-bit integer. And nVidia isn't delivering that.

2) Real algorithmic improvements

We're still stuck with computational geometry algorithms that don't parallelize. It would be awfully useful if nVidia would actually research some new algorithms instead of just waving around the ML/AI marketing wand.

But, then, this is the company that built itself on benchmarketing, so ...

jiggawatts4y ago

Can you explain why such large numbers are required?

Back-of-the-napkin maths is that a chip that is 3cm on each side -- which is huge -- can be subdivided into 0.007 nanometre increments using 32 bit integers. That's 1/7th of the diameter of a hydrogen atom!

The resolution with 64-bit floats (let alone integers) would be absurd, roughly a million times finer-grained still. That's probably enough to simulate individual electrons zipping around in their orbitals with acceptable precision.

Even if the simulation codes did something silly like simply assigning 1.0 = 1cm, a 64-bit float still allows resolutions of something like a billionth of a nanometre...

bsder4y ago

> Can you explain why such large numbers are required?

Absolutely.

Even if you start with 32 bits, you often have polygons with many sides. In the worst case, you are modeling a "circle" and have to increase your precision to enough level to be accurate (please note that nobody in the right mind in VLSI would ever draw a "circle"--however, you wind up with an "implied" one due to DRC, more down below ...)

The problem is that line sweep intersection checks in DRC require approximately 3n+a couple bits to differentiate intersections that may be close to degenerate or have multple intersections near to each other. So, if you start with 32-bit numbers, you require approximately 96 bits plus a little for your intermediate calculations. (See: Hobby -- "Practical segment intersection with finite precision output" -- I'll let people find their own copy of the paper so HN doesn't splatter some poor site that I link)

You would think that doesn't matter since VLSI tends to limit itself to rectilinear and 45 degree angles. Unfortunately life isn't that simple.

If you take a simple rectangle and say "Nothing can be within distance x", you get a slightly larger rectangle parallel to the sides. Easy. The problem is that you also wind up with an implied quarter circle (told you this would come back) near each corner. Not so easy.

Put those circles such that they overlap only very slightly and you may have segments that are pretty close to tangent. Super not easy. Unfortunately, VLSI design often consists of putting those metals such that they are riiiight at the limit of spacing. Consequently, your super-not-easy case also becomes a very common case. Ouch.

Of course, you could just move the rectangle completely outward so that you have squares at the corners. However, that gives up a non-trivial amount of area that most places aren't willing to concede.

There is a reason why Siemens (nee Mentor) Calibre is so egregiously expensive.

2 more replies

qayxc4y ago

> The resolution with 64-bit floats (let alone integers) would be absurd, roughly a million times finer-grained still.

Careful there! Floating point numbers do not form a proper field, not even a semi-group. Due to the uneven distribution of elements, the field axioms don't hold (e.g. both commutativity and distributivity can be violated) and great care has to be taken to assure the numeric stability of computations.

1 more reply

Dylan168074y ago

> You really want 128-bit integer. And nVidia isn't delivering that.

How much slower (per unit area) is that to do in software, compared to a full 128-bit hardware unit?

erwincoumans4y ago

"as of 11.5, CUDA and nvcc support __int128_t in device code when the host compiler supports it (e.g., clang/gcc, but not MSVC). 11.6 added support for debug tools with __int128_t."

See:

https://developer.nvidia.com/blog/cuda-11-6-toolkit-new-rele... https://developer.nvidia.com/blog/implementing-high-precisio...

tboerstad4y ago

DRC and LVS are just logical checks right?

“Is the minimal distance between all metal routing > 10 nm” etc.

Can you explain why high precision is needed for that?

atq21194y ago

Having worked in EDA myself, though not on these final signoff steps, I agree that purely geometry-based checks really don't need doubles. Most of it should just be done as 32-bit fixed point. Both because it's better for performance, and because it drives home the point that you need to think carefully about precision issues for correctness reasons. Using doubles is just a band-aid.

I'm less confident about it when it comes to anything that involves calculating anything electromagnetic because I just don't know that subfield.

FutureZeitgeist4y ago

Can you explain why you need greater precision/range?

rektide4y ago· 7 in thread

Did nvidia just promise us singularity? :)

Hard to read a talk like this from a pulpit & not see shout outs to the incredibly super-fantastic open-source innovative projects like OpenROAD which have been shipping amazingly well-routed-by-AI chips for a while now. There's papers you can cite, galore, many open source designs[1].

It's not like Nvidia is promising anyone else will benefit from this work. This seems to be very high level coverage their R&D department is looking at, perhaps/perhaps not using. The article makes it hard to find out what is available, what has been published or otherwise deeply discussed (which is I think the best we can hope from Nvidia not real participation). There's only one paper linked, on NVCell[2], described as:

> The first is a system we have called NVCell, which uses a combination of simulated annealing and reinforcement learning to basically design our standard cell library.

This just feels like so much else going on in computing. WSL coming to windows, the recent Unity vs Unreal topic[3]. It's hard to imagine refusing to participate with others. It's hard to imagine not being part of the open source community working shoulder to shoulder to push for better. NVidia patently doesn't get it, patently isn't participating, patently isn't there. It's cool we can hear what they are up to, but it's also extremely NVidia that they're doing it all on their own. Anyhow, Looking forward to more AI based chip power system design starting to emerge; that sounds like a good idea NV.

[1] https://theopenroadproject.org/

[2] https://research.nvidia.com/publication/2021-12_nvcell-stand...

[3] https://news.ycombinator.com/item?id=31064552 (412 points, 3 days ago, 311 comments)

nomel4y ago

> but it's also extremely NVidia that they're doing it all on their own.

Having a lead in chip design is their literal bread and butter. I think it's extremely "publicly traded company" more than "NVidia". Do you have an example of a company releasing an open source version of their secret sauce (foundation of their profits)?

bawolff4y ago

Netscape.

Worked out great for them /s (albeit writing was already on the wall for them by that point.)

rektide4y ago

> Do you have an example of a company releasing an open source version of their secret sauce?

The chip design itself should be the secret sauce. Not the tools you make the chip with. Nvidia is resolutely not-contributing. Many other companies are starting to get onboard with open chip design. This doesn't mean the chips have to be open, but the tooling needs to be something shared & co-developable. If this is a little pet research project that's one thing, but there really needs to be ongoing workforce development, a strong advance. The NSF's TILOS, a strong alliance/nexus of researchers within & around the OpenROAD community, get this[1]:

> TILOS – The Institute for Learning-enabled Optimization at Scale – is an NSF National AI Research Institute for advances in optimization, partially supported by Intel Corporation. The institute began operations in November 2021 with a mission to "make impossible optimizations possible, at scale and in practice".

> There are six universities in TILOS: UCSD, MIT, National University, Penn, UT-Austin, and Yale. The institute seeks a new nexus of AI and machine learning, optimization, and use in practice. Figure 4 shows four virtuous cycles envisioned for the institute: 1. mutual advances of AI and optimization provide the foundations; 2. challenges of scale, along with breakthroughs from scaling, bind together foundations and the use domains of chip design, networks and robotics; 3. the cycle of translation and impact brings research and the leading edge of practice closer together; and 4. the cycle of research, education, and broadening participation grows the field and its workforce.

The virtues written here are self evident & obvious. Trying to just get good yourself without trying to help advance the field, not participating, not taking advantages of scale of many working together, not participating in open research, the risks of having isolated teams, and not participating in cycles of development: whatever the nvidia or "publicly traded company" worlds think they're doing, they're missing out, and hurting everyone and especially themselves for this oldschool zero-sum competitive thinking.

There are plenty of company's releasing the chips too. Google's OpenTitan[2] security chip. WD's Swerv RISC-V core for their driver controller ARM R-series replacement[3]. Open standards if not chips like UCI for chiplets or CXL for interconnect are again examples of literally everyone but NVidia playing well together, trying for better, standardizing a future for participation & healthy competition & growth. Nvidia again and again is the company which simply will not play with others.

I challenge you to answer your own question in reverse: are any companies other than Nvidia embarking up AI/ML chipmaking in a closed fashion? There probably are, let's follow & watch them.

[1] https://theopenroadproject.org/news/leveling-up-a-trajectory...

[2] https://opentitan.org/

[3] https://github.com/chipsalliance/Cores-SweRV

9 more replies

root_axis4y ago

> Having a lead in chip design is their literal bread and butter

Sounds tasty, I'll have to take a trip to the nvidia cafe some time =)

2 more replies

jwlake4y ago

That article doesn't actually have enough data to really know. The anecdotes are all just design assist, make things faster from 20 mins to 3 seconds. Not singularity. There are teasers but nothing clear singularity. I think the big problem is they are just using it for fuzzy algorithm optimization, which is clearly not self learning.

areskay34y ago

All of the work discussed is published to some extent. References are in the slides

https://arxiv.org/pdf/2012.10597.pdf

https://research.nvidia.com/publication/2020-07_grannite-gra...

https://research.nvidia.com/sites/default/files/pubs/2020-07...

https://ieeexplore.ieee.org/document/8920342

rrss4y ago

which openroad tools use "AI" rather than normal optimization like traditional EDA tools have used for decades?

b200004y ago· 3 in thread

the last time I checked autorouters were still not capable of doing all the routing on a multi layer PCB properly, and manual work was still required to produce a decent design.

TomVDB4y ago

How is that a relevant comment in a discussion about ASIC design?

I hope you don't have the idea that chip routing is done manually.

beambot4y ago

IIRC, Place & Route is a known NP complete problem. In this regard, autorouters (whether IC or PCB) can benefit from "better" heuristics -- i.e. it's an optimization problem where AI can help.

2 more replies

tboerstad4y ago

Are the analog parts (current nitrist etc) autorouted now?

I worked on MCU layout around 2011, and only the digital logic was autorouted/placed.

1 more reply

productceo4y ago· 2 in thread

Please keep up the processing power progress!

Economics of the software industry (or at least the products that I work on) depend on the assumption that cost of computing (including storage) diminish exponentially over time! <3

kevincox4y ago

You say this like it is a good thing. It seems to me that if a whole industry is dependant on exponential growth of another than the former is being quite reckless.

Of course exponential growth will help, but relying on it seems like a bit too much risk.

hurflmurfl4y ago

I think that's the point GP is making with his sarcastic remark :)

ImportOllie4y ago· 1 in thread

I don't understand the backlash here. The jist seemed to be traditional tools that are exact take a long time to process complex designs. Deep learning offers a statistical approach that can give a 'coarse' prediction and they're using this to reduce development time. That seems to make sense to me, especially in the earlier verification phases of the hardware design lifecycle.

To me this sounds like a good use-case of AI and Neural Nets. It doesn't appear to be looking to replace the traditional tools, just augment.

jonnycomputer4y ago

I seem to recall that the original title of the post was more sensationalist; something about replacing human designers.

W-Stool4y ago

I've got a whole "HAL9000" feeling going here right now.

"Sorry Dave - I can't quite do that ..."

_blz24y ago

I remember him from the vlsi text dally and poulton.

orangebeet4y ago

I really hope that they can apply some of these AI approaches on the driver situation on Linux as well. I will never buy an Nvidia product after the nightmares they've put me through.

j / k navigate · click thread line to collapse

78 comments

43 comments · 9 top-level

maxwells-daemon4y ago· 13 in thread

I work on this team! (Specifically: applied deep learning research, chip design).

marginalia_nu4y ago

It's very much the sort of problems crypto is having. So many grifters actual interesting uses of the technology are very hard to identify and take seriously.

maxwells-daemon4y ago

q-big4y ago

> "AI" has been used to market so much nonsense it's probably becoming a problem communicating actual interesting uses of AI.

On the other hand: if the people who do serious work in this area don't call out this nonsense, they must accept that their (serious) work becomes devalued.

> It's very much the sort of problems crypto is having. So many grifters actual interesting uses of the technology are very hard to identify and take seriously.

Here, the same holds.

1 more reply

PaulHoule4y ago

I think it's funny how "the old AI" had combinatorical optimization as a major theme, for instance

https://en.wikipedia.org/wiki/Travelling_salesman_problem

which is closely related to the central operation of logic, the canonical NP problem

https://en.wikipedia.org/wiki/Boolean_satisfiability_problem

as well as the playing of games like Chess, Poker, etc.

https://en.wikipedia.org/wiki/A*_search_algorithm

and it's only natural that new techniques of optimization (both direct and through heuristics like the neural network used in AlphaGo) are used today for chips.

maxwells-daemon4y ago

xbmcuser4y ago

lvl1024y ago

To be fair, Nvidia does a lot of “selling” when they’re basically making money from crypto and CUDA monopoly.

hoosieree4y ago

kraussvonespy4y ago

A quick tangent if you have time and can discuss it: some really interesting, effective and odd antenna designs came from AI:

https://ti.arc.nasa.gov/m/pub-archive/1244h/1244%20(Hornby)....

Have there been any odd, surprising or wildly efficient chip designs that have come out of the AI designs?

hardolaf4y ago

I saw the in-depth presentation at DAC. Until your company is willing to actually release your work, it's marketing.

CreateAccntAgn4y ago

maxwells-daemon4y ago

selimthegrim4y ago

Ha, this does sound awesome. Are you guys hiring?

bsder4y ago· 8 in thread

What is extremely telling is what is missing ... Design Rule Checking (DRC) and Layout Vs Schematic (LVS).

These require:

1) Longer bit length arithmetic

32-bit float simply isn't enough. 64-bit float is close, but limited. You really want 128-bit integer. And nVidia isn't delivering that.

2) Real algorithmic improvements

But, then, this is the company that built itself on benchmarketing, so ...

jiggawatts4y ago

Can you explain why such large numbers are required?

Even if the simulation codes did something silly like simply assigning 1.0 = 1cm, a 64-bit float still allows resolutions of something like a billionth of a nanometre...

bsder4y ago

> Can you explain why such large numbers are required?

Absolutely.

You would think that doesn't matter since VLSI tends to limit itself to rectilinear and 45 degree angles. Unfortunately life isn't that simple.

There is a reason why Siemens (nee Mentor) Calibre is so egregiously expensive.

2 more replies

qayxc4y ago

> The resolution with 64-bit floats (let alone integers) would be absurd, roughly a million times finer-grained still.

1 more reply

Dylan168074y ago

> You really want 128-bit integer. And nVidia isn't delivering that.

How much slower (per unit area) is that to do in software, compared to a full 128-bit hardware unit?

erwincoumans4y ago

"as of 11.5, CUDA and nvcc support __int128_t in device code when the host compiler supports it (e.g., clang/gcc, but not MSVC). 11.6 added support for debug tools with __int128_t."

See:

https://developer.nvidia.com/blog/cuda-11-6-toolkit-new-rele... https://developer.nvidia.com/blog/implementing-high-precisio...

tboerstad4y ago

DRC and LVS are just logical checks right?

“Is the minimal distance between all metal routing > 10 nm” etc.

Can you explain why high precision is needed for that?

atq21194y ago

I'm less confident about it when it comes to anything that involves calculating anything electromagnetic because I just don't know that subfield.

FutureZeitgeist4y ago

Can you explain why you need greater precision/range?

rektide4y ago· 7 in thread

Did nvidia just promise us singularity? :)

> The first is a system we have called NVCell, which uses a combination of simulated annealing and reinforcement learning to basically design our standard cell library.

[1] https://theopenroadproject.org/

[2] https://research.nvidia.com/publication/2021-12_nvcell-stand...

[3] https://news.ycombinator.com/item?id=31064552 (412 points, 3 days ago, 311 comments)

nomel4y ago

> but it's also extremely NVidia that they're doing it all on their own.

bawolff4y ago

Netscape.

Worked out great for them /s (albeit writing was already on the wall for them by that point.)

rektide4y ago

> Do you have an example of a company releasing an open source version of their secret sauce?

I challenge you to answer your own question in reverse: are any companies other than Nvidia embarking up AI/ML chipmaking in a closed fashion? There probably are, let's follow & watch them.

[1] https://theopenroadproject.org/news/leveling-up-a-trajectory...

[2] https://opentitan.org/

[3] https://github.com/chipsalliance/Cores-SweRV

9 more replies

root_axis4y ago

> Having a lead in chip design is their literal bread and butter

Sounds tasty, I'll have to take a trip to the nvidia cafe some time =)

2 more replies

jwlake4y ago

areskay34y ago

All of the work discussed is published to some extent. References are in the slides

https://arxiv.org/pdf/2012.10597.pdf

https://research.nvidia.com/publication/2020-07_grannite-gra...

https://research.nvidia.com/sites/default/files/pubs/2020-07...

https://ieeexplore.ieee.org/document/8920342

rrss4y ago

which openroad tools use "AI" rather than normal optimization like traditional EDA tools have used for decades?

b200004y ago· 3 in thread

the last time I checked autorouters were still not capable of doing all the routing on a multi layer PCB properly, and manual work was still required to produce a decent design.

TomVDB4y ago

How is that a relevant comment in a discussion about ASIC design?

I hope you don't have the idea that chip routing is done manually.

beambot4y ago

IIRC, Place & Route is a known NP complete problem. In this regard, autorouters (whether IC or PCB) can benefit from "better" heuristics -- i.e. it's an optimization problem where AI can help.

2 more replies

tboerstad4y ago

Are the analog parts (current nitrist etc) autorouted now?

I worked on MCU layout around 2011, and only the digital logic was autorouted/placed.

1 more reply

productceo4y ago· 2 in thread

Please keep up the processing power progress!

Economics of the software industry (or at least the products that I work on) depend on the assumption that cost of computing (including storage) diminish exponentially over time! <3

kevincox4y ago

You say this like it is a good thing. It seems to me that if a whole industry is dependant on exponential growth of another than the former is being quite reckless.

Of course exponential growth will help, but relying on it seems like a bit too much risk.

hurflmurfl4y ago

I think that's the point GP is making with his sarcastic remark :)

ImportOllie4y ago· 1 in thread

To me this sounds like a good use-case of AI and Neural Nets. It doesn't appear to be looking to replace the traditional tools, just augment.

jonnycomputer4y ago

I seem to recall that the original title of the post was more sensationalist; something about replacing human designers.

W-Stool4y ago

I've got a whole "HAL9000" feeling going here right now.

"Sorry Dave - I can't quite do that ..."

_blz24y ago

I remember him from the vlsi text dally and poulton.

orangebeet4y ago

I really hope that they can apply some of these AI approaches on the driver situation on Linux as well. I will never buy an Nvidia product after the nightmares they've put me through.

j / k navigate · click thread line to collapse