undefined | Better HN

0 pointsmodeless1y ago0 comments

Yeah, it may fit their current workload perfectly, but it doesn't seem very future proof with the limited bandwidth. Given how fast ML is evolving these days I question if it makes sense to design and deploy a chip like this. I guess they do have a very large workload that will benefit immediately.

0 comments

tony_cannistra1y ago

Don't mean to single you out at all, but I find this comment to be a great example of how the "ML Hype" is perceived by a certain segment folks in our industry.

The development of this chip shows that it doesn't (and shouldn't!) matter to the ML teams at Meta how 'fast ML is evolving.'

Indeed what it demonstrates is that a huge, global, trillion-dollar business has operationalized an existing ML technology to the extent that they can invest into, and deploy, customized hardware for solving a business problem.

How ML "evolves" is irrelevant. They have a system which solves their problem, and they're investing in it.

airstrike1y ago

Not to mention the capabilities they developed by actually creating this and what they'll be able to do next thanks to this experience.

You've gotta learn to walk before you can run

janalsncm1y ago

In their defense, it’s because the article is (understandably) sparse on details about what makes the requirements of their ranking models different from image classification or LLMs. Unless you work in industry it’s unlikely you will have heard of DeepFM or ESMM or whatever Meta is using.

And building out specialized hardware does lock you in to a certain extent. Want to use more than 128GB of memory? Too bad, your $10B chip doesn’t support that.

sangnoir1y ago

> Want to use more than 128GB of memory? Too bad, your $10B chip doesn’t support that.

Which is probably why Meta is also buying the biggest Nvidia datacenter cards by the shipload. There is no need to run inference for a small model - say for a text-ad recommendation system - on an H100 with attendant electricity and cooling costs.

1 more reply

prpl1y ago

To me, it’s bizarre to see the HPC mindset taking hold again after the cloud/commodity mindset dominated the last 16 years.

You don’t always need a Ferrari to go to the store

thorncorona1y ago

WDYM by HPC mindset?

1 more reply

Aurornis1y ago

> Yeah, it may fit their current workload perfectly, but it doesn't seem very future proof

It’s custom silicon designed for a specific, known workload. It’s not designed to be a general purpose part or to be future proofed for unknown future applications.

When a new application comes along with new requirements, the teams will use their experience to create a new chip targeting that new application.

That’s the great part about custom silicon: You’re not hitting general specs for general applications that you may not even know about yet. You’re building one very specific thing to do a very specific job and do it very well.

noiseinvacuum1y ago

Right and they have a LOT of GPUs from Nvidia for handle the unknown. Custom silicon for custom workloads seems like a good strategy specially considering the capabilities that the team will develop along the way.

giantrobot1y ago

Offloading a known workload to a custom chip can also save a lot on operations costs, particularly power. Facebook is interested in workload operations per watt rather than raw floating point operations per watt. A GPU might have better raw specs but if the whole GPU package has worse workload ops per watt, a custom chip is likely better.

At Facebook's scale the spherical cow raw performance stats don't matter nearly as much as real world workloads per ops dollar. They can also repurpose their GPUs to other workloads and let their custom chips handle the boring baseline stuff.

j / k navigate · click thread line to collapse

0 comments

tony_cannistra1y ago

Don't mean to single you out at all, but I find this comment to be a great example of how the "ML Hype" is perceived by a certain segment folks in our industry.

The development of this chip shows that it doesn't (and shouldn't!) matter to the ML teams at Meta how 'fast ML is evolving.'

How ML "evolves" is irrelevant. They have a system which solves their problem, and they're investing in it.

airstrike1y ago

Not to mention the capabilities they developed by actually creating this and what they'll be able to do next thanks to this experience.

You've gotta learn to walk before you can run

janalsncm1y ago

And building out specialized hardware does lock you in to a certain extent. Want to use more than 128GB of memory? Too bad, your $10B chip doesn’t support that.

sangnoir1y ago

> Want to use more than 128GB of memory? Too bad, your $10B chip doesn’t support that.

1 more reply

prpl1y ago

To me, it’s bizarre to see the HPC mindset taking hold again after the cloud/commodity mindset dominated the last 16 years.

You don’t always need a Ferrari to go to the store

thorncorona1y ago

WDYM by HPC mindset?

1 more reply

Aurornis1y ago

> Yeah, it may fit their current workload perfectly, but it doesn't seem very future proof

It’s custom silicon designed for a specific, known workload. It’s not designed to be a general purpose part or to be future proofed for unknown future applications.

When a new application comes along with new requirements, the teams will use their experience to create a new chip targeting that new application.

noiseinvacuum1y ago

giantrobot1y ago

j / k navigate · click thread line to collapse