undefined | Better HN

0 pointsponector1y ago0 comments

I don't agree. Who will buy it? A few enthusiasts who wants to run LLM locally but cannot afford M3 or 4090?

It will be a niche product with poor sales.

0 comments

bt1a1y ago

I think there's more than a few enthusiasts who would be very interesting in buying 1 or more of these cards (if they had 32+ GB of memory), but I don't have any data to back that opinion up. It is not only those who can't afford a 4090 though.

While the 4090 can run models that use less than 24GB of memory at blistering speeds, models are going to continue to scale up and 24GB is fairly limiting. Because LLM inference can take advantage of splitting the layers among multiple GPUs, high memory GPUs that aren't super expensive are desirable.

To share a personal perspective, I have a desktop with a 3090 and an M1 Max Studio with 64GB of memory. I use the M1 for local LLMs because I can use up to 57~GB of memory, even though the output (in terms of tok/s) is much slower than ones I can fit on a 3090.

michaelbrave1y ago

Right now I have a 3090TI so it's not worth it for me to upgrade to a 4090, but I do run into Vram constraints a lot, especially with merging stable diffusion models, especially as the models get larger (XL-Cascade-etc). As I move toward running multiple LLMs at a time I run into similar problems.

I would gladly buy a card that ran a touch slower but had massive Vram, especially if it was affordable, but I guess that puts me into that camp of enthusiasts you mentioned.

Dalewyn1y ago

>models are going to continue to scale up and 24GB is fairly limiting

>24GB is fairly limiting

Can I take a moment to suggest that maybe we're very spoiled?

24GB of VRAM is more than most peoples' system RAM, and that is "fairly limiting"?

To think Bill once said 640KB would be enough.

hnfong1y ago

It doesn't matter whether anyone is "spoiled" or not.

The fact is large language models require a lot of VRAM, and the more interesting ones need more than 24GB to run.

The people who are able to afford systems with more than 24GB VRAM will go buy hardware that gives them that, and when GPU vendors release products with insufficient VRAM they limit their market.

I mean inequality is definitely increasing at a worrying rate these days, but let's keep the discussion on topic...

1 more reply

loudmax1y ago

I tend to agree that it would be niche. The machine learning enthusiast market is far smaller than the gamer market.

But selling to machine learning enthusiasts is not a bad place to be. A lot of these enthusiasts are going to go on to work at places that are deploying enterprise AI at scale. Right now, almost all of their experience is CUDA and they're likely to recommend hardware they're familiar with. By making consumer Intel GPUs attractive to ML enthusiasts, Intel would make their enterprise GPUs much more interesting for enterprise.

mysteria1y ago

The problem is that this now becomes a long term investment, which doesn't work out when we have CEOs chasing quarterly profits and all that. Meanwhile Nvidia stuck with CUDA all those years back (while ensuring that it worked well on both the consumer and enterprise line) and now they reap the rewards.

Wytwwww1y ago

Current Intel and its leadership seems to be much more focused on long term goals/growth than before, or so they claim.

resource_waste1y ago

I need offline LLMs for work.

It doesnt need to be consumer grade, it doesnt need to be ultra high either.

It needs to be cheap enough for my department to expensive it via petty cash.

antupis1y ago

It would be same playbook that NVIDIA did CUDA where was market 2010 when it was research labs and hobbyists doing vector calculations.

Aerroon1y ago

It's about mindshare. Random people using your product to do AI means that the tooling is going to improve because people will try to use them. But as it stands right now if you think there's any chance you want to use AI in the next 5 years, then why would you buy anything other than Nvidia?

It doesn't even matter if that's your primary goal or not.

talldayo1y ago

> Who will buy it?

Frustrated AMD customers willing to put their money where their mouth is?

resource_waste1y ago

>M3

>4090

These are noob hardware. A6000 is my choice.

Which really only further emphesizes your point.

>CPU based is a waste of everyone's time/effort

>GPU based is 100% limited by VRAM, and is what you are realistically going to use.

jmward011y ago

Microsoft got where they are because the developed tools that everyone used. The got the developers and the consumers followed. Intel (or AMD) could do the same thing. Get a big card with lost of ram so that the developers get used to your ecosystem and then sell the enterprise GPUs to make the $$$. It is a clear path with a lot of history and it blows my mind Intel and AMD aren't doing it.

zoobab1y ago

"Microsoft got where they are because the developed tools that everyone used."

It's not like they don't have a monopoly on pre-installed OSes.

alecco1y ago

AFAIK, unless you are a huge American corp with orders above $100m Nvidia will only sell you old and expensive server cards like the crappy A40 PCIe 4.0 48GB GDDR6 at $5,000. Good luck getting SXM H100s or GH200.

If Intel sells a stackable kit with a lot of RAM and a reasonable interconnect a lot of corporate customers will buy. It doesn't even have to be that good, just half way between PCIe 5.0 and NVLink.

But it seems they are still too stuck in their old ways. I wouldn't count on them waking up. Nor AMD. It's sad.

ponectorOP1y ago

Parent comment requested non-enterprise, consumer grade GPU with tons of memory. I'm sure there is no market for this.

However, server solutions could have some traction.

alecco1y ago

Hobbyists are stacking 3090s with NVLink.

j / k navigate · click thread line to collapse

0 comments

bt1a1y ago

michaelbrave1y ago

I would gladly buy a card that ran a touch slower but had massive Vram, especially if it was affordable, but I guess that puts me into that camp of enthusiasts you mentioned.

Dalewyn1y ago

>models are going to continue to scale up and 24GB is fairly limiting

>24GB is fairly limiting

Can I take a moment to suggest that maybe we're very spoiled?

24GB of VRAM is more than most peoples' system RAM, and that is "fairly limiting"?

To think Bill once said 640KB would be enough.

hnfong1y ago

It doesn't matter whether anyone is "spoiled" or not.

The fact is large language models require a lot of VRAM, and the more interesting ones need more than 24GB to run.

The people who are able to afford systems with more than 24GB VRAM will go buy hardware that gives them that, and when GPU vendors release products with insufficient VRAM they limit their market.

I mean inequality is definitely increasing at a worrying rate these days, but let's keep the discussion on topic...

1 more reply

loudmax1y ago

I tend to agree that it would be niche. The machine learning enthusiast market is far smaller than the gamer market.

mysteria1y ago

Wytwwww1y ago

Current Intel and its leadership seems to be much more focused on long term goals/growth than before, or so they claim.

resource_waste1y ago

I need offline LLMs for work.

It doesnt need to be consumer grade, it doesnt need to be ultra high either.

It needs to be cheap enough for my department to expensive it via petty cash.

antupis1y ago

It would be same playbook that NVIDIA did CUDA where was market 2010 when it was research labs and hobbyists doing vector calculations.

Aerroon1y ago

It doesn't even matter if that's your primary goal or not.

talldayo1y ago

> Who will buy it?

Frustrated AMD customers willing to put their money where their mouth is?

resource_waste1y ago

>M3

>4090

These are noob hardware. A6000 is my choice.

Which really only further emphesizes your point.

>CPU based is a waste of everyone's time/effort

>GPU based is 100% limited by VRAM, and is what you are realistically going to use.

jmward011y ago

zoobab1y ago

"Microsoft got where they are because the developed tools that everyone used."

It's not like they don't have a monopoly on pre-installed OSes.

alecco1y ago

If Intel sells a stackable kit with a lot of RAM and a reasonable interconnect a lot of corporate customers will buy. It doesn't even have to be that good, just half way between PCIe 5.0 and NVLink.

But it seems they are still too stuck in their old ways. I wouldn't count on them waking up. Nor AMD. It's sad.

ponectorOP1y ago

Parent comment requested non-enterprise, consumer grade GPU with tons of memory. I'm sure there is no market for this.

However, server solutions could have some traction.

alecco1y ago

Hobbyists are stacking 3090s with NVLink.

j / k navigate · click thread line to collapse