So with proprietary blobs that give you more trouble that they're worth?
No one seriously cares about this running Windows. We want Steam and CUDA/Ollama, and Windows just gets in the way. nVidia are simply not that oblivious, but I have to admit in their position I'd have considered the Microsoft involvement more trouble than it's worth, which is among the many reasons I'm not a billionaire.
Maybe they think the RAM market is so terrible it will kill the whole initiative regardless.
Has Steam finally started to push for native Linux games instead of translating Windows ones?
I expect we'll get there in a few years, so perhaps this is Nvidia taking an early step in that direction.
In that case, this goes against Anthropic and OpenAI's business models. Which is a double whammy after Jensen Huang's recent comment about how agentic coding will only increase demand for software engineers, not reduce it.
So it also feels like a part of a budding shift in the competitive tension between the various parts of the AI supply chain.
Non-techy consumers may never do it, but at some point businesses are going to start asking when do they stop paying per token and start running models themselves. Right now the hardware is cost prohibitive, but I doubt that'll always be the case. Eventually the hardware will get cheaper and more available, and Nvidia seems to be betting on that.
They don't care where inference happens, so long as it happens on Nvidia hardware.
And when that happens, the pitch to non-techy users is "Free ChatGPT you can use offline with zero privacy risk". Once hardware accessibility and LLM efficiency advance to the point that this becomes feasible, I suspect it'll result in a much bigger hit to the cloud AI market than many expect.
After a few generations (and over a decade) that was indistinguishable from the CPU chip itself.
It's a long hyperbole, I know, but I think local inference is inevitable; and the big fishes know it.
Will that be a complex technical setup? An appliance? An additional chip in your motherboard? So transparent it's burned right into the CPU? Those are just implementation details. We're probably just one generational breakthrough away from it.
They will. As some point in the future, people will want everything, they'll prompt full movies because they're bored and want to watch something.
Local AI capabilities are growing at a rapid pace, but so is hosted AI. While you can do a surprising amount of useful work with a model occupying a few to a few hundred gigs of VRAM, the hosted models are going to be way ahead for a long time.
AI inference is different. You get the outcome by passing text through some weights at the time you need it. There's no ongoing work besides training and releasing new models. If I had something that rivalled Opus 4+ I could use locally, I would switch in a heartbeat.
Google/Microsoft and hosting your own email is a byproduct of how difficult (socially, not technically) hosting your own email has become - mostly because SMTP protocol is inherently broken by spam and patched by social construct (trusted nodes, abuse@, 3+ DNS entries and counting, etc). Purely technical solutions, such HashCash etc, got discontinued in exchange for social ones. Central providers made (sometimes in exchange for, sometimes as excuse of, spam protection) self-hosting socially hard.
Now, I wonder if, and how, once Anthropic and OpenAI need to demonstrate profitability, could hamstring local AI. Which has been /so far/ very valuable for me in doing things that hosted providers don't want liability for, and align against (even if totally lawful and fair use!).
- v4.5: 1x cost, 100% quality, 100% speed but maybe sometimes 80% speed because of load - v4.6: 3x cost, 105% quality, 80% speed most of the time depends - v4.7: 9x cost, 115% quality, 90% speed most of the time
Then people will either stick with v4.5 for everything it can do and, if knowledgeable, use v4.7+ for critical or specific tasks.
But if we add the option of:
LocalLLM: one time hardware + electricity cost, good enough quality for 90% of work, good enough speed for 90% of work, no vendor lock in/sudden cost spikes...
Then there is an edge to running it yourself unless you can burn investor cash to get to the next level.
I think the recent headlines on org token spend plus my own experience just today (June 1) with the new Copilot Pro limits is going to push those with the compute to run locally.
As of about 1pm today I did something to hit 47% of my entire June premium requests (copilot Pro, not converted).
As of 2pm I'm using Gemma 4 E4B on a 12gb GPU (with large context window) off my desktop to power VS Code with Copilot on my laptop. I'm going to build an AMD Strix Halo system next week when parts arrive so I can queue up a few models in parallel or work with something I need that much RAM for.
I'm not lifting the earth with my LLM setup. Gemma 4 E4B is solid for accelerating my current projects. and it's costing me pennies more per hour vs blowing half my Copilot Pro plan in a distracted morning.
I'm at a vendor conference this weekend that is showing off their Agent/Agentic workflows. Nobody can tell me how they balance the cost long term. Hopefully whoever the vendor is paying for their cloud LLM token usage doesn't spike cost in a year (or the vendor themselves) after companies convert and are trapped VMware style with these agent processes. You can bring your own (cloud) model subscription. I need to find out if we can point it back to our own local LLM endpoint and try local models for the same processes. Even if it takes 5x longer, it could be cheaper and more secure.
That said, Apple's vertical integration is a massive competitive advantage here, IMO. Nvidia's reliance on Microsoft & Windows for software support likely makes competing w/ Apple an uphill battle.
If/when Local AI gets good enough to compete with Cloud AI on most inference workloads, Apple starts to look like Nvidia's biggest competitor.
While this is admittedly a dream scenario, the biggest downside would be Apple effectively having a monopoly in "Agent-ready" consumer electronics. Hopefully local AI both becomes the norm, and there is sufficient competition among the consumer platforms.
Side-note: I would love to see an "RTX Spark" Framework 13 mainboard at some point.
Apple's vertical integration has led to a Siri overhaul that took half a decade to roll out, and it won't even run locally. They built an NPU coprocessor that's basically dark silicon for expensive inference, and then shipped MLX to stop Tensorflow and Pytorch from replacing Apple's role in the stack entirely. Mac owners are pleading for signed CUDA drivers for the PCIe or Thunderbolt in their $5,000+ Mac Pros. Apple's ecosystem is pure liability for AI, they're not moving any product for datacenter inference and can't even sell the hardware to themselves: https://9to5mac.com/2026/03/02/some-apple-ai-servers-are-rep...
Nvidia's profit margins are safe. Even if the RTX Spark is a completely failed product, Apple is not encroaching on the markets that Nvidia dominates.
But I like it. It's a copy of Apple's SoC design philosophy, same as AMD's Strix Halo, which I always thought was really cool both for laptops and home PCs. NVidia's traditional consumer cards pull way too much power and are too noisy to comfortably put them in a living or office environment.
The writing is on the wall, neither Anthropic nor OpenAI are anywhere near close to sustainability and if one or, worse, both fail the entire demand bubble for NVDA crashes.
It's smart to set up alternative destination markets while they can do so in peace.
> Over 100 Windows software providers such as Adobe, Blackmagic Design, Blender, CapCut, ComfyUI and OTOY, and game developers such as KRAFTON, NetEase, Remedy Entertainment, Riot Games and XBOX are embracing the new RTX Spark platform. [...] NVIDIA is partnering with Adobe to rearchitect Adobe Premiere and Photoshop for RTX Spark. [0]
> Gaming on Arm is finally coming of age thanks to the NVIDIA partnership. Native anti-cheat solutions from Epic and BattlEye are fully supported on the RTX Spark platform. Major developers are jumping on board, with Riot Games bringing League of Legends and Valorant natively to the architecture, alongside KRAFTON bringing PUBG Battlegrounds. [1]
Also, Nintento Switch is an Nvidia/Arm gaming device so many game publishers already have some experience with the combo.
[0] https://nvidianews.nvidia.com/news/nvidia-microsoft-windows-...
[1] https://www.windowslatest.com/2026/06/01/microsoft-builds-it...
The big news is more so on the games side, which is probably where Nvidia had some pull.
I'm curious what "rearchitect for RTS Spark" means in practice though. Sounds like its less convincing them to make an arm build for windows, but they are maybe taking advantage of some hardware specific features? If so, what does that mean for the Snapdragon X series I wonder?
Microsoft pulls in their weight as well, so this seems like it has a decent chance of getting industry support.
Only the ones which explicitly list something like the Riot Games mention are really related to the device/Nvidia. The thing which really pushes this along is user adoption/market share, not big names. This device will help that, especially in the gaming space, but it's easy to get over eager as it being from Nvidia means everyone else who has been waiting will just now jump on board too because of that.
So if anything, we need to push more game studios to use open source dependencies which will make porting easier.
What would push more games would be Valve actually making it worthwhile to natively target Linux.
1. in order to run LLMs, especially the best ones, you need complicated devices which are expensive
2. if you buy one for your personal use, you are probably not going to utilize it all the time and it will be idle a lot
It seems to me that it will always be more economical that the LLM-running devices are in a datacenter where it is easier to make sure they are always utilized
The price of a mini-PC with Intel Panther Lake is at least double in comparison with the price of a mini-PC with Arrow Lake H having similar specifications, and I am talking about barebones, before adding DRAM and SSDs, whose prices have risen even more.
The rise in prices is somewhat obfuscated by the confusing names of CPUs, i.e. some old and new CPUs may seem to be at similar prices and they have similar names, but the new CPU actually corresponds to a lower segment of the market, by having e.g. a smaller GPU and a lower clock frequency, while the CPU model that really corresponds to the old is named such that it seems to belong to the class corresponding to its present price.
As a concrete example of this obfuscation, which may confuse the buyers of laptops or mini-PCs, I have an ASUS 15 Pro with "Core Ultra 5 225H". If I would buy an ASUS 16 Pro now, the corresponding CPU model, the cheapest which is not worse than what I have, would be "Core Ultra X7 358H".
The whole replacing people angle is just the short term use case the more ghoulish executives are thinking about. In practice, lots of lots of new use cases have been made possible by LLMs. A lot of which can be done locally. But whatever capacity you have locally, they can have more of and for cheaper, and they manage the model instead of you doing it yourself. I think you put it nicely though, their moat will be thinned, and I doubt they'll be as profitable as their funding suggests, but at the same time the demand for them won't go away either. I don't know if OpenAI and Anthropic will be viable, but I'm nearly certain Deepseek is.
The tipping point will be power usage, if a local llm can run the same workload for less power that would be a game changer. Nvidia might get decimated, but even Google and others have moved on from GPUs already, they have faster and more power efficient TPUs. Add to that network bandwidth and availability issues, their moat remains. Also consider that even for graphics capabilities, user devices just don't have a consistent spec to make things like widespread 3d graphics and webgl usage viable. Someone's cheap android phone will never run a local llm reliably,same as it won't a 3d game. even if they have a high-end iphone, network providers aren't always performant as they are in western countries, and then there are people that won't want to install your app or local software, and then browser based exposure of the capability to sites which will have similar hardware spec issues, OS instabilities, competing tabs,etc...
- bulk discounts - cheaper electricity - high utilisation to spread the costs among many users
I don't see how PCs could ever compete against it. Most users AI demands would probably result in >90% idle time on the GPU.
But I really do question how well Windows on Arm is really going to work out long term.
For Apple it worked because they were able to force the issue. If you wanted a new Mac it was going to be Arm and we all knew eventually (this year or is it next year?) Intel support would drop. Over time we have seen M series exclusive features.
Developers were forced to update or abandon Mac which gave users a great experience (with some early growing pains).
This is something that Windows will never be able too do. They will always be stuck maintaining an emulator and a likely large subset of apps only supporting one over the other. (also does this work the other way around with an Arm only app working on x86?)
This seems like a repeat of when it was not uncommon for games to only support Intel or AMD or NVIDIA or AMD. But worse since they are not both x86. Sure at least we have emulation but just like with Rosetta2 it shouldn't ever be the long term solution.
Qualcomm is also working on a really good ARM ISA CPU with their acquisition of NuVia and subsequent Oryon architecture.
Meanwhile this is just using off-the-shelf ARM CPUs in a MediaTek SoC with blackwell bolted to the side of it. ARM's CPUs so far have been subpar for laptop-class chips. Hence why neither Apple nor Qualcomm are using them.
MediaTek is involved in the SoC but both the CPU & GPU from Nvidia are bolted on to it. I.e. it's not a standard MediaTek CPU with an Nvidia GPU added.
tbh, I always read this as Intel doing some sales magic here.
Apple: "Hey, we're making a product that has a 15w thermal envelope, do you have anything?"
Intel: "Yes!"
(Unspoken: their products will throttle down to fit, in fact, they will try to run always at 99ºC so you always get the best performance! FEATURE!)
Apple: "uhhhh..."
Consumers: "HEH IS IT EVEN A PRO DEVICE IF IT DOESN"T HAVE <INTEL MARKETING BRAND TERM>?"
Apple: "UHHHH... Guess we'll do it ourselves"
But the bigger problem in my opinion: How much of the Windows userbase actually sticks to Windows because of its backwards-compatibility?
--> What would happen if they break this model and the OS is only judged based on its user experience and available applications...?
I'm not sure it would stand any chance to compete in the B2C space. If I think about it, there's not a single new feature in Windows of the last ~20 years I particularly care about.
Without backwards compatibility, there's barely any ecosystem. MacOS on the other hand is full of ecosystem features, improving collaboration, connectivity, handoff across devices, etc.
True, but if you're only in the ecosystem as a mac user, in many ways it's felt like a mixed bag. I still wildly prefer mac over other operating systems, but if upgrades had a price, I think those sales would mostly go to iPhone users. Even at free, I'm yet to find a compelling reason to install Tahoe, and will probably just continue waiting until the next one.
I can easily run Qwen3.6 35B-A3B with Q5_K_M with a 260k+ context window with some vram to spare. It easily runs probably 80tps. It took me quite a while to find the
Compared to GHCP Claude Sonnet 4.5 or 4.6, I have full parity. The wall clock time is faster for agentic workflows, and rule following is about on par.
With either, doing something kind of novel or obscure takes more hand holding compared to just generate a GUI or crud app. For example, trying to build an actual program that performs a complicated process correctly requires quite a bit of hand holding to get it to properly help.
Sure, it isn't Opus or something, but I think with the right harness, it probably can get close. I think most of the issues these days is the harnesses are lacking.
It was suspected to come soon enough, but it was a nice cheap road for my small hobby stuff. When they announced the price changes, I started to explore alternatives, and with the news of Qwen3.6 35B being both and having quality, I figured it was worth a try out, and self-hosting made the most sense to me, since that meant I was free from being a forever-renter.
I’ve got both (single R9700, dual B70) and they do nicely for about anything I throw at them, such that the latter has a visible improvement when the model is well-cached.
I'll probably try to figure that problem out in about a month. Worst case is I move it to another even older desktop to replace the 9800 GTX+ inside of that one.
Perhaps the next generation of the spark will improve on the bandwidth and RAM size numbers. Yes it's a lot like a Strix Halo, but this has CUDA, which will be of interest to developers who want that.
I was looking for AMD AI Max+ 395 laptops recently, and the only ones I've found were 13 inch models, which seems odd from a heat dumping standpoint. I'm looking for 16 inches, I guess the 13 inch form factor would make it easy for commutes where you're taking it to dock to a large monitor at work or home, but no 14 inch screens?
https://www.servethehome.com/nvida-introduces-rtx-spark-an-a...
M5 Max beats it, but for the price of an M5 Max, you are better off just getting a desktop with 2 3090s, which will be cheaper even at current prices.
I've heard there's still a large backlog of both software problems, and hardware problems with the platform. The software problems could be fixed with time, but they'll still give a shitty first impression. I'd have thought Nvidia would just bury this and try again with a successor run of silicon with a new design.
This thing seems practically destined to just be a repeat of the Snapdragon laptop debacle.
that's what nvidia is hoping for
What would be interesting to me would be how quickly developers start targeting ARM64 directly.
https://docs.nvidia.com/dgx/dgx-spark-porting-guide/porting/...
One reason it works surprisingly well on modern systems is how much is offloaded to the GPU. You aren't going to get great power optimization or anything without it being truly native though.
There are games which are CPU limited though, and it will be interesting how those do. Curiously those also tend to be in engines with Arm support already.
When you lay out the software stack it is essentially OS > Game code > APIs. Both the OS and APIs are native code, it is only that middle point that needs the real work.
This is why x86 to ARM doesn't have such a heavy performance cost. So games can be CPU heavy but if it is heavy at the API end, that isnt a huge issue.
Very cool.
https://www.techpowerup.com/gpu-specs/gb10.c4342 https://www.nvidia.com/en-us/products/rtx-spark/
Basically the same tradeoff as macmini with unified memory.
I think its gonna be another failure as we are used to see with the PC market these days.
Why do I have the feeling it's been intentionally made to be bad in order to get you on to their most pensive datacenter gear.
At this point, your cost-efficient options include used 3090s, "frankenrigs" using recycled data center cards, and a handful of "workstation" class cards, where the originally high margins and the long enterprise purchasing cycles have kept prices from going up too fast.
In contrast, a lot of these "personal" AI systems are basically a GPU-like core wired to larger amounts of slow RAM. Which is still semi-affordable. Generally speaking, they make for OK chatbots but extremely slow coding agents. Whereas you can run a modestly useful coding agent at reasonable speed on a 3090.
So yeah, a lot of these systems are bit scammy. But not because it's a secret conspiracy to protect data center cards. Rather, there simply isn't enough fast RAM in the entire world. So they'll flog you disappointly slow RAM instead.
TL;dr: Might be useful for some use cases, but benchmark very carefully.
I think they make a great "second device" where you have something meatier to fall back to if something doesn't quite work right. I'm not sure if it's ready to take on the "main device" role just yet. But it's a far far better experience than the Surface RT days.
- 5090/6000 Pro: 1792GB/s
- 5080:: 960GB/s
- 5070Ti: 892GB/s
- M3 Ultra: 819GB/s
- DGX Spark: 273GB/s (less than an M5 Pro at 307GB/s)
Memory bandwidth isn't everything but it will cap inference rate pretty heavily. Also, the M3 Ultra is for an almost 2 year old Mac Studio. It's widely expected that it'll be refreshed in Q3 with a likely M5 or M4 Ultra with >1000GB/s. I really hope Apple realizes what a market opportunity Apple has here.
The above shows just how good value the 5090 really is. It basically a RTX 6000 Pro with less RAM (and ~12% fewer CUDA units), which is a ~$10k card, for 20-30% of the price. This also demonstrates how NVidia uses VRAM for market segmentation. As an aside, the true data center cards (eg B100, H100) use HBM memory at ~3.2TB/s.
[1]: https://wccftech.com/nvidia-enters-pc-space-with-rtx-spark/
This is much better value than 5090, you can run much bigger models.
> tl;dr - For software development, Qwen3.6 27B, 5090 gives you ~3x speed over M5 Max, letting you plow through code, while M5 Max gives you ~4x memory, letting you use higher quantization and bigger context. Which would you choose and why?
I've read a number of things from which the consensus seems to be that yes you can run a larger model and/or have more context with a 128GB+ Mac but the performance gap is still massive and with current hardware we're still talking about inference rates that matter. By this I mean there's a big difference between 10tok/s vs 30. Once we get to t apoint where it's 100 vs 300, it won't be as big of a deal, a bit like FPS in games.
Oh and there are similar concerns with the DGX Spark [2].
[1]: https://www.reddit.com/r/LocalLLaMA/comments/1t5v2gr/need_ad...
[2]: https://www.reddit.com/r/LocalLLaMA/comments/1sqk333/dgx_spa...
The biggest thing where this will crush Apple is the initial prefill phase. 6000+ cores vs 32/40, + active cooling with fans. For local llm models, this matters quite a bit more than tokens/second.
In the end, neither are really worth it for llm use compared to just building a desktop and just port forwarding over ssh to ollama.
Of course, DGX Spark is a miniPC, so laptops will likely be slower due to power limits/throttling.
+ Windows
+ Screen
- ConnectX-7 Smart NIC
Can the link type be toggled between Ethernet and Infiniband? (Don't think I've ever heard of a laptop with IB.)
> "Our goal is to deliver unmetered intelligence to every home and every desk with Windows," said Satya Nadella, chairman and head of Microsoft.
Then:
> However, Ian Fogg, Research Director at industry analyst firm FDM CCS Insight said the change was "likely to come with a significant price tag" and Nvidia would be targeting "those looking for workstation-class performance".
So... not every desk with Windows.
It just feels too much like what they said about Apple II and early Windows. A play at nostalgia instead putting real thought into it.
My question is, what happens to the people who use RTX cards for gaming? This new solution isn't meant for that. Do they need an "AI accelerator" and a gaming-centric GPU?
Even in the analytics side most of the stuff is some shonky ass numpy or excel gank.
I don’t know what the market is. I just can’t see it.
bechmarks with DGX arnt spectacular for NVIDIAs software and CUDA lead.
wouldnt count on this being a price/compute challenger. especially with overpriced VRAM.
All those CUDA cores in the sparks but they're starved for memory bandwidth.
I am still waiting for NVidia to release a system that legit beats 3090 maxxing for the home gamer...
And is it really a way to lock in people? With AI coding tools, isn’t it trivial to write software on top of CUDA and rewrite it to target some other hardware?
Also I heard the tensor core instructions on the dgx are gimped and you’re better off with a rtx pro x000. Is that the same with these machines?
Geekbench Single Thread Score:
- DGX Spark (same CPU as RTX Spark): 3125
- Snapdragon X1 Elite: 2950
- Snapdragon X2 Elite Extreme: 4050
- AMD Ryzen 9 9955HX: 3225
- Intel Core Ultra 9 290HX Plus: 3175
- Apple M5 Max: 4350
I'm happy to be wrong about Qualcomm's latest X2 chip performance, even if it is shipping in only a single product so far. Their previous best was the lowest in this list.
Around 2-3K USD something with a good GPU + CPU + 128GB of integrated RAM is just going to be an awesome experience.
Considering Mac options are north of 5K+ even on a regular day.
https://www.bhphotovideo.com/c/product/1957120-REG/apple_mbp...
I'm not sure if I like this. Sure for a laptop this might be not a big problem but if this ARM ecosystem is a success it will spread to desktop computers and I fear we could lose the existing modularity.
"Introducing the NVIDIA RTX Spark™ Superchip. The fusion of NVIDIA AI and RTX graphics in a single chip redefines Windows PCs and delivers amazing creating, AI development, and gaming—on the slimmest, most beautiful RTX laptops ever and small, ultra-efficient desktops."
With MLX, Apple is building an answer to CUDA, and if people start switching from ChatGPT & Claude to some app that runs on their M5, suddenly Apple starts to look like Nvidia's biggest competitor.
If Nvidia doesn't have a pathway towards getting hardware into the hands of consumers, it could be a really difficult road ahead for them.
I'm here for it. Local models can do a lot of what I need at almost no cost, plus the fun of making them work better or building a new system to handle that aspect of my home lab. A Strix Halo system may not be amazingly fast but at 128gb of RAM it can keep up with most open models worth exploring.
Based on June 1 Copilot Pro plan premium token burn and cost, unless you REALLY know how to use cloud AI efficiently and are tooled up to do so a local LLM on hardware you may already own is very appetizing.
I converted a lot of work today to a 6.5gb local LLM on a 12gb GPU and no, it's not as good. But it is 'free' or at least feels that way, especially when I need to redo something and my copilot premium request % doesn't change.
I think more announcements will follow soon from other companies.
Nvidia really threw stuff over the wall with the DGX Spark release. They don't seem to really care. I sort of think they'll spend a little more time on Windows, where there's no pesky upstreaming to do and they can just do whatever, but man, it's such typical hubris from Nvidia to build such an expensive box with good chips but make it basically unsupportable and roasty hot all the time.
You also generally have to run an ever more stale two year old Ubuntu derived DGX OS to get anywhere, with bespoke kernel and drivers all. None of it is well supported, none of it just works like a comparable PC or even well behaved arm system would.
As for other ARM, there were rumors AMD Sound Wave is/was going to be a ~10W arm APU, but there hasn't been much said about it lately. Honestly given the ram crunch, it's maybe just not worth trying to build a system with a cheap core, if the rest of your costs are going to stay so stratospheric. https://www.techpowerup.com/341848/amd-sound-wave-arm-powere...
Looking at devices like the NVIDIA Shield gives me some hope that NVIDIA will be better than Qualcomm here. I just hope this is not a case where the OEM has to purchase X years of driver support from the chip vendor beforehand, and that NVIDIA will provide support directly itself.
It's just worse Strix Halo, as you are landing square in middle of Windows ARM problems
I 'd say that is an improvement if you want to run local llm inference. Still well below with what you can achieve with Apple chips though.
NVIDIA and Microsoft Reinvent Windows PCs for the Age of Personal AI
https://news.ycombinator.com/item?id=48352705
NVIDIA DGX Station for Windows Puts a Trillion-Parameter AI Supercomputer on Every Enterprise Desk
https://news.ycombinator.com/item?id=48352691
Introducing Surface Laptop Ultra: Made for world makers
https://news.ycombinator.com/item?id=48352627
Introducing a powerful new chapter for Windows PCs, accelerated by NVIDIA RTX Spark
In university a friend of mine had a large hardcover book she kept in her dorm freezer. I asked here WTF she had a big book in there. She said it was for minecraft - she'd place her laptop on top of it while playing. The book was cold but also quite dry. I wonder how well it worked.
I was lucky that iteration 1 (sans towel) didn't ruin the laptop...
All I care about is if I can get one of these for significantly less than a dgx and get Linux on it for some cuda Blackwell kerneling.
What does AMD or Intel have here?
I think the future will be 50/50 x64 vs arm64 for PCs.
NVIDIA nailed it
I don't think so.
This most likely be a winmodem situation, again
A powerful new chapter for Windows PCs, accelerated by Nvidia RTX Spark
https://news.ycombinator.com/item?id=48352693
Surface Laptop Ultra: Made for World Makers
Eventually a lot of inference will get right-sized into something you affordably run yourself.
knowing nvidia and their approach to consumer hardware (especially those on notebooks), i'd take a heavy grain of salt on getting a good deal.
also, they've been bad to their ecosystem partners, but now they've been able to boss them around into supporting them so far. i suppose getting anticheat vendors to finally support arm might be enough of a nudge for linux support? one can only dream
Maybe the Nth time's the charm and Microsoft+Nvidia will manage to make Windows on ARM a viable platform.
Saying that I think this is product is kinda dead on arrival.
Geekbench cpu bench leaks indicate they aren’t as good as m3 at single core even.
Will they support booting into a Linux installer?
Guess I need to postpone my gamer PC renewal to end 2030.
More seriously, obviously a ton of work in an incredibly competitive space, and an incredible machine (without getting into competitive comparisons/minutiae). Was watching a techtechpotato[0] quick post pre-launch about "why is this even being tried?", which was also interesting. What an age we live in.
However, I'd jump from Mac in a Heartbeat if this supported Linux.
https://www.gartner.com/en/newsroom/press-releases/2026-4-10...