undefined | Better HN

0 pointschatmasta8mo ago0 comments

This is the biggest threat to the GPU economy – software breakthroughs that enable inference on commodity CPU hardware or specialized ASIC boards that hyperscalers can fabricate themselves. Google has a stockpile of TPUs that seem fairly effective, although it’s hard to tell for certain because they don’t make it easy to rent them.

0 comments

2 comments · 2 top-level

Zigurd8mo ago

I don't think we will need to wait for anything as unpredictable as a breakthrough. Optimizing inference for the most clearly defined tasks, which are also the tasks where value is most readily quantified, like coding, is underway now.

xadhominemx8mo ago

More efficient inference = more reasoning token. Hyperscaler ASICs are closing the gap at the hardware/system level, yes.

j / k navigate · click thread line to collapse