undefined | Better HN

0 pointsrob_c1y ago0 comments

Given I know people running gemma3 on local devices for over almost a month now this is either a very slow news day or evidence of finger missing the pulse... https://blog.google/technology/developers/gemma-3/

0 comments

3 comments · 1 top-level

simonw1y ago· 2 in thread

This is new. These are new QAT (Quantization-Aware Training) models released by the Gemma team.

rob_cOP1y ago

There's nothing more than an iteration on the topic, gemma3 was smashing local results a month ago and made no waves as it dropped...

simonw1y ago

Quoting the linked story:

> Last month, we launched Gemma 3, our latest generation of open models. Delivering state-of-the-art performance, Gemma 3 quickly established itself as a leading model capable of running on a single high-end GPU like the NVIDIA H100 using its native BFloat16 (BF16) precision.

> To make Gemma 3 even more accessible, we are announcing new versions optimized with Quantization-Aware Training (QAT) that dramatically reduces memory requirements while maintaining high quality.

The thing that's new, and that is clearly resonating with people, is the "To make Gemma 3 even more accessible..." bit.

1 more reply

j / k navigate · click thread line to collapse