Why is this news—which is mostly technical and incremental—causing such panic?
https://arxiv.org/abs/2501.12948
Keep an eye on the effort to reproduce here: https://github.com/huggingface/open-r1
We will see if the (over?) reaction matches reality in time. Media sure loves to whipsaw us all around
Another thing to consider is society & the market is not currently rationale, lots of wild swings, over reactions, messaging & beliefs at the extremes
They successfully distilled the reasoning capabilities from larger models into much smaller ones. e.g. Their 14B model outperforms other 32B models.