Why is this news—which is mostly technical and incremental—causing such panic?
https://arxiv.org/abs/2501.12948
Keep an eye on the effort to reproduce here: https://github.com/huggingface/open-r1
We will see if the (over?) reaction matches reality in time. Media sure loves to whipsaw us all around
They successfully distilled the reasoning capabilities from larger models into much smaller ones. e.g. Their 14B model outperforms other 32B models.