story
Debut in the sense that it’s something good enough that it’s getting mainstream attention.
Unfortunately LLMs are shifting compute time to test time instead of train time. I don't really like this and frankly it shows a stalling of the architectures, data sets, etc...