E.g. https://www.anthropic.com/research/tracing-thoughts-language...
I'm just an end user who tried to use these "frontier models" to actually solve real olympiad problems. They're useless.