Actually in this case they possibly are estimates.
It's been known for some years[1] that LLMs do regression in-context. Frontier models have been trained against many, many issue text that include task break downs and estimates.
Interesting. So it may have learned how to estimate as a human but doesn’t understand that it doesn’t operate at that speed :D
I wonder if there’s a reasonable way to give an llm parameters that give it a concept of its own execution speed. Seems that could be useful for multiple purposes