I suspect this applies especially to language or logic related tasks.
We will likely never find out, variance seems to be extremely high, while the actual differences are very low.