They did 40 runs of compilations (10 each, twice). At the very least, a scatter or bar graph would be helpful, as it would let you see the data distribution (those numbers are good as uninterpretable on their own).
Saying that, 40 isn't very many times to draw statistical conclusions so you'd probably need more data.