What is exactly is being measured in, for example, the R script? The same code running in the same modern version of R on Ubuntu 12 and 18? Or is the R version dating to Ubuntu 12? Or is it contemporaneous for both releases?
I really wish these websites would repeat these 100x or something so we could get an idea of the variability in the measurements.