There are programming techniques where you offload all the computation onto (via OpenGL!) onto the CPU. I did this for image processing where we needed to do histogram normalization. It was not fun.
But the GFlop numbers given are maybe an order or two of magnitude off from achieved performance for something like Lapack.