Big difference is Google does the entire stack and also has scale to conceptually be able to create a better solution. But that is theoretical.
Here we can see the results where the Google TPU gets almost twice result per dollar over Nvidia. But then Google should be able to iterate more quickly.
Take the move from using CNN to using capsule networks. The idea for Capsules came from Hinton and Google is going to be there first to optimize in hardware. This is the benefit of playing in all layers of the stack.
Or the using NN for text to speech and offering at scale. Google just has inherent advantages over Nvidia and now we get to see a little more concrete results. But hope we get a lot more similar and see if the Google advantage holds up.