Computational neuroscience, but since your username is earth science I would mention one of the core algorithms which is quite a bit faster is the spherical harmonic transform.
Not the parent commenter. But, I know a few scientific tools in the genomics field that had the exact demand. The bottleneck was always memory bandwidth. 64bit wide floating point and memory bandwidth would make a huge difference for such tools.