They're not faster for all cases. Near memory works great for some edge cases such as looping on an array of data in a block.
This is very much a special edge case. For many other non-benchmark situations this isn't so simple. If you add the overhead of memory copying and lack of identity things are more complex.