There is HDFS, which has aged better, and the old MapReduce 'query' processing system which has aged worse. (Replaced by Spark and about 10 other things.)
There is a large supply of firms that would like to dethrone HDFS, because they think customers think that paying to 3x replicate the data is too much. (The winner is Amazon S3 where you pay even more!)
Maybe the scene has changed, ceph has made some inroads, but HDFS has the amazing property of being almost as fast running in degraded mode as it is normally, thus being fast enough that it can regrade faster than it degrades.
A big cluster is going to be partially degraded a lot so it matters.