All in all it's the easiest relational database I've ever worked with in terms of stability, speed, and scalability. I know this sounds like an ad for Aurora, but I just really like it.
* 1000+ node clusters
* Petabyte scale data
* 10s of millions of reads and writes per second
Given my preface, the "how" is scale out on top of Cassandra, of course. Not SQL, and hard to do if you have a highly relational model, but many stories of success at those kinds of scale.
As a Cassandra devops / data modeler myself, I would be fascinated to read more details about your scaling challenges. Do you guys have an engineering blog I could read?
- hot vs cold data ratio of the total size - read vs write data ratio - if read/writes are split - how partitioning, if used, is done - total machine(s) resources disk/ram - average (read)query response time - how machine/node failure is handled
This didn't seem to be a problem. It was the simultaneous write operations that created real limits, banging on the disks/disk controller like that.
MongoDB handled about 13K ops/sec at peak times with around 5-8K of these being writes.
MySQL was probably around 2-3K ops/sec.