> Tweets alone generate petabytes of data a year
Nope. It's not Tweets that generate that data. It's the insane amount of (mostly unnecessary) noise that gets thrown into the mix: analytics, logs, metrics, you name it.
Every time you scroll Twitter sends multiple events to the server. That alone will generate a large chunk of those petabytes.