https://news.ycombinator.com/item?id=38425415
You should probably also point out the big red text on the subredditstats pages. I didn't see it when I posted the links, since I'm colorblind and hues of red are entirely invisible to me. Also I have trouble counting past the number of fingers on my hands, so I didn't notice that the numbers were a bit off. If I had noticed that, I still would've needed one of the very clever people here to explain the significance.
> If I had noticed that, I still would've needed one of the very clever people here to explain the significance.
I don't know if that's sarcastic or serious or if that's some dig at me? What an odd comment.
I also suspect that smaller subs are a more useful measure than these huge subs, because I'd expect them to die off much quicker than the huge ones with a lot of inertia.
That said, I understand it can be annoying having 16 people tell you you're wrong all in slightly different ways. It's the price of posting on the internet I'm afraid. But it was (and still is!) the top post on this thread, even though it's not just factually wrong, but spectacularly factually wrong – which is fine, everyone is wrong sometimes – but people do have the tendency to point that out. As long as it's not a pedantic point I don't think that's wrong even if there's a comment already, if you think you can make that point better.
The data is useless as an absolute measure of activity, sure, as described in https://news.ycombinator.com/item?id=38425501. The message at the top of the main subredditstats page says, "...the data collector is not robust, and so the numbers should only be used as a general guide." You can read that. Let's assume I can read that, too.
But it does track as a representative sample of trends. Picking something less noisy than a niche sub, we can ask whether there have been recent newsworthy events that might show up as spikes in this data. And, there is: look again at the posts/day and comments/day graphs on https://subredditstats.com/r/worldnews, and you see clear spikes in activity right around October 7 -- well after Reddit's API changes would have affected subredditstats.
If the data collector has only been able to pick up, say, approximately 20% of the site's activity for each subreddit, then trends are still trends as long as the data collection hasn't changed in a radically new way. And, sure, that could be the case after Reddit's API changes, but as I pointed out in https://news.ycombinator.com/item?id=38425150 (and as supported in another user's reply using an entirely different source of data), the API changes don't line up precisely with the change in user activity, and Reddit is clearly, observably, less active now in all of its large subs.
Now, for my part, I assumed this would all be pretty obvious stuff. I'm not doing a terribly deep analysis here; I'd expect anyone else to see the same things at a glance. But a few people seem to think that because the numbers aren't a perfect match, the entire point just collapses and clearly Reddit is now busier than ever, and those people are completely missing the point. Using a 12,000-subscriber sub with the noisiest possible data to try to disprove sitewide trends is even more wrong, and then smugly saying, "everyone is wrong sometimes", is not just condescending, but frankly embarrassing.
The diametrically opposing argument here is that Reddit is perfectly healthy and the API changes and blackout protest had no significant impact on the site, and that checks notes 12,000 subscribers in r/thethickofit are sufficient evidence for this. And, like, okay, if that's your argument, cool, carry on, just come out and say so.