Bugmenot is great for sites like this.
So, instead of discussing the topics in the video, the majority of commenters here are discussing the flaws of the website it's hosted on or debating whether or not reddit is profitable. Neither of which has ANYTHING to do with scaling.
I expected better, people. Seriously.
In this and some other technical topics, people end up discussing their personal tastes with web site's design, their individual UI frustrations with some button on the web site, the font, the color, and other random non-relevant topic; like now the profitability.
Except for that reddit is not profitable.
http://blog.reddit.com/2013/08/reddit-myth-busters_6.html
http://www.reddit.com/r/TheoryOfReddit/comments/1ihwy8/rathe...
When that blog post went up I was as surprised as you to see it wasn't profitable when I was there.
I can definitely see millions of networked memcache calls being a bottleneck, and if the batching adds another ms per req on average, but removes the bottleneck, then they can serve a lot more users at a cost of 1ms per req.
Is there anything in TFA that would support my theory? I don't know. I don't care enough to endure InfoQ. (I did for a Rich Hickey talk once, lo these various months, and yea it were a minor inconvenience).
Edit: whoa jedbergo!
1. Use AWS
2. Use Postgres
3. Use AWS
4. Use Cassandra
5. Use python, so later you can write C when shit needs to go super fast
That's what I got.
Those are some of the important lessons, although use (postgres|cassandra) are really too prescriptive. More like "use the right tool or tools for the job".
Also, use consistent key hashing where appropriate is another important less that I should emphasize more.
And "build for 3" is another important lesson. It makes scaling much easier.
I'll probably be back to check out more of the videos, but definitely not because of the site. If the editing is good, YouTube is just fine, otherwise SlideShare plus an audio file is just perfect.
Edit:
[F] First Byte Time
[C] Keep-alive Enabled
[C] Compress Transfer
[A] Compress Images
[A] Progressive JPEGs
[F] Cache static content
[ ] Effective use of CDN
Source: http://www.webpagetest.org/result/130816_PE_AYH/A full transcript with interleaved slide images takes a few minutes to read at most, and lets you control the pace of information absorption. A video with slides, especially when you cannot 2x the talking speed, is a painfully slow data transfer method.
Video + Slides = analog modem
Transcript w/ embedded slides = Google fiber :-)
Of course, if you're into audio books instead of reading, maybe you consider that a feature.
// I push video for a living. It's great for visual explanation like DIY instruction (e.g. woodworking, swapping RAM on a Mac Mini), emotional content, personal story telling, etc. Systems architecture is generally not in one of those categories.
I don't have a solid block of 40 undisturbed minutes to listen to a talk. Give me a transcript and I can read a paragraph here and there as I do other things at my own pace. I might have ten minutes here, ten minutes there. I don't want to be constantly pausing/unpausing the video, or worse - switching between the video and my music.
Plus, if I concentrate, I could read a 40 minute talk in 20 minutes or less.
Basically, when I'm reading, I control the pace. I rarely watch videos that are longer than about 5 minutes (that aren't entertainment, which is entirely different).
Memcache has no guarantees about durability, but is very fast, so the vote data is stored there to make rendering of pages as quick as possible.
Cassandra is durable and fast, and gives fast negative lookups because of its bloom filter, so it was good for storing a durable copy of the votes for when the data isn't in memcache.
Postgres is rock solid and relational, so it was a good place to store votes as a backup for Cassandra (we could regenerate all the data in Cassandra from Postgres if necessary) and also for doing batch processing, which sometimes needed the relational capabilities.
A lot of those people aren't interested in that content, so it will suddenly get an influx of downvotes.
I'm glad there is an explanation based on user behavior for this phenomenon because admin level vote tampering is such a tired theory.
It's important to keep in mind that reddit was already doing twice as much traffic as Digg before they launched v4.
I, for one, went from a /r/php lurker to real user at that time so I thought there was dozens of us, dozens !
Funny story about that. When we were owned by Conde, the accounting was a little different (they took on some of the charges, like Akamai), and so we were actually told we were slightly profitable.
When that blog post went up I was as surprised as you to see it wasn't profitable when I was there.
Firstly, as with Wikipedia, if Reddit were forced to close because of money issues, Reddit could simply post a 'donate now or reddit shuts down' post and they would likely be rolling in millions of dollars.
Second, simply because reddit itself is not profitable does not mean people are not making a lot of money off reddit. The moderator system lends itself very well to a kind of 'corporate capture' of communities where moderators can be (and are) bought off for very tidy sums.
From what I remember, this is kind of why they started Reddit Gold.
Reddit succeeded largely because the company that bought them is making money elsewhere.
I don't think grandparent should be downvoted, he raises a good point. Tons and tons of people use Reddit, but Reddit has a hard time making a living.
Checkup alladvantage - they paid people to surf. Had millions of users, but ultimately failed because their "business model" was idiotic.
Getting millions of users is pretty easy if you pay them to be a user. Someday though, it's only worthwhile if you can build a sustainable business which at least doesn't lose money hand over fist.
If Reddit hadn't got bought and supported by other profitable businesses, I doubt it would have survived.
Yep, the site is still in the red. We are trying to finish the year at break-even (or slightly above, to have a margin of error) though. [1]
1: http://www.reddit.com/r/TheoryOfReddit/comments/1ihwy8/rathe...
Someday, the money will run out, and they'll have to try and turn a profit.
Their business model is fundamentally bad.
Source?
[1] http://www.reddit.com/r/CrazyIdeas/comments/1i7crj/antigold_...
[2] http://blog.reddit.com/2013/08/reddit-myth-busters_6.html