An Update from Robinhood’s Founders (opens in new tab)

(blog.robinhood.com)

212 pointsBeowolve6y ago277 comments

277 comments

On the profession side of this, if you're an engineer at RH in the thick of this - many have been there. It seems dire now, but in a few years the fog, panic, and haze of no sleep will become a story you tell your peers at happy hour.

Many will cast stones - but they have been there too. If they haven't, well maybe their day will also come. You may feel bad at the moment - but the best way professionally forward is "We try our best tomorrow"

cheschire6y ago

If this were an outage directly caused by a natural disaster, I could understand. This outage was an availability problem. This clearly points to some prioritization problems within the leadership layers if robust and resilient infrastructure was not emphasized.

The prioritization problems may not be due to ignorance or malice though, and may be justifiable if there are other fires that are burning brighter. It's still pointing to problems though, and I think it's completely legitimate for engineers to question the stability of the company when this sort of thing happens.

At the very least as an engineer I would be asking some pointed questions of my leadership. Maybe not dusting off the resume yet, but still I'd want to get reassurance from internally that the leadership problems that caused this are being addressed.

malux856y ago

Sometimes you just have to cut them some slack. Have you engineered a highly available cluster before? I'm not talking about the hot-standby postgres master that gets called on once every 2 years, but I'm talking about a 180 node Cassandra cluster thats doing 15,000 writes a second 24/7 and peaking at 60,000 writes a second every day, and you have to do node replacements every week or two because of the high load.

Or I'm talking about a 200 node hadoop cluster thats doing the electrical metering and billing for 8 million people, and is NOT allowed to stop.

Or the trading platform thats running sub millisecond trades and downtime means 300,000 $ USD per minute.

These are systems I have engineered over the last 10 years, and I can say: These things are complex and have failures in 1000 different ways, and while you're monitoring 999 of them that one thing you're not looking at is festering under the surface (your monitoring system is tracking IRQ hardware interrupt response times, right???)

Part of being in a team is everyone pulling together, and yes it's stressful at the time, but even very good management cant see all ends, just like very good engineering cant predict everything. I don't think it's useful to start pointing the finger at management and "asking some pointed questions at leadership" because sometimes everyone is doing their best. Yes we should analyse our failures so we can do better, but your tone is very accusatory, and I believe that a better approach is an all inclusive chat about how we can do better, and management saying "great job engineering" for fixing it, and giving them a break after the stressful event.

TheCondor6y ago

Does the duration of their downtime suggest a “1/1000” unmonitored oversight? Or is it more like a threshold that was meet and probably could/should have been observed?

And FWIW, they have down time every day and weekend, at least in a virtual sense; the load does drop off in a very real sense too. You are spiritually correct, they should pull together and sort it out, and they owe nobody money here (don’t use a discount broker if you want some sort of guarantee about trades) but as a general rule you should ever feel too sorry for banker under just about any circumstances. The harshest lesson here, for everybody, was the only thing they would do for you was give you some commission free trades but that won’t work with this one, so a non-apology is what you get.

2 more replies

raiyu6y ago

No doubt there are many complex systems and they inevitably go down. Every provider has suffered meaningful outages.

I think the issue here isn’t so much that the system went down but the blog post.

It’s very light on details and doesn’t go far enough in terms of re-establishing trust with the customers that were affected. Which by the looks of it is everyone attempting any trade most of the day on Monday.

luckylion6y ago

On the other hand, they've had plenty of time and resources to do just that in a reliable fashion, it's not like it's one guy in his bedroom (I hope!). It's not like they are volunteers doing this open source for the community, they are getting paid (very well, I assume) to run the system. And Management is getting paid (even better, I assume) to make sure the priorities are right and correct decisions are taken. "Who could've known there might be a lot more traffic" sounds like somebody failed in Management, and engineering might have failed by not foreseeing the issue and/or informing Management.

Sure, don't burn people at the stake, but "hey, it's hard, don't blame them, they are doing their best" doesn't cut it for me. I'm sure they're expecting to be paid and not for someone to "do their best" to pay them.

1 more reply

Ntrails6y ago

> Or the trading platform thats running sub millisecond trades and downtime means 300,000 USD per minute.

I mean, I'll bite. Assuming you only traded 6 hours a day (ie US) that'd be a 27bn dollar a year strategy, and the only way for returns to be linear and trading to be sub milli is market making/arbitrage.

That is a lot of half spreads...

2 more replies

techie1286y ago

Kudos, these are moderate sized systems you've built over your career. There are lot bigger and more mission critical systems in the world and you might build them one day.

I understand GP's tone wasn't exactly nice here. But here's the rub with RH's outage. RH is unfortunately in an industry (Finance, Healthcare, Aviation, Food, etc.) where people _need_ to trust them to be successful. The consequences of failure in these industries is very catastrophic not only for them but their clients. Sure failures happen but the scale at which RH has failed and the lukewarm response they've put out has pissed off people. I don't recall any brokerage, old or new, that has failed so catastrophically and has responded to it so poorly. If you think you have a worse example, I am all ears.

6 more replies

LaserToy6y ago

It is not about scale, it is about the fact that people lost real money. If you can’t make it work you should not be in that business, and I don’t really care how hard they work.

I’m taking my account off their platform.

3 more replies

dirtydroog6y ago

If you used Scylla you'd have only needed 90 nodes. (Don't believe the instability rumours)

1 more reply

donavanm6y ago

Ive seen bigger, scarier, potentially costlier time based bugs personally. I dont think this would make me reevaluate my employment if I was at robinhood. As the parent says you either learn these lessons the hard way or you havent learned them yet. Thats doesnt translate to being a “leadership failure.”

Your smaller point about prioritization is spot on though. I dont believe Ive seen any similar incidents lead to business ending outcomes. I personally point to sony or, more recently, equifax as examples of the disparity between actual business impact and technical abhorrence. In light of that why is it worth trying to preemptively solve technical challenges instead of business needs? Every calorie spent on “what if” subtracts from “whats needed.”

kerng6y ago

Reminds me of the book Showstopper and the personal stories in - its about the creation of Windows NT. Pretty interesting how things where not so differnet some 30 years ago

In case anyone is interested: https://www.amazon.com/Show-Stopper-Breakneck-Generation-Mic...

dmix6y ago

Interesting, so it took 5yrs, $150-million, and 250-employees to get NT shipped. Adding this one to my reading list!

bertil6y ago

Important step though: have a retro, many maybe and write a report explaining what was messed up and how you might mitigate in the future. It looks like it’s going to be a good one. If you can share a sanitised version publicly, that would hopefully make it all a little bit more worth it.

I think I speak for everyone here if I say that, if that report is public and interesting, everyone on this thread will be happy to get you a drink.

vinaypai6y ago

This is all true for a company that is actually pushing any boundaries as opposed to failing pathetically at a well solved problem.

indecisive_user6y ago

Robinhood opened up stock trading to a large portion of the population that would otherwise not have been interested in traditional trading platforms with high commissions.

Their success helped to pressure companies such as TD and Schwab to mostly get rid of commissions as well, which is great for the average trader

I think Robinhood has a lot of problems, but to say they're not pushing any boundaries ignores the huge changes they've brought to the industry.

C1sc0cat6y ago

This is the 21st century low cost trading has been around for several decades now

1 more reply

unicornmama6y ago

Pushing the boundaries? They wrote an app that gamifies stock and options trading...

RayVR6y ago

Having worked as a professional investor since 2012, I can say these outages can happen anywhere. I've seen day long outages at exchanges where tens or hundreds of billions of dollars would have been trading, at brokers where who knows how much would have traded. I've also experienced these outages at retail companies that are more established, including TD Ameritrade (I become a customer when ThinkOrSwim was acquired.) I have also seen brokers screw over individuals on a significant scale without real ramifications.

The fact that Robinhood is telling people anything about the outage is only because they are the company they are, operating in the startup world/mentaity.

To the people thinking they should be compensated in some way...If you are doing >$1m daily volume, maybe you can contact them to see what they can do but even then, I doubt it. The way this should be handled is to have multiple executing brokers. You can implement offsetting positions if needed and transfer positions when your main account becomes available, if you are using a broker that can clear. Right now it seems Robinhood is working to implement clearing but you could still go to neutral or put on your positions.

twic6y ago

> The fact that Robinhood is telling people anything about the outage is only because they are the company they are, operating in the startup world/mentaity.

Yep. Intercontinental Exchange and Eurex, two huge capital markets exchanges, routinely have multi-hour outages and don't even acknowledge that they've happened, let alone explain them.

whb076y ago

multi hour isn't day and a half.

Itsdijital6y ago

I have mixed feelings of sympathy about this whole RH thing.

Anyone who has used RH regularly should be well aware of how inept it is. Any spikes in volume or volatility, even on a single stock, bring it to it's knees pretty often. Like not just the last week, but even during calm periods. I've personally lost 20-30% on positions solely because RH was bugging out, thankfully I use RH just for "fun trades" usually <$100.

I cannot fathom having the balls to trade any real amount of money on the platform while being aware of these long term issues.

On the flipside I feel for new users and perhaps even generally inactive users who weren't aware of RH's incredible flakiness. I'd imagine (or hope to) the losses of most of those users were small, assuming they were new or casual and just testing the waters.

Even if one of my small plays hit it big on RH, the money would just go to my main account on TD (which has been smooth all week shy of a few hiccups Fri morning during record volume). It's been obvious for a long time that RH should not and cannot be trusted. If you're trading options with a $60K account on RH, well, I don't even have words for that level of ignorance.

stef256y ago

I abandoned Coinbase after having difficulties getting a few 1000 bucks out of there. It worked out in the end.

Problems with my data I can tolerate up to a point. Problems with my money I absolutely can not tolerate. As you said, it's unfathomable how people can trade money on a platform that's flaky.

robinson-wall6y ago

The interesting thing about working for a UK challenger bank - I now have visibility into all of the outages going on at large, high-street banks here.

Complete outages are rare, and well-publicised, but things go wrong a lot more[1] than you might think without any communications to customers that anything is wrong, sometimes outright denying[2] that there's a problem.

1: https://twitter.com/nickrw/status/1141058572547215360

2: https://twitter.com/nickrw/status/1164162320672669696

twic6y ago

IIRC the SLA for FPS is 2 hours. So if a bank stops processing them for an hour, that's within tolerance, and they don't need to tell anyone.

I think your point is that it's a very different mindset to the native internet world, and that is certainly true!

0x8BADF00D6y ago

It’s another example of why DevOps has become a buzzword and most teams just pay lip service to it.

UncleMeat6y ago

Everything has outages. Is this the new narrative now that we've moved on from the leap year thing? That RobinHood is just a bunch of shitty engineers?

There are no public details about the root cause.

I think RH is bad for people in general, but this pile-on is outrageous.

Itsdijital6y ago

Robinhood crashing isn't an isolated unfortunate "well it happens to everyone" moment.

RH has constantly had issues at least since I started using it over a year ago. I didn't notice it really at first, but I also didn't know much about anything trading related back then. It didn't take long though for me to have my first "incident" where my market orders were seemingly vanishing into the abyss as the underlying moved. I'm not talking seconds, I'm talking minutes. For a market order on high liquidity options. Never mind trying to get filled at anything besides the ask (buying) or bid (selling).

RH has had serious underlying issues for a long time now. This incident didn't happen in vacuum. The writing has been in huge block letters on the wall for a long time.

1 more reply

kortilla6y ago

No, a brokerage being down for an entire day is not the norm.

cheez6y ago

There are a couple of situations where outages are not normal or acceptable:

1. Dealing with other people's money 2. Monitoring/managing other people's health

3 more replies

bob10296y ago

Saying 'everything has outages' is kind of disingenuous. There are many computer systems in the world today that can be considered to have practically perfect up-time. Mainframes have uptime measured in decades. I realize the concept of 1 gigantic iron box in a heavily-fortified installation with 2N+1 redundancies throughout is still not enough to ensure 100% uptime. But, when is the last time you swiped your credit card and had a failure to process the transaction?

1 more reply

frockington16y ago

> That RobinHood is just a bunch of shitty engineers?

It is confirmed they are worse than virtually any reputable brokerage. It might not be their fault directly but its 2020, not 1998

jennyyang6y ago

I know quite a few people that were personally affected by this and lost money due to the two outages and they are all pulling their money from Robinhood. The fact that they can't offer any compensation might be a big problem for them, since they already have zero trading fees, which is what most brokerages offer as compensation.

Personally it doesn't pass the smell test for me. The load was much higher the previous week and load problems go away once the load disappears. They probably had a lot less load the rest of the day, so the fact they were down the entire day suggests it was something else. I would need a fully transparent post mortem before I believed anything they said.

solidasparagus6y ago

Failures due to high load can take a while to resolve - you often need to fix the broken infrastructure, process the backlog, and catch up to live.

hcknwscommenter6y ago

You can't process the backlog on a trading platform. If i put in a trade at 2:20 pm and the system goes down, I don't want my trade to execute next morning at market open. That's insane. Especially the RH flavor of YOLO infinite leverage call option nonsense.

radicaldreamer6y ago

Exactly, you have to default to fill or kill within the trading day. You just can’t treat certain products like a standard queue... sometimes time is the most important component

1 more reply

solidasparagus6y ago

True for RH trades but I'm sure there is a lot of other data being handled - such as market data.

afc6y ago

Load problems don't go away when the load disappears. If the system isn't engineered very carefully (this takes a lot of work!), you may have cascading failures that may take hours to resolve, especially if you have bad retry policies (their mention of thundering herd problem seems to indicate that they might).

We wrote a bit about this here: https://landing.google.com/sre/sre-book/chapters/addressing-...

I would strongly caution anyone who thinks this subject is trivial, just add a bit of load shedding and you're done. I wrote a bit about my team's work (including a simplified view of some of the considerations that go into how we do retries) here: https://landing.google.com/sre/sre-book/chapters/handling-ov...

jennyyang6y ago

They specifically said it lead to a DNS failure. They didn't mention anything else, like corrupt data, etc. Sure there are plenty of ways that outages, not just load problems, can cause significant outages, but what Robinhood specifically said was that they had load issues that lead to a DNS failure. They should be more forthcoming with exactly happened if they want people to trust them.

rpdillon6y ago

I'm not sure it's fair to assume that service gets automatically restored when load dissipates after failures due to high load.

driverdan6y ago

This isn't something new, downtime is the norm for Robinhood. Anyone trusting them with more than play money is foolish.

neuronic6y ago

This is the correct sentiment. People who put anything more than play money into Robinhood should not be surprised when their financial life is ruined.

balls1876y ago

How did they lose money?

jiqiren6y ago

Quick example: They bought puts on Friday and couldn't unload them for a full day + following morning.

Monday morning puts were down - it was obvious the market was recovering in a big way. Instead of cutting losses at ~20% in the morning they lost ~99% of their position. Some lost 100% since the options expired EOD.

balls1876y ago

Thank you for the explanation.

UncleMeat6y ago

> it was obvious the market was recovering in a big way

Was it? Markets started up today but ended way lower.

2 more replies

rolltiide6y ago

> The fact that they can't offer any compensation might be a big problem for them, since they already have zero trading fees

Robinhood makes the most money than any known firm on Wall Street by getting paid specifically to leak user's trades to other traders.

SEC requires a periodic report on that which shows compensation.

Can't believe people are still buying Robinhood's pitch of misdirection.

LatteLazy6y ago

Is there a source you can cite for that? Why would anyone want retail investor order data? Especially since most of their orders execute immediately, so you can just get the trade data from the venue...

a2h6y ago

Rule 606 disclosure for the source. And it's not about the trade data it's the order itself. Second link is a very thorough explanation.

https://cdn.robinhood.com/assets/robinhood/legal/RHS%20SEC%2...

https://www.google.com/url?sa=t&source=web&rct=j&url=http://...

JumpCrisscross6y ago

> Why would anyone want retail investor order data?

Former market maker here.

Retail flow is low risk. If I buy $100mm of institutional flow, I could get a bunch of corporate hedging orders. Or I could make a single bet against George Soros. With retail, one tends to find lots of small orders. Even if there are some with high information, i.e. they're smart money and I'm going to lose money trading against them, they're small enough to be manageable.

Retail is also low information. At an old job, we bought a prominent retail broker's options flow. The number of in-the-money unexercised options that would come through that pipe was mind-blowing. (Today, whoever was buying Robinhood's flow likely got the same.)

SifJar6y ago

I think you may misunderstand the concept of "Payment for Order Flow"

rolltiide6y ago

In what way?

How would you describe it in a way everyone can understand in as few words?

1 more reply

RestlessMind6y ago

This is such an empty update. At the very least, they should have published a detailed postmortem or committed to one by a certain date. How are we supposed to know that they have learned their lessons?

harikb6y ago

I don’t work for them, but I am pretty sure we can blame the litigious nature of this industry for the lack of detail in the postmortem. Not everyone can afford to be cloudflare :)

Even for Cloudflare, I thought the company will get sued out of existence after the proxy data leak, but finance industry/SEC etc is a completely different ballgame.

dx0346y ago

I believe it's the fear of litigation rather than actual litigation. Other companies also manage to publish postmortems and don't get sued out of existence.

elliekelly6y ago

The compliance world isn’t quite as fast-moving as tech. Even a “high priority” business continuity post mortem at a financial institution is going to take at least a week for all of the lawyers & senior management to agree on the language.

dilly_li6y ago

Start from the email notification. They have been asking themselves the easy questions.

Just look at the top questions in their email:

* Are the funds in my account safe? Yes, your funds are safe.

* Was my personal information affected? No, your personal information was not affected.

* Can I use my Robinhood debit card? Yes. If you have a debit card, you should have been—and should still be able to—use your card, but you may have had issues receiving notifications, viewing your balance, and seeing transactions in your app.

------------

The real question is: How is Robinhood compensating for the missed trades?

Stop asking yourself the easy questions, RH.

throwsprtsdy6y ago

I think it's unlikely that Robinhood (or any brokerage) would compensate people for losses on hypothetical trades that could have been made during an outage. Such a policy would allow customers to pick their entry and exit points, and extract money from the brokerage at will.

Even if the trades were well-defined at the time the outage occurred, there would still be an asymmetry between people demanding compensation on their profitable trades while eschewing losses on their bad trades. It's doubtful any brokerage would be willing to eat that.

Execution risk is a risk.

asah6y ago

Are expiring in-the-money options a "hypothetical trade" ?

benmanns6y ago

Those are automatically exercised at expiration.

2 more replies

throwsprtsdy6y ago

That's an interesting question. I suppose it's hypothetical in the sense that they now have to look at "what if" those options had been exercised; but unlike a spot trade that someone "would have" done, Robinhood might already have had obligations on its end of the original options trade.

hcknwscommenter6y ago

Absolutely. Why wouldn't they be?

topherpalmtree6y ago

Yeah seriously if you have > a few hundred in options on robinhood. And you’re waiting until the day they expire to unload them. You’re dumb or don’t care about your money.

kccqzy6y ago

No brokerage will do that. Here's an excerpt from the account agreement of Schwab, a respected discount broker:

> During periods of heavy trading and/or wide price fluctuations ("Fast Markets"), there may be delays in executing your order or providing trade status reports to you. […] Schwab is not liable to you for any losses, lost opportunities or increased commissions that may result from you being unable to place orders for these stocks through the Electronic Services.

throway98126y ago

This is absolutely not true. Broker-dealers and brokerages routinely credit clients for execution out of line with the market. Schwab does in fact give price adjustments for slowly or incorrectly handled orders.

The reason nobody will be compensated here is due to two things,

(1) There is no way to determine what a fair execution would have been, since clients couldn't submit orders in the first place.

(2) Clients will adversely select their losing trades for corrections and this would bankrupt Robinhood in about five minutes.

Source: work at a wholesaler.

vel0city6y ago

I mean, you say its "absolutely not true" and yet that's literally verbatim from their Brokerage Account Agreement.

https://www.schwab.com/public/schwab/nn/agreements/schwab_br...

Maybe in some cases they go above and beyond their account agreement if they like you as a customer, but according to the agreement you sign with them its not their problem if things go bad in this way.

yesiamyourdad6y ago

> There is no way to determine what a fair execution would have been, since clients couldn't submit orders in the first place.

On the flip side, clients have no guarantee that there would have been a counterparty for their order.

neom6y ago

I had to wait on hold for well over an hour to get through to the HSBC trading desk, HSBC isn't going to compensate me.

driverdan6y ago

Wait, people still trade over the phone?

Scoundreller6y ago

Dunno about HSBC, but in Canada, often you have to call your broker when you want to sell a stock on a different market than you bought it.

E.g. Buying TD in Canada, and wanting to sell on NYSE for US$.

manigandham6y ago

Unlikely to have compensation for trades, and only people with limit orders set before the outage would be able to claim damages.

It's no different than you breaking your phone or losing your network connection. Nothing is guaranteed to work all the time. RH might face fines for the extended nature of the outage though, specially since they've managed to avoid them for plenty of past mistakes so far.

floatingatoll6y ago

If they compensate for missed trades due to service outages, then an attacker could take a position, repeatedly DDOS Robinhood until the position is favorable during a DDOS, and then demand reimbursement since they "would have" cashed out that favorable position.

It follows that Robinhood must never reimburse for outages.

jsf016y ago

I’d be interested to read a deep technical post-mortem like those which have become fairly standard among other big tech companies. Hoping Robinhood does the right thing here.

0xy6y ago

Still silence on the traders who lost tens of thousands of dollars? Are they going to be compensating or not?

This blog post doesn't appear to say anything. It's not an apology, it's not an explanation, it doesn't say what they're going to do in response.

This is after the incident in which there was no status updates or support availability for multiple hours of time. Why can't they commit to updates every hour or every 30 minutes?

SkyPuncher6y ago

I'm having a really hard time understanding this argument.

Unless I have an SLA with a provider outlining penalties, they don't owe me anything if they go down. How is this any different?

titanomachy6y ago

You could view it as a business decision. Will they lose reputation and customers if they don't compensate for the outage? Do they expect that the long-term cost of that loss would be more than the one-time hit of paying out now?

They may not have a legal/contractual obligation here, but that doesn't mean that treating their customers poorly is without consequence.

solidasparagus6y ago

That one-time hit would be massive. With the benefit of hindsight, everyone is going to say they lost money by missing the perfect trades.

topherpalmtree6y ago

Yeah but your business decision was to be a high stakes gambling platform to begin with

ivalm6y ago

Almost certainly they cannot compensate. If average user lost $1k that's a cool $10b they would need to compensate.

0xy6y ago

The difference is regulation. There are very few regulations and oversight on cloud compute providers, whereas an average person cannot just spin up an app and begin selling securities in a month as you can being a cloud provider.

While RH's ToS does theoretically absolve them of technical issues, they are obligated to comply with 'best execution' securities mandates, no? Separately, it'd be extremely bad for business if they refused compensation.

The point is moot anyway, since they're offering "case-by-case" compensation.

https://techcrunch.com/2020/03/03/robinhood-outage-cause/

toomuchtodo6y ago

Robinhood will have to deal with a flood of FINRA and SEC complaints from these outages. I'm unsure how much longer FINRA will allow them their broker dealer license with a copious amount of failure in the rear view mirror.

Arbitration is forced, but Robinhood is on the hook for the fees for everyone who decides to arbitrate. Robinhood users might not get anything, but they can still cause pain.

1 more reply

hcknwscommenter6y ago

You mean "case-by-case" denial delay and obfuscation.

crystaldev6y ago

> This blog post doesn't appear to say anything. It's not an apology, it's not an explanation, it doesn't say what they're going to do in response.

On the advice of any good lawyer.

ska6y ago

I agree the level of feedback isn't great, but what would people be compensated for? Did they misplace actual orders?

CamelCaseName6y ago

There were some people claiming that RH erroneously exercised their options on r/wallstreetbets. Could be a hoax, but if it isn't, then that seems like grounds for compensation.

Of course, no one complains when RH makes a mistake in the client's favor.

ivalm6y ago

Those people don’t know what is pin risk. Basically their long puts got exercised because automatic execution is determine at 4pm and they didn’t object (creating a short position in equity), their short puts didn’t get exercised because the stock rallied by 5pm and their counterparty was diligent and prevented auto execution (thus no long equities position to compensate the short equities position). Robinhood couldn’t rebuy the short equity position because the actual price rose above the put price leading to a net loss.

1 more reply

conanbatt6y ago

You couldn't execute or cancel orders.

alkonaut6y ago

That means there are no orders mishandled either. If no one has an SLA then just switching the servers off without thinking about whether customers were planning on trading seems fully in their right. This is terrible for their reputation, but that does't mean they are going to start handing out money because people argue they could have avoided losses if the servers had been up. It's going to be extremely difficult for any customers to back that up legally.

1 more reply

mandelbrotwurst6y ago

People lost the opportunity to place orders. Determining the actual cost is of course impossible since you don't know what orders people would have placed.

ska6y ago

Absent a contract on availability that doesn’t sound like something you would have a case for.

1 more reply

cdurth6y ago

Yesterday was the largest upswing in market history and the entirety of RH missed out.

majormajor6y ago

"Missed out" doesn't seem like the right phrase here. If you already owned the stock, you still held it, no?

So people who were going to continue to sell off got lucky that they couldn't make that trade, and people who were going to buy got unlucky?

Does anyone seriously expect compensation, or think that it's deserved, or is it group wishful thinking? How would it even work? Would they just take people's word for their supposed intent? Or are people wanting some sort of "here's a gift card" type deal?

This is not to defend RobinHood - I've personally kept my money with well-established companies cause conservative, old, proven systems seem like a good thing for a product in this space - but shit happens, no? There will be more good days, and more bad days, in the market, it's a long-run game anyway, and it's pretty easy to vote with your wallet in this space.

3 more replies

endorphone6y ago

The close of today is effectively the open yesterday, so everyone is back where they were.

Of course the problem with the "compensate me" arguments is that a lot of people were going to make decisions that would have turned out poorly yesterday (indeed, the market is balanced and every transaction has a counterparty), though of course with the amazing clarity of hindsight few would recognize or admit that. So if they need to compensate for illusory lost trades, do some people have to pay them for losses they would have incurred?

[I get that there are some complex options that can legitimately be all downside when trading isn't available, but that's a less common option]

1 more reply

hcknwscommenter6y ago

Dude. They are NOT compensating. This is clear.

dang6y ago

Recent and related:

https://news.ycombinator.com/item?id=22477567

https://news.ycombinator.com/item?id=22475019

https://news.ycombinator.com/item?id=22468361

https://news.ycombinator.com/item?id=22465178

aloknnikhil6y ago

Genuine question: With no commission trading at places like Schwab and eTrade, is it even worth trading on Robinhood? For as far as I could remember (about 2 years ago), Robinhood has always failed to scale.

manigandham6y ago

Options are completely free on Robinhood while they still have a per-contract fee at other brokerages. If you don't care about that then no, there's no reason to stick with Robinhood.

benmanns6y ago

Additionally Robinhood self clears options (or for some other reason?) and does not charge the Options Clearing Corp fee of $0.055/contract or the Options Regulatory Fee of $0.0388/contract which all other brokers charge (incl. ones with $0 or flat rate commissions/fees like WeBull, Gatsby, Tradier). All you pay is the FINRA and SEC fees on sells of about a penny each for small trades.

Actually, if anyone knows of another broker who _doesn't_ charge these, please let me know. If you're first for the broker I'll give you $20 for the tip.

Itsdijital6y ago

Trust me, please trust me, you really really really want to be paying a competent broker when trading options.

If it's chump change you're trading, sure, use RH.

If it's serious money, the $0.65/contract or whatever pays for itself many times over. Even if it's just the ability to regularly get filled between the spread it pays for itself.

2 more replies

kccqzy6y ago

What kind of options are you trading such that the fee of a few cents per contract is noticeable? The bid-ask spread is wider than that.

1 more reply

mjs336y ago

Their DNS system failed? How?! Unless DNS stands for “Do Not Sell”

tbrock6y ago

This happened to us at Hustle years ago. Basically if you run on AWS there’s a DNS server provided inside each VPC that usually works fine but which has no observable load metrics etc... so you don’t really know you are slamming it and are about to have a problem unless you audit your entire codebase.

Why? Well that tiny DNS server has certain capacity constraints and if you don’t cache DNS lookups by using a http/https agent for example (in NodeJS) you wind up looking up the same dns info over and over and churning sockets like it’s going out of style. If you run really really hot the poor thing falls over (rightly so).

The limits are high and DNS is fast so you usually don’t notice but when you are under load bugs like this come out of the woodwork. When it falls down you look up the AWS docs, lean back in your chair upon finding this isn’t an “elastic” part of AWS and say “FUUUUUUUUCK” so loud it can be heard from outer space.

If you are Robinhood though don’t you have some former Netflix SRE/DevOps beast on staff that knows this and so you run your own DNS and monitor it?

jcheng6y ago

I read this and thought, “surely there’s an OS-level DNS cache?”

Apparently not on Linux! https://stackoverflow.com/questions/11020027/dns-caching-in-...

anaphor6y ago

Well, there is https://www.freedesktop.org/software/systemd/man/systemd-res... but you may or may not think that's part of the "OS".

JdeBP6y ago

That's misleading. The way that this has worked for decades on Linux-based operating systems and on Unices is that one installs a local caching DNS proxy, choosing one of the many available: ISC's BIND, Bernstein's dnscache, unbound, dnsmasq, PowerDNS, MaraDNS, and so forth.

Every Unix system having a local caching DNS proxy was and is as much a norm as every Unix system having a local MTS. A quarter of a century ago, this would have been BIND and Sendmail. Things are more variable, now.

To illustrate that this was considered the norm, here is a random book from the 1990s. Smoot Carl-Mitchell's _Practical Internetworking with TCP/IP and UNIX_ says, quite unequivocally:

> You must run a DNS server if you have Internet connectivity. The most common UNIX DNS server is the Berkeley Internet Name Daemon (BIND), which is part of most UNIX systems.

People sometimes think that this is not the case nowadays, and the fact that a computer is a personal computer magically means that a Unix or Linux-based operating system should offload this task and not perform it locally. They are wrong, and that is DOS Think. Ironically, they don't even get to play the resource allocation card nowadays. The amount of memory and network bandwidth that needs to be devoted to caching proxy DNS service on a personal computer is dwarfed by the amounts nowadays consumed by WWW browsers and HTTP(S).

There's no similar argument for a node in a datacentre.

Ideally, not only should every machine have a (forwarding/resolving) caching proxy DNS server, every organization (or LAN, or even machine) should have a local root content DNS server. A lot of (quite valid) DNS lookups stop at the root with fixed or negative answers. Stopping that from leaving the site/LAN/machine is beneficial.

Ironically, putting a forwarding caching proxy DNS service on the local end of any congested, slow, expensive, or otherwise limited link is advice that I and others have been handing out for over 20 years. It's exactly what one should be doing with things like Amazon's non-local proxy DNS server limited to 1024 packets/second/interface.

* http://jdebp.uk./FGA/dns-server-roles.html#ChoosingProxy

So the question is not whether there a local DNS cache mechanism exists. It's whether it's set up by the company dishing out the VMs, and if not why not. Amazon provides instructions on how to add dnsmasq, and clearly labels this as how to reduce DNS outages. So it's not even the case that Amazon is wrongly discouraging having local caching proxy DNS servers.

* https://aws.amazon.com/premiumsupport/knowledge-center/dns-r...

2 more replies

ajsharp6y ago

Wait, what?? There's an invisible DNS server running inside your VPC? I get what you're saying wrt cached DNS lookups but this seems wild.

ra1n856y ago

It's a DNS resolver that runs on the hypervisor hosting every instance.

1 more reply

andreareina6y ago

This allows them to hand out private network addresses (IIRC they use 172.x.x.x) when the DNS query happens from within AWS.

rconti6y ago

"Invisible?" I mean, everyone who builds AWS infra, even just single ec2 instances, is aware of it. It's definitely possible that application engineers aren't aware, though.

PaywallBuster6y ago

AWS should simply provide monitoring and alerting by default on these footnote service limits.

manigandham6y ago

What scenarios cause this many DNS lookups though? Connections should be kept-alive after the IP translation, so if it's really new connections being setup constantly then wouldn't that show up as a major bottleneck first?

aeyes6y ago

Running on Kubernetes this is easy, it's one of the first issues you hit.

Every DNS request for external domains turns into 10 if you don't explicitly configure FQDNs (dot at the end). This is because in the default configuration the resolver runs with ndots 5 to search all the possible internal Kubernetes and cloud-provider names. Then you have lookups for IPv4 and IPv6 in parallel. So for every external name you look up, you storm the upstream DNS with 10 requests for non existing domains.

Furthermore, the current default DNS service in Kubernetes doesn't have any kind of caching for these kinds of lookups (especially not NXDOMAIN) enabled.

But like I said, this is one of the first issues you hit running Kubernetes on Amazon. It is widely known and can easily be fixed by scaling up some more instances, changing ndots settings, using FQDNs or configuring caching. There is no way that this was the issue, it is plastered all over the internet, the logs are clear and the fixes can be implemented in minutes.

It also doesn't go down completely, the rate-limiter is packets/s on the interface.

dilyevsky6y ago

It’s easy to have tens of thousands of dns lookups per sec if you don’t know what you’re doing or didn’t pay attention. Connections wouldn’t be bottleneck if the are outbound.

tempsy6y ago

Sad that there isn’t an actual apology anywhere to be found in the letter at all.

And now with the fed rate cut the interest on cash is only 1.3%, with more cuts expected later in the year, which was the last big differentiator. I don’t see how they don’t see massive net withdrawals going forward.

CamelCaseName6y ago

> And now with the fed rate cut the interest on cash is only 1.3%, with more cuts expected later in the year, which was the last big differentiator. I don’t see how they don’t see massive net withdrawals going forward.

This isn't really an issue because the fed rate cut impacts everyone. Other institutions will cut their interest rates as well. I know of a few banks (Canadian) that have already lowered their GIC rates.

If anything, this is actually good for RH. Now instead of comparing 1.8% at RH and 1% at another Financial Institution, you're comparing 1.3% and 0.5% -- a much bigger multiple.

tempsy6y ago

most brokerages don’t actually pay anything. With another cut it’s going to be <1% vs 0%. Hardly anything even with a six figure balance. That’s my point.

GaryNumanVevo6y ago

Yeah because if they're culpable then they can be sued via class-action

xyst6y ago

The boys down in the salt mines of WSB will want a blood sacrifice.

Founders should be fired. CTO/CIO should be replaced.

vinaypai6y ago

Historic... Unprecedented... Thundering herd, a bunch of excuses to explain why they couldn't handle the volume that most real brokerages handle every second.

alishan-l6y ago

I heard it was related to the leap year. Apparently they had downtime 4 years ago as well.

ablekh6y ago

I'm curious about your thoughts on why a technical infrastructure, which, by nature of being cloud-native, is supposed to be (and likely has been) architected as a highly elastic platform, have not stood the test of time in this regard.

Based on the in information from Robinhood's careers site, their platform is largely based on the following technology stack:

  - Python, Django, Django Rest Framework
  - Go
  - PostgreSQL
  - Container and container orchestration technologies (Docker, Kubernetes)
  - Microservice-oriented architectures and related OSS technologies (Kafka, Celery/RabbitMQ, nginx, Redis, Memcached, Airflow, Consul)
  - Cloud-native infrastructure (AWS, GCP)
  - Infrastructure as Code and configuration management (Terraform, SaltStack, Ansible, Chef, Puppet)
  - CI/CD and test automation frameworks (Cypress.io, Jenkins, Appium, UIAutomation, Bazel)

vbtemp6y ago

On Reddit I've been trying to ask this ELI5:

Why would you use RH instead of a normal, mainstream brokerage like Vanguard, Fidelity, etc that already has (1) an app and (2) commission-free trades?

lkbm6y ago

Easy answer: As someone who's used Vanguard for index funds and the like for a couple decades now, I had no idea they had an app or commission-free trades. They don't market this at all.

As a secondary answer, normal, mainstream brokerages have pretty bad tech, tbh. I don't expect it to be worse than Robinhood in terms of things like security, and I expect UX to be worse. (Side note: I just discovered that Vanguard actually has a secret security key option hidden under Account maintenance, so I can finally switch from sms 2fa. +1 to Vanguard.)

infinite8s6y ago

> Side note: I just discovered that Vanguard actually has a secret security key option hidden under Account maintenance, so I can finally switch from sms 2fa. +1 to Vanguard.

It looks like you still need security codes setup:

"You'll need to register for both security codes and security keys, however. That's because keys and codes go hand in hand—if you lose your key or don't have it, we'll need to send you a code in order for you to log on. In addition, you'll always need a code to access your accounts from a mobile device."

If an attacker can skip the security key you might as well not use one.

fny6y ago

My brother has a Fidelity account and apparently even he was blocked from putting in orders online last Thursday, so I'm not sure they're immune either.

acchow6y ago

Can't wait for the post mortem

joobus6y ago

I don't think we will get a postmortem. Their lawyers will kill it because it will be an admission of guilt and open them up to even more legal liability.

wbl6y ago

The SEC did one for Knight Capital.

SkyPuncher6y ago

Being down for a trading day is not the same as actively selling $440 million in assets.

1 more reply

numlock866y ago

Maybe in another four years when they finally realize they still haven't fixed the leap bug. Didn't work out for this year apparently. Last leap year had the exact same problem. The problem is that the ticket is very low priority because right now it is working again and won't happen again until at least 2024 ... By then it will most likely be forgotten. Again.

lemmox6y ago

I'm always amazed by how tricky DNS failures can be.

Ambele6y ago

I can't help but think this glitch was a good thing and Robinhood investors would do better if they traded less anyhow. According to an OpenFolio correlational study, traders who trade more than 12 times per year make 0.5% less than traders who trade less than 12 times per year. OpenFolio was one of the first three websites to have an API integration with Robinhood portfolios.

VWWHFSfQ6y ago

every time I see a company that has "co-CEOs" I always wonder what kind of weird stuff is going on in that company

winrid6y ago

Wasn't Steve Jobs a "co-CEO" of Apple for a while? (Edit, I mean after he came back from NeXT)

LaserToy6y ago

Blame the load. If only Robinhood were not famous for saying they are hiring only the best...

The best do not go down like that.

vsareto6y ago

That’s always marketing speak and never to be believed.

dirtydroog6y ago

Companies like Robinhood regularly go down when markets are volatile. It was quite frustrating when the financial crisis was in full swing not being able to log in to my trading account. I reckon I would have made a killing.

0xDEEPFAC6y ago

"Multiple factors contributed to the unprecedented load that ultimately led to the outages. The factors included, among others, highly volatile and historic market conditions; record volume; and record account sign-ups. "

What a sad press release, I am sure people at their corporate office were sweating over this. The long and short of it is that users trusted the service would work and had possibly a great deal invested only to get a comment when everything breaks down deflecting blame "OMG we weren't prepared for what our users did!"

We live in a sad state of software. I expect things like this and the Equifax scandal to continue if things like software security, reliability, and performance aren't taken into account.

homero6y ago

This is common in Bitcoin exchanges. Never thought I'd see a stock broker go down but i guess it's the same issues.

buryat6y ago

> Traditionally depicted dressed in Lincoln green, he is said to have robbed from the rich and given to the poor.

Does the name still stand?

shrimpx6y ago

Wow this is Coinbase in 2017. Trading mass hysteria takes down unprepared Silicon Valley trading service.

shiado6y ago

Does Robinhood have stop-loss orders an if so did they execute when it was down?

sitzkrieg6y ago

volatility always shakes out the trading companies with lame infrastructure

gadders6y ago

I mean this is OK as an apology, but is there an actual post mortem anywhere was can read?

c9306y ago

dnsmasq is your friend.

egdod6y ago

Pretty light on information.

ilrwbwrkhv6y ago

wasnt this cause by leap year and them not taking that into account?

illnewsthat6y ago

They denied that on Twitter: https://twitter.com/AskRobinhood/status/1234861941413351434

tomc19856y ago

So what of the screenshot in the original twitter post? Was it doctored? Showing GMT?

ajhurliman6y ago

I don't know if it had anything to do with leap year, but I also checked dev tools and saw the same issue (requests for market data on March 3, 2020 on March 2, 2020 8AM PST). However, it was busted for both the website as well as the Android app (and I'm guessing iOS too) so it doesn't seem like it's purely a client-side problem unless all of their clients were built from the same source.

beart6y ago

Not according to the linked blog post.

wyxuan6y ago

I feel like it was that and a bunch of other things that led to this

lowdose6y ago

Did any of the complaining people actually pay RH for their service or is this 1st world entitlement?

yaur6y ago

"That in turn led to a “thundering herd” effect—triggering a failure of our DNS system."

I'm just a spectator but I can not imagine that this was somehow caused by a DNS failure.

zippergz6y ago

I’m not sure I can even count the number of outages I’ve been involved in that had DNS issues as at least part of the cause.

yaur6y ago

sure, I've seen outages that are caused by DNS config problems. But I don't think I've ever seen one caused by a "thundering herd" overwhelming DNS servers.

Another give away that this is a lie is that support emails were getting a stock postfix error message which means that MX records at least were resolving.

rhizome6y ago

Is every bitcoin company run by ex cellphone store employees? Just the exchanges?

dickjocke6y ago

Robinhood isn't a bitcoin company. That's just a feature they have. Its main product offering is the commision free trading--and their presence pushed a lot of big players to adopt the same offering. The wallstreetbets gang is silly and all, but I think they have really democratized stock trading, and made the whole idea seem much more accesible. I think they founders are former finance guys. I hope this doesn't sound like guerilla marketing. I don't even use the app, I have used it but I'm just not that interested in picking stocks. I just think it's cool as an ex-code monkey to entreprenur story.

triceratops6y ago

> I think they have really democratized stock trading

I would think Vanguard did that already. Most people should be trading ETFs, not individual stocks.

ska6y ago

That's a separate issue I think. They obviously do different things.

idnefju6y ago

I believe you still need a broker for ETFs, which usually carry brokerage fees.

3 more replies

Itsdijital6y ago

Wallstreetbets doesn't trade stocks.

tree36y ago

> I think they have really democratized stock trading

How does "free" = "democratizing"? Stocks have been easily accessible for years to retail investors.

> their presence pushed a lot of big players to adopt the same offering

Misleading, big brokers were already going down this path.

robjan6y ago

How much did it cost to place a trade for a $100 stock previously? RH definitely helped more people gain access to directly trading shares on the stock market, regardless of whether or not they were responsible in doing so.

mkchoi2126y ago

It's cool that the founders of the company publish blog posts like this for a short outage. Hope other CEOs learn from this and become even more transparent in the future :D

tmpz226y ago

A day long outage for a $7B company is a big deal. They don’t deserve credit for this.

manigandham6y ago

Short outage? They were down for almost the entire trading yesterday and hours today. And there's barely any transparency in this post compared to standard post-mortems.

rpdillon6y ago

Was this their post-mortem? I didn't see anything to indicate that.

tomc19856y ago

It was a useless puff piece that said absolutely nothing of interest. How is that worthy of a pat on the back?

bayonetz6y ago

Just use Square’s Cash App! Free stock trades AND you can buy fractional shares AND a bunch of other stuff like P2P payments and bitcoin. I work there and so can say with some authority that we can handle more volume without going down than RH can.

frockington16y ago

As an alternative use a real brokerage that has a history of success. It's becoming clearer everyday that fintech startups are not responsible

minimaxir6y ago

If you actually work at Square, it's poor form to advertise in this manner.

redis_mlc6y ago

Actually, bayonetz's posting is the only useful one in the comments for this article. Most of us are here for information from actual industry insiders, and this qualifies.

Here's some more inside info ...

If your "financial app" provider doesn't have a banking charter, run. None of the recent trendy fintech companies have a charter, and are thus clown cars.

astura6y ago

Fidelity offers banking services and doesn't have a banking charter but they aren't a "clown car," they are one of the largest financial institutions in the world.

bayonetz6y ago

Disagree. I’m suggesting a better alternative at a contextually relevant time based on personally earned experience.

pensatoio6y ago

Or they’re just taking pride in their work?

bayonetz6y ago

Indeed!

kortilla6y ago

Everyone is a Super Bowl winner when they’re armchair quarterbacking.

j / k navigate · click thread line to collapse

277 comments

czbond6y ago

cheschire6y ago

malux856y ago

Or I'm talking about a 200 node hadoop cluster thats doing the electrical metering and billing for 8 million people, and is NOT allowed to stop.

Or the trading platform thats running sub millisecond trades and downtime means 300,000 $ USD per minute.

TheCondor6y ago

Does the duration of their downtime suggest a “1/1000” unmonitored oversight? Or is it more like a threshold that was meet and probably could/should have been observed?

2 more replies

raiyu6y ago

No doubt there are many complex systems and they inevitably go down. Every provider has suffered meaningful outages.

I think the issue here isn’t so much that the system went down but the blog post.

luckylion6y ago

1 more reply

Ntrails6y ago

> Or the trading platform thats running sub millisecond trades and downtime means 300,000 USD per minute.

That is a lot of half spreads...

2 more replies

techie1286y ago

Kudos, these are moderate sized systems you've built over your career. There are lot bigger and more mission critical systems in the world and you might build them one day.

6 more replies

LaserToy6y ago

It is not about scale, it is about the fact that people lost real money. If you can’t make it work you should not be in that business, and I don’t really care how hard they work.

I’m taking my account off their platform.

3 more replies

dirtydroog6y ago

If you used Scylla you'd have only needed 90 nodes. (Don't believe the instability rumours)

1 more reply

donavanm6y ago

kerng6y ago

Reminds me of the book Showstopper and the personal stories in - its about the creation of Windows NT. Pretty interesting how things where not so differnet some 30 years ago

In case anyone is interested: https://www.amazon.com/Show-Stopper-Breakneck-Generation-Mic...

dmix6y ago

Interesting, so it took 5yrs, $150-million, and 250-employees to get NT shipped. Adding this one to my reading list!

bertil6y ago

I think I speak for everyone here if I say that, if that report is public and interesting, everyone on this thread will be happy to get you a drink.

vinaypai6y ago

This is all true for a company that is actually pushing any boundaries as opposed to failing pathetically at a well solved problem.

indecisive_user6y ago

Robinhood opened up stock trading to a large portion of the population that would otherwise not have been interested in traditional trading platforms with high commissions.

Their success helped to pressure companies such as TD and Schwab to mostly get rid of commissions as well, which is great for the average trader

I think Robinhood has a lot of problems, but to say they're not pushing any boundaries ignores the huge changes they've brought to the industry.

C1sc0cat6y ago

This is the 21st century low cost trading has been around for several decades now

1 more reply

unicornmama6y ago

Pushing the boundaries? They wrote an app that gamifies stock and options trading...

RayVR6y ago

The fact that Robinhood is telling people anything about the outage is only because they are the company they are, operating in the startup world/mentaity.

twic6y ago

> The fact that Robinhood is telling people anything about the outage is only because they are the company they are, operating in the startup world/mentaity.

Yep. Intercontinental Exchange and Eurex, two huge capital markets exchanges, routinely have multi-hour outages and don't even acknowledge that they've happened, let alone explain them.

whb076y ago

multi hour isn't day and a half.

Itsdijital6y ago

I have mixed feelings of sympathy about this whole RH thing.

I cannot fathom having the balls to trade any real amount of money on the platform while being aware of these long term issues.

stef256y ago

I abandoned Coinbase after having difficulties getting a few 1000 bucks out of there. It worked out in the end.

Problems with my data I can tolerate up to a point. Problems with my money I absolutely can not tolerate. As you said, it's unfathomable how people can trade money on a platform that's flaky.

robinson-wall6y ago

The interesting thing about working for a UK challenger bank - I now have visibility into all of the outages going on at large, high-street banks here.

1: https://twitter.com/nickrw/status/1141058572547215360

2: https://twitter.com/nickrw/status/1164162320672669696

twic6y ago

IIRC the SLA for FPS is 2 hours. So if a bank stops processing them for an hour, that's within tolerance, and they don't need to tell anyone.

I think your point is that it's a very different mindset to the native internet world, and that is certainly true!

0x8BADF00D6y ago

It’s another example of why DevOps has become a buzzword and most teams just pay lip service to it.

UncleMeat6y ago

Everything has outages. Is this the new narrative now that we've moved on from the leap year thing? That RobinHood is just a bunch of shitty engineers?

There are no public details about the root cause.

I think RH is bad for people in general, but this pile-on is outrageous.

Itsdijital6y ago

Robinhood crashing isn't an isolated unfortunate "well it happens to everyone" moment.

RH has had serious underlying issues for a long time now. This incident didn't happen in vacuum. The writing has been in huge block letters on the wall for a long time.

1 more reply

kortilla6y ago

No, a brokerage being down for an entire day is not the norm.

cheez6y ago

There are a couple of situations where outages are not normal or acceptable:

1. Dealing with other people's money 2. Monitoring/managing other people's health

3 more replies

bob10296y ago

1 more reply

frockington16y ago

> That RobinHood is just a bunch of shitty engineers?

It is confirmed they are worse than virtually any reputable brokerage. It might not be their fault directly but its 2020, not 1998

jennyyang6y ago

solidasparagus6y ago

Failures due to high load can take a while to resolve - you often need to fix the broken infrastructure, process the backlog, and catch up to live.

hcknwscommenter6y ago

radicaldreamer6y ago

Exactly, you have to default to fill or kill within the trading day. You just can’t treat certain products like a standard queue... sometimes time is the most important component

1 more reply

solidasparagus6y ago

True for RH trades but I'm sure there is a lot of other data being handled - such as market data.

afc6y ago

We wrote a bit about this here: https://landing.google.com/sre/sre-book/chapters/addressing-...

jennyyang6y ago

rpdillon6y ago

I'm not sure it's fair to assume that service gets automatically restored when load dissipates after failures due to high load.

driverdan6y ago

This isn't something new, downtime is the norm for Robinhood. Anyone trusting them with more than play money is foolish.

neuronic6y ago

This is the correct sentiment. People who put anything more than play money into Robinhood should not be surprised when their financial life is ruined.

balls1876y ago

How did they lose money?

jiqiren6y ago

Quick example: They bought puts on Friday and couldn't unload them for a full day + following morning.

balls1876y ago

Thank you for the explanation.

UncleMeat6y ago

> it was obvious the market was recovering in a big way

Was it? Markets started up today but ended way lower.

2 more replies

rolltiide6y ago

> The fact that they can't offer any compensation might be a big problem for them, since they already have zero trading fees

Robinhood makes the most money than any known firm on Wall Street by getting paid specifically to leak user's trades to other traders.

SEC requires a periodic report on that which shows compensation.

Can't believe people are still buying Robinhood's pitch of misdirection.

LatteLazy6y ago

a2h6y ago

Rule 606 disclosure for the source. And it's not about the trade data it's the order itself. Second link is a very thorough explanation.

https://cdn.robinhood.com/assets/robinhood/legal/RHS%20SEC%2...

https://www.google.com/url?sa=t&source=web&rct=j&url=http://...

JumpCrisscross6y ago

> Why would anyone want retail investor order data?

Former market maker here.

SifJar6y ago

I think you may misunderstand the concept of "Payment for Order Flow"

rolltiide6y ago

In what way?

How would you describe it in a way everyone can understand in as few words?

1 more reply

RestlessMind6y ago

harikb6y ago

I don’t work for them, but I am pretty sure we can blame the litigious nature of this industry for the lack of detail in the postmortem. Not everyone can afford to be cloudflare :)

Even for Cloudflare, I thought the company will get sued out of existence after the proxy data leak, but finance industry/SEC etc is a completely different ballgame.

dx0346y ago

I believe it's the fear of litigation rather than actual litigation. Other companies also manage to publish postmortems and don't get sued out of existence.

elliekelly6y ago

dilly_li6y ago

Start from the email notification. They have been asking themselves the easy questions.

Just look at the top questions in their email:

* Are the funds in my account safe? Yes, your funds are safe.

* Was my personal information affected? No, your personal information was not affected.

------------

The real question is: How is Robinhood compensating for the missed trades?

Stop asking yourself the easy questions, RH.

throwsprtsdy6y ago

Execution risk is a risk.

asah6y ago

Are expiring in-the-money options a "hypothetical trade" ?

benmanns6y ago

Those are automatically exercised at expiration.

2 more replies

throwsprtsdy6y ago

hcknwscommenter6y ago

Absolutely. Why wouldn't they be?

topherpalmtree6y ago

Yeah seriously if you have > a few hundred in options on robinhood. And you’re waiting until the day they expire to unload them. You’re dumb or don’t care about your money.

kccqzy6y ago

No brokerage will do that. Here's an excerpt from the account agreement of Schwab, a respected discount broker:

throway98126y ago

The reason nobody will be compensated here is due to two things,

(1) There is no way to determine what a fair execution would have been, since clients couldn't submit orders in the first place.

(2) Clients will adversely select their losing trades for corrections and this would bankrupt Robinhood in about five minutes.

Source: work at a wholesaler.

vel0city6y ago

I mean, you say its "absolutely not true" and yet that's literally verbatim from their Brokerage Account Agreement.

https://www.schwab.com/public/schwab/nn/agreements/schwab_br...

yesiamyourdad6y ago

> There is no way to determine what a fair execution would have been, since clients couldn't submit orders in the first place.

On the flip side, clients have no guarantee that there would have been a counterparty for their order.

neom6y ago

I had to wait on hold for well over an hour to get through to the HSBC trading desk, HSBC isn't going to compensate me.

driverdan6y ago

Wait, people still trade over the phone?

Scoundreller6y ago

Dunno about HSBC, but in Canada, often you have to call your broker when you want to sell a stock on a different market than you bought it.

E.g. Buying TD in Canada, and wanting to sell on NYSE for US$.

manigandham6y ago

Unlikely to have compensation for trades, and only people with limit orders set before the outage would be able to claim damages.

floatingatoll6y ago

It follows that Robinhood must never reimburse for outages.

jsf016y ago

I’d be interested to read a deep technical post-mortem like those which have become fairly standard among other big tech companies. Hoping Robinhood does the right thing here.

0xy6y ago

Still silence on the traders who lost tens of thousands of dollars? Are they going to be compensating or not?

This blog post doesn't appear to say anything. It's not an apology, it's not an explanation, it doesn't say what they're going to do in response.

This is after the incident in which there was no status updates or support availability for multiple hours of time. Why can't they commit to updates every hour or every 30 minutes?

SkyPuncher6y ago

I'm having a really hard time understanding this argument.

Unless I have an SLA with a provider outlining penalties, they don't owe me anything if they go down. How is this any different?

titanomachy6y ago

They may not have a legal/contractual obligation here, but that doesn't mean that treating their customers poorly is without consequence.

solidasparagus6y ago

That one-time hit would be massive. With the benefit of hindsight, everyone is going to say they lost money by missing the perfect trades.

topherpalmtree6y ago

Yeah but your business decision was to be a high stakes gambling platform to begin with

ivalm6y ago

Almost certainly they cannot compensate. If average user lost $1k that's a cool $10b they would need to compensate.

0xy6y ago

The point is moot anyway, since they're offering "case-by-case" compensation.

https://techcrunch.com/2020/03/03/robinhood-outage-cause/

toomuchtodo6y ago

Arbitration is forced, but Robinhood is on the hook for the fees for everyone who decides to arbitrate. Robinhood users might not get anything, but they can still cause pain.

1 more reply

hcknwscommenter6y ago

You mean "case-by-case" denial delay and obfuscation.

crystaldev6y ago

> This blog post doesn't appear to say anything. It's not an apology, it's not an explanation, it doesn't say what they're going to do in response.

On the advice of any good lawyer.

ska6y ago

I agree the level of feedback isn't great, but what would people be compensated for? Did they misplace actual orders?

CamelCaseName6y ago

There were some people claiming that RH erroneously exercised their options on r/wallstreetbets. Could be a hoax, but if it isn't, then that seems like grounds for compensation.

Of course, no one complains when RH makes a mistake in the client's favor.

ivalm6y ago

1 more reply

conanbatt6y ago

You couldn't execute or cancel orders.

alkonaut6y ago

1 more reply

mandelbrotwurst6y ago

People lost the opportunity to place orders. Determining the actual cost is of course impossible since you don't know what orders people would have placed.

ska6y ago

Absent a contract on availability that doesn’t sound like something you would have a case for.

1 more reply

cdurth6y ago

Yesterday was the largest upswing in market history and the entirety of RH missed out.

majormajor6y ago

"Missed out" doesn't seem like the right phrase here. If you already owned the stock, you still held it, no?

So people who were going to continue to sell off got lucky that they couldn't make that trade, and people who were going to buy got unlucky?

3 more replies

endorphone6y ago

The close of today is effectively the open yesterday, so everyone is back where they were.

[I get that there are some complex options that can legitimately be all downside when trading isn't available, but that's a less common option]

1 more reply

hcknwscommenter6y ago

Dude. They are NOT compensating. This is clear.

dang6y ago

Recent and related:

https://news.ycombinator.com/item?id=22477567

https://news.ycombinator.com/item?id=22475019

https://news.ycombinator.com/item?id=22468361

https://news.ycombinator.com/item?id=22465178

aloknnikhil6y ago

manigandham6y ago

Options are completely free on Robinhood while they still have a per-contract fee at other brokerages. If you don't care about that then no, there's no reason to stick with Robinhood.

benmanns6y ago

Actually, if anyone knows of another broker who _doesn't_ charge these, please let me know. If you're first for the broker I'll give you $20 for the tip.

Itsdijital6y ago

Trust me, please trust me, you really really really want to be paying a competent broker when trading options.

If it's chump change you're trading, sure, use RH.

If it's serious money, the $0.65/contract or whatever pays for itself many times over. Even if it's just the ability to regularly get filled between the spread it pays for itself.

2 more replies

kccqzy6y ago

What kind of options are you trading such that the fee of a few cents per contract is noticeable? The bid-ask spread is wider than that.

1 more reply

mjs336y ago

Their DNS system failed? How?! Unless DNS stands for “Do Not Sell”

tbrock6y ago

If you are Robinhood though don’t you have some former Netflix SRE/DevOps beast on staff that knows this and so you run your own DNS and monitor it?

jcheng6y ago

I read this and thought, “surely there’s an OS-level DNS cache?”

Apparently not on Linux! https://stackoverflow.com/questions/11020027/dns-caching-in-...

anaphor6y ago

Well, there is https://www.freedesktop.org/software/systemd/man/systemd-res... but you may or may not think that's part of the "OS".

JdeBP6y ago

To illustrate that this was considered the norm, here is a random book from the 1990s. Smoot Carl-Mitchell's _Practical Internetworking with TCP/IP and UNIX_ says, quite unequivocally:

> You must run a DNS server if you have Internet connectivity. The most common UNIX DNS server is the Berkeley Internet Name Daemon (BIND), which is part of most UNIX systems.

There's no similar argument for a node in a datacentre.

* http://jdebp.uk./FGA/dns-server-roles.html#ChoosingProxy

* https://aws.amazon.com/premiumsupport/knowledge-center/dns-r...

2 more replies

ajsharp6y ago

Wait, what?? There's an invisible DNS server running inside your VPC? I get what you're saying wrt cached DNS lookups but this seems wild.

ra1n856y ago

It's a DNS resolver that runs on the hypervisor hosting every instance.

1 more reply

andreareina6y ago

This allows them to hand out private network addresses (IIRC they use 172.x.x.x) when the DNS query happens from within AWS.

rconti6y ago

"Invisible?" I mean, everyone who builds AWS infra, even just single ec2 instances, is aware of it. It's definitely possible that application engineers aren't aware, though.

PaywallBuster6y ago

AWS should simply provide monitoring and alerting by default on these footnote service limits.

manigandham6y ago

aeyes6y ago

Running on Kubernetes this is easy, it's one of the first issues you hit.

Furthermore, the current default DNS service in Kubernetes doesn't have any kind of caching for these kinds of lookups (especially not NXDOMAIN) enabled.

It also doesn't go down completely, the rate-limiter is packets/s on the interface.

dilyevsky6y ago

It’s easy to have tens of thousands of dns lookups per sec if you don’t know what you’re doing or didn’t pay attention. Connections wouldn’t be bottleneck if the are outbound.

tempsy6y ago

Sad that there isn’t an actual apology anywhere to be found in the letter at all.

CamelCaseName6y ago

If anything, this is actually good for RH. Now instead of comparing 1.8% at RH and 1% at another Financial Institution, you're comparing 1.3% and 0.5% -- a much bigger multiple.

tempsy6y ago

most brokerages don’t actually pay anything. With another cut it’s going to be <1% vs 0%. Hardly anything even with a six figure balance. That’s my point.

GaryNumanVevo6y ago

Yeah because if they're culpable then they can be sued via class-action

xyst6y ago

The boys down in the salt mines of WSB will want a blood sacrifice.

Founders should be fired. CTO/CIO should be replaced.

vinaypai6y ago

Historic... Unprecedented... Thundering herd, a bunch of excuses to explain why they couldn't handle the volume that most real brokerages handle every second.

alishan-l6y ago

I heard it was related to the leap year. Apparently they had downtime 4 years ago as well.

ablekh6y ago

Based on the in information from Robinhood's careers site, their platform is largely based on the following technology stack:

  - Python, Django, Django Rest Framework
  - Go
  - PostgreSQL
  - Container and container orchestration technologies (Docker, Kubernetes)
  - Microservice-oriented architectures and related OSS technologies (Kafka, Celery/RabbitMQ, nginx, Redis, Memcached, Airflow, Consul)
  - Cloud-native infrastructure (AWS, GCP)
  - Infrastructure as Code and configuration management (Terraform, SaltStack, Ansible, Chef, Puppet)
  - CI/CD and test automation frameworks (Cypress.io, Jenkins, Appium, UIAutomation, Bazel)

vbtemp6y ago

On Reddit I've been trying to ask this ELI5:

Why would you use RH instead of a normal, mainstream brokerage like Vanguard, Fidelity, etc that already has (1) an app and (2) commission-free trades?

lkbm6y ago

Easy answer: As someone who's used Vanguard for index funds and the like for a couple decades now, I had no idea they had an app or commission-free trades. They don't market this at all.

infinite8s6y ago

> Side note: I just discovered that Vanguard actually has a secret security key option hidden under Account maintenance, so I can finally switch from sms 2fa. +1 to Vanguard.

It looks like you still need security codes setup:

If an attacker can skip the security key you might as well not use one.

fny6y ago

My brother has a Fidelity account and apparently even he was blocked from putting in orders online last Thursday, so I'm not sure they're immune either.

acchow6y ago

Can't wait for the post mortem

joobus6y ago

I don't think we will get a postmortem. Their lawyers will kill it because it will be an admission of guilt and open them up to even more legal liability.

wbl6y ago

The SEC did one for Knight Capital.

SkyPuncher6y ago

Being down for a trading day is not the same as actively selling $440 million in assets.

1 more reply

numlock866y ago

lemmox6y ago

I'm always amazed by how tricky DNS failures can be.

Ambele6y ago

VWWHFSfQ6y ago

every time I see a company that has "co-CEOs" I always wonder what kind of weird stuff is going on in that company

winrid6y ago

Wasn't Steve Jobs a "co-CEO" of Apple for a while? (Edit, I mean after he came back from NeXT)

LaserToy6y ago

Blame the load. If only Robinhood were not famous for saying they are hiring only the best...

The best do not go down like that.

vsareto6y ago

That’s always marketing speak and never to be believed.

dirtydroog6y ago

0xDEEPFAC6y ago

We live in a sad state of software. I expect things like this and the Equifax scandal to continue if things like software security, reliability, and performance aren't taken into account.

homero6y ago

This is common in Bitcoin exchanges. Never thought I'd see a stock broker go down but i guess it's the same issues.

buryat6y ago

> Traditionally depicted dressed in Lincoln green, he is said to have robbed from the rich and given to the poor.

Does the name still stand?

shrimpx6y ago

Wow this is Coinbase in 2017. Trading mass hysteria takes down unprepared Silicon Valley trading service.

shiado6y ago

Does Robinhood have stop-loss orders an if so did they execute when it was down?

sitzkrieg6y ago

volatility always shakes out the trading companies with lame infrastructure

gadders6y ago

I mean this is OK as an apology, but is there an actual post mortem anywhere was can read?

c9306y ago

dnsmasq is your friend.

egdod6y ago

Pretty light on information.

ilrwbwrkhv6y ago

wasnt this cause by leap year and them not taking that into account?

illnewsthat6y ago

They denied that on Twitter: https://twitter.com/AskRobinhood/status/1234861941413351434

tomc19856y ago

So what of the screenshot in the original twitter post? Was it doctored? Showing GMT?

ajhurliman6y ago

beart6y ago

Not according to the linked blog post.

wyxuan6y ago

I feel like it was that and a bunch of other things that led to this

lowdose6y ago

Did any of the complaining people actually pay RH for their service or is this 1st world entitlement?

yaur6y ago

"That in turn led to a “thundering herd” effect—triggering a failure of our DNS system."

I'm just a spectator but I can not imagine that this was somehow caused by a DNS failure.

zippergz6y ago

I’m not sure I can even count the number of outages I’ve been involved in that had DNS issues as at least part of the cause.

yaur6y ago

sure, I've seen outages that are caused by DNS config problems. But I don't think I've ever seen one caused by a "thundering herd" overwhelming DNS servers.

Another give away that this is a lie is that support emails were getting a stock postfix error message which means that MX records at least were resolving.

rhizome6y ago

Is every bitcoin company run by ex cellphone store employees? Just the exchanges?

dickjocke6y ago

triceratops6y ago

> I think they have really democratized stock trading

I would think Vanguard did that already. Most people should be trading ETFs, not individual stocks.

ska6y ago

That's a separate issue I think. They obviously do different things.

idnefju6y ago

I believe you still need a broker for ETFs, which usually carry brokerage fees.

3 more replies

Itsdijital6y ago

Wallstreetbets doesn't trade stocks.

tree36y ago

> I think they have really democratized stock trading

How does "free" = "democratizing"? Stocks have been easily accessible for years to retail investors.

> their presence pushed a lot of big players to adopt the same offering

Misleading, big brokers were already going down this path.

robjan6y ago

mkchoi2126y ago

It's cool that the founders of the company publish blog posts like this for a short outage. Hope other CEOs learn from this and become even more transparent in the future :D

tmpz226y ago

A day long outage for a $7B company is a big deal. They don’t deserve credit for this.

manigandham6y ago

Short outage? They were down for almost the entire trading yesterday and hours today. And there's barely any transparency in this post compared to standard post-mortems.

rpdillon6y ago

Was this their post-mortem? I didn't see anything to indicate that.

tomc19856y ago

It was a useless puff piece that said absolutely nothing of interest. How is that worthy of a pat on the back?

bayonetz6y ago

frockington16y ago

As an alternative use a real brokerage that has a history of success. It's becoming clearer everyday that fintech startups are not responsible

minimaxir6y ago

If you actually work at Square, it's poor form to advertise in this manner.

redis_mlc6y ago

Actually, bayonetz's posting is the only useful one in the comments for this article. Most of us are here for information from actual industry insiders, and this qualifies.

Here's some more inside info ...

If your "financial app" provider doesn't have a banking charter, run. None of the recent trendy fintech companies have a charter, and are thus clown cars.

astura6y ago

Fidelity offers banking services and doesn't have a banking charter but they aren't a "clown car," they are one of the largest financial institutions in the world.

bayonetz6y ago

Disagree. I’m suggesting a better alternative at a contextually relevant time based on personally earned experience.

pensatoio6y ago

Or they’re just taking pride in their work?

bayonetz6y ago

Indeed!

kortilla6y ago

Everyone is a Super Bowl winner when they’re armchair quarterbacking.

j / k navigate · click thread line to collapse