Why Monzo's bank transfers weren't working on the 30th of May (opens in new tab)

(monzo.com)

284 pointsrobinson-wall7y ago94 comments

94 comments

81 comments · 18 top-level

robinson-wallOP7y ago· 12 in thread

I just posted this semi-technical post-mortem on Monzo's about why we had an outage with Faster Payments (UK bank transfers) last Month.

I'll hang around here to answer any more technical questions if anyone's interested.

kennydude7y ago

Nice to see how Faster Payments actually work in a nice understandable way.

That does sound weird how it happened by a corrupted date formatter. I'm assuming it's something like the formatter reset itself back to the langauge default

robinson-wallOP7y ago

Yeah, instead of `20190530` we were getting `['bert', 'time', 1559, 238096, 0]`. My understanding is a race condition put the code into a situation where it couldn't determine how to format that field, and we just got a default string representation of the underlying value.

Dunedan7y ago

That reminds me of a race condition I encountered a while ago:

There was this massive Java application processing hundreds of parallel requests per second. For each request it wrote a line with billing information into a log file. Those lines were fine from a quick glance, but when we tried processing them later we encountered invalid dates. Those log records contained future dates as well as invalid dates like 2019-02-30. Long story short: In the end we figured out that this was caused by the date formatting not being done thread safe (might have been SimpleDateFormat, but I don't remember the details anymore), causing the date components of multiple threads to get interleaved. Ouch, I guess somebody learned a lesson back then.

tuukkah7y ago

Which platform gives you unformatted data if a format can't be determined? Or was the code similar to getDate(format) and format happened to be null? Was the race condition in your code or in the platform? (EDIT: I suppose these would be questions more for the developers of the third-party gateway.)

3 more replies

CornishPasty7y ago

Was the original BERT formatting from the Hub or from something inside the Gateway's network? I always associated BERT with Erlang, do you know if it was involved or if it was something else?

Cheers for this blog post, by the way. It was really informative about the issue, and about how FPS works.

robinson-wallOP7y ago

It's used exclusively inside the Gateway's infrastructure. They're a mix of Java and Erlang but I'm not sure on the proportion.

FPS uses ISO8583 for its messaging format, and I suspect at the edge the Gateway translates it to a BERT blob for passing around internally.

1 more reply

hc917y ago

You guys are using Form3 for FPS, is this correct?

robinson-wallOP7y ago

Ah "any more technical questions"... except this one.

Sorry, but I can't name our partner.

wrboyce7y ago

We can take an educated guess...

https://status.form3.tech/incidents/wyhyxydxgh30

robinson-wallOP7y ago

I would note that this status page says "Our FPS Direct gateway provider".

1 more reply

BillinghamJ7y ago

I believe it's PayPort, which is run by Vocalink - who also run FPS itself.

As far as I understand, PayPort was/is the recommended options for all new "direct" connections. Though it seems that it is also possible to go more-direct into the FPS system itself.

jayelbe7y ago

You might be interested to know that Monzo's API refers to Faster Payments transactions as "payport_faster_payments".

retube7y ago· 12 in thread

What I still don't understand with bank transfers is: what control is there to ensure that debits and credits are offsetting. Doesn't this rely on the bank be being honest? Can't the sending bank just not debit the senders account?

pjc507y ago

Then they've lost money.

The important thing to understand is "clearance and settlement". Banks either maintain accounts with each other ("nostro/vostro accounts") or, within a country, at the central bank. So e.g. Halifax and Monzo will have accounts at the Bank of England.

Settlement will either be immediate or delayed. For immediate, at the same time as Halifax is sending a "please credit £10 to Bob" message to Monzo, they will send a message to the Bank of England to transfer £10 between their account and Monzo's.

For delayed settlement, the banks wait until the end of the day, add up the total money in each direction, subtract the difference, and transfer that.

A lot of work goes into making sure all the necessary entries line up. So, in the example, if the bank sent a payment message but didn't debit their user's account, either they would have made the central bank transfer (in which case they've lost £10 and effectively given it to their user's account), or they haven't, in which case Monzo will notice and demand payment for the discrepancy.

Banking is eventually consistent, and has been for centuries.

jessaustin7y ago

The sending bank is on the hook for that money when everything gets settled up, so it has a strong incentive to perform that debit.

retube7y ago

what is this settling up? how does that work?

lukevp7y ago

I can speak to how it works for card processing and ACH is most likely similar. To participate in payment processing in the banking network, you have to have a Merchant ID that is tied to a bank account. The processor or gateway is holding a suspense/escrow account on your behalf throughout the day and when a batch of transactions settles, it will resolve the balance difference with your bank account. The amount of payments allowed into or out of your escrow account is set by the processors based on your company's financial health and a risk analysis since if you just debited say $10 million from the escrow account and you only had $5 in your account, the processor would need to collect that debt from you, and they do not have a guarantee that they'll be able to do so. This is how it works for debit cards and bank accounts since $ amounts are real. It's slightly different for credit cards because the $ amount is in a way fictional, so they don't do the escrow holding and just temporarily "allocate" part of the credit limit (this is called an authorization) and when it is settled this is "captured", which enqueues the authorization for future processing. A few days later it will process and be included in a lump sum of funding into your merchant account. This reply is my personal understanding and meant for educational reasons and doesn't represent opinions or viewpoints of any company, and should not be considered advice of any kind and it may be inaccurate.

adambyrtek7y ago

Another blog post from Monzo explains this in more details, see the Net Settlement section: https://monzo.com/blog/2016/01/20/how-do-bank-payments-work/

jacobush7y ago

Search for bank clearing house

twic7y ago

It could, but then it has effectively extended a loan to the sender. Would they want to do that?

retube7y ago

why is that? two records have been updated in two databases. this isn't blockchain, there's nothing to enforce they are consistent (that I can tell)

PeterisP7y ago

Bank account balances aren't money, they're IOU's, a record of debt - a bank handing out a statement that shows an account balance of $100 quite literally means the bank saying "we acknowledge that [as of date X] we owe you $100" and nothing else.

The standard money transfer from Bob to Joe is a deal where the bank says "ok, Bob, we owed you $100 but if you want that then we'll now owe that $100 to Joe instead".

It's also worth noting that's just a record of debt not reality - there has to be some legal basis for that transaction to actually change the liability between the bank and the account holder, simply changing the balance in a database doesn't change the amount of debt but just the record i.e. "bank's opinion" of that debt; and if that record/opinion is wrong, then that balance can and will be disputed, and if the dispute can't be resolved otherwise, then it'll be up to courts to decide if that debt is valid or not.

If you record just the credit without the debit, then it's the equivalent of the bank unilaterally agreeing to new debt, the bank asserting that it now owes $100 to Joe just because. It's free to do that, but it would mean that it's "books won't balance" i.e. their accounting isn't consistent with itself and doesn't match reality, so to properly account for that transaction they'd have to book a debit to their profit&loss statement since they lost money by acknowleding that balance increase i.e debt without an offseting balance/debt decrease to someone else.

robinson-wallOP7y ago

The sending bank will have their Bank of England settlement account debited by the central FPS system, based on the central FPS system's view of the world, at the end of the settlement cycle.

The recipient bank will receive the money into their settlement account at that time. If the sending bank doesn't debit their customer then both sending and receiving customers will have the money in their accounts, but the sending bank will be out of pocket.

nathankunicki7y ago

The enforcing entity here is the central bank they both have accounts with. If a customer of Monzo sends £10 to a customer of RBS, then that money never leaves the central bank, and both records are just updated. But the total amount in the central bank must still add up to the accounts of both Monzo and RBS, otherwise there is a discrepancy.

The settlement process through a central bank is a way of ensuring that banks dont need to literally send truckloads of cash to each other at the end of the day.

Monzo says to the central bank, "today I sent RBS £1,500,000", and RBS says to the central bank, "today I sent Monzo £1,200,000". So the central bank just debits Monzo's account with them by £300,000, and credits RBS's account with them by £300,000. The total amount in the central bank remains the same.

So, sure, a bank could claim they sent less money to another bank than they did, but eventually the numbers wouldn't add up, and it would trigger a bucketload of auditing, likely resulting in revocation of banking licenses, and legal issues for both the bank and people involved.

twic7y ago

You've had a few answers about the use of central bank accounts; i'll note that this process is called net settlement:

https://en.wikipedia.org/wiki/Net_settlement

There is also a technique involving things called nostro/vostro accounts, where banks have money on deposit with each other, and the sending bank's deposit with the receiving bank is used to cover transfers:

https://en.wikipedia.org/wiki/Nostro_and_vostro_accounts

Of course, then they need to keep their accounts topped up, and they can do that by transfers through other banks, which might be central banks or commercial ones. The nostro/vostro system is suitable for use where banks don't trust each other so much, eg because they are in different countries. I think it was used more in the past, before reliable central settlement schemes were established, but i'm not sure.

You can think of net settlement as being a bit like nostro/vostro where the accounts have infinite free overdraft facilities, and so the banks never build up a credit balance, and just settle their debts at the end of the day.

mwexler7y ago· 7 in thread

What I think is fascinating is not just that we applaud Monzo for this, but that we allow other important services that control our lives to get away with revealing nothing about what happened or what they've changed to prevent it. Can you imagine any large bank (for the US, say JP Morgan Chase, Citi, Bank of America, etc.) putting out a note with this level of transparency, accountability, and clear direction to change?

For more about what makes a good apology, see https://withoutbullshit.com/?s=apology&submit=Search by Josh Bernoff, a former Forrester editor and a very direct writer.

FigmentEngine7y ago

In the UK they do have to: https://www.fca.org.uk/firms/fca-mandated-and-voluntary-info...

dx7tnt7y ago

I can't imagine one of those giant banks having this kind of outage.

robinson-wallOP7y ago

It happens all the time, and they just don't tell you. We get automated notices when banks connect and disconnect from the Faster Payments network, something happens every few days. Not always this length, but occasionally.

Just yesterday a major high street bank stopped sending payments for an hour, and was telling customers on Twitter that there were no problems.

Hell, the central system (what I called the Hub in this article) had a 12 hour split brain meltdown last July which had banks emailing each other spreadsheets back and forth for two weeks afterwards.

2 more replies

kbody7y ago

Various banking (sub)systems break all the time. We just never get any postmortems or public apologies.

This reminds me the saying "Never admit a wrongdoing and you'll never be wrong".

It's great that we get several of those new startup banks (Monzo, N26 etc.) that provide superior experience and slowly show what horrible things traditional banks were getting away with.

thedanbob7y ago

Some months ago I transferred some money between two accounts at different banks. The money arrived _twice_ in two different accounts of mine at the destination bank. I contacted them and asked them to cancel the second transfer, but they just told me it would get automatically denied for insufficient funds at the origin. They also warned me that if it happened again my account might have restrictions placed on it.

An apology would have been nice, but I suppose unwarranted threats are more in character.

CloudNetworking7y ago

Google TSB / Banc Sabadell

clankstar7y ago

Fast Payments break between EU banks all the time.

ziddoap7y ago· 5 in thread

Clear, detailed but accessible, plans in place moving forward, apology read sincerely, providing support to affected customers immediately, and answering follow up questions to technical users who are interested in more detail.

A+ job on handling the unfortunate situation, Monzo.

We can only hope more companies follow this great example.

ccrush7y ago

Every time I see a company say "we're sorry" I can't help but think about the South Park episode where the BP CEO says "we're sorry!" It's either that or the one with the Time Warner employees with the nursing flaps in their shirts.

_carl_jung7y ago

Using "we" in any public announcement, writeup, blog post, etc. is always in danger of sounding contrived.

ziddoap7y ago

Personally, I prefer something written in a relaxed style rather than a formal-voice only in most cases, especially for blog posts.

Formal only generally comes across, to me, as cold and distant. Great for a persuasive essay or other mediums where you want to remove the topic from the author, not so great for communicating with your audience and wanting to come across as sincere.

If anything, a strict formal-voice only blog post would come across, to me, as contrived.

To each their own.

1 more reply

hombre_fatal7y ago

> Time Warner employees with the nursing flaps in their shirts.

I had to look that one up: https://www.dailymotion.com/video/x15ij62 (3min)

tudorizer7y ago

Not to take away from their good communication, but they are still relatively small. If they keep this up while growing, then they might have some secret sauce.

playpause7y ago· 5 in thread

This is a perfect post-mortem. Their communication and support has always been really good. I've been using Monzo as my primary bank account ever since they registered as a bank, and I've converted a lot of friends to it. But... over the last year, the iOS app has fallen in quality: long UI freezes, frequent sign-outs with no explanation, silly UI bugs. My non-technical friends have noticed the same issues. It's a real shame.

Nextgrid7y ago

Agreed. This caused me to leave them for Starling Bank, though I’m considering switching back - I’d rather take a faulty app but good customer support than a good app but no support at all.

1 more reply

kingofspain7y ago

Recently updated the iOS app and it's definitely got quite laggy, especially on the pots screen (I only have about 5 pots too). Used to be so nippy as well.

madeofpalk7y ago

I made the adventurous mistake of upgrading my main iPhone to iOS 13 and the Pots screen just refuses to load - tapping the icon freezes the app. As I keep most of my money in pots, I didn't have any money until I got out an old phone and installed Monzo on it.

andyrew7y ago

This is fixed for me in the latest Monzo update (came out yesterday)

_fzslm7y ago

If you still have problems after the latest App Store update, their TestFlight build fixed that issue.

yingw7877y ago· 4 in thread

@robinson-wall Nice writeup, definitely raises the standards in the banking industry! I have a few questions:

1. Was this post-mortem part of an official process or something of an individual initiative? I saw it published on the blog, but it might be helpful to have this information disambiguated from marketing material on a separate site: https://status.cloud.google.com/summary

2. I'm not sure how payment processors work, but would having multiple payment processors from Monzo's interface make sense from a cost/benefit perspective?

3. Any plans to expand to the U.S. anytime soon, or recommend any banks that follow Monzo's best practices? ;-)

robinson-wallOP7y ago

1. A mix of both, we have a culture of being transparent by default - it's one of the first things that attracted me to come and work here. I was the incident lead for this on the day, and volunteered to write up this post-mortem. I did have help from colleagues in the marketing team to try and make this as accessible as possible.

As another poster mentioned we already have a status page where we post about incidents as they happen (though obviously not in quite as much detail as here). Personally I think our main blog is a reasonable place to have this ️.

2. Multiple redundant payment processors would be great, but ultimately infeasible. As a settling FPS participant we have to have a single Bank of England settlement account, tied 1:1 to a "bank code". Multiple sort codes map to a single bank code, and migrating sort codes between bank codes is non-trivial.

It'd be great if we could migrate sort codes easily between redundant connections, but as we build our own Gateway we'll have complete control over how our failover mechanisms work. Here's to much greater uptime in the future!

3. As another commenter mentioned - yes! We're just doing staff testing for now, but we've got a waiting list up. It'll be a prepaid product issued by another bank before we get a US banking license, just like we were in the UK a couple of years ago.

breakingcups7y ago

I'll ask unashamedly, any plans for the EU?

gr-eg7y ago

3. https://monzo.com/blog/2019/06/13/monzo-usa/

adwww7y ago

The article links to a status page - https://monzo.statuspage.io/

spiderfarmer7y ago· 4 in thread

Somewhat related question: How can Monzo offer 1.55% interest while the interest with most banks is around 0,3%?

djhworld7y ago

The savings accounts are offered by third party banks, not Monzo.

The 1.55% rate is fixed term for 12 months with no withdrawals

justusthane7y ago

Maybe this comment was about UK banks which I can't speak to, but we have banks in the US (Ally as one that I'm familiar with and use, but there are many more) that offer high-interest savings accounts at above 2% interest.

bradstewart7y ago

Could be keeping a good chunk of their deposited cash in money-market funds (or similar) which are currently paying around 2-2.5%, while providing customers with immediate access to funds with their remaining cash (insulating their customers from the delays associated with buying/selling those funds and so forth).

Jonnerz7y ago

Likely something to do with the fact Monzo are extremely lean and have minimal overheads compared to traditional banks.

baby7y ago· 4 in thread

I've been using Monzo less and less since I moved to the US due to the cost of topping it up. It's really sad that there is no true equivalent to Monzo here :(

afarrell7y ago

Monzo just launched in the US. See their announcement at https://monzo.com/blog/2019/06/13/monzo-usa/

There is a waitlist to join though.

zn447y ago

https://monzo.com/blog/2019/06/13/monzo-usa/

Qasaur7y ago

TransferWise has a debit card and full banking facilities, I assume they are available in the United States?

xchaotic7y ago

Yes, I used Transfer Wise borderless card in the US in 2017

PhantomGremlin7y ago· 3 in thread

The software bug at the heart of the problem:

The bug was in a computer program the Gateway uses to translate payment messages between two formats. When the program was operating under load, the system tried to clear memory it believed to be unused (a process known as garbage collection).

But because it was using an unsafe method to access memory, the code ended up reading memory that had already been cleared away, causing it not to know how to translate the date field in payment messages.

So apparently a dangling reference.

seanmcdirmid7y ago

Is that really proper use of the term garbage collection? If you are doing memory management manually, it sounds more like the lack of garbage collection. Unless they were using an unsafe GC for C/C++?

jey7y ago

Sounds like they were hanging onto a pointer to an object allocated by GC. For example, in Python/C API if you use a borrowed reference PyObject* after it has gone out of scope and been GC'd.

rahilb7y ago

I'm pretty sure they're a Java shop, and they're referring to sun.misc.unsafe and the goodies inside that let you manually allocate memory.

sandGorgon7y ago· 2 in thread

Just curious - whats the stack you guys run ?

I'm wondering what do you use to call these external processing APIs. I assume these are blocking calls.

robinson-wallOP7y ago

There's a good writeup by Oliver, our head of engineering, about our tech stack on our blog[1] with an accompanying Kubecon talk[2].

TL;DR- Largely Go microservices running on k8s, with http-based RPC calls for synchronous communication, and kafka for asynchronous communication.

As for sending and receiving of this kind of payment message, they are largely async but it does depend on the payment system we're talking about. When we build our own FPS gateway we're going to have to have something to manage "sessions" (TCP connections) which will block waiting for a response to an individual payment messages. Right now our communication with our third party Gateway is via a queue.

[1]: https://monzo.com/blog/2016/09/19/building-a-modern-bank-bac...

[2]: https://www.youtube.com/watch?v=YkOY7DgXKyws

sandGorgon7y ago

actually I kind of like this one - https://softwareengineeringdaily.com/wp-content/uploads/2018...

We have learnt a lot from you guys as we build out similar systems in India. Thank you for putting this stuff out!

Quick question that I have always wondered about - would you have used something like Uber Cadence (https://github.com/uber/cadence) as the core of your infrastructure if it had been available back thhen ?

kjlfhg87y ago· 2 in thread

Not related to the outage, but any plans to provide banking on pc's instead of just phones and any plans to provide small businesses accounts in the future?

lol7687y ago

Yes, they've had job positions open for more web work. I think they will expand this.

They already offer business accounts. I have one open for my Ltd company.

ownagefool7y ago

I registered my interest a while ago but they haven't given me one yet, so I wouldn't say they offer business accounts, so much as they're going to/are testing this.

Starling does offer business accounts now, but you can only have one Person of Significant Control, i.e. over 25% owner. There is no monthly fee with their offering though, so it's probably the better offer.

1 more reply

peteretep7y ago· 2 in thread

> They later tell us they believed that datacentre was introducing the corruption

What now? Their datacentre was ... rewriting (presumably) encrypted packets?

robinson-wallOP7y ago

Sorry, perhaps this isn't very clear as I've tried to simplify the explanation to make it accessible to a wide audience.

What I meant here is they could tell that the corruption was being introduced by some component in their infrastructure, and they were only observing it for messages passing through one of their two active-active sites.

noir_lord7y ago

I understood that as you intended.

It's a fine line between understandable to laymen and people been pernickity sadly.

edraferi7y ago· 1 in thread

This is a very well-written postmortem. It’s clear enough that a non-technical customer effected by the outage could understand the explanation, at least at a high level. It’s also detailed enough that a technical person can trace the root cause to a buggy garbage collector in format transformation function. The whole thing uses clear language with a bare minimum of jargon. Nice work!

aeorgnoieang7y ago

> the root cause to a buggy garbage collector

Or, rather, unsafe access of memory managed by a garbage collector:

> The bug was in a computer program the Gateway uses to translate payment messages between two formats. When the program was operating under load, the system tried to clear memory it believed to be unused (a process known as garbage collection).

> But because it was using an unsafe method to access memory, the code ended up reading memory that had already been cleared away, causing it not to know how to translate the date field in payment messages.

gregdoesit7y ago

This is a well-written post-mortem for public reading. I encourage people to read through it.

Being someone who also works in the payments space currently, relying on gateways, I have gone through several similar outages, where we detected a gateway issue causing an outage, notified the gateway who ack’d... and then we waited. More than one time, like Monzo, we built a workaround on our end, before the gateway provider could even mitigate the outage.

Hats off to the Monzo team, who clearly have a solid oncall and incident mitigation strategy in-place. They determined an outage happening in 4 minutes, built a workaround as best they could and deployed it in 2 hours, while it took the gateway provider 9 hours only to mitigate their change that caused the issue the first place. Granted the issue seemed complex, this is still slow.

Unfortunately, in cases like this, the best one can do is make sure there is a clear SLA in-place with the third party, with a contract stating financial liability in case the third party fails to meet this SLA. Monzo will not tell us much about this part, but I suspect the gateway will have to pay a hefty fee to Monzo, as their availability dropped to under 99% for this month, which should trigger payments/fee reductions from the third party with a well-written contract. It is good to see they are pushing the third party to do a proper post-mortem and prevention actions, as well as holding them accountable.

Nice work!

ablation7y ago

Thank you for posting this. Great read, and nice to see the team at Monzo sharing this level of detail: consumable but still detailed.

GordonS7y ago

The project I'm currently working on has a QA lag of 4-5 days for code to reach production.

I'm seriously impressed they were able to deploy mitigations to product twice in the same few hours, especially given they are a bank (and a small one, at that), and the consequences of fucking up are enormous.

It's been said here many times already, but I'll join those saying "well done" for handling this so well, and for the extraordinary level of transparency!

osrec7y ago

Tldr; unsafe memory management in a third party's software corrupted dates (under high load, due to garbage collection), causing transactions to fail or get reversed.

nvr2197y ago

You can do anything at monzocom

j / k navigate · click thread line to collapse

94 comments

81 comments · 18 top-level

robinson-wallOP7y ago· 12 in thread

I just posted this semi-technical post-mortem on Monzo's about why we had an outage with Faster Payments (UK bank transfers) last Month.

I'll hang around here to answer any more technical questions if anyone's interested.

kennydude7y ago

Nice to see how Faster Payments actually work in a nice understandable way.

That does sound weird how it happened by a corrupted date formatter. I'm assuming it's something like the formatter reset itself back to the langauge default

robinson-wallOP7y ago

Dunedan7y ago

That reminds me of a race condition I encountered a while ago:

tuukkah7y ago

3 more replies

CornishPasty7y ago

Was the original BERT formatting from the Hub or from something inside the Gateway's network? I always associated BERT with Erlang, do you know if it was involved or if it was something else?

Cheers for this blog post, by the way. It was really informative about the issue, and about how FPS works.

robinson-wallOP7y ago

It's used exclusively inside the Gateway's infrastructure. They're a mix of Java and Erlang but I'm not sure on the proportion.

FPS uses ISO8583 for its messaging format, and I suspect at the edge the Gateway translates it to a BERT blob for passing around internally.

1 more reply

hc917y ago

You guys are using Form3 for FPS, is this correct?

robinson-wallOP7y ago

Ah "any more technical questions"... except this one.

Sorry, but I can't name our partner.

wrboyce7y ago

We can take an educated guess...

https://status.form3.tech/incidents/wyhyxydxgh30

robinson-wallOP7y ago

I would note that this status page says "Our FPS Direct gateway provider".

1 more reply

BillinghamJ7y ago

I believe it's PayPort, which is run by Vocalink - who also run FPS itself.

As far as I understand, PayPort was/is the recommended options for all new "direct" connections. Though it seems that it is also possible to go more-direct into the FPS system itself.

jayelbe7y ago

You might be interested to know that Monzo's API refers to Faster Payments transactions as "payport_faster_payments".

retube7y ago· 12 in thread

pjc507y ago

Then they've lost money.

For delayed settlement, the banks wait until the end of the day, add up the total money in each direction, subtract the difference, and transfer that.

Banking is eventually consistent, and has been for centuries.

jessaustin7y ago

The sending bank is on the hook for that money when everything gets settled up, so it has a strong incentive to perform that debit.

retube7y ago

what is this settling up? how does that work?

lukevp7y ago

adambyrtek7y ago

Another blog post from Monzo explains this in more details, see the Net Settlement section: https://monzo.com/blog/2016/01/20/how-do-bank-payments-work/

jacobush7y ago

Search for bank clearing house

twic7y ago

It could, but then it has effectively extended a loan to the sender. Would they want to do that?

retube7y ago

why is that? two records have been updated in two databases. this isn't blockchain, there's nothing to enforce they are consistent (that I can tell)

PeterisP7y ago

The standard money transfer from Bob to Joe is a deal where the bank says "ok, Bob, we owed you $100 but if you want that then we'll now owe that $100 to Joe instead".

robinson-wallOP7y ago

The sending bank will have their Bank of England settlement account debited by the central FPS system, based on the central FPS system's view of the world, at the end of the settlement cycle.

nathankunicki7y ago

The settlement process through a central bank is a way of ensuring that banks dont need to literally send truckloads of cash to each other at the end of the day.

twic7y ago

You've had a few answers about the use of central bank accounts; i'll note that this process is called net settlement:

https://en.wikipedia.org/wiki/Net_settlement

https://en.wikipedia.org/wiki/Nostro_and_vostro_accounts

mwexler7y ago· 7 in thread

For more about what makes a good apology, see https://withoutbullshit.com/?s=apology&submit=Search by Josh Bernoff, a former Forrester editor and a very direct writer.

FigmentEngine7y ago

In the UK they do have to: https://www.fca.org.uk/firms/fca-mandated-and-voluntary-info...

dx7tnt7y ago

I can't imagine one of those giant banks having this kind of outage.

robinson-wallOP7y ago

Just yesterday a major high street bank stopped sending payments for an hour, and was telling customers on Twitter that there were no problems.

Hell, the central system (what I called the Hub in this article) had a 12 hour split brain meltdown last July which had banks emailing each other spreadsheets back and forth for two weeks afterwards.

2 more replies

kbody7y ago

Various banking (sub)systems break all the time. We just never get any postmortems or public apologies.

This reminds me the saying "Never admit a wrongdoing and you'll never be wrong".

It's great that we get several of those new startup banks (Monzo, N26 etc.) that provide superior experience and slowly show what horrible things traditional banks were getting away with.

thedanbob7y ago

An apology would have been nice, but I suppose unwarranted threats are more in character.

CloudNetworking7y ago

Google TSB / Banc Sabadell

clankstar7y ago

Fast Payments break between EU banks all the time.

ziddoap7y ago· 5 in thread

A+ job on handling the unfortunate situation, Monzo.

We can only hope more companies follow this great example.

ccrush7y ago

_carl_jung7y ago

Using "we" in any public announcement, writeup, blog post, etc. is always in danger of sounding contrived.

ziddoap7y ago

Personally, I prefer something written in a relaxed style rather than a formal-voice only in most cases, especially for blog posts.

If anything, a strict formal-voice only blog post would come across, to me, as contrived.

To each their own.

1 more reply

hombre_fatal7y ago

> Time Warner employees with the nursing flaps in their shirts.

I had to look that one up: https://www.dailymotion.com/video/x15ij62 (3min)

tudorizer7y ago

Not to take away from their good communication, but they are still relatively small. If they keep this up while growing, then they might have some secret sauce.

playpause7y ago· 5 in thread

Nextgrid7y ago

Agreed. This caused me to leave them for Starling Bank, though I’m considering switching back - I’d rather take a faulty app but good customer support than a good app but no support at all.

1 more reply

kingofspain7y ago

Recently updated the iOS app and it's definitely got quite laggy, especially on the pots screen (I only have about 5 pots too). Used to be so nippy as well.

madeofpalk7y ago

andyrew7y ago

This is fixed for me in the latest Monzo update (came out yesterday)

_fzslm7y ago

If you still have problems after the latest App Store update, their TestFlight build fixed that issue.

yingw7877y ago· 4 in thread

@robinson-wall Nice writeup, definitely raises the standards in the banking industry! I have a few questions:

2. I'm not sure how payment processors work, but would having multiple payment processors from Monzo's interface make sense from a cost/benefit perspective?

3. Any plans to expand to the U.S. anytime soon, or recommend any banks that follow Monzo's best practices? ;-)

robinson-wallOP7y ago

breakingcups7y ago

I'll ask unashamedly, any plans for the EU?

gr-eg7y ago

3. https://monzo.com/blog/2019/06/13/monzo-usa/

adwww7y ago

The article links to a status page - https://monzo.statuspage.io/

spiderfarmer7y ago· 4 in thread

Somewhat related question: How can Monzo offer 1.55% interest while the interest with most banks is around 0,3%?

djhworld7y ago

The savings accounts are offered by third party banks, not Monzo.

The 1.55% rate is fixed term for 12 months with no withdrawals

justusthane7y ago

bradstewart7y ago

Jonnerz7y ago

Likely something to do with the fact Monzo are extremely lean and have minimal overheads compared to traditional banks.

baby7y ago· 4 in thread

I've been using Monzo less and less since I moved to the US due to the cost of topping it up. It's really sad that there is no true equivalent to Monzo here :(

afarrell7y ago

Monzo just launched in the US. See their announcement at https://monzo.com/blog/2019/06/13/monzo-usa/

There is a waitlist to join though.

zn447y ago

https://monzo.com/blog/2019/06/13/monzo-usa/

Qasaur7y ago

TransferWise has a debit card and full banking facilities, I assume they are available in the United States?

xchaotic7y ago

Yes, I used Transfer Wise borderless card in the US in 2017

PhantomGremlin7y ago· 3 in thread

The software bug at the heart of the problem:

So apparently a dangling reference.

seanmcdirmid7y ago

jey7y ago

Sounds like they were hanging onto a pointer to an object allocated by GC. For example, in Python/C API if you use a borrowed reference PyObject* after it has gone out of scope and been GC'd.

rahilb7y ago

I'm pretty sure they're a Java shop, and they're referring to sun.misc.unsafe and the goodies inside that let you manually allocate memory.

sandGorgon7y ago· 2 in thread

Just curious - whats the stack you guys run ?

I'm wondering what do you use to call these external processing APIs. I assume these are blocking calls.

robinson-wallOP7y ago

There's a good writeup by Oliver, our head of engineering, about our tech stack on our blog[1] with an accompanying Kubecon talk[2].

TL;DR- Largely Go microservices running on k8s, with http-based RPC calls for synchronous communication, and kafka for asynchronous communication.

[1]: https://monzo.com/blog/2016/09/19/building-a-modern-bank-bac...

[2]: https://www.youtube.com/watch?v=YkOY7DgXKyws

sandGorgon7y ago

actually I kind of like this one - https://softwareengineeringdaily.com/wp-content/uploads/2018...

We have learnt a lot from you guys as we build out similar systems in India. Thank you for putting this stuff out!

kjlfhg87y ago· 2 in thread

Not related to the outage, but any plans to provide banking on pc's instead of just phones and any plans to provide small businesses accounts in the future?

lol7687y ago

Yes, they've had job positions open for more web work. I think they will expand this.

They already offer business accounts. I have one open for my Ltd company.

ownagefool7y ago

I registered my interest a while ago but they haven't given me one yet, so I wouldn't say they offer business accounts, so much as they're going to/are testing this.

1 more reply

peteretep7y ago· 2 in thread

> They later tell us they believed that datacentre was introducing the corruption

What now? Their datacentre was ... rewriting (presumably) encrypted packets?

robinson-wallOP7y ago

Sorry, perhaps this isn't very clear as I've tried to simplify the explanation to make it accessible to a wide audience.

noir_lord7y ago

I understood that as you intended.

It's a fine line between understandable to laymen and people been pernickity sadly.

edraferi7y ago· 1 in thread

aeorgnoieang7y ago

> the root cause to a buggy garbage collector

Or, rather, unsafe access of memory managed by a garbage collector:

gregdoesit7y ago

This is a well-written post-mortem for public reading. I encourage people to read through it.

Nice work!

ablation7y ago

Thank you for posting this. Great read, and nice to see the team at Monzo sharing this level of detail: consumable but still detailed.

GordonS7y ago

The project I'm currently working on has a QA lag of 4-5 days for code to reach production.

It's been said here many times already, but I'll join those saying "well done" for handling this so well, and for the extraordinary level of transparency!

osrec7y ago

Tldr; unsafe memory management in a third party's software corrupted dates (under high load, due to garbage collection), causing transactions to fail or get reversed.

nvr2197y ago

You can do anything at monzocom

j / k navigate · click thread line to collapse