Researchers reverse-engineer the Dropbox client: What it means (opens in new tab)

(techrepublic.com)

147 pointsheyitsnick12y ago45 comments

45 comments

37 comments · 11 top-level

seiji12y ago· 6 in thread

I always find reverse engineering things made by people amusing. We could just, you know, ask someone.

It's like when a new iPhone comes out and they throw the custom silicon under electron microscopes. It's entertaining, and I'm sure fun for the people doing it, but fighting information wars against ourselves just seems silly.

There are large problems humans don't have answers to, but we're busy making things then figuring out how the things we made work. Madness ensues.

objclxt12y ago

> There are large problems humans don't have answers to, but we're busy making things then figuring out how the things we made work

Many technologies have been developed or accelerated through the need to reverse engineer something. I would argue the techniques developed to break the Enigma Code during WW2 had profound effects on computing generally.

Often reverse engineering a technology can also allow you to make improvements the other party has yet to realise, catalysing new ideas and research.

Not that all this means you are necessarily wrong, although perhaps it is a little too idealistic to hope for a world where information isn't a valuable currency?

seiji12y ago

Think of it taken to extreme measures.

Imagine a company where Team Database releases a binary-only library to the rest of the company. They won't tell you how it works and you can't talk to them, but it seems to work well enough. Then one day, Team Website wants to do something else with the database (a new type of query, new type of storage model, something non-trivial). In this backwards company, Team Website spends months reverse engineering the library and protocol to hack their own functionality into it. That's mad, right?

A large view presents two views of knowledge: things humans know —and— things humans don't know. We're circling around rediscovering what other people have done while they sit there quite able to give us what we want to know.

Now, adversarial conditions prevent such blanket sharing: capitalism, sovereign nations, war, etc.

Think of Intel. In some ways, they control the pinnacle of CPU design that humanity can surface at this point in time. We don't have anybody to ask "well, what comes next?" in the 10 year CPU roadmap—we have to discover the future along the way.

We should spend more time asking "well, what comes next?" and less time rediscovering what people already know how to do (modulo it making you better at actually discovering new things, or just for fun, or for cyberwar, etc).

guiomie12y ago

I thought the enigma had been stolen from the U-571 ... ahah

1 more reply

jontas12y ago

I just read the actual whitepaper (https://github.com/kholia/dedrop/blob/master/paper/accepted/...) and one of the interesting takeaways is that this particular reverse engineering resulted in the discovery of actual vulnerabilities that were responsibly reported to Dropbox and patched.

Simply asking Dropbox how this stuff worked would've (probably) never uncovered these security issues.

Edit:

Just wanted to add one more benefit of this attempt at reverse engineering, from the whitepaper's introduction:

> Our work reveals the internal API used by Dropbox client and makes it straightforward to write a portable open-source Dropbox client

seiji12y ago

Do you ever find it amazing we still run closed sourced software?

Is it not bad enough the Microsoft and Adobe hegemony force the entire world to have an attack surface wider than Jupiter to exploit at the whims of eastern european teenagers?

1 more reply

wladimir12y ago

Fighting any wars against ourselves seems silly. But the problem is that companies aren't that willing to share information, or it is only available for a large price and/or with restrictive NDAs. Also, finding out how things work is simply fun.

Say, I needed write a custom GPU driver for some device, either to improve performance for some specific application or to work outside the dependency or API constraints of the binary blob (like porting to another OS). Usually vendors provide no register level documentation about graphics hardware, so the only way to do this is by reverse engineering.

Another reason for reverse engineering can be to find backdoors and security vulnerabilities (like these guys did) or even for legal reasons to find whether some copyrighted (or GPLed) code was used.

No madness needed at all. Or maybe just a bit.

recuter12y ago· 5 in thread

http://neopythonic.blogspot.co.il/2011/06/depth-and-breadth-...

"The contrast with my visitor the next day couldn't be greater. Through a former colleague I got an introduction to Drew Houston, co-founder and CEO of the vastly successful start-up company Dropbox.

Python plays an important role in Dropbox's success: the Dropbox client, which runs on Windows, Mac and Linux (!), is written in Python. This is key to the portability: everything except the UI is cross-platform. (The UI uses a Python-ObjC bridge on Mac, and wxPython on the other platforms.) Performance has never been a problem -- understanding that a small number of critical pieces were written in C, including a custom memory allocator used for a certain type of objects whose pattern of allocation involves allocating 100,000s of them and then releasing all but a few. Before you jump in to open up the Dropbox distro and learn all about how it works, beware that the source code is not included and the bytecode is obfuscated. Drew's no fool. And he laughs at the poor competitors who are using Java."

Sometime after that, Drew poached Guido from Google. I remember this post. :)

captainmuon12y ago

You can use a custom memory allocator in python? I wonder if this is somehow pluggable, or if they had to modify the interpreter.

Scaevolus12y ago

All you need to do is set the appropriate type's tp_alloc/tp_dealloc function pointers [1] (type-specific malloc/free functions). Dropbox was having fragmentation issues from the large amount of garbage generated while scanning the filesystem, and making memory allocation use type-specific memory pools fixed it.

    [1] http://docs.python.org/2/c-api/typeobj.html#PyTypeObject.tp_alloc

zaphar12y ago

The dropbox guys gave a talk about this at pycon one year but I'm having trouble finding it now. I remember thinking it involved less work than I thought it would.

1 more reply

sesqu12y ago

According to the article, they do use a modified interpreter.

DonnyV12y ago

When you have to start patching the framework to handle memory allocation its probably time to move on. Just use Mono and you get your cross platform feature and obfuscation.

wladimir12y ago· 4 in thread

How could there have been any doubts that the heavily obfuscated Python could be reverse engineered? Me, and some others, did it quite a while ago. It wasn't a lot of work to find the opcode mapping using frequency analysis and a bit of reasoning (ie, mapping against known libraries). Anyone remember dropship? https://en.wikipedia.org/wiki/Dropship_(software) I wonder if they're going to send a takedown request this time too.

Oh I see dropship is mentioned in the paper, great :)

In any case, interesting that they found some previously unknown security holes this way. This again proves that security through obscurity, at least for client software, doesn't work. When will people learn. You can't hide anything on the client for the user, at least not for long.

quasque12y ago

It does raise the bar slightly though, so is still worth doing. Instead of simply running the pyc files through a decompiler as would be the case without obfuscation, one has to reverse engineer their modified Python binary to figure out the altered format of the code blocks. This is not a very common set of skills.

randuser12y ago

Obfuscated code is surely harder to understand and work with than original code with descriptive variable names, comments, formatting, etc. Wouldn't this make it more difficult to find vulnerabilities?

jacquesm12y ago

It makes it just as easy for the whitehats and for the blackhats so it makes no difference. It may give some people a false sense of security that they would have not had if they were able to look at the code.

Presumably dropbox is through its enormous distribution a very fat target and I find it hard to believe that this published effort would be the first instance of such an undertaking. You're average blackhat isn't going to publish his hack but will market it for all it is worth.

Then you get pages like these:

http://1337day.com/exploit/description/19604

(click 'ok')

I don't think the dropbox team obfuscates their code as a security measure, they more likely do it to increase the depth of their moat by a little bit and to make it a bit harder to write third party clients against their non-published api's.

cLeEOGPw12y ago

Then the question becomes, is it more beneficial to make it difficult to find vulnerabilities, or make it easier and fix it when found.

groby_b12y ago· 4 in thread

I am slightly confused as to why reverse-engineering a client allows you to sidestep two-factor auth. That should be entirely a server-side thing.

pfitzsimmons12y ago

From skimming the original paper, it seems as if you can bypass authentication if you know certain keys that that particular dropbox client stores locally. Of course, if you were able to access those values on the local hard-drive, you likely already have access to the victim's hard-drive or computer. In that case you have the victim's local copy of the dropbox folder already, there is no need for reverse engineering.

This "weakness" is no different than the weakness of two-factor authentication in any scenario where login is persistent. I have two-factor gmail authentication for gmail with "remember me" set so I do not have to log in every day. If someone steals my laptop and gets my cookies, they can log in as me regardless of two-factor authentication, until the cookie authentication expires.

bad_user12y ago

If somebody steals my laptop and it's still open, they still have to provide my user password since the screen gets locked after some time of inactivity. And reading from the hard-drive directly won't help, because I got an encrypted hard-drive (with dmcrypt).

I did this precisely because the laptop is a single point of failure. Steal somebody's laptop and bam, you've got access to everything important to that person.

My Android phone is also encrypted (with a much weaker password) and I can also remotely delete everything on it through Google Apps.

1 more reply

toyg12y ago

Do you want to go through a two-factor challenge every time a single file is synchronised?

e12e12y ago

It's two-factor authentication not two-factor authorization.

I'd also think that authentication was (should be) a server-side thing: and that at that point you'd get some form of session/token/ticket.

2 more replies

RachelF12y ago· 3 in thread

Does Dropbox not use Amazon S3 as their storage engine anyway? This should have an open API?

mcpherrinm12y ago

Dropbox does have an API, https://www.dropbox.com/developers but this is about reverse engineering the client which seems to use things not here -- in particular, some authentication stuff. I haven't read in depth about why that allowed them to bypass 2-factor auth though.

jontas12y ago

From the whitepaper (https://github.com/kholia/dedrop/blob/master/paper/accepted/...):

> We found that two-factor authentication (as used by Dropbox) only protects against unauthorized access to the Dropbox’s website. The Dropbox internal client API does not support or use two-factor authentication!

zeckalpha12y ago

Don't they use S3 internally? I assumed the desktop client does not access S3 directly, but that their server is middleware.

unknownian12y ago· 2 in thread

Fun fact: the GNU/Linux Dropbox client is licensed under the GPL. I don't know if the article was referring to it though.

derobert12y ago

No it's not. The dropbox installer and Nautilus hooks are released under the GPL. But it actually downloads a proprietary binary which does most everything.

unknownian12y ago

My mistake, sorry.

lucb1e12y ago· 1 in thread

Read the paper. They haven't actually found a way to really bypass two-factor authentication and all other security measures. With their findings, you can hijack an account if:

- you feel like cracking a 256-bit random value remotely (can't locally bruteforce it), or

- you have filesystem access.

I'd say both are irrelevant. You can't crack 256-bit values locally, let alone if you have to check the value remotely, and with filesystem access I imagine you can do a whole lot more than just uploading files to someone's Dropbox.

Bypassing two-factor authentication with either of the options is possible though, and I can see the issue, but this is by design. I don't think you want to have to enter your credentials (username, password, second factor) every single time you store a file or check for updates.

captainmuon12y ago

If you have filesystem (write) access, you don't have to hack Dropbox to upload files, just put the files in the appropriate folder. And if you can execute code, you can just remote control the UI (move the cursor, type) and do anything the user can.

But I'm glad to hear that they found no "actual" weakness, that would enable a hacker with only my account name, or who is on my WiFi, to access my Dropbox.

andrewcooke12y ago· 1 in thread

so why does this need to be obfuscated? is it not possible to do this securely and transparently?

diydsp12y ago

I'm not a security expert, but from a management standpoint, obfuscation certainly /slows/ (not inhibits) the spread of homebrew Dropbox clients. Homebrew clients have the potential to create lots of customer support issues...

To a _conservative, lean organization_, it's better to constrain customer use cases to known good clients than to handle fallout such as "I lost all my data!" "What rev of client were you running?" "zAxX0r'2 m0D51c|< ph3y3Ldr.0p 0.0.69r."

That said, I could hope for Dropbox to evolve to a more open (ssh-based?) model, though I'm not a security architect :)

naner12y ago

The "expert" analysis was a bit lame. He brings in an expert pen tester who provides a legal opinion?!

arnehormann12y ago

Link to the presentation of the reverse-engineers: https://www.usenix.org/sites/default/files/conference/protec...

ChazDazzle12y ago

http://pastebin.com/gzF4XkBL

Fun find from the source code: There's a module named "gandolf.py" which appears to have something to do with version control.

j / k navigate · click thread line to collapse

45 comments

37 comments · 11 top-level

seiji12y ago· 6 in thread

I always find reverse engineering things made by people amusing. We could just, you know, ask someone.

There are large problems humans don't have answers to, but we're busy making things then figuring out how the things we made work. Madness ensues.

objclxt12y ago

> There are large problems humans don't have answers to, but we're busy making things then figuring out how the things we made work

Often reverse engineering a technology can also allow you to make improvements the other party has yet to realise, catalysing new ideas and research.

Not that all this means you are necessarily wrong, although perhaps it is a little too idealistic to hope for a world where information isn't a valuable currency?

seiji12y ago

Think of it taken to extreme measures.

Now, adversarial conditions prevent such blanket sharing: capitalism, sovereign nations, war, etc.

guiomie12y ago

I thought the enigma had been stolen from the U-571 ... ahah

1 more reply

jontas12y ago

Simply asking Dropbox how this stuff worked would've (probably) never uncovered these security issues.

Edit:

Just wanted to add one more benefit of this attempt at reverse engineering, from the whitepaper's introduction:

> Our work reveals the internal API used by Dropbox client and makes it straightforward to write a portable open-source Dropbox client

seiji12y ago

Do you ever find it amazing we still run closed sourced software?

Is it not bad enough the Microsoft and Adobe hegemony force the entire world to have an attack surface wider than Jupiter to exploit at the whims of eastern european teenagers?

1 more reply

wladimir12y ago

Another reason for reverse engineering can be to find backdoors and security vulnerabilities (like these guys did) or even for legal reasons to find whether some copyrighted (or GPLed) code was used.

No madness needed at all. Or maybe just a bit.

recuter12y ago· 5 in thread

http://neopythonic.blogspot.co.il/2011/06/depth-and-breadth-...

"The contrast with my visitor the next day couldn't be greater. Through a former colleague I got an introduction to Drew Houston, co-founder and CEO of the vastly successful start-up company Dropbox.

Sometime after that, Drew poached Guido from Google. I remember this post. :)

captainmuon12y ago

You can use a custom memory allocator in python? I wonder if this is somehow pluggable, or if they had to modify the interpreter.

Scaevolus12y ago

    [1] http://docs.python.org/2/c-api/typeobj.html#PyTypeObject.tp_alloc

zaphar12y ago

The dropbox guys gave a talk about this at pycon one year but I'm having trouble finding it now. I remember thinking it involved less work than I thought it would.

1 more reply

sesqu12y ago

According to the article, they do use a modified interpreter.

DonnyV12y ago

When you have to start patching the framework to handle memory allocation its probably time to move on. Just use Mono and you get your cross platform feature and obfuscation.

wladimir12y ago· 4 in thread

Oh I see dropship is mentioned in the paper, great :)

quasque12y ago

randuser12y ago

jacquesm12y ago

Then you get pages like these:

http://1337day.com/exploit/description/19604

(click 'ok')

cLeEOGPw12y ago

Then the question becomes, is it more beneficial to make it difficult to find vulnerabilities, or make it easier and fix it when found.

groby_b12y ago· 4 in thread

I am slightly confused as to why reverse-engineering a client allows you to sidestep two-factor auth. That should be entirely a server-side thing.

pfitzsimmons12y ago

bad_user12y ago

I did this precisely because the laptop is a single point of failure. Steal somebody's laptop and bam, you've got access to everything important to that person.

My Android phone is also encrypted (with a much weaker password) and I can also remotely delete everything on it through Google Apps.

1 more reply

toyg12y ago

Do you want to go through a two-factor challenge every time a single file is synchronised?

e12e12y ago

It's two-factor authentication not two-factor authorization.

I'd also think that authentication was (should be) a server-side thing: and that at that point you'd get some form of session/token/ticket.

2 more replies

RachelF12y ago· 3 in thread

Does Dropbox not use Amazon S3 as their storage engine anyway? This should have an open API?

mcpherrinm12y ago

jontas12y ago

From the whitepaper (https://github.com/kholia/dedrop/blob/master/paper/accepted/...):

zeckalpha12y ago

Don't they use S3 internally? I assumed the desktop client does not access S3 directly, but that their server is middleware.

unknownian12y ago· 2 in thread

Fun fact: the GNU/Linux Dropbox client is licensed under the GPL. I don't know if the article was referring to it though.

derobert12y ago

No it's not. The dropbox installer and Nautilus hooks are released under the GPL. But it actually downloads a proprietary binary which does most everything.

unknownian12y ago

My mistake, sorry.

lucb1e12y ago· 1 in thread

Read the paper. They haven't actually found a way to really bypass two-factor authentication and all other security measures. With their findings, you can hijack an account if:

- you feel like cracking a 256-bit random value remotely (can't locally bruteforce it), or

- you have filesystem access.

captainmuon12y ago

But I'm glad to hear that they found no "actual" weakness, that would enable a hacker with only my account name, or who is on my WiFi, to access my Dropbox.

andrewcooke12y ago· 1 in thread

so why does this need to be obfuscated? is it not possible to do this securely and transparently?

diydsp12y ago

That said, I could hope for Dropbox to evolve to a more open (ssh-based?) model, though I'm not a security architect :)

naner12y ago

The "expert" analysis was a bit lame. He brings in an expert pen tester who provides a legal opinion?!

arnehormann12y ago

Link to the presentation of the reverse-engineers: https://www.usenix.org/sites/default/files/conference/protec...

ChazDazzle12y ago

http://pastebin.com/gzF4XkBL

Fun find from the source code: There's a module named "gandolf.py" which appears to have something to do with version control.

j / k navigate · click thread line to collapse