The FBI stole an Instapaper server in an unrelated raid (opens in new tab)

(blog.instapaper.com)

581 pointsgarethr15y ago250 comments

250 comments

112 comments · 26 top-level

Xk15y ago· 29 in thread

Instapaper stores only salted SHA-1 hashes of passwords, so those are relatively safe.

Obligatory statement on NEVER USING SHA-1 HASHES to make passwords "safe".

Any normal person can brute force millions of SHA-1 hashes (salted however much you want) per second on a GPU.

If the FBI so wanted (although I don't believe they do) I'm sure they could brute force almost every single password in that database. Granted, it's the government and they have better ways of obtaining such information, but if there is someone the FBI is watching on Instapaper's databases and they so wanted, storing the SHA-1 hash of the password all but handed them over to the FBI.

I am now glad my Instapaper password was generated randomly, 16 characters long, and I will now change it just to be safe.

For anyone running a database which stores ussername/passwords, take a look at bcrypt or scrypt. They're millions (no, I am not exaggerating) of time better than SHA-1.

(Edit: Grammar)

dspace15y ago

This is more "crypto nerd imagination", a la the XKCD comic. The FBI doesn't care about the encrypted passwords because it has access to all the content in plaintext. And what else would they need the passwords for? Other accounts on other services? They can just confiscate those servers too, where the content is most likely also in plaintext.

So in this case, where the FBI is involve, using a SHA-1 hash poses no extra security vulnerability.

spoondan15y ago

They can just confiscate those servers too, where the content is most likely also in plaintext.

I imagine that many companies are better prepared to deal with the FBI than this data center was. I have a hard time imagining the FBI going into a Google data center and easily walking out with a few racks. But even if that's too optimistic, I doubt the FBI could go about seizing servers for very long. If nothing else, this would eventually piss off big companies who will lobby Congress to curtail the FBI.

tankenmate15y ago

In this case since the warrant probably didn't allow for the seizure of Instapaper's servers / data you run a serious Fourth Amendment risk of any evidence within being inadmissible. That said even if it is inadmissible the FBI now know things they might not have known before. There is the obvious point that there is very little likelihood of any direct evidence of a crime in Instapaper's data, there maybe indirect or circumstantial evidence though.

adrianscott15y ago

"So in this case, where the FBI is involve, using a SHA-1 hash poses no extra security vulnerability."

meeeh...

remember the fbi is not a person, it's an organization. the org can have bad actors in it who might be able to access the encrypted passwords but not be able to confiscate servers.

also, confiscating a server(s) is much more visible / detectable...

getsat15y ago

> can brute force millions

Modern consumer video cards can do billions per second now. You might as well just store them in plaintext instead of using SHA1/MD5 with or without salting. :/

mwytock15y ago

I dont understand. If you can use mixed-cased, letters and symbols you have 26 * 2 + 20 = 72 possible characters.

72^8 >> 1e9

It would still take more than 8 days to brute force at 1 billion/sec. And using a longer password (16 chars?) would make this a very long time.

Or is there other trick that makes this fast? Or, is it simply that people don't choose random, long passwords?

4 more replies

dragonsky15y ago

As they have the Web code base, you must assume that they have the salt to the hashing... If they actually want to get these passwords, all they have to do is generate rainbow tables using SHA1 and the appropriate salt. We're back to relying on the length and bit depth (range of characters) of the passwords you are trying to find.

1 more reply

IgorPartola15y ago

I have been thinking about switching everything to bcrypt, but there is definitely way too much confusion about bcrypt vs scrypt, how many rounds to set for bcrypt, etc. What is the definitive source for figuring out what the new standard should be? Does anyone have any links to something that's peer-reviewed and approved for use by someone with enough authority to do so?

tptacek15y ago

No there isn't. You only think that because when geeks discuss anything that involves one or more knobs, a huge debate must necessarily ensue about the proper values of those knobs.

Just use the bcrypt defaults. You will be fine. You will in particular be so much better off than salted SHA-1 that this topic will be mooted. Later on, maybe in 5-10 years, you can re-engage with the debate about what a good cost factor for bcrypt will be in 2020.

4 more replies

seiji15y ago

scrypt slides: http://www.tarsnap.com/scrypt/scrypt-slides.pdf

Takeaway: Cost to crack one MD5 password: $1. Cost to crack one scrypt password: $50M to $200B.

You want your login to be slow compared to the rest of your application. It's okay to take half a second to verify a login.

1 more reply

oskarkv15y ago

I'm just wondering: What if I used SHA-1 a million times on the password, i.e. hashing the hash over and over. Wouldn't that make it much more time-consuming for an attacker? Or am I missing something? The input every time but the first would be a random-looking 160 bit number, so it would be hard to guess. And if the attacker wanna look for common passwords in a dictionary the attacker must hash them a million times, no?

rdl15y ago

Absolutely. That's essentially PBKDF2 (http://en.wikipedia.org/wiki/PBKDF2).

You usually add a salt (an additional string which is stored in the clear, but which makes your local instance globally unique, so the attacker can't precompute value to hash mappings ("Rainbow Tables" [which are faster to make if you have alien technology, from what I've heard]) for all sites.

I'd still suggest using bcrypt or scrypt.

1 more reply

joevandyk15y ago

So if I'm using SHA-1 already to store passwords, what are my options for moving to a different system? I assume there's no way to rehash the passwords?

Xk15y ago

You have two choices that I see:

1. The next time a user logs in to your system and you verify against the SHA-1 hash that they are who they say they are, recompute the correct hash for bcrypt. Then, delete the SHA-1 hash. It does you no good to have a bcrypt version if you keep the SHA-1 version around.

2. Generate the bcrypt hash from the SHA-1 hash. That is, pretend that the SHA-1 hash is the user's password. This isn't as clean (your password authentication software will then have to do SHA-1 followed by bcrypt) but it means you'll be able to migrate your entire database all at once if you so choose. This also causes a very (very, very) slightly higher chance of password collisions, although there's not much to worry about from that.

1 more reply

holdenk15y ago

I've had the "joy" of doing password migrations in the past when moving authentication systems. We made our auth chain support both methods and then when someone logged in with an old method it stored the password in the new method and deleted the old record. This was on a system with only a few hundred users though, so YMMV.

mapgrep15y ago

Convert as sessions expire and people sign in again. You'll obviously need a db column to keep track of which passwords are converted/unconverted, e.g. password_algo.

meow15y ago

You can bcrypt your current sha1 hashes.

kindly15y ago

I currently use a sha hash (with salt) but rehash it x amounts of times. I have changed x over the years to be larger to get an acceptable trade off in computation time. Why is bcrypt much better than this? Is it because the algorithm is less gpu friendly?

Xk15y ago

What you describe is basically PBKDF1. If you wanted to make it slightly better, you could go with PBKDF2. It's true that bcrypt is better in some ways, but you're fine with what you're doing now. If you really wanted to improve on things you could go with scrypt which eats memory also, but it's more difficult to get things to work right.

mbreese15y ago

That was my first thought too. Not only does the FBI have the salted hashes, but they also have a copy of the code for the website. So they know what the salt values are. This makes it even easier to brute force the hashes.

tptacek15y ago

You've been downmodded because password hash salts are public nonce values; usually, schemes that depend on "secret salts" are crackpot alternatives to secure password hashes.

(I didn't downmod you).

2 more replies

cheez15y ago

Can you describe why it's better?

tptacek15y ago

http://codahale.com/how-to-safely-store-a-password/

(Be prepared for your comment score to visit the grey depths if you attempt to relitigate Coda's blog post here and don't know exactly what you're talking about.)

2 more replies

5l15y ago

He already did:

"Any normal person can brute force millions of SHA-1 hashes (salted however much you want) per second on a GPU."

This is not true of bcrypt.

tbrownaw15y ago

It takes much longer to compute the hash of a given password, which essentially makes it as if everyone chose passwords with a couple extra bytes of entropy in them.

marbletiles15y ago

Was far happier when he didn't store passwords at all, tbh.

lwat15y ago

Are you joking?

2 more replies

derrickpetzold15y ago

Just store in plaintext because I am already assuming you are. All the this talk about sha-1 vs bcrpyt vs scrpyt is nice and all but I have little faith that most companies care about this as much as HN does. I believe that most people are using the default password storage mechanism for their framework which are already known to be easy to break if the database is compromised. But all of that is mute anyways. Unless you have access to the site's source how would you know if they are hashing at all much less which one they are using? The best practice is to use a random password for each site you use. I just don't see any point in having an rememberable password for websites and hashing just leaves a false sense of security as illustrated by md5.

Xk15y ago

... what?

> Just store in plaintext because I am already assuming you are.

No, actually, I don't think I will store plaintext passwords.

> All the this talk about sha-1 vs bcrpyt vs scrpyt is nice and all but I have little faith that most companies care about this as much as HN does.

So what? Just because other people don't do it doesn't mean you don't have to also. Fortunately for us, there are a lot of startup founders here who might read this and learn something.

> I believe that most people are using the default password storage mechanism for their framework which are already known to be easy to break if the database is compromised.

I disagree. I think most people use SHA-1 because they know better than to store plaintext passwords. What they don't know is that it's terribly broken.

> But all of that is mute anyways.

No, it's really not.

> Unless you have access to the site's source how would you know if they are hashing at all much less which one they are using?

There are two problems here. (1) If you have access to the site's password database, there's a really good chance you have access to the entire database, and can look up how they're doing it. (2) Even if you can't lookup how they're doing it, you just try them all and find which one it is. I'd bet you money that if someone's hashing passwords, they're using one of {MD4, MD5, SHA0, SHA1, SHA2, DES}. If, god forbid, they're not using one of those and actually wrote their own hashing algorithm, you have even more to worry about.

> The best practice is to use a random password for each site you use.

For sure, no doubt about it. But what we're talking about here is the best practice for application developers, not the users. The users can't do anything about how their password is stored.

> I just don't see any point in having an rememberable password for websites and hashing just leaves a false sense of security as illustrated by md5.

Or, you know, you could use bcrypt and be secure about it.

1 more reply

yuvadam15y ago· 15 in thread

I'm trying to think of an analogy which can explain why this might be reasonable from the FBIs perspective.

Suppose you were using a shared storage space (shared servers, or server farm) with several other dudes. One of them is a drug dealer. One day the police/FBI decide to raid the storage space since the drug dealer has been using it to store illegal drugs.

Is it not reasonable to consider this collateral damage (which, granted, is totally unnecessary) during law enforcement operations?

I'm not saying this is OK in any case, but might this not be a reasonable move by the law enforcement agencies?

cheald15y ago

It is not reasonable if the FBI does not have a warrant for your servers(/storage space). Instapaper is completely right to call this "theft".

If his servers are included in the warrant because they were suspected of housing whatever it is the FBI was after, and the court granted the FBI the right to seize them, then yeah, it's reasonable.

If he was sharing a physical machine with the bad guys, then yeah, sorry, that's collateral damage. However, if he was on his own separate leased machine, there is absolutely no reason for the FBI to seize it. It'd be like them executing a seizure warrant on one of those self-storage spaces, and seizing the contents of all the adjoining compartments (which the person being investigated would have had no access to) just because.

pavel_lishin15y ago

Do we know what the warrant stated? If it authorized them to take the rack containing the server they were after, then this is legal, if unfortunate.

If the police have a warrant for my apartment, and you happen to leave your backpack and server, your stuff will most likely be confiscated, along with mine, if it interests the police.

1 more reply

nkassis15y ago

I'm guessing they could have asked to take the whole rack as to not have to tell the hosting company about the raid and risk alerting the target. They also did the raid in the middle of the night which shows they were probably trying to avoid alerting the target.

They probably didn't have anyway to know which machine it was just which rack it was. They also probably didn't have to tell the hosting company directly just the facility that they were raiding.

1 more reply

GHFigs15y ago

The problem is that with blade servers like DigitalOne provided, both of these things can be true at the same time.

1 more reply

m0nastic15y ago

I can certainly think of scenarios in which this action was reasonable from the FBI perspective.

I don't like to be in the position of defending the FBI (my own personal and professional relationship with them is complicated), but I think the following situation is plausible (which isn't to say it's what happened, as we don't know):

FBI determines the originating IP address of whatever their investigation is targetting (based on published information, it looks like a "scareware" operation").

FBI determines the IP address is "owned" by an overseas hosting provider, and that the physical servers are in a datacenter in the U.S.

FBI obtains a warrant for the seizure of all associated computing equipment (which may very well include the upstream devices used by the hosting provider).

FBI executes warrant at datacenter, sees that the servers are actually blades in a chasis; takes entire chasis (as reconstructing the data later on may require that the servers be bootable.)

The very last forensic case I worked involved having to acquire evidence from a server which was hosting a web application by a hosting provider. This was a shared hosting scenario, so in addition to acquiring the targeted information, all other customers on the server were also effectively offline (as the server was being imaged, and later as the original hard drives were entered as evidence).

Now, obviously, that isn't the exact same situation as what is described here, but in the event that the servers were blades, I don't think it's outside the realm of possibility to think that the entire chasis would need to be retrieved.

gwright15y ago

Consider an analogy. The FBI gets a valid warrant for the servers belonging to a company with a street address of "101 Main St, Somewhere, DC". The building at 101 Main St. is a multi-tenant, multi-story, office building.

If the FBI seized all the computer equipment in the entire building or even just the computers on the same floor as the targeted company but belonging to other companies who just happen to be physically adjacent to the targeted company, would it seem reasonable?

1 more reply

PatrickTulskie15y ago

It seems like either the FBI didn't pay attention to the information given to them by DigitalOne or DigitalOne had poor information about where their servers are located.

The picture painted here is that the FBI came in and hastily took a bunch of equipment without making sure they were taking the right stuff. If that is accurate, then it's likely they might have missed a server with data on it that they needed for their case. Moving quickly and causing collateral damage in a relatively safe environment where you actually have the time to triple check your work is inexcusable on all fronts.

nkassis15y ago

I was under the impression that DigitalOne wasn't even informed (they were in sweden or something) until 3 hours after the incident (from the NYT article).

smallblacksun15y ago

Or they don't trust that DigitalOne (or some employees of DigitalOne) aren't collaborating with their target.

notatoad15y ago

this is not shared hosting. the server taken belonged to instapaper. being located in the same datacenter should not be grounds for seizure.

if you're looking for a metaphor, think about a self-storage facility ([one of these places](http://www.moversandpackers.org/wp-content/uploads/2010/10/s...). imagine you're renting one of those units, and somebody renting a unit on the other side of the yard is a drug dealer. the FBI comes in, and in the process of seizing the assets of the drug dealer across the yard, they also seize all the stuff in your storage unit. There is no way that is reasonable.

xuki15y ago

The server belonged to Digital One.

I didn’t own the hardware — I was leasing it from DigitalOne.

2 more replies

mrcharles15y ago

No. Even if it was shared space, it should be possible to, through software and IT, extract the necessary data and bar it from further operation.

falcolas15y ago

I think it's always important to remember that the first order of business in a raid is to preserve evidence against deletion or modification. This means that their first task is to remove the hardware from anybody's hands but theirs. At which point they can peruse the data as they are able.

Why did they take an entire rack, instead of a few servers? I can think of a couple of potential reasons. - VM's, which could potentially live on any physical server in a VM pool - Insufficient information on which physical servers belong to their suspects - They just don't trust the colo operators to not be involved, and thus limit the suspect data to the servers they provide.

While I wholly agree that it's unfortunate that Instapaper and Pinboard were affected, it's not an unexpected consequence of having your servers alongside (or on the same physical machines) of people you don't know.

1 more reply

tptacek15y ago

I don't think the IT skill required to reliably extract evidence from an arbitrary hosting operation (of potentially arbitrary complexity) is simply "on tap" for the FBI.

If you want to say "tough luck that's just what it costs to collect evidence in 2011", fine, but it's probably not fair to say that the FBI should just naturally have that capability.

2 more replies

sharth15y ago

Unless you don't trust the hosting provider. Then their best bet is to take down the proper machines.

nbpoole15y ago· 8 in thread

So, the FBI has a copy of Instapaper's complete database and a copy of their website code. The database includes:

- Salted SHA-1 hashed passwords for Instapaper

- Encrypted passwords for linked Pinboard accounts (with the encryption key stored in the website code)

- OAuth tokens for linked Facebook/Twitter/Tumblr accounts (and presumably also the secret keys used by Instapaper to use those tokens).

That's (potentially) a lot of personal information.

midnightmonster15y ago

Perhaps even more important, they have a list of hundreds (thousands?) of pages I thought were interesting enough to read later. Seems like a fine base from which to build or enhance profiles of thousands of citizens.

foobarbazoo15y ago

Even better, they can use that information to build or enhance profiles WITHOUT getting any kind of judicial approval or oversight. Yay, Patriot Act (not).

imrehg15y ago

As a real practical question out of curiosity: how would you design their system differently so unauthorized people having only your hard drives couldn't get any data at all?

rosser15y ago

On my collocated server, I use encrypted LVM for all of my filesystems (except /boot, of course). On my next hardware upgrade cycle, I'm going to install a USB gyroscope (inside the chassis, using one of the front USB headers) and write a daemon that will issue a `umount -lfa && halt -n` if the box is ever moved.

Note that this isn't simply to keep prying eyes off my data; I live near an overdue earthquake fault line. When it does finally give, I should have a (slightly) better chance of the machine coming through intact.

1 more reply

kijinbear15y ago

Full-disk encryption. You enter the key whenever the system needs to be rebooted. I know at least one company that does this with all of their US-hosted servers.

2 more replies

pavel_lishin15y ago

You could always hash the e-mails, although this would make resetting your password impossible.

How much data do Facebook's OAuth tokens contain? By looking at one, can you tell that it's linked to Pavel Lishin's account?

3 more replies

forgotAgain15y ago

If the hardware is in hand then I don't see any practical way to protect the information. The only way that works is to not store the information in cloud servers.

1 more reply

boucher15y ago

Presumably most or all of these services will let you rotate your secret key, and Instapaper will do that. Assuming it was done rather quickly, then there would be no practical risk of unwanted access for any of the OAuth accounts. If the services don't let you rotate your secret key, well... they probably should.

Astrohacker15y ago· 8 in thread

I think it may be prudent to begin encrypting all data on disk that can reasonably be encrypted while being able to set up the server remotely so that no one can just snatch your server and get all your data.

This could work by encrypting your database in a truecrypt volume that must be mounted by entering the password. Thus, the data is only ever saved on disk in encrypted form, and the key to access the data is not saved on the disk. Of course, it is still in principle possible for anyone to access that information if they have physical access to the computer while it's running, but at least this makes that much harder.

pavel_lishin15y ago

How fast is Truecrypt? How much would this slow down database and file access?

ineedtosleep15y ago

Truecrypt is significantly slower, especially on the higher strength encryption methods. The program itself has a benchmark in it, so download it and check it out for yourself if you're that curious. (Note that it is relative to your hard drive's speed)

Astrohacker15y ago

I don't know. It would obviously slow down database access. It would be nice if someone tested this.

Wilya15y ago

I suspect this would only be reasonably applicable if you manage to reduce disk accesses to the very minimum. I'm not very familiar with these setups, but I assume they slow down disk accesses quite a lot.

foobarbazetc15y ago

Yeah, and watch your database IOPs fall by 1000x.

It's not feasible to run databases on encrypted block devices. Some databases let you encrypt certain tables or columns, though.

cheez15y ago

I don't think this is reasonable. If you lose power, the the volume is toast as I understand it.

andrewcooke15y ago

if i'm understanding you correctly, you're wrong. it's just another layer in the system (one more function in the mapping from your data to magnetic patterns - a function that's completely reasonable, predictable, etc, just hard to guess). the problem is that you need to enter the password on boot, which makes automated startup difficult.

for example, my laptop disk's main partition is encrypted. i need to enter the password when i boot, but nothing terrible happens if i lose power or the system crashes or whatever.

Astrohacker15y ago

Obviously you should back it up, like you would be doing anyway.

mrcharles15y ago· 7 in thread

All the more reason for data havens to exist. Run your server from a country where the police can't just take it with impunity.

dkubb15y ago

I think it's probably safer to proceed with the assumption that any sufficiently motivated government can seize your physical machine anytime they wish.

I should note that I'm not disagreeing with you, I just think there are more important considerations to make before physical location of the data.

pavel_lishin15y ago

Of course, then you have to keep careful tabs on that country's politics. The police can't confiscate your data in June, until a new bill is passed in July, and suddenly it's up for grabs.

Furthermore, I'm not sure I'd want to host my data in a country where the police cannot pursue digital criminals.

5l15y ago

Like it or not it's still the wild west, and I suspect most people here trust their own ability to protect themselves more than they trust the sheriff who only investigates crimes against the mayor and can't even ride a horse or shoot straight.

16BitTons15y ago

Do you have any suggestions on safe countries? As far as I can tell, the USA is still has the best mixture of freedom and protection available.

mrlase15y ago

The Netherlands, Sweden, etc. provide pretty good coverage and is where a lot of the seedboxes for torrenting are held. If you have something you want the government to have a almost nonexistent (depending on what it is) chance of getting to, go with Russia, China, etc. and other countries that probably don't have the best relations with the United States.

1 more reply

cma15y ago

The CIA doesn't even need warrants.

gte910h15y ago

In the US? The CIA can't operate in the US.

2 more replies

jarin15y ago· 3 in thread

Looks like the FBI is operating from the Department of Homeland Security playbook now.

ktsmith15y ago

The FBI has been doing these kinds of raids for years and years, there just hasn't been one in the news lately.

Symmetry15y ago

Its pretty similar to what the US Secret Service was doing in 1990. http://www.sjgames.com/SS/

idonthack15y ago

what do you mean "now"? DHS got their handbook from the FBI

drjoem15y ago· 3 in thread

i am wondering why these companies wern't using EC2?

seiji15y ago

That brings up an interesting point: can the FBI seize EC2 servers?

lwat15y ago

Of course they can. If they get a judge to sign their warrant they can seize anything they please.

chow15y ago

EC2 is not always a good substitute for dedicated servers for numerous reasons, I/O performance chief among them.

jsdalton15y ago· 2 in thread

Surely there is a legal precedent which provides at least some framework for what can or cannot be seized during a warrant search? This can't be the first time government agents have mistakenly seized property in an otherwise lawful search.

Also, while I completely understand Instapaper's unwillingness to pursue this through the courts, that is the way our legal system is structured. If you believe you have been harmed in some way by a government action, the courts are the avenue through which you must obtain recourse.

(Not a lawyer, so if I'm wrong about any of the above please correct me.)

cheez15y ago

It's called the constitution of the United States. If the enforcers don't follow it, your only recourse is the Supreme Court which will probably throw out your claim for national security reasons.

danielsoneg15y ago

Not necessarily. I (also) am not a lawyer, but the question isn't whether the FBI has the authority, constitutional or otherwise, to seize the servers owned by the target of the warrant, the question is whether they overstepped their bounds in seizing three whole racks of servers. If it's shown they were careless or did not take sufficient caution in their raid to avoid seizing unrelated servers, they could be held liable for damages. In this case in particular, the number of unrelated companies that have been affected by this (and the number of servers present in 3 racks) makes a case for negligence.

Again, though, the question isn't whether they had the right to seize the servers they had warrants for - they did, and you won't get that questioned by any court - but whether they did so properly, and it's not unheard of for a law enforcement agency to get slapped for overstepping their bounds. It's not Common, but it's not unheard of, and it's not a 4th amendment issue either.

2 more replies

bestes15y ago· 2 in thread

I think the OP was unreasonably harsh on DigitalOne (never heard of them let alone have any interests). It is very possible that they are consumed with FBI questioning, gag orders or who knows what else. I would give them a pass for a few days until more detail comes out.

lamnk15y ago

I think so too. He says:

   I have no idea whether I’ll ever see the server again

In this case the host probably doesn't know better than him. According to the NYTimes they are a swiss company, they only rent space and connectivity from the data center.

I see people jump up and down accusing their host being a bad host when their websites go down for 10 minutes. The thing is, shit like this happens all the time. Some years ago even Rackspace was taken offline because a truck hit their data center. Bizarre, right? Yes, but it did happen.

idlewords15y ago

The problem with DigitalOne was a complete lack of communication around this event. It was a long time (and a lot of badgering) before any of us learned anything about what had happened. I can sympathize with being busy during a crisis, but total silence for 24+ hours, with no working website, email, status page, or twitter account, is not acceptable.

1 more reply

johngalt15y ago· 2 in thread

Why isn't Facebook having their servers seized? Google? Amazon? If the FBI is really targeting the "badguys" I'm sure there have been more badguys using facebook/gmail/AWS than any single colo.

Why haven't there been similar seizures of any larger corporate entities? Even if the current FBI practices are valid, should the application of those practices be a function of size/wealth/power? Which servers of Sony's were seized after distributing rootkits?

maw15y ago

Good question. No solid answers here, but my guess would be some combination of more redundancy, better and more active lawyers, and the large players not talking about it when it does go down.

epoxyhockey15y ago

FB, Google, etc all provide a nice procedure for LEO to query all desired info. It is not necessary to seize equipment. Example: https://www.eff.org/files/filenode/social_network/Facebook20... (pdf)

mrcharles15y ago· 1 in thread

The more I think about it, the more I think this should be treated the same as any of the other thefts of data information to have happened in the past few months. Sony, Toyota, Sega, etc. A potentially hostile group now has a ton of personal info. People should know.

pavel_lishin15y ago

A potentially hostile group that also has much greater resources at its disposal than LulzSec, and much more ambiguous motivations.

smackfu15y ago· 1 in thread

To be clear, the server stopped responding, and the host he is paying for the server has not responded at all. The server could simply be unplugged, or all the network cables were unplugged during the raid. Who knows? I guess "The FBI stole my server is a better headline" though.

In my experience with our leased data center cages, we are expected to fly in to town if we ever need to physically manipulate the servers or even plug things in. The data center employees don't even go into the locked cages.

If the FBI forced open a locked cage, and did stuff in there, I would not expect anything to be addressed until DigitalOne showed up to fix it.

protomyth15y ago

If DigitalOne's people are out of country, a truly evil tactic for the FBI would be to ask customs to reject any reps entry.

justinweiss15y ago· 1 in thread

Looks like it's back:

http://twitter.com/instapaper/status/84106275796946944

"As of 2 minutes ago, my DigitalOne server is back online. The logs indicate that it was off and not booted during the time it was missing."

m0nastic15y ago

But that would mean that the FBI weren't bumbling morons who salted the earth after tearing out everything in the datacenter with a power supply...

I'm not sure I can deal with the possibility.

teoruiz15y ago· 1 in thread

I can't help to compare this raid with the feds raid to the Novus Ordo Seclorum hosting company pictured in Cryptonomicon.

lukejduncan15y ago

Every HN post can easily have a Stephenson reference.

bproper15y ago· 1 in thread

You think it's a coincidence they nabbed Whitey Bulger this morning, after 16 years on the run?

His Instapaper account was probably full of stories about Santa Monica.

VladRussian15y ago

i think his Instapaper account was full of stories about Whitey Bulger and his old friends/partners/etc... and this is how they "Big Data"-sifting-found him :)

gokhan15y ago· 1 in thread

What's the proper way of storing OAuth tokens in this situation? Given that all the tokens of users and your private key is on the server (even if it's embedded in code), there's no way for Instapaper for keeping those tokens secure in case of a compromise (by FBI or Lulzdudes or anyone).

Seems like Instapaper should change it's private key for, say, Facebook.

roc15y ago

I would think encrypting the third-party tokens with the user's password would be a decent start.

When the user's password is verified, it could be used to unlock those tokens and store them in the active session structure in RAM. There'd still be some exposure, particularly in the case of being rooted, but an attacker couldn't just dump the database.

leon_15y ago· 1 in thread

Hmm. I've built something similar to instapaper for myself. (Using a native OS X app). People were making jokes at me how I was re-inventing the wheel.

Now I'm somewhat happy having done the extra work. At least the FBI doesn't have my "read later" bookmarks. (Which often consist of the words 'hack', 'malware' and 'reverse engineering'.)

I guess I will reinvent the wheel instead of using cloud services more often in the future.

pavel_lishin15y ago

An in between solution would be better - write an open source version of Instapaper that people could install on their own servers, instead of everyone rolling their own.

tritchey15y ago

"a Swiss hosting company leasing blade servers"

If they are truly blade servers, then they were possibly sharing the same chassis, power supply and backplane. Could the FBI have pulled just the blades in question? Possibly. But I can very easily imagine the entire blade chassis being viewed as a monolithic component that they would want to be able to perform whatever forensic analysis they are planning. They could also have pulled whatever blades they were not after, and left them, but until you replace the chassis, you are dead in the water.

iqster15y ago

Turns out the server was not stolen!

https://twitter.com/#!/instapaper/status/84106275796946944

ChuckMcM15y ago

It would make for an interesting Freedom of Information (equipment) request. "Give me my damn server back." But the damage is of course done.

If you are a voting citizen of the US I recommend you write (not email, write a letter, put postage on it and everything) to your elected congressional representatives and ask that Congress immediately put curbs on the police powers of the FBI when it comes to infrastructure seizures.

mmaunder15y ago

Contact the ACLU, they will probably take your case.

neckbeard15y ago

Update: http://blog.instapaper.com/post/6854208028

andrewcooke15y ago

is there a better solution that encrypting data and putting the password in the source? obviously this is for cases where you can't use a hash.

it seems to me that, at least, it would make sense to have the db and web server physically separate in that case (although i guess someone stealing hardware is not normally a common scenario).

engtech15y ago

Julian Assange stated that the feds have backdoor, no court order access to gmail, yahoo, facebook, et all.

Why worry about this?

bhartzer15y ago

yet another reason to make regular backups of your site.

gcb15y ago

who watches the watchers?

j / k navigate · click thread line to collapse

250 comments

112 comments · 26 top-level

Xk15y ago· 29 in thread

Instapaper stores only salted SHA-1 hashes of passwords, so those are relatively safe.

Obligatory statement on NEVER USING SHA-1 HASHES to make passwords "safe".

Any normal person can brute force millions of SHA-1 hashes (salted however much you want) per second on a GPU.

I am now glad my Instapaper password was generated randomly, 16 characters long, and I will now change it just to be safe.

For anyone running a database which stores ussername/passwords, take a look at bcrypt or scrypt. They're millions (no, I am not exaggerating) of time better than SHA-1.

(Edit: Grammar)

dspace15y ago

So in this case, where the FBI is involve, using a SHA-1 hash poses no extra security vulnerability.

spoondan15y ago

They can just confiscate those servers too, where the content is most likely also in plaintext.

tankenmate15y ago

adrianscott15y ago

"So in this case, where the FBI is involve, using a SHA-1 hash poses no extra security vulnerability."

meeeh...

remember the fbi is not a person, it's an organization. the org can have bad actors in it who might be able to access the encrypted passwords but not be able to confiscate servers.

also, confiscating a server(s) is much more visible / detectable...

getsat15y ago

> can brute force millions

Modern consumer video cards can do billions per second now. You might as well just store them in plaintext instead of using SHA1/MD5 with or without salting. :/

mwytock15y ago

I dont understand. If you can use mixed-cased, letters and symbols you have 26 * 2 + 20 = 72 possible characters.

72^8 >> 1e9

It would still take more than 8 days to brute force at 1 billion/sec. And using a longer password (16 chars?) would make this a very long time.

Or is there other trick that makes this fast? Or, is it simply that people don't choose random, long passwords?

4 more replies

dragonsky15y ago

1 more reply

IgorPartola15y ago

tptacek15y ago

No there isn't. You only think that because when geeks discuss anything that involves one or more knobs, a huge debate must necessarily ensue about the proper values of those knobs.

4 more replies

seiji15y ago

scrypt slides: http://www.tarsnap.com/scrypt/scrypt-slides.pdf

Takeaway: Cost to crack one MD5 password: $1. Cost to crack one scrypt password: $50M to $200B.

You want your login to be slow compared to the rest of your application. It's okay to take half a second to verify a login.

1 more reply

oskarkv15y ago

rdl15y ago

Absolutely. That's essentially PBKDF2 (http://en.wikipedia.org/wiki/PBKDF2).

I'd still suggest using bcrypt or scrypt.

1 more reply

joevandyk15y ago

So if I'm using SHA-1 already to store passwords, what are my options for moving to a different system? I assume there's no way to rehash the passwords?

Xk15y ago

You have two choices that I see:

1 more reply

holdenk15y ago

mapgrep15y ago

Convert as sessions expire and people sign in again. You'll obviously need a db column to keep track of which passwords are converted/unconverted, e.g. password_algo.

meow15y ago

You can bcrypt your current sha1 hashes.

kindly15y ago

Xk15y ago

mbreese15y ago

tptacek15y ago

You've been downmodded because password hash salts are public nonce values; usually, schemes that depend on "secret salts" are crackpot alternatives to secure password hashes.

(I didn't downmod you).

2 more replies

cheez15y ago

Can you describe why it's better?

tptacek15y ago

http://codahale.com/how-to-safely-store-a-password/

(Be prepared for your comment score to visit the grey depths if you attempt to relitigate Coda's blog post here and don't know exactly what you're talking about.)

2 more replies

5l15y ago

He already did:

"Any normal person can brute force millions of SHA-1 hashes (salted however much you want) per second on a GPU."

This is not true of bcrypt.

tbrownaw15y ago

It takes much longer to compute the hash of a given password, which essentially makes it as if everyone chose passwords with a couple extra bytes of entropy in them.

marbletiles15y ago

Was far happier when he didn't store passwords at all, tbh.

lwat15y ago

Are you joking?

2 more replies

derrickpetzold15y ago

Xk15y ago

... what?

> Just store in plaintext because I am already assuming you are.

No, actually, I don't think I will store plaintext passwords.

> All the this talk about sha-1 vs bcrpyt vs scrpyt is nice and all but I have little faith that most companies care about this as much as HN does.

So what? Just because other people don't do it doesn't mean you don't have to also. Fortunately for us, there are a lot of startup founders here who might read this and learn something.

> I believe that most people are using the default password storage mechanism for their framework which are already known to be easy to break if the database is compromised.

I disagree. I think most people use SHA-1 because they know better than to store plaintext passwords. What they don't know is that it's terribly broken.

> But all of that is mute anyways.

No, it's really not.

> Unless you have access to the site's source how would you know if they are hashing at all much less which one they are using?

> The best practice is to use a random password for each site you use.

For sure, no doubt about it. But what we're talking about here is the best practice for application developers, not the users. The users can't do anything about how their password is stored.

> I just don't see any point in having an rememberable password for websites and hashing just leaves a false sense of security as illustrated by md5.

Or, you know, you could use bcrypt and be secure about it.

1 more reply

yuvadam15y ago· 15 in thread

I'm trying to think of an analogy which can explain why this might be reasonable from the FBIs perspective.

Is it not reasonable to consider this collateral damage (which, granted, is totally unnecessary) during law enforcement operations?

I'm not saying this is OK in any case, but might this not be a reasonable move by the law enforcement agencies?

cheald15y ago

It is not reasonable if the FBI does not have a warrant for your servers(/storage space). Instapaper is completely right to call this "theft".

If his servers are included in the warrant because they were suspected of housing whatever it is the FBI was after, and the court granted the FBI the right to seize them, then yeah, it's reasonable.

pavel_lishin15y ago

Do we know what the warrant stated? If it authorized them to take the rack containing the server they were after, then this is legal, if unfortunate.

If the police have a warrant for my apartment, and you happen to leave your backpack and server, your stuff will most likely be confiscated, along with mine, if it interests the police.

1 more reply

nkassis15y ago

They probably didn't have anyway to know which machine it was just which rack it was. They also probably didn't have to tell the hosting company directly just the facility that they were raiding.

1 more reply

GHFigs15y ago

The problem is that with blade servers like DigitalOne provided, both of these things can be true at the same time.

1 more reply

m0nastic15y ago

I can certainly think of scenarios in which this action was reasonable from the FBI perspective.

FBI determines the originating IP address of whatever their investigation is targetting (based on published information, it looks like a "scareware" operation").

FBI determines the IP address is "owned" by an overseas hosting provider, and that the physical servers are in a datacenter in the U.S.

FBI obtains a warrant for the seizure of all associated computing equipment (which may very well include the upstream devices used by the hosting provider).

FBI executes warrant at datacenter, sees that the servers are actually blades in a chasis; takes entire chasis (as reconstructing the data later on may require that the servers be bootable.)

gwright15y ago

1 more reply

PatrickTulskie15y ago

It seems like either the FBI didn't pay attention to the information given to them by DigitalOne or DigitalOne had poor information about where their servers are located.

nkassis15y ago

I was under the impression that DigitalOne wasn't even informed (they were in sweden or something) until 3 hours after the incident (from the NYT article).

smallblacksun15y ago

Or they don't trust that DigitalOne (or some employees of DigitalOne) aren't collaborating with their target.

notatoad15y ago

this is not shared hosting. the server taken belonged to instapaper. being located in the same datacenter should not be grounds for seizure.

xuki15y ago

The server belonged to Digital One.

I didn’t own the hardware — I was leasing it from DigitalOne.

2 more replies

mrcharles15y ago

No. Even if it was shared space, it should be possible to, through software and IT, extract the necessary data and bar it from further operation.

falcolas15y ago

1 more reply

tptacek15y ago

I don't think the IT skill required to reliably extract evidence from an arbitrary hosting operation (of potentially arbitrary complexity) is simply "on tap" for the FBI.

If you want to say "tough luck that's just what it costs to collect evidence in 2011", fine, but it's probably not fair to say that the FBI should just naturally have that capability.

2 more replies

sharth15y ago

Unless you don't trust the hosting provider. Then their best bet is to take down the proper machines.

nbpoole15y ago· 8 in thread

So, the FBI has a copy of Instapaper's complete database and a copy of their website code. The database includes:

- Salted SHA-1 hashed passwords for Instapaper

- Encrypted passwords for linked Pinboard accounts (with the encryption key stored in the website code)

- OAuth tokens for linked Facebook/Twitter/Tumblr accounts (and presumably also the secret keys used by Instapaper to use those tokens).

That's (potentially) a lot of personal information.

midnightmonster15y ago

foobarbazoo15y ago

Even better, they can use that information to build or enhance profiles WITHOUT getting any kind of judicial approval or oversight. Yay, Patriot Act (not).

imrehg15y ago

As a real practical question out of curiosity: how would you design their system differently so unauthorized people having only your hard drives couldn't get any data at all?

rosser15y ago

1 more reply

kijinbear15y ago

Full-disk encryption. You enter the key whenever the system needs to be rebooted. I know at least one company that does this with all of their US-hosted servers.

2 more replies

pavel_lishin15y ago

You could always hash the e-mails, although this would make resetting your password impossible.

How much data do Facebook's OAuth tokens contain? By looking at one, can you tell that it's linked to Pavel Lishin's account?

3 more replies

forgotAgain15y ago

If the hardware is in hand then I don't see any practical way to protect the information. The only way that works is to not store the information in cloud servers.

1 more reply

boucher15y ago

Astrohacker15y ago· 8 in thread

pavel_lishin15y ago

How fast is Truecrypt? How much would this slow down database and file access?

ineedtosleep15y ago

Astrohacker15y ago

I don't know. It would obviously slow down database access. It would be nice if someone tested this.

Wilya15y ago

foobarbazetc15y ago

Yeah, and watch your database IOPs fall by 1000x.

It's not feasible to run databases on encrypted block devices. Some databases let you encrypt certain tables or columns, though.

cheez15y ago

I don't think this is reasonable. If you lose power, the the volume is toast as I understand it.

andrewcooke15y ago

for example, my laptop disk's main partition is encrypted. i need to enter the password when i boot, but nothing terrible happens if i lose power or the system crashes or whatever.

Astrohacker15y ago

Obviously you should back it up, like you would be doing anyway.

mrcharles15y ago· 7 in thread

All the more reason for data havens to exist. Run your server from a country where the police can't just take it with impunity.

dkubb15y ago

I think it's probably safer to proceed with the assumption that any sufficiently motivated government can seize your physical machine anytime they wish.

I should note that I'm not disagreeing with you, I just think there are more important considerations to make before physical location of the data.

pavel_lishin15y ago

Of course, then you have to keep careful tabs on that country's politics. The police can't confiscate your data in June, until a new bill is passed in July, and suddenly it's up for grabs.

Furthermore, I'm not sure I'd want to host my data in a country where the police cannot pursue digital criminals.

5l15y ago

16BitTons15y ago

Do you have any suggestions on safe countries? As far as I can tell, the USA is still has the best mixture of freedom and protection available.

mrlase15y ago

1 more reply

cma15y ago

The CIA doesn't even need warrants.

gte910h15y ago

In the US? The CIA can't operate in the US.

2 more replies

jarin15y ago· 3 in thread

Looks like the FBI is operating from the Department of Homeland Security playbook now.

ktsmith15y ago

The FBI has been doing these kinds of raids for years and years, there just hasn't been one in the news lately.

Symmetry15y ago

Its pretty similar to what the US Secret Service was doing in 1990. http://www.sjgames.com/SS/

idonthack15y ago

what do you mean "now"? DHS got their handbook from the FBI

drjoem15y ago· 3 in thread

i am wondering why these companies wern't using EC2?

seiji15y ago

That brings up an interesting point: can the FBI seize EC2 servers?

lwat15y ago

Of course they can. If they get a judge to sign their warrant they can seize anything they please.

chow15y ago

EC2 is not always a good substitute for dedicated servers for numerous reasons, I/O performance chief among them.

jsdalton15y ago· 2 in thread

(Not a lawyer, so if I'm wrong about any of the above please correct me.)

cheez15y ago

It's called the constitution of the United States. If the enforcers don't follow it, your only recourse is the Supreme Court which will probably throw out your claim for national security reasons.

danielsoneg15y ago

2 more replies

bestes15y ago· 2 in thread

lamnk15y ago

I think so too. He says:

   I have no idea whether I’ll ever see the server again

In this case the host probably doesn't know better than him. According to the NYTimes they are a swiss company, they only rent space and connectivity from the data center.

idlewords15y ago

1 more reply

johngalt15y ago· 2 in thread

Why isn't Facebook having their servers seized? Google? Amazon? If the FBI is really targeting the "badguys" I'm sure there have been more badguys using facebook/gmail/AWS than any single colo.

maw15y ago

Good question. No solid answers here, but my guess would be some combination of more redundancy, better and more active lawyers, and the large players not talking about it when it does go down.

epoxyhockey15y ago

FB, Google, etc all provide a nice procedure for LEO to query all desired info. It is not necessary to seize equipment. Example: https://www.eff.org/files/filenode/social_network/Facebook20... (pdf)

mrcharles15y ago· 1 in thread

pavel_lishin15y ago

A potentially hostile group that also has much greater resources at its disposal than LulzSec, and much more ambiguous motivations.

smackfu15y ago· 1 in thread

If the FBI forced open a locked cage, and did stuff in there, I would not expect anything to be addressed until DigitalOne showed up to fix it.

protomyth15y ago

If DigitalOne's people are out of country, a truly evil tactic for the FBI would be to ask customs to reject any reps entry.

justinweiss15y ago· 1 in thread

Looks like it's back:

http://twitter.com/instapaper/status/84106275796946944

"As of 2 minutes ago, my DigitalOne server is back online. The logs indicate that it was off and not booted during the time it was missing."

m0nastic15y ago

But that would mean that the FBI weren't bumbling morons who salted the earth after tearing out everything in the datacenter with a power supply...

I'm not sure I can deal with the possibility.

teoruiz15y ago· 1 in thread

I can't help to compare this raid with the feds raid to the Novus Ordo Seclorum hosting company pictured in Cryptonomicon.

lukejduncan15y ago

Every HN post can easily have a Stephenson reference.

bproper15y ago· 1 in thread

You think it's a coincidence they nabbed Whitey Bulger this morning, after 16 years on the run?

His Instapaper account was probably full of stories about Santa Monica.

VladRussian15y ago

i think his Instapaper account was full of stories about Whitey Bulger and his old friends/partners/etc... and this is how they "Big Data"-sifting-found him :)

gokhan15y ago· 1 in thread

Seems like Instapaper should change it's private key for, say, Facebook.

roc15y ago

I would think encrypting the third-party tokens with the user's password would be a decent start.

leon_15y ago· 1 in thread

Hmm. I've built something similar to instapaper for myself. (Using a native OS X app). People were making jokes at me how I was re-inventing the wheel.

Now I'm somewhat happy having done the extra work. At least the FBI doesn't have my "read later" bookmarks. (Which often consist of the words 'hack', 'malware' and 'reverse engineering'.)

I guess I will reinvent the wheel instead of using cloud services more often in the future.

pavel_lishin15y ago

An in between solution would be better - write an open source version of Instapaper that people could install on their own servers, instead of everyone rolling their own.

tritchey15y ago

"a Swiss hosting company leasing blade servers"

iqster15y ago

Turns out the server was not stolen!

https://twitter.com/#!/instapaper/status/84106275796946944

ChuckMcM15y ago

It would make for an interesting Freedom of Information (equipment) request. "Give me my damn server back." But the damage is of course done.

mmaunder15y ago

Contact the ACLU, they will probably take your case.

neckbeard15y ago

Update: http://blog.instapaper.com/post/6854208028

andrewcooke15y ago

is there a better solution that encrypting data and putting the password in the source? obviously this is for cases where you can't use a hash.

it seems to me that, at least, it would make sense to have the db and web server physically separate in that case (although i guess someone stealing hardware is not normally a common scenario).

engtech15y ago

Julian Assange stated that the feds have backdoor, no court order access to gmail, yahoo, facebook, et all.

Why worry about this?

bhartzer15y ago

yet another reason to make regular backups of your site.

gcb15y ago

who watches the watchers?

j / k navigate · click thread line to collapse