HealthCare.gov Sends Personal Data to Dozens of Tracking Websites (opens in new tab)

(eff.org)

346 pointsmarkolschesky11y ago94 comments

94 comments

73 comments · 18 top-level

mindslight11y ago· 10 in thread

What else could one possibly expect when an industry has succeeded at convincing the government to make buying their product mandatory?!

I know the EFF focuses specifically on informational issues, but stirring outrage over one abuse of a captive market when such abuses are by design is a disservice to general sanity.

dublinben11y ago

The entire issue with the ACA and private insurance aside, this particular website does not need to have 18 tracking scripts on it. I'm sure this is just another symptom of the convoluted development process behind the Healthcare.gov mess.

threeseed11y ago

Not sure if you've ever worked on an enterprise website before but this would have nothing to do with the development process. This is all the work of the various marketing/product teams wanting to use the tools they are accustomed to.

It's ridiculous how many times I've seen this happen before.

mindslight11y ago

That depends entirely on whose perspective you use for the definition of need, which is the point of my original comment.

threeseed11y ago

The ACA model in the US is very similar to what exists in many places in the world e.g. here in Australia. And it was driven from the needs of the government not the needs of the health care industry. Although they are a beneficiary. That said the model really does work.

The fact is that uninsured people has a devastating effect on the economy. It prevents movement of labour, affects productivity, promotion to higher socio-economic levels, prevents people starting businesses, affects crime and countless other social effects. You need to force people who don't think they need it to have it.

brc11y ago

From experience, an Australian doesn't have the correct frame of reference to even engage in the US healthcare debate. I have tried to understand the issues many times but it comes from such a fragmented starting point it's difficult to understand unless you've been in it for a long, long time.

Your points about uninsured are valid, but it's much more complicated than just saying 'hey, you guys should insure everyone'. So I generally try and observe from the sidelines.

mindslight11y ago

We're clearly coming from very different places with regards to whether governments exist for their people, or people exist for their government. FWIW, forcing someone to purchase something they otherwise wouldn't actually hurts movement of labour, promotion to higher socio-economic levels, and people starting businesses.

UrMomReadsHN11y ago

>The ACA model in the US is very similar to what exists in many places in the world e.g. here in Australia.

Does your boss decide what your health insurance will be?

Are health insurance companies in Australia publicly traded for profit corporations?

There's other questions I have but I wont ask because I feel that people may take them as attacks. (I don't mean to attack)

>it was driven from the needs of the government not the needs of the health care industry.

The white paper that became thd affordable care act was written by Liz Fowler who was a VP at one of the largest health insurance companies. After the ADA was passed she became a lobbyist for a major pharmaceutical company.

kalleboo11y ago

> What else could one possibly expect when an industry has succeeded at convincing the government to make buying their product mandatory?!

There are plenty of non-buggy software projects in captive markets. It's not an excuse. This is not a political problem, it's an engineering one.

spacemanmatt11y ago

Do not underestimate the back-end complexity of data integration. The front end may be a small engineering problem, but I assure you, the back end is fraught with more political, protocol, transmission, and format problems than you could imagine. Or maybe you can. But please do consider the number of disparate businesses that had to be technically unified for this purpose, and the tendency for technical unity to break constantly across political/organizational boundaries.

programmarchy11y ago

Yeah... this is what happens when the customer is not the end-user of the product. Markets break down because there's no feedback loops.

Kind of like those toilet paper dispensers in public bathrooms that require a key to open and make it as difficult as possible to unroll a few sheets of what feels like sandpaper. Georgia Pacific has no reason to care about the person sitting there, powerless, on the toilet. Terrible user experience!

jayess11y ago· 9 in thread

Isn't this a HIPAA violation?

markolscheskyOP11y ago

While the data itself would fit the description of PHI, I don't know if healthcare.gov itself qualifies since it isn't a "health care provider, health plan, public health authority, employer, life insurer, school or university, or health care clearinghouse". That doesn't mean that it's against best practices. I built an analytics platform for a project with the VA on https://catalyze.io/baas (I also work there), so their are some alternatives to analytics when HIPAA is a concern.

troyastorino11y ago

It could depend on whether healthcare.gov signed Business Associate Agreements (BAAs) with the insurers that it's connecting to. If it did have to sign BAAs, then heathcare.gov would be covered under the scope of those BAAs, and would likely have to be complying with the security rule and the privacy rule.

UrMomReadsHN11y ago

I think only lawyers can say since HIPAA seems like rather complex legislation. According to Wikipedia, PHI seems like it can be almost anything...

http://en.wikipedia.org/wiki/Protected_health_information

brc11y ago

Don't governments always write themselves an exemption from following laws for everyone else? I imagine they are exempt by law from things that would land others in court or jail.

navait11y ago

When I worked as a researcher the NIH, we took privacy very seriously, as we were liable for information leakage caused by our negligence. The federal government is liable for HIPPA violations.

spacemanmatt11y ago

Doesn't the federal government enjoy pretty nearly complete immunity from its own laws? In practice that seems to be the case, anyway.

Kikawala11y ago

No. I don't see any protected health information.

honksillet11y ago

# of pregnancies is a violation. One could infer whether the individual is pregnant, has had a miscarriage or even an abortion.

markolscheskyOP11y ago

The user's IP address (which I imagine gets collected by doubleclick) is one of the 18 identifiable attributes which makes data PHI. But, I don't think that healthcare.gov needs to comply by HIPAA.

zaroth11y ago· 7 in thread

This is pretty shocking. What is PII doing in the query string in the first place? Disclosing pregnancy status from an insurance application sounds like a possible HIPPA violation and runs afoul of various state laws around 'Insurance Information and Privacy Protection'. E.g http://www.leginfo.ca.gov/cgi-bin/displaycode?section=ins&gr.... See Section 791.13(k). That's just CA law but many states followed with their own version. (IANAL)

I think the really big penalties come into play when medical information is 'personally identifiable'. Since this data is going to Google, Facebook, and Twitter (really?!) with 3rd party cookies, or even without, it would be hard to argue this data is not personally identifiable.

It's not like they didn't know they weren't sending this data out. Or perhaps the highly advanced debugging prowess of "Chrome Inspector" is beyond their pay grade.

Edit: Oh it's not even just Referral leak it's actually it the request in some cases, so blatantly intentional. :-(

threeseed11y ago

> Oh it's not even just Referral leak it's actually it the request in some cases, so blatantly intentional.

Be careful about throwing the term intentional around. There is nothing to suggest this is the case. It's just a shocking breakdown in security/testing processes and/or a bug. We see security/privacy issues everyday. They are almost never intentional.

Alupis11y ago

> They are almost never intentional.

Somebody had to specifically code the application to concatenate into the string:

> smoker=1&parent=&pregnant=1&mec=&zip=85601&state=AZ&income=35000

acdha11y ago

Reading the example more closely, that's part of a URL:

https://4037109.fls.doubleclick.net/activityi;src=4037109;ty...?

Unfortunately, a quick Google search doesn't explain what the oref parameter does but from the name I'm assuming it's something like "original referrer".

You don't need malice to explain this – it's entirely plausible to imagine that some people wanted to track user activities and they had a staggering lapse in HIPAA auditing due to the rush of getting the site out and stabilized.

4 more replies

darkerside11y ago

It could have been passed along as a returnUrl. Never attribute to malice what can be explained by incompetence.

scintill7611y ago

Here's the full URL, which was embedded into another URL:

  https://www.healthcare.gov/see-plans/85601/results/?county=04019&age=40&smoker=1&parent=&pregnant=1&mec=&zip=85601&state=AZ&income=35000&step=4

Looks like a plausible search URL from a <form> element with GET. Putting it into the querystring instead of a POST body is a bit surprising, but I think not utterly negligent. Then some Javascript code (maybe not even healthcare.gov's in-house code) looked at window.location.href and put it into another URL, and nobody noticed or stopped it. That is negligent, but more understandable, and fair to presume as unintentional, I think.

There's plenty to be legitimately upset about here, but your comment ("specifically code the application to concatenate") seems to imply the code has something outrageous like "&pregant="+currentUser.pregnant+"&smoker="+currentUser.smoker somewhere, without you giving any evidence that's the case.

2 more replies

brandonb11y ago

Not intentional. This was a bug.

zaroth11y ago

Except when it is:

Medicare spokesman Aaron Albright said outside vendors "are prohibited from using information from these tools on HealthCare.gov for their companies' purposes." The government uses them to measure the performance of HealthCare.gov so consumers get "a simpler, more streamlined and intuitive experience," he added.

http://apnews.myway.com/article/20150120/us--health_overhaul...

EdSharkey11y ago· 7 in thread

More government doing shitty things not in its charter. I'm numb to this abuse. Next up: increased taxes + inflation.

I hope I live to see the day that the laws are twisted and shredded such that all corporate-government data about every person is available for purchase. I'd love to have that detailed record of everything I've said, thought, places I've been, etc since ~Y2K. How cool would that be?

I've heard it said that future cultural anthropologists of the future will absolutely love mining the rich personal data coming out of this period of time.

at-fates-hands11y ago

>>> I've heard it said that future cultural anthropologists of the future will absolutely love mining the rich personal data coming out of this period of time.

Former Anthropologist here.

While culturally speaking it will be interesting, up to a certain point in human history there has always been physical things left behind by cultures to denote their existence.

As our whole lives have become digital, once the servers are gone, the pseudo physical evidence will vanish. One of my professors told me in passing in the early aughts that, "This generation (meaning the Y generation) will barely leave a trace of its existence in 200 years."

He inferred that once technology has evolved past our current rate of burn, the mechanisms by which we preserve our memories will be forever wiped out. He made a note of saying, "When was the last time you used something physical to create, retain or share your memories?" When was the last time you printed a photograph? Listened to a music album? Once the devices by which we save our memories become obsolete, so does our existence.

It caught me off guard, and was. . one of those times where you stop and wonder what people will dig up in 2-300 years from now and discover about our civilization? Will it all just be zero's and one's on a server somewhere?

paulfurtado11y ago

Couldn't you say that hard drives, SSDs, tape backups, etc are all still physical mediums? While these mediums lose data over time, forensics will still be able to recover partial data, similar to other physical mediums (pen and paper, photos, etc).

jcrites11y ago

Those are usually destroyed when their useful life ends, exactly because someone might dig them up later and extract data from them. Large corporate data centers, for example, physically destroy hard disks and never allow them to leave the facility intact.

There will be hard drives left around by individual consumers, I suppose, but the vast majority of all those that exist today are likely to be deliberately destroyed. We're so good at copying and replicating data these days that we no longer rely on hard drives for data permanence over long periods.

warble11y ago

(pure conjecture warning)

Perhaps, but I tend to believe the data, as we depend more and more on the 'cloud' won't be tied to physical mediums (or particular physical mediums) and instead be towed along as technology and the mediums improve.

100 terabytes of information now will likely be absurdly easy to store 100 years from now, and we don't lose what we have now because data centers will just upgrade and move the data to better storage platforms as they are invented and deployed.

Athropology of the future may not include digging up hard drives in garbage dumps. Instead you just run the latest google search.

deciplex11y ago

I don't buy it. You could make a similar case against stone tablets.

at-fates-hands11y ago

Except stone tablets have endured and lasted over thousands of years.

Why do you think most megalithic structures were made of concrete? The Ancients intended them to last a lifetime.

1 more reply

EdSharkey11y ago

Delicious 0's and 1's, all indexed and cross-referenced for your convenience.

devindotcom11y ago· 6 in thread

Spokesman Aaron Albright said outside vendors "are prohibited from using information from these tools on HealthCare.gov for their companies' purposes." The government uses them to measure the performance of HealthCare.gov so consumers get "a simpler, more streamlined and intuitive experience," he added.

It's one thing to send session length, general location, usage stuff like that to see where, for example, awareness campaigns might be needed. But really:

  smoker=1&parent=&pregnant=1&mec=&zip=85601&state=AZ&income=35000

That's a bit much! And I suppose DoubleClick is carefully siloing this information so it doesn't accidentally perform all kinds of analysis on it for comparison with its other huge databases? Perhaps they are barred from selling it wholesale to data brokers but I can't imagine they are unable to use it for plenty of their own purposes.

joering211y ago

I think the analogy would be like opening a Government Bank where the doors are always wide open and there are no locks. Everyone can see everyone account status or simply come in and grab their jewls or gold coins, but they won't because generally

[...] citizens "are prohibited by law from stealing other people's belongings"

Alupis11y ago

It would be interesting to hear a perspective from some of the HNers that actually worked on the re-build of HealthCare.gov

argc11y ago

I agree. On the surface, this looks ridiculous and I am very surprised. Hopefully there is a good explanation... but pregnancy status as a query parameter? I don't understand what that could be other than data mining.

glass-11y ago

And knowing that someone is pregnant is extremely valuable to advertisers[0]

[0] http://mashable.com/2014/04/26/big-data-pregnancy/

Arnor11y ago

I'm sure you're right that the information is getting used by DoubleClick.

To the other point about this information not being appropriate for the purposes Albright mentioned: Isn't this exactly the information that a health insurance company wants to know for outreach? If I know that all the pregnant women in Tuscon are signing up but none from Pheonix, I suddenly know where to put my next billboard or field office.

If this was a private sector company, nobody would be surprised at collecting this data. It would also be a different story if the data was being stored and analyzed in house or even if the doubleclick request happened on the server side instead of the client.

I agree with the general sentiment that this is a privacy violation, but that's because of the way that the data is collected and who processes it, not the collection and use of the data generally.

Arnor11y ago

Confused...

declan11y ago· 6 in thread

An additional problem, as I see it, is that the Obama administration made unambiguous assurances that no PII was being collected as part of Healthcare.gov's use of web measurement tools. Here's the excerpt from the privacy policy:

HealthCare.gov uses a variety of Web measurement software tools. We use them to collect the information listed in the “Types of information collected” section above. The tools collect information automatically and continuously. No personally identifiable information is collected by these tools. https://www.healthcare.gov/privacy/

Note the last sentence is in bold on the actual web page.

A Department of Health and Human Services organ called the Centers for Medicare & Medicaid Services is responsible for the site. An enterprising HN reader might want to skim through the CMS (very long) privacy impact assessment to see if there are any other incorrect claims about Healthcare.gov: http://www.hhs.gov/pia/cms-pia-summary-fy12q4.pdf

It will be interesting to see if anyone gets fired as a result of this particular privacy screwup. The buck should stop somewhere, right?

jobposter123411y ago

>A Department of Health and Human Services organ called the Centers for Medicare & Medicaid Services is responsible for the site. An enterprising HN reader might want to skim through the CMS (very long) privacy impact assessment to see if there are any other incorrect claims about Healthcare.gov: http://www.hhs.gov/pia/cms-pia-summary-fy12q4.pdf

Is there any way to split this up so each person is responsible for a section? you'd miss a lot by missing context... but if the section readers bullet pointed everything, that could be combined into a larger context.

Or, in HN speak, we could crowdsource a real-world Map/Reduce job to support big data in the citizen-scientist.

declan11y ago

I love the idea of a real-world map/reduce job. :) But before spending any time on this, please make sure it's the right PDF. It does mention Healthcare.gov, but only a few times, and I'm no expert on HHS organizational structure. Here's the full directory of PIAs: http://www.hhs.gov/pia/

zaroth11y ago

Nice find. Considering the bug is literally staring every single user in the face on the URL bar, I would imagine it would be hard to pin blame on an individual.

I guess this is the final nail in the coffin for the 'many eyes' theory though.

At least it will make a good t-shirt;

  "Query String Parameters Are Not Private"
  "Friends Don't Let Friends Store PHI in Query Parameters"

OK, never mind about the t-shirt.

maxerickson11y ago

I think the 'someone needs to be fired' is just press release journalism. It makes for an easy narrative. "There's a problem at healthcare.gov" is the first story. "What happened at healthcare.gov" is the second story. "Blah Jones has resigned" has everybody wiping their hands and looking for the next press release story to write about.

It's certainly possible that a given individual is meaningfully responsible for a problem and that they are incompetent, but it isn't necessarily the case. If the actual problem is organizational, a scapegoat just papers over it, it won't fix anything.

declan11y ago

I didn't say "someone needs to be fired" -- that's a paraphrase of what I typed, not a quote.

My point is a broader one: When you have committees and subcommittees and working groups and HHS IT people and CMS IT people and task forces and contractors and subcontractors and new replacement contractors (Accenture) and undersecretaries and sub-sub contractors and assistant secretaries and White House aides and political consultants and PR firms and deputy chiefs of staff and deputy undersecretaries all participating to some extent in the $1B+ process that is the supremely functional Healthcare.gov site we all know and love, the buck can be passed endlessly.

But in all that morass of a process, someone was or should have been responsible for ensuring that standard privacy practices were followed. To her credit, Kathleen Sebelius resigned last year (though not immediately) as a result of what the NYT called the "disastrous rollout" of Helathcare.gov. It is worth looking at whether there is any accountability in the form of dismissals or resignations with this privacy snafu.

If there is not, we should draw our own conclusions.

maxerickson11y ago

Sorry, I didn't mean to imply I was quoting you, limitations of the format.

To my point, Sebelius resigning didn't do anything to prevent this (apparent) mistake.

bagels11y ago· 4 in thread

I'm wondering whose doubleclick account those ad dollars are ending up in.

dangrossman11y ago

I was initially curious why Google/DoubleClick were in there myself, since there aren't ads on the healthcare.gov site. Those requests look to be retargeting tags so they have the ability to do things like show banner ads on CNN only to people who have an incomplete marketplace application, along with conversion tracking so they can see which marketing campaigns led to completed applications or other goals. Presumably whoever controls the rest of the healthcare.gov marketing budget also runs the DoubleClick/AdWords account.

btian11y ago

Not ad account. I think doubleclick is used for surveys.

JeremyMorgan11y ago

Nope. http://en.wikipedia.org/wiki/DoubleClick

DoubleClick is a subsidiary of Google which develops and provides Internet ad serving services

btian11y ago

I'm saying in this case. Doubleclick accounts are usually for ads.

j_s11y ago· 3 in thread

Paging HN user brandonb and 'a bunch of other Google, Facebook, and Y Combinator alums' -- did this exist while you worked on the site?

  > I've been working on healthcare.gov for the last few months

https://news.ycombinator.com/item?id=7312442

brandonb11y ago

We weren't involved with this specific part of the site but folks are on it!

(I wrapped up my involvement several months ago, but others helping out with this open enrollment period.)

bashinator11y ago

> folks are on it!

Am I correct in thinking that the cheery use of passive voice means you're under quite a serious NDA?

brandonb11y ago

There is an NDA but I used "folks" since I, personally, have returned to the startup world and am not involved in the details of fixing this particular incident. But yes, rest assured that the people who currently work on healthcare.gov are busy testing a fix, which is why they're not posting on HN.

seccess11y ago· 1 in thread

This is certainly scary stuff, but I was a bit annoyed with the line:

"...consequences such as when Target notified a woman's family that she was pregnant before she even told them. "

I've heard this story referenced time and again with respect to motivating people to care about privacy and tracking. I'm all for privacy, but I feel like: (a) we should have more recent anecdotes about the consequences of tracking than a story from 2012, (b) the mechanism that Target used to infer this is far less intrusive (not making it OK) than what we see here, and (c) its really not strong enough an example.

Not that speculation is the way to go, but what about the possibility of someone being turned down for life insurance due to this information?

protomyth11y ago

Well, it is a simple example and has the virtue of being true instead of the often quoted but misrepresented McDonalds hot coffee story. Simple examples showing a situation are best, and much like iOS bug statistics, the parties who would have the statistics on situations caused by tracking are never going to make them public.

dkroy11y ago· 1 in thread

I heard something awhile back about the us government(NSA) leveraging the cookie in a way that they could use it as a surveillance beacon. I doubt there is any relation, but it makes you think a bit.

[1] https://www.eff.org/deeplinks/2013/12/nsa-turns-cookies-and-...

bhhaskin11y ago

I am sure the NSA are involved with at least some aspect of all .gov addresses.

kumarm11y ago· 1 in thread

One of the heavily trafficked sites in India (Railway Booking) has been showing Google adsense ads. Someone is making a Million dollars a month in Government :)

https://www.irctc.co.in/eticketing/loginHome.jsf

536 Global Rank (50 in India): http://www.alexa.com/siteinfo/www.irctc.co.in

nathanaldensr11y ago

What does this have to do with the topic of this thread?

drylight11y ago

Give $563M to Accenture and you get some really shoddy work http://www.healthcaredive.com/news/accenture-snags-new-5-yea...

garazy11y ago

Looks like a few of the tracking companies only just started to appear -

http://builtwith.com/detailed/healthcare.gov

The only non-ad tool they added was the Twitter Platform to their homepage. Lots of data leakage points though.

tedunangst11y ago

Don't blame the browsers for continuing to send Referer headers though. Because browsers take your privacy seriously.

jtheory11y ago

They don't even get into the repercussions of loading externally-hosted JavaScript into a secure page.

We avoid this entirely (also hosting medical data), though it's been a bit of extra work to do so.

I'm sure Chartbeat, Mathtag, Mixpanel, Google, etc. are reasonably careful about their security, and of course they would suffer as well if one of the servers/scripts was compromised and the breach was made public.

But in short -- healthcare.org's security relies on the idea that none of these many 3rd parties will ever have a CDN server compromised, for example. Or (in other situations) have the NSA demand access.

It just takes one -- and then an "improved" script could be delivered to only clients visiting a single targeted site, or even specific targeted clients. The normal customer just sees the lock icon and can verify that there's a secure connection to the main host; but there are actually many other connections going on to other hosts, and any of them may provide a script that can access any sensitive data on the page.

fubarred11y ago

Currently, https://disconnect.me/ browser extension says https://www.healthcare.gov/ uses:

- 0 Facebook, 3 Google, 0 Twitter

- 0 Advertising

- 6 Analytics: 1 ClickTale, 4 MixPanel, 1 Chartbeat

- 0 Social

- 6 Content: 3 Google, 3 Optimizely

stephenhess11y ago

If you're looking for a better place to go than healthcare.gov, give us a try at stridehealth.com. Bunch of ex-privacy folks and healthcare folks - can shop from your phone. Pretty shocking to see such a novice mistake by an org I think we were all expecting to take it up a level this year.

natmaster11y ago

Don't worry guys, Obama's friend's companies will use that information to sell you better products. It's for your own good!

j / k navigate · click thread line to collapse

94 comments

73 comments · 18 top-level

mindslight11y ago· 10 in thread

What else could one possibly expect when an industry has succeeded at convincing the government to make buying their product mandatory?!

I know the EFF focuses specifically on informational issues, but stirring outrage over one abuse of a captive market when such abuses are by design is a disservice to general sanity.

dublinben11y ago

threeseed11y ago

It's ridiculous how many times I've seen this happen before.

mindslight11y ago

That depends entirely on whose perspective you use for the definition of need, which is the point of my original comment.

threeseed11y ago

brc11y ago

Your points about uninsured are valid, but it's much more complicated than just saying 'hey, you guys should insure everyone'. So I generally try and observe from the sidelines.

mindslight11y ago

UrMomReadsHN11y ago

>The ACA model in the US is very similar to what exists in many places in the world e.g. here in Australia.

Does your boss decide what your health insurance will be?

Are health insurance companies in Australia publicly traded for profit corporations?

There's other questions I have but I wont ask because I feel that people may take them as attacks. (I don't mean to attack)

>it was driven from the needs of the government not the needs of the health care industry.

kalleboo11y ago

> What else could one possibly expect when an industry has succeeded at convincing the government to make buying their product mandatory?!

There are plenty of non-buggy software projects in captive markets. It's not an excuse. This is not a political problem, it's an engineering one.

spacemanmatt11y ago

programmarchy11y ago

Yeah... this is what happens when the customer is not the end-user of the product. Markets break down because there's no feedback loops.

jayess11y ago· 9 in thread

Isn't this a HIPAA violation?

markolscheskyOP11y ago

troyastorino11y ago

UrMomReadsHN11y ago

I think only lawyers can say since HIPAA seems like rather complex legislation. According to Wikipedia, PHI seems like it can be almost anything...

http://en.wikipedia.org/wiki/Protected_health_information

brc11y ago

Don't governments always write themselves an exemption from following laws for everyone else? I imagine they are exempt by law from things that would land others in court or jail.

navait11y ago

When I worked as a researcher the NIH, we took privacy very seriously, as we were liable for information leakage caused by our negligence. The federal government is liable for HIPPA violations.

spacemanmatt11y ago

Doesn't the federal government enjoy pretty nearly complete immunity from its own laws? In practice that seems to be the case, anyway.

Kikawala11y ago

No. I don't see any protected health information.

honksillet11y ago

# of pregnancies is a violation. One could infer whether the individual is pregnant, has had a miscarriage or even an abortion.

markolscheskyOP11y ago

The user's IP address (which I imagine gets collected by doubleclick) is one of the 18 identifiable attributes which makes data PHI. But, I don't think that healthcare.gov needs to comply by HIPAA.

zaroth11y ago· 7 in thread

It's not like they didn't know they weren't sending this data out. Or perhaps the highly advanced debugging prowess of "Chrome Inspector" is beyond their pay grade.

Edit: Oh it's not even just Referral leak it's actually it the request in some cases, so blatantly intentional. :-(

threeseed11y ago

> Oh it's not even just Referral leak it's actually it the request in some cases, so blatantly intentional.

Alupis11y ago

> They are almost never intentional.

Somebody had to specifically code the application to concatenate into the string:

> smoker=1&parent=&pregnant=1&mec=&zip=85601&state=AZ&income=35000

acdha11y ago

Reading the example more closely, that's part of a URL:

https://4037109.fls.doubleclick.net/activityi;src=4037109;ty...?

Unfortunately, a quick Google search doesn't explain what the oref parameter does but from the name I'm assuming it's something like "original referrer".

4 more replies

darkerside11y ago

It could have been passed along as a returnUrl. Never attribute to malice what can be explained by incompetence.

scintill7611y ago

Here's the full URL, which was embedded into another URL:

  https://www.healthcare.gov/see-plans/85601/results/?county=04019&age=40&smoker=1&parent=&pregnant=1&mec=&zip=85601&state=AZ&income=35000&step=4

2 more replies

brandonb11y ago

Not intentional. This was a bug.

zaroth11y ago

Except when it is:

http://apnews.myway.com/article/20150120/us--health_overhaul...

EdSharkey11y ago· 7 in thread

More government doing shitty things not in its charter. I'm numb to this abuse. Next up: increased taxes + inflation.

I've heard it said that future cultural anthropologists of the future will absolutely love mining the rich personal data coming out of this period of time.

at-fates-hands11y ago

>>> I've heard it said that future cultural anthropologists of the future will absolutely love mining the rich personal data coming out of this period of time.

Former Anthropologist here.

While culturally speaking it will be interesting, up to a certain point in human history there has always been physical things left behind by cultures to denote their existence.

paulfurtado11y ago

jcrites11y ago

warble11y ago

(pure conjecture warning)

Athropology of the future may not include digging up hard drives in garbage dumps. Instead you just run the latest google search.

deciplex11y ago

I don't buy it. You could make a similar case against stone tablets.

at-fates-hands11y ago

Except stone tablets have endured and lasted over thousands of years.

Why do you think most megalithic structures were made of concrete? The Ancients intended them to last a lifetime.

1 more reply

EdSharkey11y ago

Delicious 0's and 1's, all indexed and cross-referenced for your convenience.

devindotcom11y ago· 6 in thread

It's one thing to send session length, general location, usage stuff like that to see where, for example, awareness campaigns might be needed. But really:

  smoker=1&parent=&pregnant=1&mec=&zip=85601&state=AZ&income=35000

joering211y ago

[...] citizens "are prohibited by law from stealing other people's belongings"

Alupis11y ago

It would be interesting to hear a perspective from some of the HNers that actually worked on the re-build of HealthCare.gov

argc11y ago

glass-11y ago

And knowing that someone is pregnant is extremely valuable to advertisers[0]

[0] http://mashable.com/2014/04/26/big-data-pregnancy/

Arnor11y ago

I'm sure you're right that the information is getting used by DoubleClick.

I agree with the general sentiment that this is a privacy violation, but that's because of the way that the data is collected and who processes it, not the collection and use of the data generally.

Arnor11y ago

Confused...

declan11y ago· 6 in thread

Note the last sentence is in bold on the actual web page.

It will be interesting to see if anyone gets fired as a result of this particular privacy screwup. The buck should stop somewhere, right?

jobposter123411y ago

Or, in HN speak, we could crowdsource a real-world Map/Reduce job to support big data in the citizen-scientist.

declan11y ago

zaroth11y ago

Nice find. Considering the bug is literally staring every single user in the face on the URL bar, I would imagine it would be hard to pin blame on an individual.

I guess this is the final nail in the coffin for the 'many eyes' theory though.

At least it will make a good t-shirt;

  "Query String Parameters Are Not Private"
  "Friends Don't Let Friends Store PHI in Query Parameters"

OK, never mind about the t-shirt.

maxerickson11y ago

declan11y ago

I didn't say "someone needs to be fired" -- that's a paraphrase of what I typed, not a quote.

If there is not, we should draw our own conclusions.

maxerickson11y ago

Sorry, I didn't mean to imply I was quoting you, limitations of the format.

To my point, Sebelius resigning didn't do anything to prevent this (apparent) mistake.

bagels11y ago· 4 in thread

I'm wondering whose doubleclick account those ad dollars are ending up in.

dangrossman11y ago

btian11y ago

Not ad account. I think doubleclick is used for surveys.

JeremyMorgan11y ago

Nope. http://en.wikipedia.org/wiki/DoubleClick

DoubleClick is a subsidiary of Google which develops and provides Internet ad serving services

btian11y ago

I'm saying in this case. Doubleclick accounts are usually for ads.

j_s11y ago· 3 in thread

Paging HN user brandonb and 'a bunch of other Google, Facebook, and Y Combinator alums' -- did this exist while you worked on the site?

  > I've been working on healthcare.gov for the last few months

https://news.ycombinator.com/item?id=7312442

brandonb11y ago

We weren't involved with this specific part of the site but folks are on it!

(I wrapped up my involvement several months ago, but others helping out with this open enrollment period.)

bashinator11y ago

> folks are on it!

Am I correct in thinking that the cheery use of passive voice means you're under quite a serious NDA?

brandonb11y ago

seccess11y ago· 1 in thread

This is certainly scary stuff, but I was a bit annoyed with the line:

"...consequences such as when Target notified a woman's family that she was pregnant before she even told them. "

Not that speculation is the way to go, but what about the possibility of someone being turned down for life insurance due to this information?

protomyth11y ago

dkroy11y ago· 1 in thread

I heard something awhile back about the us government(NSA) leveraging the cookie in a way that they could use it as a surveillance beacon. I doubt there is any relation, but it makes you think a bit.

[1] https://www.eff.org/deeplinks/2013/12/nsa-turns-cookies-and-...

bhhaskin11y ago

I am sure the NSA are involved with at least some aspect of all .gov addresses.

kumarm11y ago· 1 in thread

One of the heavily trafficked sites in India (Railway Booking) has been showing Google adsense ads. Someone is making a Million dollars a month in Government :)

https://www.irctc.co.in/eticketing/loginHome.jsf

536 Global Rank (50 in India): http://www.alexa.com/siteinfo/www.irctc.co.in

nathanaldensr11y ago

What does this have to do with the topic of this thread?

drylight11y ago

Give $563M to Accenture and you get some really shoddy work http://www.healthcaredive.com/news/accenture-snags-new-5-yea...

garazy11y ago

Looks like a few of the tracking companies only just started to appear -

http://builtwith.com/detailed/healthcare.gov

The only non-ad tool they added was the Twitter Platform to their homepage. Lots of data leakage points though.

tedunangst11y ago

Don't blame the browsers for continuing to send Referer headers though. Because browsers take your privacy seriously.

jtheory11y ago

They don't even get into the repercussions of loading externally-hosted JavaScript into a secure page.

We avoid this entirely (also hosting medical data), though it's been a bit of extra work to do so.

fubarred11y ago

Currently, https://disconnect.me/ browser extension says https://www.healthcare.gov/ uses:

- 0 Facebook, 3 Google, 0 Twitter

- 0 Advertising

- 6 Analytics: 1 ClickTale, 4 MixPanel, 1 Chartbeat

- 0 Social

- 6 Content: 3 Google, 3 Optimizely

stephenhess11y ago

natmaster11y ago

Don't worry guys, Obama's friend's companies will use that information to sell you better products. It's for your own good!

j / k navigate · click thread line to collapse