Data portability, the forgotten right of GDPR (opens in new tab)

(alias.dev)

312 pointsmehdim5y ago217 comments

217 comments

122 comments · 20 top-level

maxdo5y ago· 28 in thread

I'll re-phrase. Imagine I'm a startup. If government force me to to delete some data, it makes my life easier, no data - no privacy issues. if someone tells me , I want to port my data to competitor, because my UI better then theirs, but they still prefer competitor, why should I care about this requests, why should i spent a single second of my engineers time to implement that?

TeMPOraL5y ago

> if someone tells me , I want to port my data to competitor, because my UI better then theirs, but they still prefer competitor, why should I care about this requests, why should i spent a single second of my engineers time to implement that?

Because you're a good person and care about providing value to your users, and not just extracting value from them.

But since in practice, we can't rely on every business to be run by good citizens, this needs to be made a legal requirement, to remove the competitive advantage from being predatory and locking users down.

ryandrake5y ago

While I agree 100% with this response, let's for the sake of argument assume OP is not a good person, and doesn't actually care about providing value to users.

An answer to "why" that does not depend on voluntary goodness is: Enough people is the world generally think representative democracy is a good thing. We stand by that system for making the rules. Enough people in part of the world think there is enough of a problem to the point where a rule was made. If you want to do business in that part of the world, you need to be bound by that rule. That's why you should spend time on it. As incomprehensible as it might be, it's important enough to those citizens that they are willing to levy a penalty on you if you don't.

...and so far, at least three people actually took time out of their day to go find the "down" arrow on this obviously raving insane viewpoint :) I love you guys!

3 more replies

toolz5y ago

the spirit of such a law is great, but there's a huge problem - what does the implementation even look like? Are we going to have regulatory committees oversee which types of data should be portable and when? Who writes the protocols?

The implementation of such a law is impossible as far as I can tell and opens up huge vulnerabilities to smaller companies.

Just imagine when large companies can hire lobbyists that can force a data protocol on the smaller businesses.

The spirit of many laws is great, the implementation is unfortunately, what actually matters and I don't see solutions to these hard problems.

Allow me to go on a soapbox here, but far too many laws are created with good intentions that are destroying competition and hurting the end users.

1 more reply

golergka5y ago

> Because you're a good person and care about providing value to your users, and not just extracting value from them.

That's a false dichotomy that is also misrepresents the nature of a typical business transaction.

If you're a good person and a business owner, you're looking to make mutually beneficial business transactions. If someone is looking to move away from using your business, then it's them who's trying to extract value from you, without giving anything in return.

Of course, sometimes, as just a good person, you want to do good for other people without anything in return — but you can do it as a private person, putting your profits into charity funds. Separation of concerns is a good thing that make things clear. Also, from any moral point of view, money spent on engineer salary that allows some food app user to migrate to a competitor is probably not spent as well as feeding hungry or providing health care to sick anyway.

2 more replies

est315y ago

If you are a startup, then such a law directly benefits you because you might want to convince users to migrate to your services. If the big established competitor of yours has to offer data exports, such a migration is made easier for you, enabling your startup to grow faster, and giving users the ability to enjoy more innovation in the market.

goodpoint5y ago

Because removing an exit barrier means removing lock-in.

Not holding customers data hostage can increase your service adoption.

E.g. many companies would not pay for a web-only email service where you cannot download and backup emails.

E.g. A lot of people pay for non-locked books (epubs) that can be carried over across different devices.

Governments across the world broke lock-in mechanisms for decades (e.g. carrying phone numbers, being able to buy gas/car oil/car tires/PC components/ from independent vendors)

WA5y ago

You don't write an API to port stuff to your competitor. You write a JSON or CSV export and competitors can then make an import tool for your data format (and vice versa).

Is this really an effort? It's basically a JOIN over a bunch of tables or maybe the JSON state tree of your SPA and that's about it.

Chances are, your startup works with all data of a user and has a way to request all data from the DB anyways.

dariosalvi785y ago

A company that builds houses would very much avoid building those pointless and expensive security features. Why would they spend a second of their architects' time on that?

1 more reply

croes5y ago

I'll re-phrase: why should I care about the requests of my users? Now you know why they prefer your competitor over your better UI. Your UI may be better, but your UX sucks.

jcelerier5y ago

You're exactly the kind of person I hope my government protects me of. Companies are not meant to enrich yourself but to make the world better.

google2341235y ago

Companies are not meant to make the world better...

1 more reply

ClumsyPilot5y ago

I'll rephrase. Imagine I'm a startup. If someone tell me, I want to transfer my savings to a conpetitor, why should I care about this request?

The answer should be obvious, it's their data just like it's their savings.

jensus5y ago

the mental shift seems to be to not regard your customers data as your product but rather focus on your service as your product

kspacewalk25y ago

That will make a whole lot of business models out there not feasible. The result will be fewer free services (to put it differently, fewer services and fewer choices). If you don't pay for stuff with your data, you can't have it for free. Are we sure we want to use government regulations to impose this on consumers of services, from the top down? Instead of, say, letting them decide?

(Yes, of course it's an industry talking point. The best kind - one that's true and valid, and so far not effectively refuted).

7 more replies

tomcooks5y ago

Because it's not your data, it's mine?

cromulent5y ago

Barriers to exit are also barriers to entry.

shuntress5y ago

Because this is also required of your competitor and will allow users port their data into your startup which gives you a chance to compete.

williamtwild5y ago

Being required and complying with that requirement are two different things.

1 more reply

toomuchtodo5y ago

Isn’t engineering time cheaper than legal counsel time when your customers file complaints with the government against your org for not adhering to the law?

JumpCrisscross5y ago

> engineering time cheaper than legal counsel time

For a Silicon Valley based company hiring EU lawyers, no. Engineers are more expensive. Also, for a Silicon Valley company with limited or no EU presence, the time value of money may make incurring that deferred cost worth the saves near-term engineering time.

Laws should be followed. But laws must be enforced. OP’s point is valid. The EU passed a law and delegated enforcement to its various members, each of whom have varying levels (and interpretations) of enforcement around different parts of the text.

Until that changes, GDPR compliance will remain a courtesy. Not a right.

2 more replies

MattGaiser5y ago

Is any legal counsel time actually being spent on this? It seems like all the disability legislation. In theory it applies to websites. In practice, few give it a 2nd thought.

I have yet to hear of a company significantly harmed by failing to consider accessibility.

sam_lowry_5y ago

Yes unless you already have lawers on staff.

bombcar5y ago

This covers a good argument as to why: https://www.joelonsoftware.com/2000/06/03/strategy-letter-ii...

And it's true - there are a number of services for work that we've never tried because there's no easy way "back".

matheusmoreira5y ago

Why should your company be allowed to lock in other people's data in your company's computers and then refuse to give it back? This is obviously abusive. Why should your company be allowed to abuse its customers? Why should an abusive company even be allowed to exist?

La1n5y ago

I bet there are more laws that a company would love not to follow, but it's the law and thus you'll need to spend time implementing it.

dbetteridge5y ago

Because the data isn't yours, it belongs to the customer.

That is the opinion that GDPR encodes into law

pmlnr5y ago

Erm... because you need to follow laws. Your company would file tax records, right? And follow fire and building regulations in the office, correct? So why would it not follow GDPR?

1 more reply

0xbadcafebee5y ago

It's not your responsibility to help your customers use a competitor's service, so you definitely don't have to care about that. However, you might care if you practice "dogfooding".

The idea of eating one's own dog food is to understand the experience of the customer and improve the product. It demonstrates confidence in your product and helps you empathize with your customers. If you do this & are confident in your product, then a portability feature (to allow your customers to try out your competitors) should not be a threat.

Assuming you can convey to your customers why your product is superior, they won't have need of the porting tool. If one day they think, "Hmm, I wonder if the competitor is better", and try to use the porting tool to use the competitor, and find out it's a huge pain because the competitor's product isn't as good (or doesn't work the way yours does), they may decide they just don't feel like switching. People might also use your product just because they can switch if they ever need to.

Pandora is a great example of a shitty company that does not believe in its own product. If you use the free version, you are constantly bombarded with dark patterns and direct advertisements to get you to upgrade to their paid account. It's annoyware. If you eventually pay for the product, the only value add is fewer ads. There's no improved functionality, there's no easier experience, no better algorithm. Just slightly less pain. It's like upgrading from dogfood that tastes like shit, to dogfood that only smells like shit. If Pandora created a data portability tool, they would be screwing themselves, because they know their product is shit. If they had a great product, portability wouldn't be a threat to their business.

somethingAlex5y ago· 17 in thread

What are consumers intuitively expecting compliance with this law to look like?

Data from one service may be in an entirely different schema than the service you want to import it too - let alone format. Service A may summarize your data and throw away the granular stuff, but service B runs on the granular data.

Are consumers going to implement ETL pipelines to achieve portability? Are they expecting to hook up streaming mechanisms for enormous swathes of data?

Just as an example, if I wanted to get a list of every song I liked on Spotify and import it into Apple Music, how would that even work? The songId of Spotify is undoubtedly different than the one Apple uses. Are Apple and Spotify supposed to agree on a common file format?

I agree with the intent of the law but I'm not surprised most services do not offer an automated way to take out data. It's a rare case, often a heavy workload, and there's really no way to guarantee the data you receive is actually portable.

izacus5y ago

> Just as an example, if I wanted to get a list of every song I liked on Spotify and import it into Apple Music, how would that even work? The songId of Spotify is undoubtedly different than the one Apple uses. Are Apple and Spotify supposed to agree on a common file format?

Why wouldn't it work? Desktop apps had m3u playlist formats which could be read by multiple players - from Winamp on desktop, iTunes on a Mac or even car headunits. It's now kinda wierd to say that rockstar engineers of Apple/Spotify can't find a way to export playlists and liked songs (a global singletons essentially) they got from the SAME publishers and probably ingest from the SAME content owner data sources.

sethhochberg5y ago

m3u (or really anything that relies on file names to reference media assets) would be a potential disaster... all kinds of weird stuff makes it into your parser when you're dealing with a large enough catalog. I used to work in streaming media, including with some m3u-based legacy systems, and dealt with a pile of edge cases a mile high.

But thankfully, the industry solved this problem themselves: ISRC (International Standard Recording Code) is already used all over the royalty reporting side of the industry because it specifically solves the problem of referencing an individual recording of a work.

DDEX is a content delivery manifest format the industry also uses for this kind of purpose (sharing complex metadata about recordings in a standardized way), but its an 800lb gorilla of a format and not super consumer-friendly.

These are things that are all over the back offices of your favorite streaming service, but mostly transparent to the consumer.

1 more reply

redwall_hp5y ago

I was thinking the other day about that, sort of. Apple in the early 2000s was really into things like CalDAV and WebDAV. Safari had integrated RSS reading at one point. They embraced standardization and interoperability for many things, at least where important user data was concerned. Then something happened after the iPhone took off and iCloud became a thing, and they became all about vendor lock-in. I assume it comes from being a market leader instead of only having a relatively unpopular computing platform.

1 more reply

dsr_5y ago

There are incentives for, say, Mastodon to be able to ingest your tweeting history, or for Linked-In to eat your Facebook social graph.

There's no incentive other than the law for Twitter or Facebook to make that data exportable.

StopHammoTime5y ago

That’s why laws exist.

There’s generally no incentive to not kill someone except going to jail.

4 more replies

Guillaume865y ago

I understand your point but FYI music is a poor example as there is solutions to port metadata in that case. MusicBrainz aims to standardize music metadata and it is pretty commonly used. An example I know is the lastfm service, their APIs accept an optional mbid: https://www.last.fm/api/show/track.updateNowPlaying.

cbm-vic-205y ago

Should music streaming services be compelled to support MusicBrainz to support this GDPR case simply because it is commonly used? Who decides that mbid is the GDPR-accepted track identifier?

2 more replies

kelnos5y ago

Artist + Song Title (+ Album and track number, if it's from an album) should be enough to disambiguate in enough cases for someone to consider this "portable".

Beyond that, we have music fingerprint IDs that a service could output in the data dump along with their own service-specific ID.

> Are Apple and Spotify supposed to agree on a common file format?

For something as simple as this, yes, absolutely they should. It's bonkers that they don't and wouldn't, aside from garbage anti-competitive lock-in reasons.

908B64B1975y ago

If I was Spotify I would export that as an SQLite DB. Maybe the metadata catalog as a standalone DB too.

Apple Music has an API[0] so it's already mostly possible to import a list of songs in it.

> I agree with the intent of the law but I'm not surprised most services do not offer an automated way to take out data. It's a rare case, often a heavy workload, and there's really no way to guarantee the data you receive is actually portable.

"Data Portability" is so vaguely defined that I can't help but see it as yet another law that EU bureaucrats will use to fleece (American) "Evil Tech Giants".

[0] https://developer.apple.com/documentation/applemusicapi

anticristi5y ago

"Data portability" is vague so that the law is stable and flexible. As a comparison, "drivers need to adapt driving speed to weather conditions" is equally vague. It would simply be infeasible to publish an hourly speed limit chart based on rain, fog, snow, etc. It is the responsibility of driving instructors to raise awareness on reasons to adapt the speed. Drivers need to then interpret that clause to their situation.

Similarly, it is up to industry -- either via standardization bodies or courts -- to clarify what exactly is "data portability".

2 more replies

hnick5y ago

> Are consumers going to implement ETL pipelines to achieve portability? Are they expecting to hook up streaming mechanisms for enormous swathes of data?

As a dev I hate the fact that something like Zapier apparently has to exist in this messy world, but non-technical people like my wife tend to find it intuitive and relatively easy to use so that's one option.

Though for your example I'd argue that the ingester (Apple) has a vested interest in allowing import from many formats to poach customers. Much like how Apple went to the effort of creating the Move to iOS app on android. I wonder whether having the data exported with just a song id would be sufficient under the law, because you could just normalise all useful data away and export a list of IDs to the customer which seems clearly against the purpose of the law. Showing just IDs is not my data which would mean the actual songs I like.

capableweb5y ago

Yeah, that'd be great! We didn't get the web as we know it today until bunch of people and companies got together and created standards for everyone to rely on. Why can't we do that same for SaaS businesses?

I think the test is something like: If the concept is the same, you should be able to import/export it. For example, you have a SaaS having photo upload + being able to put the photos into a custom gallery. Then you should be able to export that gallery in a format that you can recreate the same gallery in another SaaS that also has photo upload + custom galleries.

The article itself is clear that it's not always technically feasible to offer this import/export. For example, it doesn't make sense to be able to export Facebook posts and import them into Twitter, because those are two different formats with different restrictions.

This is from the actual article:

> In exercising his or her right to data portability pursuant to paragraph 1, the data subject shall have the right to have the personal data transmitted directly from one controller to another, where technically feasible.

The full article of "Data Portability" is not that long, you can read it here: https://gdpr-info.eu/art-20-gdpr/

Helmut100015y ago

I agree, most SaaS concepts are similar and have large overlaps in feature and functionality. Just for Social Media, we've written a common data structure format (lbsn.vgiscience.org) where it is possible to import/export from all services (this one is specifically tailored for visual analytics and exploration of research/privacy questions). When working on the structure, it became clear that most Social Media concepts exist in a similar form on multiple sites. There is very little functionality that is unique to a single SaaS.

1 more reply

irrational5y ago

Isn’t “Where technically feasible” a huge loophole?

1 more reply

account425y ago

The emportant part that this law should achieve is to get the data out of the service. This at least allows competing services to provide importers, which is in their interest.

anticristi5y ago

I agree. I could develop my own ETL and wasn't sure what to do with this right. Where would I import my Klarna Checkout history and for what purpose?

I guess this law is there to ensure your images can be transferred from Dropbox to Google Drive to Apple Cloud, without any of them being tempted to pull the plug.

1 more reply

mulmen5y ago

This is so sad. That we have already forgotten how easy this is. And that we do not see that data integration is an obvious case for open standards and development.

Back in the "bad" (aka glorious) days of p2p file sharing we had no problems keeping things straight. Even Windows XP natively knows how media libraries work. Any service that makes this hard will be at a disadvantage to ones that make it easy, and maybe get roasted in court on GDPR grounds.

The only reason services do not offer you data export in an easily digestible format is that they want you to stay in the app.

tester345y ago· 14 in thread

where can I download my HN's data?

notRobot5y ago

You can't. There's also no easy way to request all profile data deletion, unfortunately.

However, they do respond to privacy requests, see:

https://news.ycombinator.com/item?id=26959559

https://news.ycombinator.com/item?id=26410165

capableweb5y ago

Have you tried emailing hn@ycombinator.com and it got denied? Or what you mean there is no easy way to request the data deletion? AFAIK they don't scrub the comments but if you request it, your username will be replaced with [deleted] for all your comments.

2 more replies

capableweb5y ago

Last time I "archived" my account data on HN I used https://github.com/HackerNews/API which seems to be working good enough for my needs.

thatguy09005y ago

Hn has no EU presence so doesn't have to follow EU laws, no? Or do they have to ip block Europeans? What would the EU actually do to hn if they did decide to enforce the rules here?

burntoutfire5y ago

Typical approach is issue a fine and then seize the assets in the EU that belong to HN's owners (if there are any).

alexaholic5y ago

GDPR is about data, not companies. It applies to all entities regardless of where they are established as long as they're doing business in the EU or processing data of EU citizens.

1 more reply

tremon5y ago

Indeed, my answer would be no. But IANAL, IANYL and TINLA.

There's https://gdpr.eu/companies-outside-of-europe/ :

> Article 3.2 goes even further and applies the law to organizations that are not in the EU if two conditions are met: the organization offers goods or services to people in the EU, or the organization monitors their online behavior.

Recital 23 clarifies what is meant by the organization offers goods or services to people in the EU: https://gdpr.eu/Recital-23-Applicable-to-processors-not-esta...

> In order to determine whether such a controller or processor is offering goods or services to data subjects who are in the Union, [..] the mere accessibility of the controller’s, processor’s or an intermediary’s website in the Union, of an email address or of other contact details, or the use of a language generally used in the third country where the controller is established, is insufficient to ascertain such intention, factors such as the use of a language or a currency generally used in one or more Member States with the possibility of ordering goods and services in that other language, or the mentioning of customers or users who are in the Union, may make it apparent that the controller envisages offering goods or services to data subjects in the Union.

Profiling is clarified in recital 24: https://gdpr.eu/Recital-24-Applicable-to-processors-not-esta...

> it should be ascertained whether natural persons are tracked on the internet including potential subsequent use of personal data processing techniques which consist of profiling a natural person, particularly in order to take decisions concerning her or him or for analysing or predicting her or his personal preferences, behaviours and attitudes.

So, I'd say no. The mere fact that HN is accessible to people in the EU does not show intent. HN is an English forum, which is the native language of the country where it is established, and does not offer its services in additional European languages, and does not advertise products in the Euro currency. I'm unable to know for sure, but I don't believe HN is using my posts here to predict or analyse my personal preferences either.

1 more reply

vincnetas5y ago

There is public API for HN data

https://github.com/HackerNews/API

Does it count like ability to download your data?

user-the-name5y ago

No. It needs to be accessible to everyone, not just to programmers with lots of free time.

dahart5y ago

Question: are you an EU citizen, and is there any way for HN to know whether you are an EU citizen? (Your public profile page has no personally identifiable information.)

GDPR is an EU law that applies to sites that market directly to EU citizens. How and whether it applies to sites outside the EU has been debated. GDPR can prevent a site from operating in the EU. But GDPR does not apply to a US citizen using a US-run web site.

https://en.wikipedia.org/wiki/General_Data_Protection_Regula...

https://gdpr-info.eu/art-3-gdpr/

Edit: speaking of personally identifiable information, GDPR defines the information that is subject to download as “personal” information, only when it can be identified. Do you have data on HN servers that is subject to GDPR even if you live in the EU? (I don’t think I do.)

See 4.1: https://gdpr-info.eu/art-4-gdpr/

La1n5y ago

>only when it can be identified.

Note that it also includes indirect identification, which means that if combined with other data it would identify you. Recital 30 might be of use here too;

>Natural persons may be associated with online identifiers provided by their devices, applications, tools and protocols, such as internet protocol addresses, cookie identifiers or other identifiers such as radio frequency identification tags. This may leave traces which, in particular when combined with unique identifiers and other information received by the servers, may be used to create profiles of the natural persons and identify them.

Rygian5y ago

My HN username (Rygian) is PII because it can be used to identify me indirectly (HN has a log of my username connecting from IP x.y.z.w, and my IP address is PII).

2 more replies

pbhjpbhj5y ago

Most people have an email address on their profile, that's PII. One could post one's name, that's definitely PII and AIUI that affects all the data then on the site, as it's now associated.

1 more reply

M2Ys4U5y ago

>GDPR is an EU law that applies to sites that market directly to EU citizens.

That is wrong. The GDPR does not make reference to citizenship.

It explicitly notes that it applies when either when the data subject is physically in the EU/EEA, or when the data controller/processor is based in the EU/EEA.

1 more reply

kijin5y ago· 9 in thread

Exporting your personal data is only half of the story. Importing is the other half.

Suppose I exported all of my posts, photos, contacts, and a bunch of metadata from social network A. Perhaps I could view the contacts in Excel and browse the photos in my favorite gallery app. But unless I can upload it all to social network B and continue as if I've been using B all along, the data is not really "portable". It's just a backup, a frozen snapshot that can't be unpacked anywhere else.

I'm not even sure if it makes any sense to import one's Twitter feed into Instagram or one's Facebook profile into Reddit. Edit: I'm not saying this is because of anti-competitive behavior on anyone's part. The services simply are so drastically different.

Mordisquitos5y ago

As you say, the data domains of different services may simply be logically incompatible, which is fair enough. As critical as I am of social media platforms, I don't think they should have any obligation to implement the abstract ability to "import" data in general. As long as they provide their users with the ability to export their data in a reasonable fashion, in a well-defined and consistent open format, and as parseable as technically feasible, I believe they have done their duty.

What's important is that a Twitter, or Facebook, or Reddit alternative should be able to implement data import from their respective competitor, without facing intentional anti-competitive difficulties. Let "the market" decide which services want to implement the import of which kind of data.

ncallaway5y ago

Absolutely. Service providers have a real incentive to implement _import_ functionality, since it means an opportunity for new users on the platform.

Since the incentives for the service providers line up with the users, I don't see a need for the government to regulate importing data.

Export is the exact opposite. The incentives for the service providers go directly against a user. After all, if the provider export a use's data, then then that user can just leave!

toyg5y ago

Where this is a real need, something will come up - somebody will develop a service, a utility that can be a bridge between the exported data and the new target. Without the export, this would not even be possible.

dane-pgp5y ago

> Exporting your personal data is only half of the story. Importing is the other half.

If we're wishing for things, I'd like to add the ability to have "live" exports/imports. By that I mean two services with comparable data types should be able to keep your accounts in synch, such that a change made on one service is reflected (after only a short delay) on the other service.

You would still have to visit Facebook if you wanted to see what your friends there were saying, but they would be able to see what you were posting on Mastodon, for example, without needing to create a Fediverse account.

BiteCode_dev5y ago

Putting a square into a round role does not make sense, but you may now be able to design a compatible square or round hole that one can move to.

beckingz5y ago

On the other hand every single app at one point would happily import your contacts no matter how they were formatted...

The incompatibility is by design.

1 more reply

matharmin5y ago

Without something like the GDPR, incentives are very different for exporting vs importing: Import functionality makes it easier for someone to switch to your service, while export functionality makes it easier for someone to switch away.

Additionally, if a service is missing import functionality, you can choose to not use it. While you'd often only realize you need export functionality while you've already used a service for years.

Maybe the case of porting data between social media services doesn't make much practical sense, but there are many other services where it does. Fitness tracking apps as an example - you'd likely want to take your history with if you switch services.

And usually these services would not be using the exact same data format for import vs export. But that's not as big an issue in practice - you'd often find third-party scripts or services that can do the conversion for you.

goodpoint5y ago

Data from a popular social network is gold.

Other social networks would implement import functions in a day.

dariosalvi785y ago

The gdpr actually says that data should be automatically transferred from one controller to another where technically feasible. https://gdpr-info.eu/art-20-gdpr/

mihaic5y ago· 5 in thread

Overall, I think GDPR is a positive force that protects consumers. It does have one major downside though, and that's that it treats entities of all sizes in the same way.

Placing the same regulatory burdens on start-ups as on big tech is a drag on innovation, and it's frustrating that there is no minimal cap on users before GDPR comes into effect, given how the EU has constant exception for artisanal food and goods manufacturers.

Lawyers and third parties want to get a piece of the pie, so they'll present themselves as indispensable. It's almost as if this is the EU version of TurboTax.

dheera5y ago

I think a much stronger solution would be to require all browsers to disable cookies by default, and let the user opt into websites that you need to sign into.

In its current state GDPR has effectively just littered the website with popups that don't let you disable "functional" cookies in the popup. Functional my ass. The pages work fine without them. I disable all cookies except on a short list of sites I need to log in. Unfortunate side effect is the goddamn GDPR popups keep popping up on all those news sites.

chrizel5y ago

Yes, I don't understand why this is not handled on the browser level. Back then in the 90s browsers asked the user for every web page that wanted to store a cookie. The situation we have now is not much worse.

But now every web page (at least in the EU) makes some kind of ugly popover and many of them try to convince you to accept all tracking with some kind of dark patterns of UI design.

I don't understand why we won't stop all this nonsense and build this stuff into web browsers as opt-in or opt-out (let the user decide) and therefore ensure that the UI is always the same without any dark patterns.

1 more reply

TheManInThePub5y ago

*sigh*

Here we go again.

The GDPR has no problem AT ALL with cookies. Use as many as you like with no need for popups. However, if you are using cookies to track or personally identify me (advertisers take a bow), then you need to ask my permission to do so. And so you should.

I am unaware of how a browser may possibly be used to block only personally identifying cookies, and besides, putting the onus to do so onto the data owner is against the principle of the GDPR; that personal data is MINE and you must ask my permission to use it.

capableweb5y ago

> and that's that it treats entities of all sizes in the same way

Go back and read GDPR again because it doesn't. There are exemptions for smaller business. I don't remember exactly what they are, but I remember reading some of the provisions are excluded for companies under 250 employees or something like that.

M2Ys4U5y ago

>it's frustrating that there is no minimal cap on users before GDPR comes into effect, given how the EU has constant exception for artisanal food and goods manufacturers.

Privacy (including the right to protection of one's personal data) is a human right. Creating artisanal food is not.

beyondcompute5y ago· 4 in thread

Absolutely! I remember asking to export my data from one of the services and the support pretty much ignored me (they replied in general but “forgot” to mention anything related to that question).

grishka5y ago

I wanted to get my data out of ask.fm because I answered quite a lot of questions there back when it was fun. The GDPR export option was nowhere to be found. Opened a support ticket, they asked me for a EU ID... Well, yeah, I don't have one, I'm not a EU resident, I wanted to piggyback on the laws of countries that actually care about their people. But it just struck me that they hate their users this much. Even Facebook didn't go this low.

On an absolutely unrelated note, I reverse engineered ask.fm's client API back when I was actually using it.

wizzwizz45y ago

Under GDPR I think they're not allowed to require an EU ID. So just say “I'm not required to give you my personal data for this”.

3 more replies

varispeed5y ago

Companies think that the data that is portable is your email address, profile picture, address, IP addresses - but other things like posts, comments are not. It is actually not well defined in GDPR and if portability means transferring your profile (e.g. username, email and some details about you only), then GDPR is pretty much useless in that regard.

account425y ago

> Companies think

Which ones have you tried exporting your data from?

varispeed5y ago· 4 in thread

The problem is that the what actually has to be made portable is not well defined in GDPR. It basically says that it means any data by which the user can be identified by. When I requested a data export from some companies their legal team argued, that they cannot send me my projects in a structured format nor they won't allow import of project files from another service, because this is not a personal data (or rather not a PII). So this is actually another area where GDPR is all bark but no bite. I could only request my personal data, that is my name, address, email and IP addresses...

dariosalvi785y ago

What the law doesn't specify is if data associated to personal data should be included or not. It's kind of paradoxical because I may have a table in the DB with the details about the user, like name and email, and another table about, say, sexual preferences (usually considered quite sensitive). I could argue that the sexual preferences table is not personal of viewed alone, but of course it is very personal when linked through an ID.

I think that, as long as the ID is there, and that data is therefore linkable, that data IS personal.

IANAL, but that company's reply is bullshit and could be easily brought to court.

varispeed5y ago

> IANAL, but that company's reply is bullshit and could be easily brought to court.

Nobody has money and time for that unfortunately and if you are using a platform that you like or need, then it is impossible, because you'll risk getting your account deleted.

1 more reply

pmlnr5y ago

> The problem is that the what actually has to be made portable is not well defined in GDPR

Wait, is there anything well defined in GDPR?

EDIT: k, so for the downvoters: I mean it. GDPR is muddy at best. Think IP addresses: in most cases, they are _not_ PII, especially when it's a dynamic IP from an ISP, yet nearly everything insists on hiding IPs or converting them into geo information.

TeMPOraL5y ago

> Think IP addresses: in most cases, they are _not_ PII, especially when it's a dynamic IP from an ISP

In other words: they can be PII, and you can't easily determine which of them aren't - the way they're being assigned is out of your control.

(Even in cases people think IPs aren't PII, they become so when combined with other datasets - dynamic IPs can be quite stable.)

> Wait, is there anything well defined in GDPR?

In terms of singling out particular technologies? No. In terms of defining principles and criteria? Very much yes. If GDPR went the other way, it would be trivial to work around by subtly changing the technologies or data involved.

mxmilkiib5y ago· 3 in thread

Remembering http://dataportability.org etc

djdeutschebahn5y ago

Maybe these two are also relevant:

https://github.com/google/data-transfer-project

https://datatransferproject.dev/

ot11385y ago

Do you know what happened to them (and/or some of the other companies/projects/initiatives that launched with the same goals)?

mxmilkiib5y ago

I've a messy collection of links (many of which I need to web.archive.org fix) on https://wiki.thingsandstuff.org/Open_social#DataPortability that you might be interested in.

Basically, companies thought it more profitable to not put any effort into letting users escape their service (or keep chat federated, etc.)

Various threads are still around though.

1 more reply

jFriedensreich5y ago· 3 in thread

it took me fighting 6 months with viacom support to get my song plays for last.fm . spotify improved from 2 weeks to 2 days but its still ridiculous to call something true data portability that is not automatic and not instant. a lot of companies tried giving me semi obfuscated pdfs or html without classes or classes that were random strings, we need to improve the law to enforce instant availability and an industry standard format like json or xml. also this needs to be completely automatable without having to do it myself.

capableweb5y ago

> it took me fighting 6 months with viacom support to get my song plays for last.fm

Sue them. It should be faster than 30 days according to GDPR.

> a lot of companies tried giving me semi obfuscated pdfs or html without classes or classes that were random strings

Giving you obfuscated data is also against GDPR as the data needs to be clearly machine-readable. Again, sue them as you now have two points against them.

jeroenhd5y ago

The GDPR does not give you any way to sue them directly. You can report the company to your country's DPA, which should look into the issue and might take it with the offending party in a court of law. That is assuming that the parent is actually an EU citizen or a foreign citizen living in the EU; if they aren't, the GDPR doesn't apply to them.

I see a lot of (mostly non-EU) commenters thinking that the GDPR is grounds for any individual to sue any company for practically anything because privacy is hard, (which is probably why everyone was so hyped to hate on the GDPR) but that's just not how it works.

As much as I value data portability, I'd much rather see a DPA sue the hell out of the companies that make those ridiculous, illegal cookie walls and popovers filled with dark patterns instead.

ot11385y ago

Sue them under what law or jurisdiction?

Xavdidtheshadow5y ago· 3 in thread

For what it's worth Facebook and Instagram (also owned by FB, but is fairly separate product-wise) have pretty good export tools. You make a request in the web UI and a short time later, can download a zip with a bunch of JSON files. I was pleasantly surprised by how much they included.

fossislife5y ago

Under GDPR, they have to include everything (they admit) they have about you, isn't that right?

anticensor5y ago

Unsurprisingly enough, they only include what you entered yourself, but not derived data about you.

Xavdidtheshadow5y ago

I think, but I'm not sure how accessible it has to be. I was mostly commenting on how approachable the data format was. Formatted json with descriptive keys.

I have no idea what the law requires about the data format, so they could be doing the absolute minimum.

robin_reala5y ago· 3 in thread

The best use of GDPR for data portability that I’ve ever seen was right here on HN: https://news.ycombinator.com/item?id=24764371

Long story short, Confiks takes Spotify to task for removing the API that SongKick used to retrieve playlist data; a short time and several factual emails later they restore API access.

okamiueru5y ago

It's great that it made Spotify activate a useful API they had little good reason to terminate. However, that "victory" always left me thinking a lot of people missed the point, including the person who challenged Spotify, as well as Spotify employee who responded to those emails, who confirmed they had little good reason to disable/remove the API.

To summarize, Spotify can of course terminate any API they so wish, and there is absolutely nothing in GDPR that compels them to keep it up. As long as they of course still follow the requirements by GDPR. As annoying as a workaround would be for Confiks, emailing the data within those 30 days to SongShift is acceptable and in compliance. There is nothing in GDPR that says "data transfer options once enabled cannot be removed and/or changed".

Quoting the clause "the data subject shall have the right to have the personal data transmitted directly from one controller to another, where technically feasible.", always seemed off with what is being demanded, because, they would be complying with that requirement by compiling and archive of the data, a big dump, and send it to SongShift. "I demand that you re-enable the API", on the other hand has no basis whatsoever in GDPR. The quote follows with "... or allow for some other method to allow me to exercise my rights under the GDPR", which is exactly right.

Spotify re-enabled the API because they chose to, not because they had to.

To clarify: I think it was great that Confiks convinced Spotify to re-enable the API. But, the arguments presented were not particularly convincing, and I took this as a case of Spotify doing the right thing, rather than a "David vs Goliath".

wizzwizz45y ago

> Spotify re-enabled the API because they chose to, not because they had to.

Spotify re-enabled the API because:

• they already had the API, so they couldn't claim it was an undue development burden to re-enable it;

• it was cheaper than setting up another system based on manually emailing large chunks of the database; and

• it was good press.

2 more replies

gpm5y ago

Not that this impacts your broader point, but

> the data within those 30 days

This is a common misconception. The requirement is

> GDPR information must be provided without undue delay but at latest within one month. Only in reasoned cases may this one-month deadline be exceptionally exceeded.

The operative requirement is without undue delay, one month is a maximum on the definition of undue delay, not a minimum. Intentionally delaying compliance for the better part of a month after you have demonstrated the ability to send the data within seconds would undoubtedly constitute undue delay... (Some exceptions apply)

1 more reply

jbverschoor5y ago· 3 in thread

What about data-portability of in-game assets?

gpm5y ago

You want a csv saying what "this account has this item" flags are set for your account in the database? Ya, you can probably get that, for all the good it will do you.

55606752605y ago

Theoretically you could ask for your account data in a game and for it to be sent to developers of another game. But there are very few if any incentives for receiving party to honour your purchases somewhere else. And even if they would chose to do this - they will not be able to provide you same assets (unless we are talking about some unity store bought models).

selfhoster115y ago

Let's solve this with NFTs!

Partly joking, but partly serious. Maybe we could use something like this.

slver5y ago· 3 in thread

Facebook's data worth $1294 per user per year is probably BS. It'd mean Facebook generating 3.5 TRILLION in PROFIT every year.

loeg5y ago

The website currently says, "up to $1294." The phrase suggests that at least one user is worth that much and the rest are worth less. One might imagine that that user is clicking on a ton of ads every year (and buying the products).

mehdimOP5y ago

author here. We divided the number of revenues and the marketcapitalization per regional revenues US user : $1294 market cap in average, EU user : $494 market cap in average, Asia $109, Rest of the world $80 It is explained in more detail in the report

bombcar5y ago

Maybe they took Facebook's market cap (about a trillion) and divided by yearly active users or something.

LeonM5y ago· 2 in thread

Though an important part of privacy, I think this data portability is really useless, for both consumers and businesses.

1. There is no format defined in which the data must be given

2. There is no feasible way of defining such format anyway

3. Thanks to 1 and 2, there is no feasible way of importing this data into another service.

AFAIK, a consumer has the right of requesting the export in any chosen format, but it says nowhere in the GDPR that the data controller must supply this for free [0]. As a business you are allowed to charge a fee to cover the cost of exporting the data, as long as this fee is considered 'reasonable'.

[0] Note: IANAL, a befriended paralegal told me this.

ocdtrekkie5y ago

If the right exists, tools can be built to exploit it. For example, one could write a tool that ingests Google Takeout data to convert it for other platforms, because the data output exists in a... somewhat... usable form.

Presumably, someone wanting to build a service that encourages people to migrate from Google services could build an import tool to their platform with it.

simpss5y ago

data needs to be provided in a machine readable format.

From there, a specific parser/converter needs to be built by the competing service for imports.

capableweb5y ago· 1 in thread

Not sure it got forgotten, I think members of https://datatransferproject.dev/ didn't start actively moving until GDPR came into effect, and it seems they are doing _something_, although still it's very basic.

jvalencia5y ago

https://github.com/google/data-transfer-project

nicbou5y ago

There are still some issues with it (incomplete data, manually triggered data exports), but it's a notable improvement nonetheless.

It's particularly valuable when it lets you export instant messaging conversations and shared photo albums. It means that companies cannot hold your data hostage to keep you on their platform.

I use GDPR exports for a personal data thing I'm building [0][1]. It simply wouldn't work without GDPR, because public APIs are increasingly rare. Most of your personal data is locked and GDPR data exports are usually the only way to access it on your own terms.

[0] Intro: https://nicolasbouliane.com/projects/timeline

[1] Code: https://github.com/nicbou/timeline

mehdimOP5y ago

Co-author here of the research. The most simple and effective and rapid solution would be to impose API neutrality. As explained in the report, it would just obliges API providers to give back the same API access to users than they give to their partners. For instance, why I get less data from Facebook if I ask my personal data, than if I create an app and ask maximum app permission (all OAuth scopes)? API neutrality already works. For instance, Open banking in UK and PSD2 in Europe apply API neutrality. Any 3rd party can access to a bank API if they are granted by the user to do so. After 2 years, for instance, up to 20% of the UK online banking population beneficiated from it as "Banking data Portability via APIS" . 20% is huge. If FAMGAs and all other big companies data was accessible via "neutral APIs" to users, data portability would be "a thing"

Also, the fact that you don't know what to do with you data dump in JSON is a blocker. With APIs, integrations by 3rd parties are simpler and more user oriented.

Last point, with API neutrality, no need of maximizing "interoperablity" (even is is always useful and makes things simpler, we have seen that with DataTransferProject it does not work really as companies don't work with the same data model) Developers will do the matching work between the original app and the destination app, no worries, when incentive is here, middleware glue will come. The problem these days is that the source of data is useless, has no value, so no incentive. You can look at this study with GDPR Facebook data value for developers https://www.law.nyu.edu/centers/engelberg/pubs/2019-11-06-Da... The main question is : Why a Facebook GDPR Data dump/takeout has no value for developers where Facebook API has value for millions of applications developers and businesses? With API neutrality it will have maximum value for users (as it has already value for partners) and minimizing fatigue to implement portability (an API is lot more developer friendly than a JSON dump that you receive in 30 days via email and that the user need to upload somewhere)

brutuscat5y ago

ZKP all the way https://www.aepd.es/en/prensa-y-comunicacion/blog/encryption...

dundarious5y ago

2 of the 6 ways portability is broken are duplicates of each other.

Danski05y ago

Forgotten? It's a basic GDPR article (article 20) that every somewhat serious EU company know.

Link bait by a company making their profit of GDPR confusion.

j / k navigate · click thread line to collapse

217 comments

122 comments · 20 top-level

maxdo5y ago· 28 in thread

TeMPOraL5y ago

Because you're a good person and care about providing value to your users, and not just extracting value from them.

ryandrake5y ago

While I agree 100% with this response, let's for the sake of argument assume OP is not a good person, and doesn't actually care about providing value to users.

...and so far, at least three people actually took time out of their day to go find the "down" arrow on this obviously raving insane viewpoint :) I love you guys!

3 more replies

toolz5y ago

The implementation of such a law is impossible as far as I can tell and opens up huge vulnerabilities to smaller companies.

Just imagine when large companies can hire lobbyists that can force a data protocol on the smaller businesses.

The spirit of many laws is great, the implementation is unfortunately, what actually matters and I don't see solutions to these hard problems.

Allow me to go on a soapbox here, but far too many laws are created with good intentions that are destroying competition and hurting the end users.

1 more reply

golergka5y ago

> Because you're a good person and care about providing value to your users, and not just extracting value from them.

That's a false dichotomy that is also misrepresents the nature of a typical business transaction.

2 more replies

est315y ago

goodpoint5y ago

Because removing an exit barrier means removing lock-in.

Not holding customers data hostage can increase your service adoption.

E.g. many companies would not pay for a web-only email service where you cannot download and backup emails.

E.g. A lot of people pay for non-locked books (epubs) that can be carried over across different devices.

Governments across the world broke lock-in mechanisms for decades (e.g. carrying phone numbers, being able to buy gas/car oil/car tires/PC components/ from independent vendors)

WA5y ago

You don't write an API to port stuff to your competitor. You write a JSON or CSV export and competitors can then make an import tool for your data format (and vice versa).

Is this really an effort? It's basically a JOIN over a bunch of tables or maybe the JSON state tree of your SPA and that's about it.

Chances are, your startup works with all data of a user and has a way to request all data from the DB anyways.

dariosalvi785y ago

A company that builds houses would very much avoid building those pointless and expensive security features. Why would they spend a second of their architects' time on that?

1 more reply

croes5y ago

I'll re-phrase: why should I care about the requests of my users? Now you know why they prefer your competitor over your better UI. Your UI may be better, but your UX sucks.

jcelerier5y ago

You're exactly the kind of person I hope my government protects me of. Companies are not meant to enrich yourself but to make the world better.

google2341235y ago

Companies are not meant to make the world better...

1 more reply

ClumsyPilot5y ago

I'll rephrase. Imagine I'm a startup. If someone tell me, I want to transfer my savings to a conpetitor, why should I care about this request?

The answer should be obvious, it's their data just like it's their savings.

jensus5y ago

the mental shift seems to be to not regard your customers data as your product but rather focus on your service as your product

kspacewalk25y ago

(Yes, of course it's an industry talking point. The best kind - one that's true and valid, and so far not effectively refuted).

7 more replies

tomcooks5y ago

Because it's not your data, it's mine?

cromulent5y ago

Barriers to exit are also barriers to entry.

shuntress5y ago

Because this is also required of your competitor and will allow users port their data into your startup which gives you a chance to compete.

williamtwild5y ago

Being required and complying with that requirement are two different things.

1 more reply

toomuchtodo5y ago

Isn’t engineering time cheaper than legal counsel time when your customers file complaints with the government against your org for not adhering to the law?

JumpCrisscross5y ago

> engineering time cheaper than legal counsel time

Until that changes, GDPR compliance will remain a courtesy. Not a right.

2 more replies

MattGaiser5y ago

Is any legal counsel time actually being spent on this? It seems like all the disability legislation. In theory it applies to websites. In practice, few give it a 2nd thought.

I have yet to hear of a company significantly harmed by failing to consider accessibility.

sam_lowry_5y ago

Yes unless you already have lawers on staff.

bombcar5y ago

This covers a good argument as to why: https://www.joelonsoftware.com/2000/06/03/strategy-letter-ii...

And it's true - there are a number of services for work that we've never tried because there's no easy way "back".

matheusmoreira5y ago

La1n5y ago

I bet there are more laws that a company would love not to follow, but it's the law and thus you'll need to spend time implementing it.

dbetteridge5y ago

Because the data isn't yours, it belongs to the customer.

That is the opinion that GDPR encodes into law

pmlnr5y ago

Erm... because you need to follow laws. Your company would file tax records, right? And follow fire and building regulations in the office, correct? So why would it not follow GDPR?

1 more reply

0xbadcafebee5y ago

It's not your responsibility to help your customers use a competitor's service, so you definitely don't have to care about that. However, you might care if you practice "dogfooding".

somethingAlex5y ago· 17 in thread

What are consumers intuitively expecting compliance with this law to look like?

Are consumers going to implement ETL pipelines to achieve portability? Are they expecting to hook up streaming mechanisms for enormous swathes of data?

izacus5y ago

sethhochberg5y ago

These are things that are all over the back offices of your favorite streaming service, but mostly transparent to the consumer.

1 more reply

redwall_hp5y ago

1 more reply

dsr_5y ago

There are incentives for, say, Mastodon to be able to ingest your tweeting history, or for Linked-In to eat your Facebook social graph.

There's no incentive other than the law for Twitter or Facebook to make that data exportable.

StopHammoTime5y ago

That’s why laws exist.

There’s generally no incentive to not kill someone except going to jail.

4 more replies

Guillaume865y ago

cbm-vic-205y ago

Should music streaming services be compelled to support MusicBrainz to support this GDPR case simply because it is commonly used? Who decides that mbid is the GDPR-accepted track identifier?

2 more replies

kelnos5y ago

Artist + Song Title (+ Album and track number, if it's from an album) should be enough to disambiguate in enough cases for someone to consider this "portable".

Beyond that, we have music fingerprint IDs that a service could output in the data dump along with their own service-specific ID.

> Are Apple and Spotify supposed to agree on a common file format?

For something as simple as this, yes, absolutely they should. It's bonkers that they don't and wouldn't, aside from garbage anti-competitive lock-in reasons.

908B64B1975y ago

If I was Spotify I would export that as an SQLite DB. Maybe the metadata catalog as a standalone DB too.

Apple Music has an API[0] so it's already mostly possible to import a list of songs in it.

"Data Portability" is so vaguely defined that I can't help but see it as yet another law that EU bureaucrats will use to fleece (American) "Evil Tech Giants".

[0] https://developer.apple.com/documentation/applemusicapi

anticristi5y ago

Similarly, it is up to industry -- either via standardization bodies or courts -- to clarify what exactly is "data portability".

2 more replies

hnick5y ago

> Are consumers going to implement ETL pipelines to achieve portability? Are they expecting to hook up streaming mechanisms for enormous swathes of data?

capableweb5y ago

This is from the actual article:

The full article of "Data Portability" is not that long, you can read it here: https://gdpr-info.eu/art-20-gdpr/

Helmut100015y ago

1 more reply

irrational5y ago

Isn’t “Where technically feasible” a huge loophole?

1 more reply

account425y ago

The emportant part that this law should achieve is to get the data out of the service. This at least allows competing services to provide importers, which is in their interest.

anticristi5y ago

I agree. I could develop my own ETL and wasn't sure what to do with this right. Where would I import my Klarna Checkout history and for what purpose?

I guess this law is there to ensure your images can be transferred from Dropbox to Google Drive to Apple Cloud, without any of them being tempted to pull the plug.

1 more reply

mulmen5y ago

This is so sad. That we have already forgotten how easy this is. And that we do not see that data integration is an obvious case for open standards and development.

The only reason services do not offer you data export in an easily digestible format is that they want you to stay in the app.

tester345y ago· 14 in thread

where can I download my HN's data?

notRobot5y ago

You can't. There's also no easy way to request all profile data deletion, unfortunately.

However, they do respond to privacy requests, see:

https://news.ycombinator.com/item?id=26959559

https://news.ycombinator.com/item?id=26410165

capableweb5y ago

2 more replies

capableweb5y ago

Last time I "archived" my account data on HN I used https://github.com/HackerNews/API which seems to be working good enough for my needs.

thatguy09005y ago

Hn has no EU presence so doesn't have to follow EU laws, no? Or do they have to ip block Europeans? What would the EU actually do to hn if they did decide to enforce the rules here?

burntoutfire5y ago

Typical approach is issue a fine and then seize the assets in the EU that belong to HN's owners (if there are any).

alexaholic5y ago

GDPR is about data, not companies. It applies to all entities regardless of where they are established as long as they're doing business in the EU or processing data of EU citizens.

1 more reply

tremon5y ago

Indeed, my answer would be no. But IANAL, IANYL and TINLA.

There's https://gdpr.eu/companies-outside-of-europe/ :

Recital 23 clarifies what is meant by the organization offers goods or services to people in the EU: https://gdpr.eu/Recital-23-Applicable-to-processors-not-esta...

Profiling is clarified in recital 24: https://gdpr.eu/Recital-24-Applicable-to-processors-not-esta...

1 more reply

vincnetas5y ago

There is public API for HN data

https://github.com/HackerNews/API

Does it count like ability to download your data?

user-the-name5y ago

No. It needs to be accessible to everyone, not just to programmers with lots of free time.

dahart5y ago

Question: are you an EU citizen, and is there any way for HN to know whether you are an EU citizen? (Your public profile page has no personally identifiable information.)

https://en.wikipedia.org/wiki/General_Data_Protection_Regula...

https://gdpr-info.eu/art-3-gdpr/

See 4.1: https://gdpr-info.eu/art-4-gdpr/

La1n5y ago

>only when it can be identified.

Note that it also includes indirect identification, which means that if combined with other data it would identify you. Recital 30 might be of use here too;

Rygian5y ago

My HN username (Rygian) is PII because it can be used to identify me indirectly (HN has a log of my username connecting from IP x.y.z.w, and my IP address is PII).

2 more replies

pbhjpbhj5y ago

Most people have an email address on their profile, that's PII. One could post one's name, that's definitely PII and AIUI that affects all the data then on the site, as it's now associated.

1 more reply

M2Ys4U5y ago

>GDPR is an EU law that applies to sites that market directly to EU citizens.

That is wrong. The GDPR does not make reference to citizenship.

It explicitly notes that it applies when either when the data subject is physically in the EU/EEA, or when the data controller/processor is based in the EU/EEA.

1 more reply

kijin5y ago· 9 in thread

Exporting your personal data is only half of the story. Importing is the other half.

Mordisquitos5y ago

ncallaway5y ago

Absolutely. Service providers have a real incentive to implement _import_ functionality, since it means an opportunity for new users on the platform.

Since the incentives for the service providers line up with the users, I don't see a need for the government to regulate importing data.

Export is the exact opposite. The incentives for the service providers go directly against a user. After all, if the provider export a use's data, then then that user can just leave!

toyg5y ago

dane-pgp5y ago

> Exporting your personal data is only half of the story. Importing is the other half.

BiteCode_dev5y ago

Putting a square into a round role does not make sense, but you may now be able to design a compatible square or round hole that one can move to.

beckingz5y ago

On the other hand every single app at one point would happily import your contacts no matter how they were formatted...

The incompatibility is by design.

1 more reply

matharmin5y ago

Additionally, if a service is missing import functionality, you can choose to not use it. While you'd often only realize you need export functionality while you've already used a service for years.

goodpoint5y ago

Data from a popular social network is gold.

Other social networks would implement import functions in a day.

dariosalvi785y ago

The gdpr actually says that data should be automatically transferred from one controller to another where technically feasible. https://gdpr-info.eu/art-20-gdpr/

mihaic5y ago· 5 in thread

Overall, I think GDPR is a positive force that protects consumers. It does have one major downside though, and that's that it treats entities of all sizes in the same way.

Lawyers and third parties want to get a piece of the pie, so they'll present themselves as indispensable. It's almost as if this is the EU version of TurboTax.

dheera5y ago

I think a much stronger solution would be to require all browsers to disable cookies by default, and let the user opt into websites that you need to sign into.

chrizel5y ago

But now every web page (at least in the EU) makes some kind of ugly popover and many of them try to convince you to accept all tracking with some kind of dark patterns of UI design.

1 more reply

TheManInThePub5y ago

*sigh*

Here we go again.

capableweb5y ago

> and that's that it treats entities of all sizes in the same way

M2Ys4U5y ago

>it's frustrating that there is no minimal cap on users before GDPR comes into effect, given how the EU has constant exception for artisanal food and goods manufacturers.

Privacy (including the right to protection of one's personal data) is a human right. Creating artisanal food is not.

beyondcompute5y ago· 4 in thread

grishka5y ago

On an absolutely unrelated note, I reverse engineered ask.fm's client API back when I was actually using it.

wizzwizz45y ago

Under GDPR I think they're not allowed to require an EU ID. So just say “I'm not required to give you my personal data for this”.

3 more replies

varispeed5y ago

account425y ago

> Companies think

Which ones have you tried exporting your data from?

varispeed5y ago· 4 in thread

dariosalvi785y ago

I think that, as long as the ID is there, and that data is therefore linkable, that data IS personal.

IANAL, but that company's reply is bullshit and could be easily brought to court.

varispeed5y ago

> IANAL, but that company's reply is bullshit and could be easily brought to court.

Nobody has money and time for that unfortunately and if you are using a platform that you like or need, then it is impossible, because you'll risk getting your account deleted.

1 more reply

pmlnr5y ago

> The problem is that the what actually has to be made portable is not well defined in GDPR

Wait, is there anything well defined in GDPR?

TeMPOraL5y ago

> Think IP addresses: in most cases, they are _not_ PII, especially when it's a dynamic IP from an ISP

In other words: they can be PII, and you can't easily determine which of them aren't - the way they're being assigned is out of your control.

(Even in cases people think IPs aren't PII, they become so when combined with other datasets - dynamic IPs can be quite stable.)

> Wait, is there anything well defined in GDPR?

mxmilkiib5y ago· 3 in thread

Remembering http://dataportability.org etc

djdeutschebahn5y ago

Maybe these two are also relevant:

https://github.com/google/data-transfer-project

https://datatransferproject.dev/

ot11385y ago

Do you know what happened to them (and/or some of the other companies/projects/initiatives that launched with the same goals)?

mxmilkiib5y ago

I've a messy collection of links (many of which I need to web.archive.org fix) on https://wiki.thingsandstuff.org/Open_social#DataPortability that you might be interested in.

Basically, companies thought it more profitable to not put any effort into letting users escape their service (or keep chat federated, etc.)

Various threads are still around though.

1 more reply

jFriedensreich5y ago· 3 in thread

capableweb5y ago

> it took me fighting 6 months with viacom support to get my song plays for last.fm

Sue them. It should be faster than 30 days according to GDPR.

> a lot of companies tried giving me semi obfuscated pdfs or html without classes or classes that were random strings

Giving you obfuscated data is also against GDPR as the data needs to be clearly machine-readable. Again, sue them as you now have two points against them.

jeroenhd5y ago

As much as I value data portability, I'd much rather see a DPA sue the hell out of the companies that make those ridiculous, illegal cookie walls and popovers filled with dark patterns instead.

ot11385y ago

Sue them under what law or jurisdiction?

Xavdidtheshadow5y ago· 3 in thread

fossislife5y ago

Under GDPR, they have to include everything (they admit) they have about you, isn't that right?

anticensor5y ago

Unsurprisingly enough, they only include what you entered yourself, but not derived data about you.

Xavdidtheshadow5y ago

I think, but I'm not sure how accessible it has to be. I was mostly commenting on how approachable the data format was. Formatted json with descriptive keys.

I have no idea what the law requires about the data format, so they could be doing the absolute minimum.

robin_reala5y ago· 3 in thread

The best use of GDPR for data portability that I’ve ever seen was right here on HN: https://news.ycombinator.com/item?id=24764371

Long story short, Confiks takes Spotify to task for removing the API that SongKick used to retrieve playlist data; a short time and several factual emails later they restore API access.

okamiueru5y ago

Spotify re-enabled the API because they chose to, not because they had to.

wizzwizz45y ago

> Spotify re-enabled the API because they chose to, not because they had to.

Spotify re-enabled the API because:

• they already had the API, so they couldn't claim it was an undue development burden to re-enable it;

• it was cheaper than setting up another system based on manually emailing large chunks of the database; and

• it was good press.

2 more replies

gpm5y ago

Not that this impacts your broader point, but

> the data within those 30 days

This is a common misconception. The requirement is

> GDPR information must be provided without undue delay but at latest within one month. Only in reasoned cases may this one-month deadline be exceptionally exceeded.

1 more reply

jbverschoor5y ago· 3 in thread

What about data-portability of in-game assets?

gpm5y ago

You want a csv saying what "this account has this item" flags are set for your account in the database? Ya, you can probably get that, for all the good it will do you.

55606752605y ago

selfhoster115y ago

Let's solve this with NFTs!

Partly joking, but partly serious. Maybe we could use something like this.

slver5y ago· 3 in thread

Facebook's data worth $1294 per user per year is probably BS. It'd mean Facebook generating 3.5 TRILLION in PROFIT every year.

loeg5y ago

mehdimOP5y ago

bombcar5y ago

Maybe they took Facebook's market cap (about a trillion) and divided by yearly active users or something.

LeonM5y ago· 2 in thread

Though an important part of privacy, I think this data portability is really useless, for both consumers and businesses.

1. There is no format defined in which the data must be given

2. There is no feasible way of defining such format anyway

3. Thanks to 1 and 2, there is no feasible way of importing this data into another service.

[0] Note: IANAL, a befriended paralegal told me this.

ocdtrekkie5y ago

Presumably, someone wanting to build a service that encourages people to migrate from Google services could build an import tool to their platform with it.

simpss5y ago

data needs to be provided in a machine readable format.

From there, a specific parser/converter needs to be built by the competing service for imports.

capableweb5y ago· 1 in thread

jvalencia5y ago

https://github.com/google/data-transfer-project

nicbou5y ago

There are still some issues with it (incomplete data, manually triggered data exports), but it's a notable improvement nonetheless.

It's particularly valuable when it lets you export instant messaging conversations and shared photo albums. It means that companies cannot hold your data hostage to keep you on their platform.

[0] Intro: https://nicolasbouliane.com/projects/timeline

[1] Code: https://github.com/nicbou/timeline

mehdimOP5y ago

Also, the fact that you don't know what to do with you data dump in JSON is a blocker. With APIs, integrations by 3rd parties are simpler and more user oriented.

brutuscat5y ago

ZKP all the way https://www.aepd.es/en/prensa-y-comunicacion/blog/encryption...

dundarious5y ago

2 of the 6 ways portability is broken are duplicates of each other.

Danski05y ago

Forgotten? It's a basic GDPR article (article 20) that every somewhat serious EU company know.

Link bait by a company making their profit of GDPR confusion.

j / k navigate · click thread line to collapse