Taking Action Against Scraping for Hire (opens in new tab)

(about.fb.com)

23 pointsthebestmoshe3y ago11 comments

11 comments

11 comments · 8 top-level

ftyhbhyjnjk3y ago· 2 in thread

What hypocrite these companies(fb, google, linkedin, twitter, etc.) are.... Almost entirety of the content on their sites is user generated and shared with public scope. Then why the hell these companies think it's wrong to scrape that very "public" data.

The reason is simple: they just want to maintain their monopoly on the data. That's it. These companies themselves run their advertising companies based on the very same data and train their machine learning models, and also, actually sell it(cambridge analytics, ... possibly 1000's more through 3rd party partnerships to take the blame if things go wrong).

US companies are truly monstrous... by size, and their ability to care for others.

Nextgrid3y ago

Not disagreeing with your other points but Facebook didn't sell anything to Cambridge Analytica and in fact wasn't even really complicit. All they did was provide them with a way to request access to people's accounts as a standard OAuth2 flow. Idiots agreed to the access requests and as a result gave not only their own information but the information entrusted to them by friends.

The proper response to CA would've been to send everyone affected a notification saying "Your friend <name> shared your data with a third-party that is now known to be malicious. <Unfriend> <Learn More>".

The real issue with CA is that people are dumb enough to trust anything they see on the internet without a second thought. Why does a stupid "personality test" require access to my Facebook account? It doesn't and they should've declined the access prompt right there and flagged the post as malicious.

bornfreddy3y ago

To be fair, granting access to one's Facebook account is a pretty good indicator of one's personality. :)

Dayshine3y ago· 1 in thread

I use LinkedIn for two reasons:

1. So that other humans can use it in a personal capacity to find out about me.

2. As a way of boxing all recruiter spam into a single location.

The LinkedIn terms and conditions restrict usage of my data to those purposes. That's why I'm comfortable publishing my data there.

Having my personal data indexed by Google makes it easier for people to accomplish (1) and the copyright restrictions on that data result in (2).

To those saying that scraping of all "public" data should be unrestricted: how would you suggest I meet my above goals?

Nextgrid3y ago

If you don’t provide contact details besides a LinkedIn account then they can’t spam you anywhere else even if they got the data from scraping.

Obviously if you make your email public then all bets are off.

Nextgrid3y ago

So much bad faith in this press release but not surprising from such a disgusting company, with of course some China-related fear-mongering despite no evidence of wrongdoing.

> After paying for access to the scraping software, customers self-compromised their Facebook and Instagram accounts by providing their authentication information to Octopus.

They didn't "self-compromise" their account. They trust Octopus to act on their behalf, and unlike Facebook, Octopus' interests are most likely more aligned with their users' since their service is paid. This is no different from handing your Facebook credentials to your social media manager or secretary. There's no evidence that Octopus misused this access in any way.

> Octopus designed the software to scrape data accessible to the user when logged into their accounts, including data about their Facebook Friends such as email address, phone number, gender and date of birth, as well as Instagram followers and engagement information such as name, user profile URL, location and number of likes and comments per post.

This is either information people intend to be public or information they trust their friends to keep private. Now if Octopus was leaking the private information to third-parties it would be one thing, but so far I see no evidence Octopus was disclosing the scraped information to anyone but their customer (who is already authorized to access it).

> Meta is an industry leader in taking legal action to protect people from scraping and exposing these types of services

Translation: Meta is an industry leader in protecting its disgusting business model that hinges on making public data behind a walled garden with an unacceptable "privacy" policy. There wouldn't be a market for Octopus (or other scrapers) if Facebook already allowed customers to efficiently access information they're already entitled to, but that would be against their interests as their entire business hinges on information being held hostage.

They've created a problem, are selling the cure (well in this case monetizing it via ads) and are now pissed off that someone else is selling the cure for cheaper.

ftyhbhyjnjk3y ago

Scraping publicly shared data should actually be made legal. It's "public" data.

These companies themselves scrape entire internet. How come facebook content lands up in google search? Is that not scraping? Facebook also runs it's own scrapers over internet. What about those?

It's simple: a user choosing to share their data publicly "is" public data and NOT facebook's property. Fuck zuckerberg! The asshole.

srvmshr3y ago

I am surprised Meta took this aggressive approach considering they automatically populate autogenerated public pages with content drawn from Wikipedia whenever available (at least thats how it looks, with a tiny attribution to the source. But most importantly, Wikipedia infobox & page photo show up majority of the cases as primary information).

So Meta scraping Wikipedia is okay, but other entities scraping Facebook-owned media isn't?

mlboss3y ago

Didn't linkedin lost a lawsuit related to scraping https://www.zdnet.com/article/court-rules-that-data-scraping...

epups3y ago

Scraping these tech giants should not only be allowed, but encouraged.

asdadsdad3y ago

Not surprised at all, if your business is all about selfishly leveraging people's personal data.

j / k navigate · click thread line to collapse

11 comments

11 comments · 8 top-level

ftyhbhyjnjk3y ago· 2 in thread

US companies are truly monstrous... by size, and their ability to care for others.

Nextgrid3y ago

bornfreddy3y ago

To be fair, granting access to one's Facebook account is a pretty good indicator of one's personality. :)

Dayshine3y ago· 1 in thread

I use LinkedIn for two reasons:

1. So that other humans can use it in a personal capacity to find out about me.

2. As a way of boxing all recruiter spam into a single location.

The LinkedIn terms and conditions restrict usage of my data to those purposes. That's why I'm comfortable publishing my data there.

Having my personal data indexed by Google makes it easier for people to accomplish (1) and the copyright restrictions on that data result in (2).

To those saying that scraping of all "public" data should be unrestricted: how would you suggest I meet my above goals?

Nextgrid3y ago

If you don’t provide contact details besides a LinkedIn account then they can’t spam you anywhere else even if they got the data from scraping.

Obviously if you make your email public then all bets are off.

Nextgrid3y ago

So much bad faith in this press release but not surprising from such a disgusting company, with of course some China-related fear-mongering despite no evidence of wrongdoing.

> After paying for access to the scraping software, customers self-compromised their Facebook and Instagram accounts by providing their authentication information to Octopus.

> Meta is an industry leader in taking legal action to protect people from scraping and exposing these types of services

They've created a problem, are selling the cure (well in this case monetizing it via ads) and are now pissed off that someone else is selling the cure for cheaper.

ftyhbhyjnjk3y ago

Scraping publicly shared data should actually be made legal. It's "public" data.

These companies themselves scrape entire internet. How come facebook content lands up in google search? Is that not scraping? Facebook also runs it's own scrapers over internet. What about those?

It's simple: a user choosing to share their data publicly "is" public data and NOT facebook's property. Fuck zuckerberg! The asshole.

srvmshr3y ago

So Meta scraping Wikipedia is okay, but other entities scraping Facebook-owned media isn't?

mlboss3y ago

Didn't linkedin lost a lawsuit related to scraping https://www.zdnet.com/article/court-rules-that-data-scraping...

epups3y ago

Scraping these tech giants should not only be allowed, but encouraged.

asdadsdad3y ago

Not surprised at all, if your business is all about selfishly leveraging people's personal data.

j / k navigate · click thread line to collapse