Why is Stack Overflow trying to start audio? (opens in new tab)

(meta.stackoverflow.com)

909 pointsiokanuon7y ago402 comments

402 comments

180 comments · 44 top-level

Nick-Craver7y ago· 29 in thread

I just wanted to chime in from Stack Overflow here and let people know: we are aware of the issue. And we're NOT okay with it. We're trying to sort out how to kill the audio behavior now. It's not very straightforward to find where it's coming from, but we are working on it. We've also reached out to Google for their assistance in tracking it down. If anyone can offer advice, we'll more than happily take it.

- Nick Craver, Architecture Lead at Stack Overflow

coldpie7y ago

Why are you allowing arbitrary javascript to be served to your users?

nerdponx7y ago

Wish I could upvote this 1,000 times.

It's ridiculous. It's a text-based ad. At worst, it's a clickable image. At what point did it become okay in your minds to let advertisers run arbitrary code?

I've left ads turned on specifically on StackOverflow because 1) I want to support StackOverflow, and 2) I trust them not to run malicious ads.

I don't even care that they're running ads network-wide. But if they're going to be running these kinds of ads anywhere on the site, they're going right on the ad block list along with everyone else.

1 more reply

Ajedi327y ago

I think this comment[1] on the linked Meta question explains it pretty well:

> To the people confused why ads need to run their own Javascript (even ones that are just static images): The short answer is that Ad Networks do not and cannot trust website operators. They need to run their own JavaScript served from their own servers in order to verify that a real user saw the ad and for how long, and they can't trust the website operator to tell them. And these pieces of JavaScript tend to be more invasive and privacy-destroying than the website's JS because they care, far more than the actual website does, that the "user" is not a bank of iphones in a sweatshop in China.

[1]: https://meta.stackoverflow.com/questions/386487/why-is-stack...

wlesieutre7y ago

Not just arbitrary JavaScript, arbitrary JavaScript where they can’t easily even see where it came from! Sheesh.

Could we require advertisers to sign their ad code to have a trail of where it came from, prevent tampering, and make it easier to pull the plug on bad actors?

The people bearing the costs of the internet ad economy aren’t the people in any position to do anything about it. So there’s very little pressure to fix anything.

Maybe if the US government started threatening to enact something like GDPR unless the a democratic industry gets its shit together.

2 more replies

m0dest7y ago

The solution is in sight. It's called Feature Policy.

https://feature-policy-demos.appspot.com/

https://developers.google.com/web/updates/2018/06/feature-po...

1 more reply

_eht7y ago

Why are you allowing arbitrary JavaScript to run on your device?

1 more reply

zhangjunphy7y ago

Revenues are important. The users will not notice unless something happens. And when something happens they forget fast.

runn1ng7y ago

More money that way

gotodengo7y ago

From the post:

"The ad is attempting to use the Audio API as one of literally hundreds of pieces of data it is collecting about your browser in an attempt to "fingerprint" it... Your browser may be blocking this particular API, but it's not blocking most of the data."

Seems like killing the audio is the metaphorical putting a finger in the dyke of serving arbitrary JavaScript to your users.

Benjammer7y ago

Maybe in the dyke holding back user outrage, but the dyke of serving arbitrary JavaScript was never built in the first place.

1 more reply

inferiorhuman7y ago

Nick, how did things go so wrong from three years ago?

e.g. https://news.ycombinator.com/item?id=20289841

Nick-Craver7y ago

I don’t know. I am so very much trying to find out and push to make things better.

1 more reply

Coding_Cat7y ago

> we are aware of the issue. > We're trying to sort out how to kill the audio behavior now.

Are you really aware of the issue? The issue people have here is not the fact that the ad is trying to access the audio api per se but that it is trying to fingerprint the users.

wtmt7y ago

If you're "NOT okay with it", how about stopping ads completely until you resolve this problem? That should give a bigger impetus to solve it ASAP as the bottom line gets hit for multiple stakeholders.

This is not just ads, but about fingerprinting and tracking users somehow or the other by third parties. It's plain evil, and not a decent thing to continue foisting on your unsuspecting users after you've known it. Tell management to take an ethical stance and preserve the reputation of SO.

stevenjohns7y ago

Probably not his call. By "we" he's probably talking about the engineering team, which in many cases is nothing more than a conduit for whims of the marketing and sales teams.

The only time they'd do that is if the marketing team decided that the value-add from taking ads off cancelled out the profit loss from taking the ads off.

1 more reply

MzHN7y ago

So, we have:

- Stack Overflow makes a blog post about not using dynamic ads.

- Dynamic ads found on Stack Overflow, with aggressive fingerprinting.

- Architecture Lead doesn't know how this happened and is getting serious.

I have so many questions. I hope this gets a post-mortem.

amluto7y ago

The fundamental problem seems to be that you are including non-sandboxed JavaScript that you don’t control.

Perhaps you should stop doing that.

shostack7y ago

Would something like SafeFrame have avoided this issue?

https://www.iab.com/guidelines/safeframe/

geocar7y ago

Hi Nick,

If you're serious about this, I've built tools for the publisher side for stopping exactly this.

My email address is in my profile.

Nick-Craver7y ago

I’m very interested and very serious. Email sent.

JeremyBanks7y ago

I just saw this post, where an potential justification was provided for a similar script in the past: https://meta.stackoverflow.com/questions/335956/adzerk-servi...

It's hard to read the obfuscated code and be sure what's being done with the browser environment information. This script seems to generate some hash and put in some global variables, presumably for some other script to consume. I don't know whether such scripts send it to a server, compare it locally to a previously-known value, or ignore it.

jf7y ago

I would pay for an ad-free version of Stack Overflow. Take my money, please.

minitoar7y ago

I think the data in aggregate is worth more than people like you would pay for an ad-free service.

4 more replies

pushedx7y ago

It looks like something using fingerprintjs2.

This library is very popular.

https://github.com/Valve/fingerprintjs2/blob/master/fingerpr...

detaro7y ago

Not sure how that plays with rules about how you can place ads etc, but <iframe> with a feature policy can stop access to audio I think.

IloveHN847y ago

Why don't you block all the JavaScript not coming from your origin and just display a simple link+PNG as advertising?

colek427y ago

This is exactly why I block third party advertisements for myself and everyone that uses my network.

ragerino7y ago

I hear from multiple sides people reporting, to receive ads about topics thy only talked to friends about but never entered in a search engine.

Google has is currently as far away from their previous world famous "don't be evil" corporate culture.

Other examples are AMP where Google wants to make it harder to de-individualise URL's. This is being driven to an extend where Chrome on Android makes it harder to edit the URL.

Or games like Egress or PokemonGo, which in my opinion helps Google constantly update their WiFi SSIDs-To-GPS-location database.This database is rhen furthermore being used to track users location through a little permission called "WiFi Control", which also can not be found in the regular App Permissions settings entry.

To me WiFi-Control sound nothing like location tracking. But I have to admit, I am not a native speaker. Therefore I might be misunderstanding something.

tjpnz7y ago

"Don't be evil" was replaced by "Do the right thing" years ago. Great piece of corporate speak right there.

jackdh7y ago· 25 in thread

Has there been any serious thought / discussion about how the cat and mouse chase of the ads vs ad blockers is going to end?

It would be interesting to see where we are in ten years.

keithwinstein7y ago

There's a passage of Carl Sagan's "Contact" that's on point and interesting to read 34 years later. The billionaire who helps to decode the Message (from outer space) and ends up building the working copy of the Machine made his fortune by selling tools to detect and block ads from television.

There is some discussion of the technical cat-and-mouse game he has to play as advertisers try to make their content avoid detection and blend in with the regular programming. In this version of the future, the ad blockers eventually win and network television is destroyed. (The book also features networked computers and email ("telefax"), but the concept of ads appearing on them was still too futuristic for 1985.)

https://books.google.com/books?id=Q6o51-W_z8MC&lpg=PP1&dq=go...

Adnix and Preachnix were the essence of capitalist entrepreneurship, he argued repeatedly. The point of capitalism was supposed to be providing people with alternatives.

"Well, the _absense_ of advertising is an alternative, I told them. There are huge advertising budgets only when there's no difference between the products. If the products really were different, people would buy the one that's better. Advertising teaches people not to trust their judgment. Advertising teaching people to be stupid. A strong country needs smart people. So Adnix is patriotic. The manufacturers can use some of their advertising budgets to improve their products. The consumer will benefit. Magazines and newspapers and direct mail business will boom, and that'll ease the pain in the ad agencies. I don't see what the problem is."

Adnix, much more than the innumerable libel suits against the original commercial networks, led directly to their demise. For a while there was a small army of unemployed advertising executives...

rocky11387y ago

I feel that it may go the other way: that receiving communication from a source that is supported by ad revenue while knowingly and actively bypassing those same ads will be seen as theft. I fully expect lobbyists to push for this and see some success in the next 10 years.

2 more replies

yjftsjthsd-h7y ago

> In this version of the future, the ad blockers eventually win and network television is destroyed.

I love utopian visions of the future.

Gibbon17y ago

I think ad blocking is a misnomer. What people are trying to do when blocking ads is prevent marketing people from spying on them. And the performance and resource consumption that comes from that.

Personal opinion: Laws are needed to make what advertisers are doing illegal. Advertisers are spying on people to the extent where if the government did it they'd need a warrant.

pjc507y ago

I'm only mildly bothered by the tracking, since it seems so inaccurate, but the ads themselves always drive me to adblockers. Taboola were running pictures of rotten teeth for a while which was intolerable; Youtube ads are often louder than the videos.

3 more replies

yoz-y7y ago

I disagree. The tech crowd is using adblockers to prevent spying and resource consumption. But majority of people running adblockers just don't want to see ads.

5 more replies

perl4ever7y ago

The sense of being spied on wasn't really what drove me to use an ad blocker. It was the fact that once or twice I got what appeared to be malicious code trying to take over my browser, go to pages I didn't want to go to, and prevent me from leaving, in order to promote some scam. I'm not in fact (even if it's naive) particularly scared of legitimate businesses or the CIA or whatever monitoring me.

crispinb7y ago

I'm doing both. I don't want to be spied on, and there is no room in any sector of my life for corporate propaganda.

1 more reply

hombre_fatal7y ago

Adsense is just going to start providing content for you to inline into your site.

Kind of like how https://old.reddit.com/r/gaming/ is just a sequence of ads being flawlessly delivered to an ad-averse demographic that eats the ads up.

usrusr7y ago

I find it outright puzzling that CDN edge servers have not morphed into ad splicers yet, that business seems so obvious to me. The closest to a "guessplanation" I can come up with for is not happening is that there might be trust issues (overreporting/underreporting impressions) in the triangle of publisher, ad-network and CDN/ad-splicer. But I'm not convinced at all that this would outweigh the anti add-blocker advantages.

1 more reply

eof7y ago

Seems obvious without thought to me that it’s mostly moot. Very few people will be running machines like we have for the last 30-40 years, most will be on Android/iOS where ad blocking will be minimal.

Savvy users will continue to block on machines that aren’t walled gardens and through pi-hole style blocking.

I think the cat and mouse aspect will be completely overshadowed by tech giants continually neutering their users ability to block ads.

saagarjha7y ago

> most will be on Android/iOS where ad blocking will be minimal

Safari on iOS allows for content blocking, and Firefox for Android allows users to install extensions.

3 more replies

yellowapple7y ago

I'm hoping in 10 years the world will have figured out that allowing arbitrary Turing-complete code to automatically run on one's personal machine is a terrifically terrible idea, and that the World Wide Web will instead orient itself around something that doesn't make security and privacy extraordinarily difficult to achieve (whether that's still HTML/CSS or something entirely new).

At the very least, though, eventually advertising agencies will hopefully figure out that this sort of tracking is pointless; "newspaper-style" ads are more likely to actually engage with the people encountering those ads (since said ads would be selected based on the page content rather than the person reading that content). This is how DuckDuckGo's ads work; the sponsored results are selected entirely by the actual search query. If content-driven ads (plus affiliate links, but I somehow doubt that's enough of DDG's traffic to be a deciding factor here) is enough to pay for enough computational power (and the development team to run it) to serve up 30+ million queries a day, then there's no reason it can't be enough for any other site.

laughinghan7y ago

With absolutely no disrespect intended, the hope that we'll forget about the WORA dream is delusional. WORA is inevitable and the Web, for all its flaws (and they are plentiful), is far and away the closest we've ever come. Even on mobile, which was a bit of a setback for the Web as WORA, JS has only been getting better over time. There's just no turning back the clock.

Security-wise, I think the best we can hope for is more and more OS-like sandboxing and isolation, capability-based security, and other defense-in-depth measures.

Privacy-wise, for defeating tracking and the like, ideally I'd hope for technical countermeasures to win the battle, but if we do end up having rely on legal measures, they have my full support, GDPR and CCPA included.

(Random idea for a technical countermeasure against fingerprinting: have you heard of those projects trying to defeat behavioral tracking where, whenever you visit a page, it simultaneously opens a bunch of other random pages in the background, hidden from you, and simulates activity on them, the idea being that Facebook has no idea what actual websites you like to visit because it's lost in the noise? What if instead, whenever you visit a page, your browser or a plugin or a proxy or whatever opened the same page simultaneously in a bunch of hidden background windows, with a random configuration of audio enabled/disabled, user agent, screen resolution etc fingerprinted characteristics?)

1 more reply

dorgo7y ago

> Turing-complete code

You can't build apps without turing complete code. We would be back to downloading and executing applications/programs.

2 more replies

kodablah7y ago

Desktop-wise, I've often thought [evergreen] client-side tools would emerge for content extraction via local [headless] browser automation. It's something I've contemplated building myself.

dillonmckay7y ago

Like gopher?

https://en.m.wikipedia.org/wiki/Gopher_(protocol)

yoz-y7y ago

There is Weboob which might interest you http://weboob.org

tunesmith7y ago

Is there an ad blocker that interrupts/blocks your profile (the data that would normally be sent to the ad company), lets you edit/alter it, and allow the resultant profile to be sent to the ad company? As a consumer, I prefer relevant ads to irrelevant ads, and I might even prefer very relevant ads to no ads, but I don't want the ad companies to know stuff about me that isn't okay with me.

rodgerd7y ago

Google control most mobile OSes, almost all of the web browser market, and have more or less taken over the web standards process.

They've already won.

dorgo7y ago

Tracking is more than just ads. A website owner wants to know who his visitors are. Where they come from. Which devices they use. Maybe he can support an other language, optimize for other devices, offer deals for a group of customers. But he doesn't want the risk to be fined by GDPR, so he skipps all this. Less optimisation, less/worse contacts - everybody lose.

jimktrains27y ago

I don't think most people have an issue with that single sites tracking usage on said site. (Also, much of that can be obtained with server logs.)

This issue is cross domain tracking like we see with ad network that profile you over many different sites.

lifeeeeee7y ago

Neural networks scan the final rendered image of the page for ads and remove them. You can't dodge that.

ceejayoz7y ago

Sure you can, the same way TV shows have done it - by subtly incorporating it into the text of the article.

1 more reply

penagwin7y ago

It will always be a game of cat and mouse. Even with Neural networks involved :D.

[0] https://arxiv.org/abs/1412.1897 [1] https://arxiv.org/abs/1710.08864

dabeeeenster7y ago· 11 in thread

"It's not very straightforward to find where it's coming from, but we are working on it."

This encapsulates the entire problem with the current state of digital advertising in 1 simple sentence.

ehnto7y ago

It's amazing to me that an advert can run arbitrary javascript at all. It wasn't long ago that this would have been seen as a massive security risk by site owners and ad platforms alike. I'll give them the benefit of the doubt and assume that perhaps it's a structured service provided by the ad network, and the ad buyer just checks a box. But as a site owner even that would be too much for me.

I've been playing around with ideas about more ethical analytics and advertising, and I think they're pretty easily built platforms. But the question is are they marketable? Would GloboCorp and MarketingCo give up the ability to track consumers so closely in favor of a more ethical approach, or has it been too valuable for them to give up?

dTal7y ago

>It's amazing to me that an advert can run arbitrary javascript at all.

I'm not fully up to date with how these things are usually set up - is there anything in the web security model that prevents "ads" from exfiltrating arbitrary information from any page that they're on? Could an ad read my keystrokes, or scrape private messages?

3 more replies

vbsteven7y ago

It’s not the internet we want but it is the one we have. In the end it is your own responsibility for which code gets executed on your machine.

I run NoScript with all JS blocked by default and only whitelist the domains I want. On most sites you can get by with no JS or just scripts from the same domain whitelisted. Maybe a CDN from time to time

2 more replies

keyle7y ago

But you know, we wouldn't stop serving ads until we work it out... no no imagine the loss in revenues.

JohnBooty7y ago

They have to pay a bunch of engineers and pay for a bunch of servers to keep things running.

And they certainly do contribute a massive amount of value to our community -- and as far as I can tell, they've always tried their very best to be good folks.

I'm not going to tell you how to think, but they have built up a lot of trust and goodwill in my book over the past decade.

I believe them when they say they'll work hard to do the right thing.

oconnor6637y ago

Easy to say when it's not my loss in revenues.

PopeDotNinja7y ago

Or you could just not use Stack Overflow.

1 more reply

craftinator7y ago

Let's be adults here. This is SO, and I imagine you've used and enjoyed the use of their services just like the rest of us. Support them by letting passive ads sit on the edges of the page, and appreciate that they are actually trying to solve this issue.

8 more replies

manigandham7y ago

It is straightforward, but not for publishers. The adexchanges know, and do business with shady companies on both sides because they make money from volume without any consequences.

dang7y ago

We detached this subthread from https://news.ycombinator.com/item?id=20289590.

m4637y ago

It's like targeted advertising with a flash suppressor.

kylegordon7y ago· 10 in thread

And this is why, even with the best intentions of site operators, my browser will continue to use the best ad-block tools I can get, and my networks will be protected by tools like PiHole.

MRD857y ago

In the 2005 era when I was a young video gamer I used to play World of Warcraft. There was a site, Thottbot, that players would use to find out information about things in game. I picked up a keylogger malware from their adservers. One of the advertisers had been hacked and was serving Malware every few thousand ads. Since that day I've used an adblocker and I'll always continue to do so.

mrosett7y ago

I wonder if that’s how I got hacked....

andrenotgiant7y ago

Exactly. Market solutions for market problems. I'd love to see the Raspberry PI foundation develop and sell a home router with PIHole for regular consumer use.

Considering the alternatives, that sounds really appealing for me. I'd also buy it for my less tech-literate parents.

hannasanarion7y ago

You can't profit your way out of a problem you profited yourself into. There will never be enough people setting up PiHoles to offset the value of spying, and it's publishing platforms like StackOverflow that suffer.

1 more reply

alecco7y ago

That's not enough, sadly.

Besides disabling JavaScript you can put hosts file blocklists.

Simple corporation block list (e.g. Facebook, Google) https://github.com/jmdugan/blocklists/tree/master/corporatio...

"Someone Who Cares" list http://someonewhocares.org/hosts/

Ultimate Hosts Blacklist: 1 million blocked domains (once in a while you might need to unblock something) and also a bonus known hacking IP blocklist. https://github.com/mitchellkrogza/Ultimate.Hosts.Blacklist

jimmaswell7y ago

This seems melodramatic for something as trivial as an audio request.

mikeash7y ago

It’s incredibly disrespectful. Nobody wants some random ad listening to their microphone. That they’re trying it anyway indicates that they’re hoping to get some people with browsers that don’t block it, or trick some people into saying yes.

It’s not harmful, as long as you’re not one of the people who gets tricked. But it does indicate that they want to do you harm, and try to. That they failed doesn’t make it all better.

2 more replies

yifanl7y ago

Arbitrary code execution isnt really that trivial.

1 more reply

mort967y ago

Did you click the link..? It's fingerprinting, not just trying to legitimately play audio.

luckylion7y ago

It's not just audio requests btw. Google regularly serves ads that automatically redirect users to scam sites. A client I work with gets hit with that about every six months. The ads are targeting only mobile users which makes it even harder to debug. There's nothing a publisher can do to prevent this but disabling Google Adsense completely. There is no support from Google. After a few days, a week maybe, Google disables the malicious ad (or they see that the credit card didn't work) and it stops.

superasn7y ago· 7 in thread

Maybe it's to identify users behind a VPN as this is fingerprinting the device, not the connection.

That's why I think the idea of running each site in a container is so effective.

And while we're at it the container should just spit out random shit like different resolution, audio api, user agent, once in a while (unless the user turns it off) to thwart such attempts.

Unfortunately when the creator and maintener of 67% of all browsers is an ad company who is exploiting this in the firsr place, then there is no chance that this could happen

apetresc7y ago

> And while we're at it the container should just spit out random shit like different resolution, audio api, user agent, once in a while (unless the user turns it off) to thwart such attempts.

Wouldn't that break the legitimate feature-detection uses for these APIs? Asking the user to identify and whitelist each call is impractical, especially since the fail-case in this scenario would be subtle (you'd still see the page but it might randomly be in the wrong mode, or images might be scaled incorrectly, etc). At that point you might as well just turn Javascript off.

superasn7y ago

Yes I thought about it that's why "unless the user turns it off" comment in parens. I think out of 100 sites I visit everyday no website needs to access the audio api without my consent maybe except one or two which i can whitelist. Same for user agent, I don't think it should break if the container says I'm running firefox v65 or v67, etc.

2 more replies

laughinghan7y ago

Have you heard of those projects trying to defeat behavioral tracking where, whenever you visit a page, it simultaneously opens a bunch of other random pages in the background, hidden from you, and simulates activity on them, the idea being that Facebook has no idea what actual websites you like to visit because it's lost in the noise? What if instead, whenever you visit a page, your browser or a plugin or a proxy server or whatever opened the same page simultaneously in a bunch of hidden background windows, with a random configuration of audio enabled/disabled, user agent, screen resolution etc fingerprinted characteristics?

That way, the page displays correctly for you, but the server has no idea your actual fingerprint.

There's some trickiness to get this to work right; the collection of fake fingerprints would have to have a certain amount of persistence, because if it was regenerated every pageload, the server could probably tell that only one fingerprint kept showing up repeatedly. Maybe each fake fingerprint should have a completely realistic-seeming browsing session, happening in parallel with your real one, with half the collection continuing on browsing even after you're done? Except wait, ads could just separately target every fingerprint, and it doesn't matter if 99% of them are fake as long as its accuracy for your real one is still good. To defeat that you need the randomized activity using your real fingerprint.

The ideal would be if this was done through a proxy server, which would then know every fingerprint ever sent to a website. It could then provide you with a random collection of past fingerprints that have actually visited the same website, so every visitor gets a collection of fingerprints randomly drawn from the same "bag", rendering visitors indistinguishable.

vokep7y ago

Maybe, but that seems better than the current mess. I'd rather no features than features which act against my interests.

1 more reply

patrick54157y ago

I’d prefer legitimate api usage be broken than suffer through all the abuses.

nerdponx7y ago

That, and it should be pretty easy to filter out this kind of fake "chaff" data.

m4637y ago

everyone should be running the same container though.

johnwheeler7y ago· 7 in thread

I wonder if the top brass at alphabet ever worry that their trillion dollar empire is based on fragile foundations like web audio fingerprinting, etc.

that sure would keep me up at night.

obviously, i know google does more, but it seems like a large chunk of their revenue must be dependent on shady technical tricks like these working.

colinbartlett7y ago

They realized it was a risk so they built their own browser to have more control. And it worked. Only now, users are wising up and moving to Firefox.

H8crilA7y ago

No meaningful fraction of users is moving to Firefox [1] [2]. I wish this was the case, but it sure is not.

[1] https://en.m.wikipedia.org/wiki/Usage_share_of_web_browsers#...

[2] https://netmarketshare.com/browser-market-share.aspx

1 more reply

gdw27y ago

Is firefox less fingerprintable?

5 more replies

la_barba7y ago

They don't have to worry because they control the browser too, and so those tricks will continue to work for the foreseeable future.

wang_li7y ago

Which is why one of several things should happen. The first option is that there be legal requirements to adhere to public standards if you are a content distributor. And that any standards compliant client side software be allowed to use the service. In some areas we're in a world where your telephone carrier sells you your telephone, e.g. twitter, apple's imessage, etc.

Other options would be that if you are a content distribution company, e.g. youtube, google, facebook, twitter, instagram, etc. then you cannot have any control of the client side applications that consume the content. Trustbusting would come into play here.

Or legal obligations to follow a user's desire not to be tracked with real criminal fines and jail time applied to executives, managers, and developers who failed to follow the law.

1 more reply

frogpelt7y ago

They have a pretty big moat around their business.

If you want to buy advertising online you're probably gonna end up dealing with them either directly or indirectly.

xhgdvjky7y ago

the internet is already a shady hack that only usually works

ndiscussion7y ago· 4 in thread

How We Make Money at Stack Overflow: 2016 Edition: Quality ads. "...we don’t want to use an automated system that selects some ads for us. We looked at this. It didn’t allow us the control we required to maintain the level of quality we want to maintain."

How We Make Money at Stack Overflow: 2019 Edition: Taking money from Microsoft and Google fingerprinting our users 100+ ways

source: https://stackoverflow.blog/2016/11/15/how-we-make-money-at-s...

rsj_hn7y ago

Your options, as I see them.

1. Text based ads only (no third party js)

2. HTML based ads but no js (run it through DOMPurify https://github.com/cure53/DOMPurify)

3. Look for a js sandbox -- this _will_ break arbitrary js, will not be supported in all browsers, and will require dev work on your side:

  * Google Caja  https://github.com/google/caja

  * MentalJS  https://github.com/hackvertor/MentalJS

other options are available as well, in varying levels of maturity and support.

I think using a sandbox iframe is not going to be able to defeat browser fingerprinting, because the sandbox control options are not rich enough. You would need to block all JS.

lostmsu7y ago

> HTML based ads but no js (run it through DOMPurify https://github.com/cure53/DOMPurify)

Or use iframe.sandbox, which was designed for it. https://www.w3schools.com/tags/att_iframe_sandbox.asp

1 more reply

akavel7y ago

4. Images! Why would they need anything else? Why would they need JS?

1 more reply

baroffoos7y ago

There are plenty of ad networks that do not allow advertisers to run JS. You have to run the ad networks script but that's the only one.

inglor7y ago· 4 in thread

Why is this surprising to anyone? It is clear that ads use tracking mechanisms and cookies and this is no different.

Audio feature detection isn't even a novel techique.

I've seen trackers look at download stream patterns to detect whether or not BBR congestion control is used, I have seen mouse latency based on the difference between mouse ups and downs in double clocks and I have seen speed-of-interaction checks in mouse movements.

Just checking for the constructor of something an ad might legitimately use (like audio) is relatively benign to be honest and it is naive to expect ads to not do this and it is why I use an ad blocker even on sites without annoying ads

teraflop7y ago

One reason it's surprising is that, until recently, SO was particularly resistant to allowing invasive and/or obnoxious ads.

See also the recent decision to allow animated banner ads on various Stack Exchange network sites.

ehsankia7y ago

But for code that's supposed to be so smart in trying to fingerprint people without them knowing, calling an API that throws a warning in the browser seems like a really stupid move. Especially since that can be checked through feature detection, which is literally what this code is doing...

inglor7y ago

And as a fun fact networking timing fingerprinting attacks and work even if you don't have JavaScript enabled and I have been able to make a PoC that was very accurate (I did not release it but I did disclose some bits to relevant parties)

saagarjha7y ago

I hope "relevant parties" includes "browser vendors" and not "adtech companies" :)

2 more replies

ReedJessen7y ago· 4 in thread

Is this a scandal?

ProAm7y ago

No but SO has always prided themselves on reasonable, pro-consumer, safe advertisements. So the fact that SO is allowing this speaks to them a little, its unlikely they know whats happening but it's still a little gross.

kapep7y ago

I don't think it's a scandal. It's not new or surprising that ads use tracking techniques like this. Stack Exchange recently announced [1] that they will use ad networks as an experiment. That announcement was quite unpopular and met with resistance and pleas to allow only static images to avoid annoying ads as well as sophisticated tracking. So this is no scandal since they were open about it and knew the risks. It seems they ignored the community though so they probably lost some trust by the community. I wonder if they will take action and stop the experiment.

[1]: https://meta.stackexchange.com/questions/329763/were-testing...

dymk7y ago

It's 2019, everything is a scandal

dRaBoQ7y ago

And everyone is outraged.

iamnotacrook7y ago· 4 in thread

It's ok. SO's policy on abusive ads is to mention it on mets and hope a moderator notices and then acts upon it.

gortok7y ago

As a community elected moderator (https://stackoverflow.com/users/16587 ) I can tell you with certainty that moderators have no control over ads; only the development (and maybe the community team). In this case we would do the same thing the OP did, in addition we would reach out in Stack Overflow chat to the community team do inform them of the situation.

iamnotacrook7y ago

Well perhaps you should get your story straight because on this page:

https://meta.stackexchange.com/questions/329763/were-testing...

which is being prominently announced in a yellow "featured on Meta" box you can read:

"If you see any ads that are inappropriate or have any questions about this experiment, please let me know by starting a new question and tagging it with advertising"

and

"If you wish to report an advertisement, please take a screenshot of the ad and paste the URL (if possible) along with the site where you saw it to a comment or answer. I'll report it to the ads team and we can track it down to investigate."

Screenshots? Start a new question with a tag? Track it down? Shouldn't you cut to the chase and have a "report this ad" button built-in so you can immediately be alerted to malware/abusive/inappropriate ads? Perhaps it's not moderators who have the power here. As a non-moderator/employee I couldn't care less what you call the people who do it; it seems entirely inadequate. Run the ads now and if enough people complain or it gets embarrassing - like google and/or microsoft spying on users - then publish a theatrical apology. No, that doesn't work for me.

No, my ad-blocker is never coming off.

TheOtherHobbes7y ago

Ironic that content moderation is annoyingly aggressive, but ad moderation is annoyingly permissive.

fredsanford7y ago

>> Ironic that content moderation is annoyingly aggressive, but ad moderation is annoyingly permissive.

It's not ironic at all if you think about it a bit.

>>annoyingly aggressive,

Volunteer labor from nerds who expect you to match their idea of perfection

>> ad moderation is annoyingly permissive

Done by employees so it costs SO money.

EGreg7y ago· 4 in thread

I don’t get how it can get the fingerprint to be so unique as to attribute ads. Most mobile browsers are exactly the same, you have the same screen resolution and so on. And most desktop browsers when maximized are the same resolution. I mean there must be groups of thousands of users for each combination of fingerprinted features. So it’s not all the way down to the person, right? It’s just correlations?

jupp0r7y ago

You can try out https://panopticlick.eff.org/, which estimates the entropy of information they can extract from your browser. For me it's ~18 bits, which isn't great but probably enough to infer who I am if tracked across multiple sites. They don't even use the more exotic things like audio devices/codecs mentioned in the stack overflow question.

hyperpape7y ago

https://amiunique.org/fp gives a unique fingerprint for both my Mac and my iPhone. To be honest, I don't know how they manage to fingerprint the iPhone, but they claim it's a unique fingerprint.

appleiigs7y ago

No, it's not all the way down to the person. Yes, it's just correlations. Even if the fingerprint was so unique and it went down to 1 user, it wouldn't be able to actually identify that person's name etc.

The most likely use-case here is ad fraud detection anyway.

Cpoll7y ago

> The most likely use-case here

I'm not so sure. There's a lot of market value in knowing that User 2341423 went to Site A, then Site B, then bought this item, etc.

meerita7y ago· 4 in thread

Did anyone checked how much data from our data plan cede to advertising? I bet it's 30%-40%.

chance_state7y ago

I have been using uBlock Origin for about three years and I browse the web heavily (4-6 hours/day). In that time it has blocked 13% of requests (10% on mobile).

I don't have enough info to quantify the amount of data blocked though.

gorhill7y ago

It's often the case that what is blocked prevented more scripts to be pulled, which scripts could pull even more scripts and so on. Those subsequent waves of scripts are not counted as blocked because they never had a chance to be pulled by the first wave of blocked scripts.

I have a tweet in my timeline which illustrate this: https://twitter.com/gorhill/status/934474012377444352

1 more reply

meerita7y ago

I tried to check my uBlock stats, but it didn't worked well. It seems on Firefox doesn't fully works.

meerita7y ago

10% on mobile isn't that bad. It's quite a lot!

helloworm7y ago· 3 in thread

Has anyone made a plugin that does a DOS on each ad server(s) detected? Then, we have built-in DDOS on the ad servers, if enough users install it.

anfilt7y ago

While the idea is cute you do realize that would have criminal repercussions for people who install said plugin in certain countries.

progval7y ago

Not exactly a DoS, but there's a browser extension designed to click on all ads and blur the signal: Ad Nauseam

icebraining7y ago

Good luck getting that past the Chrome Web Store censors. I doubt even Mozilla would accept it.

synthmeat7y ago· 3 in thread

It's most likely for web scraper detection. State of the art was using video codec availability as fairly reliable data point, and I haven't seen audio being used for this. Quite interesting.

jupp0r7y ago

What makes you think it would be for web scraper detection vs user fingerprinting?

yjftsjthsd-h7y ago

Is there a difference? I mean, slightly different ends, but both very much benefit from fingerprinting.

synthmeat7y ago

Because they had a lot of trouble with sham sites generated by their content.

1 more reply

jasonjayr7y ago· 2 in thread

Why can't Google come up with an AMP for ads? That will transpile a restricted javascript (or whatever) into a runtime that just doesn't do these things?

This would get rid of the greasy ads, and Google could focus on making tools that allow site owners to filter by "features used in ad", and ad developers could actually return to delivering ads, rather than collecting fingerprints?

progval7y ago

> That will transpile a restricted javascript (or whatever) into a runtime that just doesn't do these things?

They already invented that: https://github.com/google/caja

"Caja uses an object-capability security model to allow for a wide range of flexible security policies, so that your website can effectively control what embedded third party code can do with user data."

skoocda7y ago

Here you go:

https://amp.dev/documentation/guides-and-tutorials/learn/int...

z3t47y ago· 2 in thread

I guess it's part of Googles Ads's endless battle against "robot" clicks. A site as big as SO should not use Google ads, but instead use their own ad service. Just make an automated system where people can signup and show an ad. Make it cost 1$ per 100 page views. That would probably earn SO two orders of magnitude more then they get from Google Ads.

cameronbrown7y ago

> $1 per 100 page views

Eh, that's like 10x average CPM nowadays. And advertisers usually are paying per click, not impression.

z3t47y ago

As an advertiser, yes, but on Google Ads you know that 90% of those will be fake ¹. And as a publisher on Google Ads you only get something like 1$ per 10000 impression ². Advertising directly on SO you know all views are not only legit, but also target at developers, so I think advertisers are willing to pay more. While most advertisers are paying per click, the whales only care about impressions, not clicks (TV commercials).

1) Measured by analyzing the traffic I got from Google Ads 2) That's what I get from Google ads as a publisher, but you used to get a lot more in the epoch, like $5-10 CPM

1 more reply

amadeusw7y ago· 2 in thread

Does Microsoft (ad owner) or Google (ad provider) perform the fingerprinting in this case?

dymk7y ago

Google

dudus7y ago

It seems that the specific script comes from https://integralads.com/ as stated by another commentator. I think the blame is to be shared here.

integralads is guilty of developing and selling this technology. Microsoft is guilty of buying it and using it Google is guilty of serving it. And why not also StackOverflow is guilty of offering that space to advertisers without enough vetoing of their ads.

After reading about integralads I'm not even sure if the purpose is to fingerprint, it seems to be more targeted towards detecting fraud, which does not require fingerprinting necessarily.

My point is that it's not as easy as pointing to one company and blaming them. This is a problem that concerns anyone on the Ad space.

JimBrimble357y ago· 2 in thread

Aside from the obvious usability benefits, this kind of thing makes it abundantly clear why much of the web has gone to javascript dependent SPAs. If you need JS to run the site, then you also have to leave it on to be tracked/fingerprinted.

Kind of makes sense why companies like Google and Facebook have invested so much in creating open-source front-end frameworks. The ROI is probably phenomenal.

I get that stackoverflow isn't an SPA, it just made me think of this point.

Side-note: you can block JS on stackoverflow and still view answers. That works for 98% of my usecase for the site.

__jal7y ago

> If you need JS to run the site

... Then I move on. Those dorky little crapware widgets are basically never worth looking at in any case, and I do take that sort of strategic tooling decision as a signal that I probably don't want to accept the 'bargain' being offered.

JimBrimble357y ago

That's fair, my point is that in many cases (a rapidly growing number of cases), the entire site is JS. If you need to service, then you have to accept the tracking.

sergiotapia7y ago· 2 in thread

Is there something I can use to randomly fuzz every tab individually as I browse the web?

They can track me through websites and I don't want that. Already using ublock origin.

fimdomeio7y ago

Not exactly what you asked for but got this from mozilla today: https://blog.mozilla.org/firefox/hey-advertisers-track-this/

kevin_thibedeau7y ago

NoScript + Decentraleyes + Random user agent + Self-destructing cookies.

rkagerer7y ago· 2 in thread

TLDR: A case of invasive fingerprinting triggered by a Microsoft ad delivered by Google.

rkagerer7y ago

Are all fingerprinting techniques used in the wild pretty generally well-known? Do any browsers have an option to blindly return a standard set of values regardless of actual client capabilities/metrics? (i.e. make it difficult to achieve more granular results than browser agent).

I know Mozilla made an anti-fingerprinting announcement recently but IIRC all it does is check scripts against a blacklist: https://blog.mozilla.org/futurereleases/2019/04/09/protectio...

sfink7y ago

There's an option in Firefox, yes. privacy.resistFingerprinting or something, you can search for it. It tends to break a number of sites, iiuc.

atoav7y ago· 1 in thread

I don’t get the modern ad stuff, any reasonable person uses an adblocker anyway, because ads are often slow, problematic in terms of privacy and security.

The fact that even people of a big site like stack overflow don’t know where it comes from instantly, is only further proof that using an adblocker is a resonable decision.

Maybe it is naive, but all ads should be in my eyes is a picture and something that counts the page views. And when you are a site that has ads as it’s main income you should have at minimum one employee who knows and tests each ad before it gets accepted and put onto your server.

Only then your customers will trust the ads you use and only then any reasonable person can even consider deactivating the adblocker for your site.

I am pretty sure somebody explored this idea before me, why doesn’t it work?

bongobongo7y ago

It works, it just won’t happen because all the structural incentives point to the status quo. Another reason to love our current crop of monopolists...

captn3m07y ago· 1 in thread

A little bit of corporate newspeak (and digging):

Ad URL: https://static.adsafeprotected.com/sca.17.4.95.js

JS Domain: adsafeprotected.com

Domain Owner: Integral Ad Science, Inc[0]

Google's recent stance on the matter of fingerprinting[2]:

>Chrome also announced that it will more aggressively restrict fingerprinting across the web. When a user opts out of third-party tracking, that choice is not an invitation for companies to work around this preference using methods like fingerprinting, which is an opaque tracking technique. Google doesn’t use fingerprinting for ads personalization because it doesn't allow reasonable user control and transparency. Nor do we let others bring fingerprinting data into our advertising products.

The important part being: _Nor do we let others bring fingerprinting data into our advertising products._

The same company advertises their fingerprinting capabilities:

>Browser and Device Analysis: We analyze the technological fingerprints of browsers and devices in order to uncover bots fraudulently posing as human users. We can validate what type of mobile or desktop device a browser is running on, providing additional context with which to identify fraud.

And it is this fingerprinting that gets them selected as a Google Brand Safety and Viewability Preferred Measurement Partner[1]

>New York, NY – Integral Ad Science (IAS) has been selected as a preferred partner in Google’s Measurement Program for both brand safety and viewability. Partners were selected after meeting rigorous standards for accuracy and using reliable methodologies to measure KPIs that matter for marketers. The program is designed to make it easier for advertisers to source trusted, third-party measurement providers.

The gist of it being that Google has heavy cognitive dissonance, with their advertising wing rewarding partners that fingerprint users (against their own policies), and the Chrome team barely managing to introduce some anti-fingerprint measures, which are clearly not enough.

[0]: https://integralads.com/capabilities/ad-fraud/

[1]: https://integralads.com/news/google-selects-ias-brand-safety...

[2]: https://blog.google/products/ads/transparency-choice-and-con...

pdkl957y ago

> Google has heavy cognitive dissonance

Perhaps, but I think some of that behavior only appears dissonant. Like the NSA, Google often uses carefully constructed language that is designed to sound like a statement about a topic of concern without saying anything actually useful. For example:

> Google doesn’t use fingerprinting for ads personalization

The only reason to add "...for ads personalization" is if they are using fingerprinting for for other purposes. This could include other ad-related purposes like attribution.

Google claims about not using specific data for a specific purpose are probsabl7 true. They simply fingerprint (and probably correlate) everything else.

kabwj7y ago· 1 in thread

If you don’t use an ad blocker you should expect your browser to behave in strange ways.

If you don’t use an ad blocker you should consider your computer compromised.

penagwin7y ago

It's been known that ads are commonly used to spread viruses / invasive tracking for years. And I've used adblock for almost 10 years!

Honestly, how are still allowed to execute javascript at all?! I get it if the ad-manager still executed javascript, but how is it okay to let random 3rd parties run js on your website?

pnw_hazor7y ago· 1 in thread

Programmers make these tools. When challenging said programmers who work for companies that promote this kind of behavior (G) they suggest that they work for these evil companies because their job is interesting and it pays well.

This practice could stop tomorrow if the best and brightest of us decided so.

luckylion7y ago

I doubt that. If "the best and the brightest" wouldn't do it, the second best and second brightest would be asked. At some point, somebody will do it. Also, isn't Google already selecting for moral flexibility? I find it hard to believe that a principled developer would start at Google, much like a pacifist engineer wouldn't work at a Pentagon contractor. So they are getting the best and the brightest whose limits of what they won't do because of personal ethics don't include ad tech, surveillance etc.

I'm not so sure that education would help either, it's my impression that ethics is just individually set. Of the people that understand Kant's categorical imperative, some will act accordingly and others will ignore their knowledge because doing so gets them more money.

6gvONxR4sf7o7y ago· 1 in thread

I would love for this to be illegal.

dymk7y ago

Thank God we live don't live in a direct democracy

ploxiln7y ago

It's pretty obvious that the only real fix is to accept money in exchange for putting an image with a hyperlink on your website.

Anything involving javascript will do shenanigans for various reasons. Fingerprinting via any means possible is industry standard ad-network behavior at this point. No one in the industry could imagine doing any less - it's impractical, it's absurd. But targeting! But fraud! But the only fix is to just give it all up, go back to how it was done in the 90s.

miohtama7y ago

I like the comment on SO: "Deanonymizing via fingerprinting - illegal in EU"

lol7687y ago

It's insane to me the extent to which companies will go in order to prevent cross-site scripting attacks.. and yet they're perfectly happy to include unvetted, potentially malicious JavaScript on the same origin in the form of ads.

There is no reason these ads should be anything other than a linked image.

mappu7y ago

There's something up with my PulseAudio (maybe changing audio output formats?) that means i hear a very loud "pop" when pages try to do this.

e.g. Browsing to an arstechnica.com article, with speakers on but nothing else playing.

1 more reply

ddtaylor7y ago

How about stop letting remote sites execute arbitrary Javascript on your pages?

crispyporkbites7y ago

As a website publisher, is there an ad network available for me to use that doesn’t allow advertisers to run JavaScript?

If so, what kind of rates can I get?

thelazydogsback7y ago

This issue (along with many others) is due to one simple fact -- the internet is still primarily about presentation and rendering not information. We had both client-side template-based rendering and Semantic Web initiatives -- these failed for various technical and non-technical reasons at the time, but I'm hoping we go in that general direction again at some point. Nobody else should be able to (definitively) decide what information I want and how it should be presented to me. We only get the Internet that the majority are willing to put up with.

louhike7y ago

Gosh, it's incredible the length they will go to de-anomize user data. I guess I will think better next time a website I like ask me to add them to my ad blocker whitelist.

miguelmota7y ago

Seems like classic fingerprinting behavior from Google Ads. It's unfortunate and hope they fix it quick but most importantly figure out a way to prevent it in the future

boomlinde7y ago

Tangentially related anecdote: I came across a site the other day that requested access to the MIDI API for no apparent reason. Is this a common tracking vector? The available MIDI interfaces can say something about the system but in 99% of cases (the 99% that don't have any physical MIDI interfaces) I don't imagine that you'll discover anything other than operating system family.

nvr2197y ago

Always use ublock (origin)

alinspired7y ago

this is the time to appreciate uBlock Origin's advanced mode, since 3rd party JS is blacklisted by default https://github.com/gorhill/uBlock/wiki/Advanced-user-feature...

unixhero7y ago

"Probably it tries to use the AudioContext for browser fingerprinting. – Bergi 11 hours ago"

avip7y ago

If you're a newcomer to this long thread, pls CTRL+F manigandham and read all his comments as a primer. Lots of misinformed couch-comments here. If you'd like to reasonably rant about ad-tech (and that's welcome), understand the value it provides first.

eyeball7y ago

I’ve been noticing horrible battery drain on my iOS devices lately. The battery monitor in settings says the worst offender is “safari audio”. I wonder if it’s something similar.

zaphirplane7y ago

If this is caused by accepting JS enabled ads. What’s to stop the ad from changing the dom or redirecting the browser to a SO fishing site

emmelaich7y ago

It now makes sense that you’re rewarded for staying logged in.

paulcarroty7y ago

Ultradisgusting case on StackOverflow: 99.999% top answers are edited by moderators - they just promote yourself with free content.

We need a real alternative - without stupid ads and master-slave karma-based community relations.

unixhero7y ago

Post closed due to wrong category.

j / k navigate · click thread line to collapse

402 comments

180 comments · 44 top-level

Nick-Craver7y ago· 29 in thread

- Nick Craver, Architecture Lead at Stack Overflow

coldpie7y ago

Why are you allowing arbitrary javascript to be served to your users?

nerdponx7y ago

Wish I could upvote this 1,000 times.

It's ridiculous. It's a text-based ad. At worst, it's a clickable image. At what point did it become okay in your minds to let advertisers run arbitrary code?

I've left ads turned on specifically on StackOverflow because 1) I want to support StackOverflow, and 2) I trust them not to run malicious ads.

I don't even care that they're running ads network-wide. But if they're going to be running these kinds of ads anywhere on the site, they're going right on the ad block list along with everyone else.

1 more reply

Ajedi327y ago

I think this comment[1] on the linked Meta question explains it pretty well:

[1]: https://meta.stackoverflow.com/questions/386487/why-is-stack...

wlesieutre7y ago

Not just arbitrary JavaScript, arbitrary JavaScript where they can’t easily even see where it came from! Sheesh.

Could we require advertisers to sign their ad code to have a trail of where it came from, prevent tampering, and make it easier to pull the plug on bad actors?

The people bearing the costs of the internet ad economy aren’t the people in any position to do anything about it. So there’s very little pressure to fix anything.

Maybe if the US government started threatening to enact something like GDPR unless the a democratic industry gets its shit together.

2 more replies

m0dest7y ago

The solution is in sight. It's called Feature Policy.

https://feature-policy-demos.appspot.com/

https://developers.google.com/web/updates/2018/06/feature-po...

1 more reply

_eht7y ago

Why are you allowing arbitrary JavaScript to run on your device?

1 more reply

zhangjunphy7y ago

Revenues are important. The users will not notice unless something happens. And when something happens they forget fast.

runn1ng7y ago

More money that way

gotodengo7y ago

From the post:

Seems like killing the audio is the metaphorical putting a finger in the dyke of serving arbitrary JavaScript to your users.

Benjammer7y ago

Maybe in the dyke holding back user outrage, but the dyke of serving arbitrary JavaScript was never built in the first place.

1 more reply

inferiorhuman7y ago

Nick, how did things go so wrong from three years ago?

e.g. https://news.ycombinator.com/item?id=20289841

Nick-Craver7y ago

I don’t know. I am so very much trying to find out and push to make things better.

1 more reply

Coding_Cat7y ago

> we are aware of the issue. > We're trying to sort out how to kill the audio behavior now.

Are you really aware of the issue? The issue people have here is not the fact that the ad is trying to access the audio api per se but that it is trying to fingerprint the users.

wtmt7y ago

stevenjohns7y ago

Probably not his call. By "we" he's probably talking about the engineering team, which in many cases is nothing more than a conduit for whims of the marketing and sales teams.

The only time they'd do that is if the marketing team decided that the value-add from taking ads off cancelled out the profit loss from taking the ads off.

1 more reply

MzHN7y ago

So, we have:

- Stack Overflow makes a blog post about not using dynamic ads.

- Dynamic ads found on Stack Overflow, with aggressive fingerprinting.

- Architecture Lead doesn't know how this happened and is getting serious.

I have so many questions. I hope this gets a post-mortem.

amluto7y ago

The fundamental problem seems to be that you are including non-sandboxed JavaScript that you don’t control.

Perhaps you should stop doing that.

shostack7y ago

Would something like SafeFrame have avoided this issue?

https://www.iab.com/guidelines/safeframe/

geocar7y ago

Hi Nick,

If you're serious about this, I've built tools for the publisher side for stopping exactly this.

My email address is in my profile.

Nick-Craver7y ago

I’m very interested and very serious. Email sent.

JeremyBanks7y ago

I just saw this post, where an potential justification was provided for a similar script in the past: https://meta.stackoverflow.com/questions/335956/adzerk-servi...

jf7y ago

I would pay for an ad-free version of Stack Overflow. Take my money, please.

minitoar7y ago

I think the data in aggregate is worth more than people like you would pay for an ad-free service.

4 more replies

pushedx7y ago

It looks like something using fingerprintjs2.

This library is very popular.

https://github.com/Valve/fingerprintjs2/blob/master/fingerpr...

detaro7y ago

Not sure how that plays with rules about how you can place ads etc, but <iframe> with a feature policy can stop access to audio I think.

IloveHN847y ago

Why don't you block all the JavaScript not coming from your origin and just display a simple link+PNG as advertising?

colek427y ago

This is exactly why I block third party advertisements for myself and everyone that uses my network.

ragerino7y ago

I hear from multiple sides people reporting, to receive ads about topics thy only talked to friends about but never entered in a search engine.

Google has is currently as far away from their previous world famous "don't be evil" corporate culture.

Other examples are AMP where Google wants to make it harder to de-individualise URL's. This is being driven to an extend where Chrome on Android makes it harder to edit the URL.

To me WiFi-Control sound nothing like location tracking. But I have to admit, I am not a native speaker. Therefore I might be misunderstanding something.

tjpnz7y ago

"Don't be evil" was replaced by "Do the right thing" years ago. Great piece of corporate speak right there.

jackdh7y ago· 25 in thread

Has there been any serious thought / discussion about how the cat and mouse chase of the ads vs ad blockers is going to end?

It would be interesting to see where we are in ten years.

keithwinstein7y ago

https://books.google.com/books?id=Q6o51-W_z8MC&lpg=PP1&dq=go...

Adnix and Preachnix were the essence of capitalist entrepreneurship, he argued repeatedly. The point of capitalism was supposed to be providing people with alternatives.

Adnix, much more than the innumerable libel suits against the original commercial networks, led directly to their demise. For a while there was a small army of unemployed advertising executives...

rocky11387y ago

2 more replies

yjftsjthsd-h7y ago

> In this version of the future, the ad blockers eventually win and network television is destroyed.

I love utopian visions of the future.

Gibbon17y ago

I think ad blocking is a misnomer. What people are trying to do when blocking ads is prevent marketing people from spying on them. And the performance and resource consumption that comes from that.

Personal opinion: Laws are needed to make what advertisers are doing illegal. Advertisers are spying on people to the extent where if the government did it they'd need a warrant.

pjc507y ago

3 more replies

yoz-y7y ago

I disagree. The tech crowd is using adblockers to prevent spying and resource consumption. But majority of people running adblockers just don't want to see ads.

5 more replies

perl4ever7y ago

crispinb7y ago

I'm doing both. I don't want to be spied on, and there is no room in any sector of my life for corporate propaganda.

1 more reply

hombre_fatal7y ago

Adsense is just going to start providing content for you to inline into your site.

Kind of like how https://old.reddit.com/r/gaming/ is just a sequence of ads being flawlessly delivered to an ad-averse demographic that eats the ads up.

usrusr7y ago

1 more reply

eof7y ago

Savvy users will continue to block on machines that aren’t walled gardens and through pi-hole style blocking.

I think the cat and mouse aspect will be completely overshadowed by tech giants continually neutering their users ability to block ads.

saagarjha7y ago

> most will be on Android/iOS where ad blocking will be minimal

Safari on iOS allows for content blocking, and Firefox for Android allows users to install extensions.

3 more replies

yellowapple7y ago

laughinghan7y ago

Security-wise, I think the best we can hope for is more and more OS-like sandboxing and isolation, capability-based security, and other defense-in-depth measures.

1 more reply

dorgo7y ago

> Turing-complete code

You can't build apps without turing complete code. We would be back to downloading and executing applications/programs.

2 more replies

kodablah7y ago

Desktop-wise, I've often thought [evergreen] client-side tools would emerge for content extraction via local [headless] browser automation. It's something I've contemplated building myself.

dillonmckay7y ago

Like gopher?

https://en.m.wikipedia.org/wiki/Gopher_(protocol)

yoz-y7y ago

There is Weboob which might interest you http://weboob.org

tunesmith7y ago

rodgerd7y ago

Google control most mobile OSes, almost all of the web browser market, and have more or less taken over the web standards process.

They've already won.

dorgo7y ago

jimktrains27y ago

I don't think most people have an issue with that single sites tracking usage on said site. (Also, much of that can be obtained with server logs.)

This issue is cross domain tracking like we see with ad network that profile you over many different sites.

lifeeeeee7y ago

Neural networks scan the final rendered image of the page for ads and remove them. You can't dodge that.

ceejayoz7y ago

Sure you can, the same way TV shows have done it - by subtly incorporating it into the text of the article.

1 more reply

penagwin7y ago

It will always be a game of cat and mouse. Even with Neural networks involved :D.

[0] https://arxiv.org/abs/1412.1897 [1] https://arxiv.org/abs/1710.08864

dabeeeenster7y ago· 11 in thread

"It's not very straightforward to find where it's coming from, but we are working on it."

This encapsulates the entire problem with the current state of digital advertising in 1 simple sentence.

ehnto7y ago

dTal7y ago

>It's amazing to me that an advert can run arbitrary javascript at all.

3 more replies

vbsteven7y ago

It’s not the internet we want but it is the one we have. In the end it is your own responsibility for which code gets executed on your machine.

2 more replies

keyle7y ago

But you know, we wouldn't stop serving ads until we work it out... no no imagine the loss in revenues.

JohnBooty7y ago

They have to pay a bunch of engineers and pay for a bunch of servers to keep things running.

And they certainly do contribute a massive amount of value to our community -- and as far as I can tell, they've always tried their very best to be good folks.

I'm not going to tell you how to think, but they have built up a lot of trust and goodwill in my book over the past decade.

I believe them when they say they'll work hard to do the right thing.

oconnor6637y ago

Easy to say when it's not my loss in revenues.

PopeDotNinja7y ago

Or you could just not use Stack Overflow.

1 more reply

craftinator7y ago

8 more replies

manigandham7y ago

It is straightforward, but not for publishers. The adexchanges know, and do business with shady companies on both sides because they make money from volume without any consequences.

dang7y ago

We detached this subthread from https://news.ycombinator.com/item?id=20289590.

m4637y ago

It's like targeted advertising with a flash suppressor.

kylegordon7y ago· 10 in thread

And this is why, even with the best intentions of site operators, my browser will continue to use the best ad-block tools I can get, and my networks will be protected by tools like PiHole.

MRD857y ago

mrosett7y ago

I wonder if that’s how I got hacked....

andrenotgiant7y ago

Exactly. Market solutions for market problems. I'd love to see the Raspberry PI foundation develop and sell a home router with PIHole for regular consumer use.

Considering the alternatives, that sounds really appealing for me. I'd also buy it for my less tech-literate parents.

hannasanarion7y ago

1 more reply

alecco7y ago

That's not enough, sadly.

Besides disabling JavaScript you can put hosts file blocklists.

Simple corporation block list (e.g. Facebook, Google) https://github.com/jmdugan/blocklists/tree/master/corporatio...

"Someone Who Cares" list http://someonewhocares.org/hosts/

jimmaswell7y ago

This seems melodramatic for something as trivial as an audio request.

mikeash7y ago

It’s not harmful, as long as you’re not one of the people who gets tricked. But it does indicate that they want to do you harm, and try to. That they failed doesn’t make it all better.

2 more replies

yifanl7y ago

Arbitrary code execution isnt really that trivial.

1 more reply

mort967y ago

Did you click the link..? It's fingerprinting, not just trying to legitimately play audio.

luckylion7y ago

superasn7y ago· 7 in thread

Maybe it's to identify users behind a VPN as this is fingerprinting the device, not the connection.

That's why I think the idea of running each site in a container is so effective.

And while we're at it the container should just spit out random shit like different resolution, audio api, user agent, once in a while (unless the user turns it off) to thwart such attempts.

Unfortunately when the creator and maintener of 67% of all browsers is an ad company who is exploiting this in the firsr place, then there is no chance that this could happen

apetresc7y ago

> And while we're at it the container should just spit out random shit like different resolution, audio api, user agent, once in a while (unless the user turns it off) to thwart such attempts.

superasn7y ago

2 more replies

laughinghan7y ago

That way, the page displays correctly for you, but the server has no idea your actual fingerprint.

vokep7y ago

Maybe, but that seems better than the current mess. I'd rather no features than features which act against my interests.

1 more reply

patrick54157y ago

I’d prefer legitimate api usage be broken than suffer through all the abuses.

nerdponx7y ago

That, and it should be pretty easy to filter out this kind of fake "chaff" data.

m4637y ago

everyone should be running the same container though.

johnwheeler7y ago· 7 in thread

I wonder if the top brass at alphabet ever worry that their trillion dollar empire is based on fragile foundations like web audio fingerprinting, etc.

that sure would keep me up at night.

obviously, i know google does more, but it seems like a large chunk of their revenue must be dependent on shady technical tricks like these working.

colinbartlett7y ago

They realized it was a risk so they built their own browser to have more control. And it worked. Only now, users are wising up and moving to Firefox.

H8crilA7y ago

No meaningful fraction of users is moving to Firefox [1] [2]. I wish this was the case, but it sure is not.

[1] https://en.m.wikipedia.org/wiki/Usage_share_of_web_browsers#...

[2] https://netmarketshare.com/browser-market-share.aspx

1 more reply

gdw27y ago

Is firefox less fingerprintable?

5 more replies

la_barba7y ago

They don't have to worry because they control the browser too, and so those tricks will continue to work for the foreseeable future.

wang_li7y ago

Or legal obligations to follow a user's desire not to be tracked with real criminal fines and jail time applied to executives, managers, and developers who failed to follow the law.

1 more reply

frogpelt7y ago

They have a pretty big moat around their business.

If you want to buy advertising online you're probably gonna end up dealing with them either directly or indirectly.

xhgdvjky7y ago

the internet is already a shady hack that only usually works

ndiscussion7y ago· 4 in thread

How We Make Money at Stack Overflow: 2019 Edition: Taking money from Microsoft and Google fingerprinting our users 100+ ways

source: https://stackoverflow.blog/2016/11/15/how-we-make-money-at-s...

rsj_hn7y ago

Your options, as I see them.

1. Text based ads only (no third party js)

2. HTML based ads but no js (run it through DOMPurify https://github.com/cure53/DOMPurify)

3. Look for a js sandbox -- this _will_ break arbitrary js, will not be supported in all browsers, and will require dev work on your side:

  * Google Caja  https://github.com/google/caja

  * MentalJS  https://github.com/hackvertor/MentalJS

other options are available as well, in varying levels of maturity and support.

I think using a sandbox iframe is not going to be able to defeat browser fingerprinting, because the sandbox control options are not rich enough. You would need to block all JS.

lostmsu7y ago

> HTML based ads but no js (run it through DOMPurify https://github.com/cure53/DOMPurify)

Or use iframe.sandbox, which was designed for it. https://www.w3schools.com/tags/att_iframe_sandbox.asp

1 more reply

akavel7y ago

4. Images! Why would they need anything else? Why would they need JS?

1 more reply

baroffoos7y ago

There are plenty of ad networks that do not allow advertisers to run JS. You have to run the ad networks script but that's the only one.

inglor7y ago· 4 in thread

Why is this surprising to anyone? It is clear that ads use tracking mechanisms and cookies and this is no different.

Audio feature detection isn't even a novel techique.

teraflop7y ago

One reason it's surprising is that, until recently, SO was particularly resistant to allowing invasive and/or obnoxious ads.

See also the recent decision to allow animated banner ads on various Stack Exchange network sites.

ehsankia7y ago

inglor7y ago

saagarjha7y ago

I hope "relevant parties" includes "browser vendors" and not "adtech companies" :)

2 more replies

ReedJessen7y ago· 4 in thread

Is this a scandal?

ProAm7y ago

kapep7y ago

[1]: https://meta.stackexchange.com/questions/329763/were-testing...

dymk7y ago

It's 2019, everything is a scandal

dRaBoQ7y ago

And everyone is outraged.

iamnotacrook7y ago· 4 in thread

It's ok. SO's policy on abusive ads is to mention it on mets and hope a moderator notices and then acts upon it.

gortok7y ago

iamnotacrook7y ago

Well perhaps you should get your story straight because on this page:

https://meta.stackexchange.com/questions/329763/were-testing...

which is being prominently announced in a yellow "featured on Meta" box you can read:

"If you see any ads that are inappropriate or have any questions about this experiment, please let me know by starting a new question and tagging it with advertising"

and

No, my ad-blocker is never coming off.

TheOtherHobbes7y ago

Ironic that content moderation is annoyingly aggressive, but ad moderation is annoyingly permissive.

fredsanford7y ago

>> Ironic that content moderation is annoyingly aggressive, but ad moderation is annoyingly permissive.

It's not ironic at all if you think about it a bit.

>>annoyingly aggressive,

Volunteer labor from nerds who expect you to match their idea of perfection

>> ad moderation is annoyingly permissive

Done by employees so it costs SO money.

EGreg7y ago· 4 in thread

jupp0r7y ago

hyperpape7y ago

https://amiunique.org/fp gives a unique fingerprint for both my Mac and my iPhone. To be honest, I don't know how they manage to fingerprint the iPhone, but they claim it's a unique fingerprint.

appleiigs7y ago

The most likely use-case here is ad fraud detection anyway.

Cpoll7y ago

> The most likely use-case here

I'm not so sure. There's a lot of market value in knowing that User 2341423 went to Site A, then Site B, then bought this item, etc.

meerita7y ago· 4 in thread

Did anyone checked how much data from our data plan cede to advertising? I bet it's 30%-40%.

chance_state7y ago

I have been using uBlock Origin for about three years and I browse the web heavily (4-6 hours/day). In that time it has blocked 13% of requests (10% on mobile).

I don't have enough info to quantify the amount of data blocked though.

gorhill7y ago

I have a tweet in my timeline which illustrate this: https://twitter.com/gorhill/status/934474012377444352

1 more reply

meerita7y ago

I tried to check my uBlock stats, but it didn't worked well. It seems on Firefox doesn't fully works.

meerita7y ago

10% on mobile isn't that bad. It's quite a lot!

helloworm7y ago· 3 in thread

Has anyone made a plugin that does a DOS on each ad server(s) detected? Then, we have built-in DDOS on the ad servers, if enough users install it.

anfilt7y ago

While the idea is cute you do realize that would have criminal repercussions for people who install said plugin in certain countries.

progval7y ago

Not exactly a DoS, but there's a browser extension designed to click on all ads and blur the signal: Ad Nauseam

icebraining7y ago

Good luck getting that past the Chrome Web Store censors. I doubt even Mozilla would accept it.

synthmeat7y ago· 3 in thread

It's most likely for web scraper detection. State of the art was using video codec availability as fairly reliable data point, and I haven't seen audio being used for this. Quite interesting.

jupp0r7y ago

What makes you think it would be for web scraper detection vs user fingerprinting?

yjftsjthsd-h7y ago

Is there a difference? I mean, slightly different ends, but both very much benefit from fingerprinting.

synthmeat7y ago

Because they had a lot of trouble with sham sites generated by their content.

1 more reply

jasonjayr7y ago· 2 in thread

Why can't Google come up with an AMP for ads? That will transpile a restricted javascript (or whatever) into a runtime that just doesn't do these things?

progval7y ago

> That will transpile a restricted javascript (or whatever) into a runtime that just doesn't do these things?

They already invented that: https://github.com/google/caja

skoocda7y ago

Here you go:

https://amp.dev/documentation/guides-and-tutorials/learn/int...

z3t47y ago· 2 in thread

cameronbrown7y ago

> $1 per 100 page views

Eh, that's like 10x average CPM nowadays. And advertisers usually are paying per click, not impression.

z3t47y ago

1) Measured by analyzing the traffic I got from Google Ads 2) That's what I get from Google ads as a publisher, but you used to get a lot more in the epoch, like $5-10 CPM

1 more reply

amadeusw7y ago· 2 in thread

Does Microsoft (ad owner) or Google (ad provider) perform the fingerprinting in this case?

dymk7y ago

Google

dudus7y ago

It seems that the specific script comes from https://integralads.com/ as stated by another commentator. I think the blame is to be shared here.

After reading about integralads I'm not even sure if the purpose is to fingerprint, it seems to be more targeted towards detecting fraud, which does not require fingerprinting necessarily.

My point is that it's not as easy as pointing to one company and blaming them. This is a problem that concerns anyone on the Ad space.

JimBrimble357y ago· 2 in thread

Kind of makes sense why companies like Google and Facebook have invested so much in creating open-source front-end frameworks. The ROI is probably phenomenal.

I get that stackoverflow isn't an SPA, it just made me think of this point.

Side-note: you can block JS on stackoverflow and still view answers. That works for 98% of my usecase for the site.

__jal7y ago

> If you need JS to run the site

JimBrimble357y ago

That's fair, my point is that in many cases (a rapidly growing number of cases), the entire site is JS. If you need to service, then you have to accept the tracking.

sergiotapia7y ago· 2 in thread

Is there something I can use to randomly fuzz every tab individually as I browse the web?

They can track me through websites and I don't want that. Already using ublock origin.

fimdomeio7y ago

Not exactly what you asked for but got this from mozilla today: https://blog.mozilla.org/firefox/hey-advertisers-track-this/

kevin_thibedeau7y ago

NoScript + Decentraleyes + Random user agent + Self-destructing cookies.

rkagerer7y ago· 2 in thread

TLDR: A case of invasive fingerprinting triggered by a Microsoft ad delivered by Google.

rkagerer7y ago

I know Mozilla made an anti-fingerprinting announcement recently but IIRC all it does is check scripts against a blacklist: https://blog.mozilla.org/futurereleases/2019/04/09/protectio...

sfink7y ago

There's an option in Firefox, yes. privacy.resistFingerprinting or something, you can search for it. It tends to break a number of sites, iiuc.

atoav7y ago· 1 in thread

I don’t get the modern ad stuff, any reasonable person uses an adblocker anyway, because ads are often slow, problematic in terms of privacy and security.

The fact that even people of a big site like stack overflow don’t know where it comes from instantly, is only further proof that using an adblocker is a resonable decision.

Only then your customers will trust the ads you use and only then any reasonable person can even consider deactivating the adblocker for your site.

I am pretty sure somebody explored this idea before me, why doesn’t it work?

bongobongo7y ago

It works, it just won’t happen because all the structural incentives point to the status quo. Another reason to love our current crop of monopolists...

captn3m07y ago· 1 in thread

A little bit of corporate newspeak (and digging):

Ad URL: https://static.adsafeprotected.com/sca.17.4.95.js

JS Domain: adsafeprotected.com

Domain Owner: Integral Ad Science, Inc[0]

Google's recent stance on the matter of fingerprinting[2]:

The important part being: _Nor do we let others bring fingerprinting data into our advertising products._

The same company advertises their fingerprinting capabilities:

And it is this fingerprinting that gets them selected as a Google Brand Safety and Viewability Preferred Measurement Partner[1]

[0]: https://integralads.com/capabilities/ad-fraud/

[1]: https://integralads.com/news/google-selects-ias-brand-safety...

[2]: https://blog.google/products/ads/transparency-choice-and-con...

pdkl957y ago

> Google has heavy cognitive dissonance

> Google doesn’t use fingerprinting for ads personalization

The only reason to add "...for ads personalization" is if they are using fingerprinting for for other purposes. This could include other ad-related purposes like attribution.

Google claims about not using specific data for a specific purpose are probsabl7 true. They simply fingerprint (and probably correlate) everything else.

kabwj7y ago· 1 in thread

If you don’t use an ad blocker you should expect your browser to behave in strange ways.

If you don’t use an ad blocker you should consider your computer compromised.

penagwin7y ago

It's been known that ads are commonly used to spread viruses / invasive tracking for years. And I've used adblock for almost 10 years!

Honestly, how are still allowed to execute javascript at all?! I get it if the ad-manager still executed javascript, but how is it okay to let random 3rd parties run js on your website?

pnw_hazor7y ago· 1 in thread

This practice could stop tomorrow if the best and brightest of us decided so.

luckylion7y ago

6gvONxR4sf7o7y ago· 1 in thread

I would love for this to be illegal.

dymk7y ago

Thank God we live don't live in a direct democracy

ploxiln7y ago

It's pretty obvious that the only real fix is to accept money in exchange for putting an image with a hyperlink on your website.

miohtama7y ago

I like the comment on SO: "Deanonymizing via fingerprinting - illegal in EU"

lol7687y ago

There is no reason these ads should be anything other than a linked image.

mappu7y ago

There's something up with my PulseAudio (maybe changing audio output formats?) that means i hear a very loud "pop" when pages try to do this.

e.g. Browsing to an arstechnica.com article, with speakers on but nothing else playing.

1 more reply

ddtaylor7y ago

How about stop letting remote sites execute arbitrary Javascript on your pages?

crispyporkbites7y ago

As a website publisher, is there an ad network available for me to use that doesn’t allow advertisers to run JavaScript?

If so, what kind of rates can I get?

thelazydogsback7y ago

louhike7y ago

Gosh, it's incredible the length they will go to de-anomize user data. I guess I will think better next time a website I like ask me to add them to my ad blocker whitelist.

miguelmota7y ago

Seems like classic fingerprinting behavior from Google Ads. It's unfortunate and hope they fix it quick but most importantly figure out a way to prevent it in the future

boomlinde7y ago

nvr2197y ago

Always use ublock (origin)

alinspired7y ago

this is the time to appreciate uBlock Origin's advanced mode, since 3rd party JS is blacklisted by default https://github.com/gorhill/uBlock/wiki/Advanced-user-feature...

unixhero7y ago

"Probably it tries to use the AudioContext for browser fingerprinting. – Bergi 11 hours ago"

avip7y ago

eyeball7y ago

I’ve been noticing horrible battery drain on my iOS devices lately. The battery monitor in settings says the worst offender is “safari audio”. I wonder if it’s something similar.

zaphirplane7y ago

If this is caused by accepting JS enabled ads. What’s to stop the ad from changing the dom or redirecting the browser to a SO fishing site

emmelaich7y ago

It now makes sense that you’re rewarded for staying logged in.

paulcarroty7y ago

Ultradisgusting case on StackOverflow: 99.999% top answers are edited by moderators - they just promote yourself with free content.

We need a real alternative - without stupid ads and master-slave karma-based community relations.

unixhero7y ago

Post closed due to wrong category.

j / k navigate · click thread line to collapse