AI browser extensions are a security nightmare (opens in new tab)

(kolide.com)

260 pointsterracatta3y ago122 comments

122 comments

62 comments · 12 top-level

kypro3y ago· 24 in thread

> Yes, large language models (LLMs) are not actually AI in that they are not actually intelligent, but we’re going to use the common nomenclature here.
I'm sorry for the off-topic comment, but why do I keep seeing this? What am I missing here – is it that some people define intelligence as >= human, or that LLM are not intelligence because they're *just* statistical models?

VoodooJuJu3y ago

It's a way for the author to distinguish himself as one who is neither a purveyor of, nor fooled by, the magic, grift, and cringy sci-fi fantasizing that currently comprises the majority of AI discussion.

Currently, most mentions of AI, outside of a proper technical discussion, are coming from crypto-tier grifters and starry-eyed suckers. Even further, a lot of discussions from otherwise technical people are sci-fi-tier fearmongering about some ostensible Skynet, or something, it's not quite clear, but it's clearly quite cringe. The latter is one of the many calibers of ammunition being used by AI incumbents to dig regulatory moats for themselves.

Anyway, I understand why the author is distinguishing himself with his LLM...AI disclaimer, given the above.

dguest3y ago

In my field it's accepted (by some) that you write "AI" for your grant proposal and say "ML" when you talk to colleagues and want to be taken seriously.

It feels a bit wrong to me, because as you say it's arguably a grift, in this case on the taxpayer who funds science grants. More charitably it might just be the applicant admitting that they have no idea what they are doing, and the funding agency seeing this as a good chance to explore the unknown. Still, unless the field is AI research (mine isn't) it seems like funding agencies should giving money to people who understand their tools.

1 more reply

shagie3y ago

I think its the "just" statistical models part.

If you pull up the TOC for an AI textbook, you'll find lots of things that aren't "intelligent". Machine learning is just a subset of it. I recall a professor in the AI department back in the 90s working on describing the shape of an object from a photograph (image to text) based on a number of tools (edge detection was one paper I recall).

Also in AI is writing a deductive first order logic solver is covered in there as are min-max trees and constraint satisfaction problems.

http://aima.cs.berkeley.edu

https://www.cs.ubc.ca/~poole/ci/contents.html (note chapter 4)

https://www.wiley.com/en-us/Mathematical+Methods+in+Artifici...

People are trying to put a box around "AI" to mean a particular thing - maybe they want AI to mean "artificial general intelligence" rather than all the things that are covered in the intro to AI class in college.

I ultimately believe that trying to use a term that has been very broad for decades to apply to only a small subset of the domain is going to end up being a fruitless Scotsman tilting at windmills.

... And you know what, I think it does a pretty good job at being intelligent. https://chat.openai.com/share/01d760b3-4171-4e28-a23b-0b6565...

bee_rider3y ago

Very clever people have located true intelligence in the gaps between what an machine can do and what a human can. Therefore, to show that you aren’t a starry-eyed rube you put a disclaimer that you aren’t really talking about intelligence, but something that just looks and acts like it.

True intelligence is, of course, definitionally the ability to do things like art or… err, wait, sorry, I haven’t checked recently, where have we put the goalposts nowadays?

ethanbond3y ago

I’m hesitant to even call this moving the goal posts. Intelligence has never been solidly defined even within humans (see: IQ debate; book smart vs street smart; idiot savants).

It’s unsurprising that creating machines that seem to do some stuff very intelligently and some other things not very intelligently at all is causing some discontent with regard to our language.

I see a whole lot more gnashing of teeth about goalposts moving than I do about people proposing actual solid goalposts.

So what’s your definition?

2 more replies

pixl973y ago

Heh, Computers will never be intelligent, we will just moving the bar until humans can no longer be classified as intelligent.

hospitalJail3y ago

Stable Diffusion doesnt make art, it makes photos. We can deem them art.

Its denoising software.

1 more reply

wongarsu3y ago

There's long been a divide between what people call hard vs soft AI, or strong vs weak AI, or narrow vs general. The definitions are a bit fuzzy, but generally a hard AI or strong AI would be able to think for itself, develop strategies and skills, maybe have a sense of self. Soft AI in contrast is a mere tool where you put something in and get something out.

Now some people don't like using the term AI for soft/weak/narrow AI, because it's a fleeting definition, mostly applied to things that are novel and that we didn't think computers were able to do. Playing chess used to be considered AI, but a short time after AI beat the human chess world master it was no longer considered AI. If you buy a chess computer capable of beating Magnus Carlsen today that's considered a clever algorithm, no longer AI. You see the same thing playing out in real time right now with LLMs, where they go from AI to "just algorithms" in record time.

JohnFen3y ago

Because we don't have a real handle on what "intelligence" actually is, any use of the word without defining it is essentially just noise.

ethanbond3y ago

Yeah this is exactly it. It’s interesting seeing a precision-oriented discipline (engineering) running into the inherently very, very muddy world of semantics.

“What do you mean it’s not intelligent?! It passed Test X!”

“Yes and now that tells us Test X was not a good test for whatever it is we refer to as ‘intelligence’”

sublinear3y ago

> LLM are not intelligence because they're just statistical models

This is exactly it for me.

xigency3y ago

Are you intelligent or just a bunch of cells? Given that I can query it for all sorts of information that I don’t know, I would consider LLMs to, at the very least, contain and present intelligence…artificially.

1 more reply

ericd3y ago

And if your brain is mostly a statistical model of the world, with action probabilities based on what parts of it happen to be excited at the moment?

2 more replies

hospitalJail3y ago

Its interesting to see what it thinks about some ideas, like I ask, what 5 companies are best at marketing. My goal here is to be hypercritical of the companies it says because they are masters at manipulation. GPT3.5 was awful and confused advertising and marketing. GPT4 was perfect (Apple, Nike, Coke, Amazon, P&G)

As much as chatgpt doesnt want to give you answers because the fuzziness, it has the ability to make judgements on things like "This is the best" or "This is the worst".

Ofc with bias.

1 more reply

ravenstine3y ago

> is it that some people define intelligence as >= human

I just want to say that this seems to be how many, if not most people define intelligence internally. If an LLM gets something wrong or doesn't know something, then it must be completely unintelligent. (as if humans never get anything wrong!)

xigency3y ago

Clearly the test isn’t >= as ChatGPT is already more coherent than large swaths of the population. The AI test for some is that its intelligence >>> human intelligence. Which is funny because by that point in time, their opinion will be more than worthless.

ethanbond3y ago

Like with humans, there are intelligent ways to be wrong and unintelligent ways to be wrong.

LLMs do a whole lot of “wrong in a way that indicates it is not ‘thinking’ the way an intelligent human would.”

1 more reply

majormajor3y ago

AI's a very soft term, and there's long been a technical vs "casual" split in what it means. Five or ten years ago you'd say your photo was retouched with AI dust removal, say, and we'd all know what that means. And that there was a big gulf between that and the sci-fi "AI" of Blade Runner or Her or Star Wars, etc.

The user interface to Chat GPT and similar tools, though, has made a lot of people think that gap is gone, and that instead of thinking they are using an AI tool in the technical sense, they now think they're talking to a full-fledged other being in the sci-fi sense; that that idea has now come true.

So a lot of people are careful to distinguish the one from the other in their writing.

russdill3y ago

It's statistical models all the way down.

ryanklee3y ago

That is not a very good reason to call an entity unintelligent. There are uncontroversial models of human intelligence that are Bayesian.

2 more replies

nathan_compton3y ago

I say that large language models are not intelligent because of the way they fail to do things. In particular, they fail in such a way as to indicate they have no mental model of the things they parrot. If you give them a simple, but very unusual, coding problem, they will confidently give you an incorrect solution even though they seem to understand programming when dealing with things similar to their training data.

An intelligent thing should easily generalize in these situations but LLMs fail to. I use GPT4 every day and I frequently encounter this kind of thing.

NumberWangMan3y ago

Is there a definition of intelligence that rules out large language models, but that does not also rule out large portions of humanity? A lot of people would readily admit that they don't have programming aptitude and would probably end up just memorizing things. Do we say those people are not intelligent?

It seems to me that the perceived difference is mostly in being able to admit that you don't know something, rather than make up an answer -- but making up an answer is still something that humans do sometimes.

1 more reply

LudwigNagasena3y ago

> is it that some people define intelligence as >= human

Just like some people define stupid as <= them. Aptitude is a multivariate spectra. It is already hard to come up with a cutoff on a single measure, way harder to do so for a bunch of different skills that for some reason happen to correlate in humans (and sometimes they diverge wildly as in the case of savant syndrome).

guy982387103y ago

More like intelligence == human. ChatGPT is superhuman in many ways.

amelius3y ago· 12 in thread

Actually, aren't all browser extensions a security nightmare?

Or has something changed recently?

kpw943y ago

Yeah parts of the article would still be as valid if this was about regular extensions.

The main difference is that AI extension, by design, send the content of the pages you browse to a server.

A malicious "calculator" extension could also send all the content to a server, and extension users don't really have an idea of what each extension is actually doing.

So skip the "Malware posing as AI browser extension" section, it's same kind of security issues as a malware calculator extension.

The legitimate AI extension's problems are more interesting.

Article wastes a bit more time on other security issues you get from using AI LLM in general. Those apply whether you're using a browser extension or chat.openai.com directly.

The valid point that applies to narrowly AI browser extension are:

1) it could send sensitive data you wouldn't have sent otherwise. Most people would know what they're doing when they explicitly paste the stuff on chat.openai.com. But when it's now automated via the extension DOM scraping, it's a bit harder to realize how much you're giving away.

2) And the hidden text prompt injection. That's interesting as now your attacker could be the website you browse, if you have configured too many plugins (Zapier plugin giving access to your email)

These 2 parts of TFA are imo novel security issues that only exist with AI browser extension, and are interesting.

mattigames3y ago

If a calculator extension is caught sending any data at all over the network they immediately would be suspicious, but evey AI extension has plausible deniability when making any requests, most can send all the webpage including form inputs and still have such deniability.

madeofpalk3y ago

That's actually what I thought the title was until reading your comment, and I agreed vehemently.

jprete3y ago

No, because a typical safe-to-run browser extension is written in such a way that it can be examined to see what it does. AI-based tools can’t be analyzed based on their code, so the only way to make them safe is by limiting their capabilities. Any such capability limit is likely to be either too constraining, not constraining enough, or require as much planning ability as the AI itself.

majormajor3y ago

When you talk about not being able to analyze these based on their code do you mean because today they're all just calling out to OpenAI or whoever?

The risks listed in the article itself mostly seem to fall under the same, non-AI-extension, core problem of "you're given them all your data." And that's a risk for non-AI-based extensions too, but if you look at the code of an AI one, it's gonna be obvious that it's shipping it off to a third party server, right? And once that happens... you can't un-close that door.

(The risks about copyright and such of content you generate by using AI tools are interesting and different, but I don't know that I'd call them security ones.)

The prompt injection one is pretty interesting, but still seems to fall under "traditional" plugin security issues: if you authorize a plugin to read everything on your screen, AND have full integration with your email, or whatever, then... that's a huge risk. The AI/injection part makes it triggerable by a third-party, which certainly raises the alarm level a lot, but also: bad idea, period, IMO.

2 more replies

amelius3y ago

The problem is the permission system. Like apps, extensions have an all-or-nothing attitude to permissions. Browsers should allow the user to be more specific about permissions, and let extensions think the user gave more permissions than they actually did. E.g. if extension insists that they need "access to entire filesystem", the browser should make the extension believe they have access to the entire filesystem, but of course the entire thing is sandboxed and the user can restrict the access behind the scenes.

Without this feature, extensions will keep insisting they need access, and the user will eventually fall for it.

1 more reply

LapsangGuzzler3y ago

shout out to the Arc browser, which has it's own browser sandbox and WYSIWYG tools to build JS snippets that run in your browser. I'm not affiliated with them in any way, but they're really changing the way I look at browsing online.

moffkalast3y ago

Does that come on a CD along with Intel Arc GPUs? :D

moritzwarhier3y ago

Already commented something similar in another thread:

Why is the security policy for extensions still not architected like other web permissions?

There has been a shift on mobile already from "take it or leave it"-style permissions on install towards more fine grained control not overidable by the app manifest.

I think Browser extensions should behave similarly. Especially when it comes to which origins an extensions is allowed to act on.

The user should be able to restrict this regardless of the manifest, even forced to do.

Extensions that need to act on all or an unknown set of origins should require a big and scary prompt after installation, regardless of what the user agrees to during installation.

I say this as a happy user of uBlock origin and React DevTools.

But for the common user the default should be to deny permissions and require user interaction.

dylan6043y ago

you can make a warning as big and scary as you can, and people will just blindly hit accept/agree/ok. the look/design of the banner is not what will stop people from hitting ok, as at this point, i don't think anything will

2 more replies

hgsgm3y ago

Mobile doesn't give you control over which origins it contacts.

1 more reply

notatoad3y ago

an extension developer can scope their extension to only run on certain URLs, and if that list changes then chrome will automatically disable it until the user re-authorizes for the new set of URLs.

so they're not a total security nightmare if they're only authorized to run on sites where you don't enter any private data. for example, looking through my extensions list, the py3redirect that autmatically redirects python2 documentation pages to python3 pages doesn't request access to anything other than python.org.

but otherwise, yeah, you're giving permission to execute arbitrary code on any website you visit, which is about as compromised as your browser can get.

CyberDildonics3y ago· 5 in thread

Browser Extensions Are a Security Nightmare - I guess you can add AI in front to make it seem new.

mahogany3y ago

Exactly - it blows my mind how normalized the permission Access your data for all websites is (I think it's Read and Change all your data on all websites for Chrome). I use only one or two extensions because of this. Why does a procrastination tool need such an insanely broad permission?

hoosieree3y ago

I wrote a Chrome extension[1] that reads no data but places a colored translucent div over the page. It requires that same "change all your data" permission.

My takeaway lesson is that the permissions model for extensions is confusing and nearly useless.

[1] https://chrome.google.com/webstore/detail/obscura/nhlkgnilpm...

3 more replies

waboremo3y ago

If it operates on more than one domain, it needs those permissions to function based on how the permissions system works. You can limit those yourself in the settings page for the extension, but everything else is basically workarounds applied to avoid that permission.

For example, a web clipper operates on multiple domains, but it can avoid it by using activetab permission instead and then offering optional permissions if it wants when you click on the clipper extension icon.

If you want something to be done automatically on multiple domains, this is not possible without that permission. Not unless you want to annoy users with prompts.

hospitalJail3y ago

Just because an extension can do that, doesnt mean they are sending your info to a server.

1 more reply

croes3y ago

Exactly.

But I think at the moment it's easier to get someone to install an extension as long it mentions GPT or AI.

matheusmoreira3y ago· 3 in thread

Pretty much every single extension that isn't uBlock Origin is a security nightmare.

vorticalbox3y ago

Even unblock is, only takes the repository owners login to be taken an update pushed.

hxugufjfjf3y ago

No. There are many good, secure browser extensions.

madeofpalk3y ago

Such as?

1 more reply

ricardo813y ago· 2 in thread

Only skimmed through the article, it seems -AI from the title would be an old story?

Also, that huge 4.7MB image in the head of the article...

Anthony-G3y ago

Another good reason to use uBlock Origin and select the “Block media elements larger than x KB” option (x defaults to 50).

Edit: Wow! I just tried loading the page and see that the ridiculously large image still loads. That’s a particularly obnoxious website: the image’s HTTP header says that its Content-Length is 0 so it still gets downloaded by the browser.

SCUSKU3y ago

SEO, who needs it!

Roark663y ago· 1 in thread

>Actually, the current AI situation may be even more perilous than Jurassic Park. In that film, the misguided science that brought dinosaurs back to life was at least confined to a single island and controlled by a single corporation. In our current reality, the dinosaurs are loose, and anyone who wants to can play with one.

I'm really tired of reading stuff like this above. Seriously, AI is a disruptive tech and some people will oppose any change, but this is too much. All of the "security issues" mentioned in the article are true for browser extensions,and perhaps even software in general.

Then the author talks about "copyright mess" just before describing how it is pretty much resolved in their company (copilot banned).

The only real "problem with AI" is really a "problem with cloud" or more precisely "problem with people's lack of understanding of it". Average people should be interested in finding software alternatives that don't undermine their privacy.

For example look at AI image up scaling. Every single android app other than mine sends user's images to a server somewhere. Are those images retained? Are they scanned for whatever "legal purposes" the maker deems adequate? No one knows. No one cares. Well specifically in the entire world about 90 people seem to care.

Why 90 people? Because that's how many users my android app has 6 months after release. (the app does all processing locally, free version is ad supported, paid version can be used 100% offline).

cyanydeez3y ago

While true, the main problem the ChatGPT era presents is the ability to do powerful things with weakly defined understand.

This is like handing out footgun coupons to all citizens who become "of age" and saying it's cool cause they were already legally allowed to buy footguns.

Tycho3y ago· 1 in thread

I wonder when we’ll start seeing computer viruses that communicate with a remote LLM in order to get help circumventing barriers.

Alternatively, maybe anti-virus software can phone home to get on-the-fly advice.

ronsor3y ago

> Alternatively, maybe anti-virus software can phone home to get on-the-fly advice.

Modern antivirus software already does this, more or less. It's usually called something like "cloud scanning."

deathlight3y ago· 1 in thread

I have perment unstoppable hiccups that have occurred in the last week or so. Nothing I have tried has made them stop in fact I just hit up more every time I try to report record anything. I would like to just breathe without having hiccups and it's not even a choice for me I'm not even permitted to even attempt to stop this Behavior May hiccups are constant and unending. I have run out of ideas of who to pursue for help this is just Agony I can't even breathe without constant hiccup interruption I don't know how to make it stop and I'll do anything at this point.

mptest3y ago

https://pubmed.ncbi.nlm.nih.gov/3395000/

In case you're not joking

FL33TW00D3y ago· 1 in thread

But what is the "AI" ran entirely locally? https://pagevau.lt/

williamstein3y ago

The "Download for Chrome" link on that page is broken. "404. That’s an error. The requested URL was not found on this server. That’s all we know."

drtgh3y ago

I do not understand how is it possible that Internet browsers do not currently have already built-in firewall that allows the user to control where the connection requests in the browser in general -the tab, the loaded web, the addon- are going to and from, and filter them.

Garcia983y ago

The issue is not AI, nor browser extensions per se, the issue is the lackluster permission system that Chrome extensions have, it's pretty similar to what Android had 7 (?) years ago, which should not be acceptable in 2023.

activiation3y ago

Automatic updates should be disabled by default...

j / k navigate · click thread line to collapse