Dependency Confusion: How I Hacked Into Apple, Microsoft and Other Companies (opens in new tab)

(medium.com)

1107 pointsRobadob5y ago402 comments

402 comments

232 comments · 71 top-level

ryukafalz5y ago· 24 in thread

I see a lot of people saying things like "this is why package signing is important" and "we need to know who the developers are" and "we need to audit everything." Some of that is true to some degree, but let me ask you this: why do we consider it acceptable that code you install through a package manager implicitly gets to do anything to your system that you can do? That seems silly! Surely we can do better than that?

This article from Agoric is extremely relevant here, from a previous such incident (re: the event-stream package on npm): https://medium.com/agoric/pola-would-have-prevented-the-even...

Put simply: in many cases, the dependencies you install don't need nearly as much authority as we give them right now. Maybe some of these packages need network access (I see a few named "logger" which might be shipping logs remotely) but do they need unrestricted filesystem access? Probably not! (They don't necessarily even need unrestricted network access either; what they're communicating with is likely pretty well-known.)

eecc5y ago

Uh, well the original developers of the Sun JVM didn’t do such a bad job after all when designing it: https://docs.oracle.com/javase/7/docs/technotes/guides/secur...

WatchDog5y ago

Javas security manager system, is usually not in effect for the majority of use cases. While maven/grade dependencies, can't run code on installation, generally once the application is ran/tested, it will be with full user permissions, not under a security manager.

The security manager is an additional layer of security that most languages don't have, however Java applets have shown it to be full of holes and generally unsuitable for running untrusted code.

The applet security posture has contributed a great deal towards negative opinion towards the language, probably would have been better off never having existed.

1 more reply

szc5y ago

The Sun JVM, as originally implemented, can express operations that are not valid for Java objects. There are parts of the JVM that attempt to constrain opcode sequences to only be from "valid java compilers operating on java objects".

In 1996, Java was being overwhelmed by exploits because the mapping of the language to the VM was not well matched. There was a Java summit with lots of interesting people. This summit was also when Sun got confirmation that MicroSoft had quite a few engineers working on an independently implemented runtime. To Sun's credit, they did get rather more serious about Java security -- but they had already created a rocky foundation.

It is my opinion, that the business model Sun had "in mind" for Java was a free runtime for everyone that they were in control of, but to make money from selling an "official" Java compiler suite.

I do not believe that the Sun Java JVM was created with security in mind.

easton5y ago

I believe that Deno (the "successor" to Node being written by Ryan Dahl) is supposed to fix this for server-side JavaScript/TypeScript. It doesn't grant any permissions to anything unless you specifically give them out (so you can say that only a specific module gets access to the filesystem, for instance, and on top of that it can only access /srv and not /etc).

https://deno.land/manual@v1.7.2/getting_started/permissions

ryukafalz5y ago

This looks like it's... getting there, but still too coarse-grained. It looks like those permissions are granted to the whole Deno process? So if your program needed both access to sensitive data on the filesystem and network access, and it used a malicious dependency, that dependency could take advantage of those permissions and exfiltrate that data.

I could be wrong, but I don't see any mention of permissions on imported code: https://deno.land/manual@v1.7.2/examples/import_export

2 more replies

nogbit5y ago

I think this is critical. The actual runtime of any code needs to do way more than what it’s doing now.

Simply relying on package signing and the like permits trusted but malicious actors. With Deno packages configured well it can really lock down and limit a ton of attack vectors.

1 more reply

bluetech5y ago

This is so obviously what needs to happen, it's really surprising it's not a feature in all major languages by now. I bet in 10 years time, giving dependencies complete control would seem crazy.

Here is an interesting proposal on how to possibly get there in JS with import maps: https://guybedford.com/secure-modular-runtimes

Deno uses ambient permissions for the entire process and unfortunately missed the opportunity to do it right.

detaro5y ago

Indeed, being able to apply capabilities on a package level would be great, but I don't know many languages/environments that implement this as a first-class feature.

newhouseb5y ago

The WASM ecosystem is exploring this through the use of what they call "nanoprocesses" wherein libraries are wrapped into modules and provided access to nothing by default [1]. This seems to be more of a pattern and consequence of how WASM works than a specific feature.

1. https://hacks.mozilla.org/2019/11/announcing-the-bytecode-al... (ignore the title, it's irrelevant to the excellent explanation that constitutes 70% of the post)

ryukafalz5y ago

Yeah. JavaScript is probably the closest to being there (with things like SES[0], LavaMoat[1], etc.) but we're not quite there yet. It's just shocking that this sort of thing is as seemingly obscure as it is; it's like the whole industry has collectively thrown up their hands and said code execution is unavoidably radioactively dangerous. (While simultaneously using package managers that... well.) But it doesn't have to be!

[0] https://github.com/Agoric/ses-shim

[1] https://github.com/LavaMoat/LavaMoat

eecc5y ago

Java does. Of course it’s never been used systematically and it has received precious little attention to DevOps ergonomics, but the infrastructure is there

2 more replies

dwohnitmok5y ago

Safe Haskell is one in this vein (it's lower level and you would apply a capability layer on top), although like other past efforts on this front it's mostly languished in obscurity even among the Haskell community and is used by very few people.

The main hope at the moment seems to be JS.

bryanrasmussen5y ago

>why do we consider it acceptable that code you install through a package manager implicitly gets to do anything to your system that you can do?

I thought it was because operating systems still use access based instead of capabilities based security?

ptx5y ago

Could you solve this in Java using the SecurityManager stuff that was used to sandbox applets, or is all that considered broken these days? (I'm not sure if you can different SecurityManagers for different parts of the app though.)

jjav5y ago

Yes, with Java you can.

That's how the web/application server containers worked (probably still do, but I've been disconnected). The server classes have different permissions from the application code classes (loaded from the .war/etc files). If an application code method calls into a system class, the permissions which apply are those or the application since that method is in the calling stack frame.

I wrote this support into several Java web container and J2EE application server products back in the day. AFAIK, all that still works great today in Java.

ryukafalz5y ago

I'm not familiar enough with Java to have a strong opinion on this, but this HN comment from the linked article mentions that you can only have one SecurityManager per app, so sounds like that's still too coarse-grained: https://news.ycombinator.com/item?id=18599365

3 more replies

thw0rted5y ago

I'm reading through all these responses and it sounds like nobody read the article. Everybody keeps bringing up JVM SecurityManager, or how granular Deno's permission system is, or a syntax for granting runtime permissions to modules (like your Agoric link). That's not what happened here. The actual attack in the article was a post-install script run by the package manager. That means whatever kind of limits you might place on runtime capabilities of the library wouldn't have mattered. You need a system that lets the package installer request granular permissions from the package manager, where the package manager runs the scripts in a sandbox and only explicitly-provided privileges are granted. I don't know of any package managers that support this feature today.

safog5y ago

This is a really nice idea but considering we haven't even solved the relatively simple case of users giving permissions to apps and expecting them to behave responsibly, I'm not optimistic that we can solve the much more challenging case of importing library code.

e.g., If someone gives an app the ability to upload photos, it can silently read all photo metadata, upload all photos to a private server instead of uploading just the single photo that the user picked. This can be solved with OS level standard photo pickers but it hasn't been yet.

Same with package code. Maybe a package needs network access for stuff it genuinely needs to do. However it can (and probably will) at some point go above and beyond in the amount of data it collects. FB Mobile SDK outage is a good example of this. https://www.bugsnag.com/blog/sdks-should-not-crash-apps

nahuel0x5y ago

Giving fine-grained permissions to sandboxed libraries is the way forward, and probably a really good use case for WASM.

edejong5y ago

It sounds like you are interested in a (distributed) capability-based security model. [1]

[1] https://enacademic.com/dic.nsf/enwiki/295618

ryukafalz5y ago

Yes, I didn't use the term explicitly in my comment but you're precisely right. ;)

Agoric (the company whose blog post I linked to) and the people behind it have done a ton of object capability work over the years.

jjn20095y ago

Its unfortunate that the proposed realms is still just a proposal. Even still I've heard many arguments that since the method of isolation lives inside JS it cannot be expected to be entirely secure and you would be much better off relying on OS level security primitives, a point that comments I've read so far completely glosses over. I'd love for someone to prove me wrong that this is air tight so we can champion realms at my work.

POLA is good to live by regardless if it can be implemented.

zomgwat5y ago

Not a complete answer (by any means) but keeping tight control over egress network access helps (I wish it was easier to limit egress access over port 443).

Systemd has some capability to restrict access to system resources. I haven't experimented with the capabilities yet so not sure what's all there.

linuxftw5y ago

I'm late to this conversation, but the thing that can restrict "doing anything" on your machine is SELinux.

rachelbythebay5y ago· 23 in thread

I’m cackling at how great this is. This is what happens when you trust the internet forever and just scarf down any old thing at build time. Of course it’ll get exploited! That’s what evil people do.

CaptArmchair5y ago

There are a lot of expensive things you can outsource. Responsibility isn't among those.

Free software / open source propels engineering as you can share and leverage the results of collective efforts. However, at no point did the concept come with inherent guarantees about concerns such as security.

esr defined 19 points for "good" open source software development in his seminal essay "The Cathedral and the Bazaar". I feel some of those are sometimes easily thrown out of the window for the sake of "efficiency" or "cost-effectiveness".

This issue resonates with bullet point 17 in particular:

> A security system is only as secure as its secret. Beware of pseudo-secrets.

I think this issue has less to do with package managers, and a lot with companies rushing into the convenience of public code platforms such as Github without properly vetting whether or not they might be inadvertently leaking internal information through packaging manifests.

https://en.wikipedia.org/wiki/The_Cathedral_and_the_Bazaar

iib5y ago

Offtopic, but I found nowhere to actually ask this question. Does anybody know if ESR is still alive? His blog [1] has not been updated in months--and looking at his post dates, this seems really out of character--, he hasn't posted anything on twitter, or his usual channels.

[1] http://esr.ibiblio.org/

2 more replies

syshum5y ago

>>There are a lot of expensive things you can outsource. Responsibility isn't among those.

That is not true at all, the industry both Development and even more so in Operations has been outsourcing responsibility for a long time, they is why we have support contracts, SLA's and other very expensive services we pay many many times more than the cost of hardware for...

To outsource responsibility... Network down -- Call Cisco... Storage Down Call EMC or Nimble... etc

1 more reply

greggman35y ago

There's more coming.... tons of github integrations ask for blanket access to your account vs Oauth, (https://github.com/marketplace). Tons of github users give that access, the access_tokens are only a password type breach away. If you have these access_tokens you can edit the repos they are for all you want.

prepend5y ago

I wish GitHub would create a proper auth design. I won’t grant blanket permissions to tokens because there’s too much risk of something going wrong.

It seems dumb that they don’t have per repo tokens. I think the issue is with their licensing as if they made proper tokens users could abuse it by giving tokens to their friends. But this should be detectable in a friendly (please don’t do that) way.

I want to be able to give read-only access to private repos.

I want to be able to give fine grained function level and repo level access.

If I’m an admin on multiple repos, I want to be able to issue a token for just a single repo so I can give that to a CI job without worrying if every single repo I admin is at risk.

They allow ssh keys with some similar functionality, but ssh keys can’t be used as much as tokens.

I’ve been waiting for a story about how some third party app granted access to my whole org gets taken over and wreaks havoc. Eventually this will probably be the attack that alters real packages instead of these name overloading packages.

1 more reply

gardaani5y ago

At the moment, there's a story about github1s.com on the front page of HN and people are asking how to give it access to their company private repos [1][2]. Scary.

[1] https://news.ycombinator.com/item?id=26087017

[2] https://news.ycombinator.com/item?id=26086789

leetrout5y ago

Apparently the Oauth scopes are much worse than GitHubs apps. Only GitHub apps allow read only access to the “metadata” by default whereas Oauth apps get access to the code, deploy keys, etc with no way to limit that access per repo.

https://docs.github.com/en/developers/apps/scopes-for-oauth-...

https://docs.github.com/en/rest/reference/permissions-requir...

PartiallyTyped5y ago

Different access tokens have different permissions, you can't just do whatever you want.

2 more replies

christophilus5y ago

Honestly, it’s one of the things that makes me nervous about running Linux on all of my computers. At least with Windows (and probably OSX), my updates come from a single vendor who has strict internal code audits and security requirements. With Linux (I’m using Pop), my updates come from a package manager with a crapload of packages, each maintained by a different team / group with no central policy. There’s no way the small team at Pop can review and audit all of the things in the apt package system, and there have to be plenty of maintainers of popular packages who get sweet offers to sell out.

Anyway. I’m sticking with Pop / Linux. But it does make me nervous!

danhor5y ago

At least with windows, the drivers aren't checked that much and accordingly, have had some serious issues.

I'd guess distros are generally better off in that respect, but kernel space & user space aren't that different nowadays, when caring about your own security

jonny_eh5y ago

For Linux, it's about choosing a distro you can trust. Most manage their own repos, some more carefully than others.

tyingq5y ago

Also happy.

I'm very happy to finally have a real world example to motivate all the folks that eye-rolled me every time I've raised it in the past. It just resonates better, especially with less technical leadership folks.

walrus015y ago

Hey let's just sudo curl | bash

what could possibly go wrong?

thombles5y ago

Not sure if serious, but I will point out this is significantly different. If I'm installing an application like homebrew or the Rust toolchain then I am explicitly giving them the right to code execution. It doesn't much matter whether they get it through the script on their website or the binaries downloaded from that website.

Random libraries, possibly pulled in by a dependency of a dependency of a dependency... not so much.

1 more reply

beermonster5y ago

You even see Microsoft offering those types of install one-liners. e.g.

curl -sSL https://dot.net/v1/dotnet-install.sh | bash /dev/stdin <additional install-script args>

See https://docs.microsoft.com/en-us/dotnet/core/tools/dotnet-in...

There are other examples I've seen from time to time.

da_big_ghey5y ago

That's curl | sudo bash smh what an amateur

beermonster5y ago

And wonder how long it will take for similar repeats on other repos such as dockerhub, apt, nuget, homebrew etc etc.

geek_at5y ago

I'm more amazed by the fact that they got bounties because the attack wouldn't be (easily) possible without insider knowledge on which dependencies their internal build system used

knowhy5y ago

> To test this hypothesis, Birsan began hunting for names of private internal packages that he could find in manifest files on GitHub repositories or in CDNs of prominent companies but did not exist in a public open-source repository.

If I'm not mistaken insider knowledge wasn't necessary.

1 more reply

peteretep5y ago

You say this, but I feel like in the 20-odd years I’ve been using package managers I’ve seen very very few real world exploits?

dspillett5y ago

While the risk is low, the potential impact of a successful exploit is massive so the matter should be taken seriously.

Remember how much was temporarily broken in the leftPad event? Imagine if all that had been silently back-doored instead?

Cthulhu_5y ago

Consider yourself lucky I guess? I mean in the 20-odd years I've been driving I've never had an accident either.

hanselot5y ago

Why would the ones getting away with it bother publishing articles about it? Rather, 20 years is a plenty long enough time to cover your tracks.

ehnto5y ago· 15 in thread

Pulling packages down at build time seems ludicrous to me, I can understand it in a development environment, but I don't understand how "Pull packages from the public internet and put them into our production codebase" past any kind of robustness scrutiny.

I guess it's a case of the ease of use proving too great, so convenient in fact that we just kind of swept the implications under the rug.

TeMPOraL5y ago

> I can understand it in a development environment

I can't. It's incredibly wasteful time and resource-wise, and ties your development process to third-party providers (and your ISP), which fall over often enough in practice.

It's a good practice to have a local cache of all the third-party dependencies you use, available to both developers and CI infrastructure.

SamuelAdams5y ago

> It's a good practice to have a local cache of all the third-party dependencies you use, available to both developers and CI infrastructure.

We have that, it's called Java and .NET, but apparently solved problems aren't interesting anymore.

golergka5y ago

> local cache

For a distributed company with developers from all over the globe the "local" here doesn't really make much sense. But from my experience with NPM, you download packages on your developer machine once you set up a project, and then only when something really messes up node_modules, which happens once in three months, on average.

You do re-download packages for every build in CI pipeline as you build a docker image from scratch though, and that's when NPM mirror is usually set up.

1 more reply

ehnto5y ago

Well I agree, I don't personally do it either. My stack is comprised of tools that are pretty comprehensive on their own, so they get committed to the repository. A backend framework, a SASS compiler binary and a frontend framework if needed. It all gets put in the repo and any tasks are run by a makefile.

nijave5y ago

Some things are like that but there is a decent amount of package managers now a days that at least pin package hashes so they'll fail if the package has been tampered with. I'm not aware of many places that audit dependencies to a greater extent than "the license is compatible and it has reasonable maintenance".

Pulling packages from the internet is fine and that's how all Linux distros work but the more important thing is signature verification, imo

HereBeBeasties5y ago

This really doesn't help when devs just upgrade everything. Or if they simply install the latest (p0wned) version of something. Pinning hashes really isn't the answer here.

golergka5y ago

Migrating from public NPM to a privately-hosted, your own mirror of NPM is not a very complicated process, and if you already have a CI pipeline in place, it can be implemented completely transparently to developers. But as many other things that an organisation has to change as it grows from a single-founder startup to a real company, it's something many people just forget to do until they face the consequences.

anchochilis5y ago

Mirrors are great for speed and protecting you from dependencies getting randomly deleted off the public repo, but I don't think they can protect from malicious packages. They'll just get pulled into the mirror.

At my last gig (Java), developers reviewed all third-party libraries + dependencies and manually uploaded them to a private Ivy server. I don't think that could work in the Node ecosystem, where every module seems to have 100+ dependencies.

EDIT:

There's a real security vs accessibility trade-off here. You can't be a productive web developer, according to modern standards, and review every single transitive dependency that gets pulled into your application. And it's very inefficient to have individual developers at different orgs separately reviewing the same libraries over and over again.

One would naturally turn to repository administrators to enforce stricter security standards. Maybe RubyGems could review all source code for every new version of a package and build it themselves instead of accepting uploads of pre-built artifacts. But these repositories are run by smallish groups of volunteers, and they don't have the resources to conduct those kinds of reviews. And no open-source developer wants to have to go through an App Store-like review process to upload their silly McWidget library.

3 more replies

andrethegiant5y ago

The goal of verdaccio is to make this less complicated. https://github.com/verdaccio/verdaccio

thrower1235y ago

99% of everyone does exactly this, though. Partly because nobody has any fucking idea what they are doing, and because this is what all the documentation everywhere tells you to do, so that's what the guy who gets tasked with setting up the CI build does...

WrtCdEvrydy5y ago

If you are using `npm i` instead of `npm ci`... you are also guilty of this.

ehnto5y ago

The secret is to not use NPM at all. It's the worst offender for exploding dependency trees.

uncledave5y ago

Yes. The way we develop software quite frankly scares the shit out of me on a daily basis.

RGamma5y ago

Well it's mostly held together by trust and (in the commercial case) warranty. That said there's so many potential entry points for malicious actors it's not even funny anymore (esp. in desktop computing)...

I try not to think about it too much and have faith in the powers that be

2 more replies

alistairSH5y ago

Even worse - if you consider yourself average, half the software out there was built on top of even weaker foundations.

The web really is held together by duct tape and bubble gum.

kasperni5y ago· 14 in thread

I'm surprised the reverse fully-qualified domain name (FQDN) model used by Java isn't more widely adopted. If you want to upload artifacts to the main repository (Maven Central) you first need to show ownership of a particular domain. For example, via a DNS TXT record (example [1]). Would make these kind of attacks a lot more difficult.

[1] https://issues.sonatype.org/browse/OSSRH-61509

jeswin5y ago

Javas FQDN model is actually pretty bad in practice. Domain names change quite often (I've seen many packages with a dead FQDN), and relying on the TXT record is going to be a security nightmare even worse than the username/password required by npm (since domains expire).

brabel5y ago

> Javas FQDN model is actually pretty bad in practice

Right, that's why we see this kind of attack all the time on Maven Central, but never on npm... oh, wait?! NO! The kind of simple attacks you see routinely on npm (typo squatting, ownership transfers to malicious authors, now this) just doesn't happen on Maven Central at all.

2 more replies

didibus5y ago

Hum... I think you're misunderstanding something.

You have a username/password to Maven Central and you also have a private key to it.

But in order to be granted a groupID (think of it as an account), you need to prove at the time of account creation that you own the domain that matches the groupID (think account name).

So if you try to register com.foo on Maven Central, at that time you need to own foo.com, otherwise you'll be rejected.

If you do own it at that time, well your account is approved and now you have a username/password to it and a private key you need to use to sign artifacts when you publish them.

If your domain expires and is later bought by someone else, that doesn't make them the new owner of your Maven Central groupID.

kasperni5y ago

You only need to validate the domain once using a TXT record. And then you use another authentication mechanism such as a username/password combination.

tarruda5y ago

I believe the TXT record validation is only an additional measure, eg to prevent a random developer from registering/uploading a package like org.apache.http2. Surely other authentication methods are used in practice.

I find it hard to believe any high profile organization would allow their domains to expire, or else they would also lose e-mail and websites, right?

1 more reply

tarruda5y ago

It seems some of the new package systems such as node/npm fail to learn from years of maturity of existing ecosystem such as Java's

raverbashing5y ago

"Years of maturity" or, just thinking about the problem for a bit.

How long did it take npm to have scoped packages. Sure, let me create a "paypal" project, they only need one js project no?

If Java suffers from excessive bureaucracy, the newer package developers/repos suffer from too much eagerness to ship something without thinking

Not to mention dependency and version craziness. If you want your software to be repeatable you need to be specific with the versions and code you're taking.

1 more reply

ratherbefuddled5y ago

As someone who worked with java for more than a decade before touching the js world, the degree to which npm has been hacked together without any of study prior art is extremely irritating. If you must build something from scratch at least invent some new problems instead of just re-discovering solved ones.

The very existence of package-lock grinds my gears and that's before it starts flip flopping because someone mistook URLs for URIs. Of course that only exists because ranged dependencies are a terrible idea, and that's before anybody even mentions things like namespaces or classifiers.

No maven wasn't perfection, and it could be (and has been) improved on - but npm doesn't even get into spitting distance.

Aissen5y ago

Or the URL-based model used by Go.

XorNot5y ago

At this point I really wish we'd just go with a proper cryptography model, with a discovery overlay to provide names.

What I want as a developer is to establish my trust relationship to developers of libraries I depend on.

`npm install <somepackage>` should first check a record of signing keys in my source code repo, then check a user-level record of signing keys I've trusted before, and then - and only then - add a tentative trust relationship if this is brand new.

`npm release` or whatever (npm is just an example - every system could benefit from this) - would then actually give me the list of new trust relationships needed, so I can go and do some validation that these are the packages I think they are.

2 more replies

Philip-J-Fry5y ago

Using a URL isn't what makes Go's dependency management that good. It's just convenience that the import is a URL.

The key thing with Go is that all dependencies have a checksum (go.sum file) and that should be committed to the repo.

So even if the domain gets hijacked and a malicious package is served up, then the checksum will fail and it will refuse to build.

People should be using internal module proxies anyway for Go. You can just store the module files in a directory, a git repo or a web service and serve up an internal cache.

SideburnsOfDoom5y ago

What does the url model bring?

Packages are typically considered immutable once published. If I have a particular package e.g. "FooLib.Bar v 1.2.3" then this zip file should _always_ contain the same bits. If I need to change those bits, e.g. to fix a bug then I need to ship e.g. "FooLib.Bar v 1.2.4"

Also packages aren't always small. So it makes sense to cache a copy locally. On dev machine "package cache" and in an org's "internal feed" and only check upstream if it's not there.

So I shouldn't need to go to the source url to get it. Ideally, I just ask "who has "FooLib.Bar v 1.2.3" for me?"

It also means that tampering can be detected with a hash.

But the "check upstream" model is now vulnerable to fake new versions.

2 more replies

kevinherron5y ago

Hmm, I wonder when this policy started. I did not have to prove ownership of the domains for the coordinates I use, though I do happen to own them.

numpad05y ago

because without proper incentive mechanism people just use "com.mycompany.greatproduct12345" for everything

mavhc5y ago· 8 in thread

https://security.googleblog.com/2021/02/know-prevent-fix-fra...

At Google, we have those resources and go to extraordinary lengths to manage the open source packages we use—including keeping a private repo of all open source packages we use internally

actuator5y ago

But Google is more or less an exception in this regard, from hiring their own offensive penetration testing teams to having a lot of paranoia in general about anything from outside. They had adopted a lot of good practices early on. Even most big companies are not as thorough as them.

I wonder how they built this culture and if it is even realistic for smaller companies to aim for it.

ftio5y ago

I work on developer infrastructure at Google. Opinions my own.

I think it typically comes down to a few key leaders having the political capital/will to enforce policies like this. Google's `third_party` policies[0] were created relatively early on and were, as far as I understand, supported by high level technical leaders.

The ROI of policies like these is not always immediately evident, so you need the faith of key leaders in order to make room for them. Those leaders don't necessarily need to be high in the org chart — they just need to be respected by folks high in the org chart.

As a counterfactual, establishing Google's strong testing culture seems to have been a mostly bottoms-up affair. Good article on the history of that at Mike Bland's blog[1].

0. https://opensource.google/docs/thirdparty/ 1. https://mike-bland.com/2011/09/27/testing-grouplet.html

tetraodonpuffer5y ago

At a previous job I pushed hard for this in a project I was responsible for, despite initial buy-in as time went on there was a consistent level of pushback about relaxing this requirement and allowing just importing anything ( the architecture of this was basically a separate repo storing ALL the dependencies where only a couple of people had commit access and where new dependencies were allowed after vetting )

Fortunately there was a hard legal requirement to vet every dependency license, otherwise I am not sure I would have been able to keep this workflow. As other posts say you do need a very strong commitment at the management level for this to work, besides security (where it feels that often it matters only until it costs money or until it’s even slightly inconvenient) it might be helpful to make a legal case (what if we ship something with a nested dependency on AGPL ) to get some help to establish these procedures.

I have been writing and architecting security related software for pretty much all my career and I find it quite scary how these days so much software delegates so much control to unvetted external dependencies.

jpalomaki5y ago

We could pay for Google (or somebody else) to do it for us.

We would pay to access their ”distribution”, a limited set of packages vetted by them. Distribution vendor would screen changes from upstream and incorporate into their versions.

Of course this is more limited world. It’s like using a paid Linux distribution with certain amount of software covered by the vendors support policies.

0xbadcafebee5y ago

That's more for availability than security. Assuming you keep the crypto checksums / author signatures of all the source code and packages, you don't need to keep a copy of the source / packages. Just verify them at download time. Many Linux distros don't even have a copy of all those binaries, they rely on HTTP mirrors of random organizations.

It's also useful for your organization to rebuild all of the source code from scratch (for reproducible packages anyway) and compare the new ones to the old ones, looking for things like compiler or hardware injection attacks. Secure build systems are definitely non-trivial.

zaphar5y ago

It's not just that. Because it's all in the same repo and built with the same build tool it's also easier to run the same security checks you would use for your own code all automatically as part of your build process. All the tooling you use to secure your own code can be used to secure third party code as well with the same low level of friction.

actuator5y ago

One more advantage of keeping it together can be easier development cycle. IDE features like autocompletion and building would be faster if artifacts can be cached.

1 more reply

rrdharan5y ago

Google has a whitepaper on exactly how this works and the security aspects of the verifiable build system:

https://cloud.google.com/security/binary-authorization-for-b...

jtsiskin5y ago· 7 in thread

That is insane that any company allowed this to happen.

""That said, we consider the root cause of this issue to be a design flaw (rather than a bug) in package managers that can be addressed only through reconfiguration," a Microsoft spokesperson said in the email."

No, npm has scopes for a reason, why would that not fix this issue?

fillest5y ago

Probably, it's more fun to play with syscall filtering in containers or with fuzzers than to review side-channels or educating coworkers. Therefore, security theater.

asiachick5y ago

Isn't it considered best practice to be secure by default? Wasn't that big fiasco with MongoDB? Why should PyPI, RubyGems, or npm be any different? I'm sure there is some reason but I'd expect them to all pull private repos before public.

Maybe the bug wasn't explained correctly but if it prefers public over private that seems like a bug.

OTOH, it certainly is an issue that if you forget and happen to test some code without being configured to have the private package server as your default then you'd get public repos.

Maybe instead of named packages companies should be using private URLs for packages. That way you always get what you ask for?

joepie91_5y ago

npm does not have any 'private package' functionality at all, instead you point it at a different registry server (using eg. Verdaccio or Artifactory) which then serves local packages and proxies public packages if they don't exist locally - or at least that's what they're supposed to do.

Artifactory apparently didn't, and served up whichever was the highest version of public vs. private. Which is stupid.

But the bottom line is that when using npm, the exact package selection policy is determined by whatever registry implementation you're talking to, and so it's the registry implementation which should prioritize private packages by default.

Cthulhu_5y ago

Scopes were only introduced in NPM 2, and iirc it's still an optional feature. Companies that used NPM early on may have opted to never use those.

But that's just NPM, it's an issue in all of the mentioned package managers.

high_byte5y ago

true. simplest solution is just always prioritize internal over external.

2 more replies

Triv8885y ago

Automatic updates from poor sources is probably a bad idea anyways... whether they prioritize local packages or not. (I.E.: Play Store, PyPy, etc...)

lukeplato5y ago

GitHub could provide a default merge rule to prevent this attack

Communitivity5y ago· 6 in thread

This doesn't surprise me. Horrify.. yes.

I've noticed more dev teams succumbing to the temptation of easiness that many modern package managers provide (NPM, Cargo, Ivy, etc.) - especially as someone who has to work with offline systems on a regular basis.

Because of that ease there are fewer tools and tutorials out there to support offline package management. There are more for using caches, though these are often along the lines of either 'the package manager will do this for you and it just works (but in case it doesn't, delete node_modules or cargo clean and re-try)', or stand up a dependency server on your own machine with these proxy settings (which has it's own security issues and is frequently disallowed by IT cybersecurity policies).

As an example, many blog articles I found a while back suggest using yumdownloader from the yum-utils package. This is unfortunately not reliable, as there are some packages that get skipped.

I have found I need to script reading a list of dependencies from a file; then for each dependency: create a directory for it, use repotrack to download its RPM and it's transitive dependency RPMs in the dependency's directory; then the script aggregates all the RPMs into one directory, removes the OS installed RPMs, uses createrepo to turn that directory into a RPM repository, and then makes an USF ISO image out of the directory for transfer onto offline system and installation.

Diggsey5y ago

I disagree: the problem is not that package managers make things easy, it's just that several of them are poorly designed.

The fact that pip/npm/gem etc. look for packages in a fallback location if not found in the private repository is a terrible design flaw. One which not all package managers have.

For example, when you add a cargo dependency from a private registry, you have to specify the registry that the dependency comes from, so cargo will never go looking in some other place for that crate. I'm sure many other package managers also have designs that are not vulnerable in this way.

Similarly, many package managers do not support pinning of transitive dependencies (with hashes), or pinning does not happen by default, so that many people are still using floating dependencies.

austincheney5y ago

Whether the package managers are poorly designed is completely ancillary. It really is primarily about developer laziness, incompetence, easiness.

Proof: https://www.theregister.com/2016/03/23/npm_left_pad_chaos/

Sudden unplanned loss of availability is a catastrophic security problem, the A in the security CIA[1]. Worse is that the dependency that caused that problem was something that should never have been a dependency in the first place.

Proper dependency management requires a degree of trust and integrity validation which are completely counter to automation. Most developers are eager to accept any resulting consequences because they don't own the consequences and because they are fearful of writing original code.

[1] https://en.wikipedia.org/wiki/Information_security#Key_conce...

2 more replies

Spooky235y ago

It's a little bit of both. Maybe "problem" is the wrong word. It's a risk that you need to understand and account for. If you're running a bank, it's an existential impact that you must avoid. If you're running a message board, it's not.

Look at what happened when the "left-pad" function disappeared from npm a few years ago. IIRC, it broke react. The downside of package managers like this is that many people have no idea what they are using.

2 more replies

specialp5y ago

In the case of RubyGems for some time now it has been throwing a warning if you do not use the `source` block to scope for gems coming from multiple gemservers.

path4115y ago

I haven't used private packages, but it astonishes me you don't just add private packages with some kind of flag so it knows to not try to pull a public package.

Anyone who uses this must have already understood and just overlooked this vulnerability when they realize their private package must have a unique name that doesn't match a public package

1 more reply

ywei34105y ago

Does Cargo resolve transitive dependencies with a hash? So for example, if I have a dependency on tokio (which depends on tokio_core), I don't /think/ the meta-data on tokio forces the exact version of tokio_core on a first download/update?

In which case, would you not get the same issue, if you do the same attack, but with a transitive dependency which you haven't specified?

ex_amazon_sde5y ago· 6 in thread

Ex-Amazon SDE here.

> a unique design flaw of the open-source ecosystems

This is a big generalization.

Inside Amazon, as well as in various Linux distributions, you cannot do network traffic at build time and you can only use dependencies from OS packages.

Each library has its own package and the code and licensing is reviewed. The only open source distribution that I know to have similar strict requirements is Debian.

[I'm referring to the internal build system, not Amazon Linux]

[Disclaimer: things might have changed after I left the company]

trishankkarthik5y ago

I work in this area. This is not a supply chain attack. This is a typosquatting "attack" people keep rediscovering every year or two.

I know, because I wrote an as yet unpublished paper on safely pulling packages from private and public repos.

acdha5y ago

I think you’re getting downvoted because your point is obscured by the confrontational tone. Argument by authority is especially unconvincing when you aren’t using common terms correctly. In normal usage, “typosquatting” refers to someone registering common misspellings in a shared namespace. As clearly described in the post this is not that but rather exploiting non-obvious differences in the order in which different namespaces are checked.

Using terms correctly is especially important in security: someone who read your comment might incorrectly believe that this did not affect them because they are using the correct names for all of their dependencies.

ex_amazon_sde5y ago

This is not correct.

Installing packages only from a trusted (and signed) source protects against typosquatting, misread or confusing package names and many other risks.

1 more reply

pgo5y ago

There are no typos here, the root issue is package manager preferring public packages of the same name over private ones

psanford5y ago

There's no typos in this attack.

itake5y ago

Can you share a link to the paper? My email is in my HN bio.

koolba5y ago· 6 in thread

> The packages had preinstall scripts that automatically launched a script to exfiltrate identifying information from the machine as soon as the build process pulled the packages in.

Pre and post install scripts in NPM packages are such a terrible idea. Even when it’s not malware, it usually just a nagging donation request with a deliberate “sleep 5” to slow down your build and keep the text displayed.

bennofs5y ago

Are there many package manager that do not have either pre-, post- or build scripts or plugins allowing arbitrary code execution during build?

pkg managers that do have that: cargo (build.rs), pip (setup.py), npm (install scripts), apt/rpm/pacman (postinstall hooks)

Maybe the only exceptions are Go and Java package managers?

dathinab5y ago

I'm pretty sure all package managers which produce packages which might bind to C do have "some form" of pre-, post- or build scripts.

The reason is simple because without it you can't properly bind to system libraries.

And even without, the supply chain attack still works against at least developers as packages are not just build but also run, often without any additional sandbox. (E.g. you run tests in the library you build which pulled in a corrupted package).

The main problem here are not build scripts (they still are a problem, just not the main) but that some of the build tools like npm haven't been build with security but convenience as priority and security was just an afterthought. For example npm did (still does?, idk) not validate if the packag freezing file and the project dependencies match so you could try to sneak in bad dependency sources.

Also for things which are classical system package managers (i.e. not build tools) like apt/rpm/pacman it build scripts really does not matter at all. The reason is that what you produce will be placed and run in your system without sand-boxing anyway, so it's a bit different then a build tool which is often used to build binaries (installers, etc.) at one place and then distribute them to many other places.

Edit: Another attack vector is to bring in a corrupted package which then "accesses" the code and data of another package, this could use speculative pointer accesses or similar but in languages like Java,Python, JavaScript you often can use reflections or overriding standard functions to archive this much more reliable.

kyrra5y ago

Go considers this to be a security bug, as was seen recently with: https://blog.golang.org/path-security (talked about briefly here: https://news.ycombinator.com/item?id=25881212)

joepie91_5y ago

I don't understand why people keep endlessly complaining about postinstall scripts.

Such 'nagging donation requests' were banned by npm pretty much days after they first appeared, IIRC, and npm itself is literally a tool for installing code to execute later, so there's no security issue here. If someone wanted to embed malware into a package, they wouldn't need postinstall scripts for it.

This is really a complete nothingburger.

koolba5y ago

> Such 'nagging donation requests' were banned by npm pretty much days after they first appeared, IIRC,

What does "banned by npm" mean? Here's an example from the source of the latest version of nodemailer (with 1.4M weekly downloads) sleeping for 4,100 ms on every install so that it can show a "Sponsor us to remove this lag" message: https://github.com/nodemailer/nodemailer/blob/a455716a22d22f...

> and npm itself is literally a tool for installing code to execute later, so there's no security issue here. If someone wanted to embed malware into a package, they wouldn't need postinstall scripts for it.

It's fine to have a standard mechanism for postinstall steps. It should be opt-in by the end user rather than opt-out. That way people know that they're running additional code and ideally selectively pick which packages are allowed to do so. The vast majority of packages do not need it anyway as they do not have C++ bindings or need to generate data.

The defaults for NPM are such that you have to know quite a bit of how NPM works to download a package and inspect the contents without executing random code.

> This is really a complete nothingburger.

It's defensive in depth. With the default being to execute remote code, a single typo could be installing a package that immediately runs malware.

minitech5y ago

“install” and “execute later” don’t always involve the same permissions. If you apply restrictive sandboxes your code, package managers that aren’t designed to be able to download untrusted code are annoying. Of course, this problem isn’t unique to npm. (It’s actually the opposite – all you need to do with npm is --ignore-scripts, whereas pretty much every other popular package manager I use just makes it impossible.)

And yes, you want to sandbox the install too anyway, but it at least needs permissions enough to do its job, i.e. interact with the network somehow. (Although I’m working on a tool to make that fully deterministic so it can never exfiltrate anything.)

There’s also the possibility that there’s no “execute” step at all, like installing a dependency tree just to inspect source, or in theory being able to skip auditing unused code paths.

1 more reply

nstart5y ago· 4 in thread

This post seems like a good time to note that by default, there's no direct way to verify that what you are downloading from dockerhub is the exact same thing that exists on dockerhub [1].

Discovered after seeing a comment on HN about a bill of materials for software, i.e., a list of "approved hashes" to ensure one can audit exactly what software is being installed, which in turn led me to this issue.

[1] - https://github.com/docker/hub-feedback/issues/1925

guax5y ago

I remember when we used to sign binaries and packages and nobody checked the pgp files anyways. We could have something similar better today, just need to be automated enough.

beermonster5y ago

I think image signing support (or at least was) is not as good as it can be. It would be nice if more images were signed by publishers and verification performed by default.

Even then, that only gives you a stronger indication that the image hasn't been altered since it was signed by the image author at any point after it being signed. However it is not a guarantee that the source produced the binary content. It's also not a guarantee that the image author knew what they were signing - though this is a different issue.

Debian has a reproducible builds initiative[1] so people can compile packages themselves and them match byte for byte what Debian built. Not sure how far they've got with that.

https://wiki.debian.org/ReproducibleBuilds

iam-TJ5y ago

Approximately 25,000 of just over 30,000 source packages are now reproducible builds - generating over 80,000 binary packages. See the graphic on the page you linked to:

https://tests.reproducible-builds.org/debian/unstable/amd64/...

1 more reply

noisenotsignal5y ago

You can enable client enforcement of Docker Content Trust [1] so that all images pulled via tag must be signed. Whether people are actually signing their images is a different question that I don't know the answer to.

[1] - https://docs.docker.com/engine/security/trust/#client-enforc...

1 more reply

keyle5y ago· 4 in thread

I'm flabbergasted by how silly this is. Bump the version and the package manager chooses yours online vs. the private one. Amazing. How silly and how expensive is this going to be as this blatant security issue is going ripple on for the next months to come.

prepend5y ago

This is why explicit pins are a good idea. Whenever you finish a project you should set the explicit versions in the lock and then tag it. The problem is with dependencies of your dependencies, but if they are public, then by their nature they won’t be using private packages that can be hijacked.

frakkingcylons5y ago

Even public packages have been hijacked. Pin all your dependencies (transitive included) and then use automation (e.g. dependabot) to update the pinned versions as needed.

thw0rted5y ago

I don't think it actually works this way for NPM specifically, if you're using scoped packages correctly. I believe you can associate a scope with one (private) repo and it will not fall back on the public repo, or choose newer / higher-numbered versions on the public repo over a version from the private one.

beermonster5y ago

Debian has apt pinning for this kind of thing.

nightpool5y ago· 4 in thread

The article mentions that RubyGems is vulnerable to this, and that Shopify in particular downloaded and ran a gem named "shopify-cloud", but I'm curious as to how this is possible given a "normal" bundler pure-lockfile setup, or more generally the source-block directives I've seen in most Gemfiles.

That is, given a Gemfile.lock like, e.g.

    GIT
      remote: https://github.com/thoughtbot/appraisal
      revision: 5675d17a95cfe904cc4b19dfd3f1f4c6d54d3502
      specs:
        appraisal (2.1.0)
          bundler
          rake
          thor (>= 0.14.0)

How would Bundler ever try and download the `appraisal` gem from RubyGems?

The Gemfile section is more explicable. While newer Gemfiles look like this:

    source "http://our.own.gem.repo.com/the/path/to/it" do
      gem 'gemfromourrepo'
    end
    # or
    gem 'gemfromourrepo', source: "http://our.own.gem.repo.com/the/path/to/it"

Older Gemfiles apparently looked like the following:

    source 'https://rubygems.org'
    source 'http://our.own.gem.repo.com/the/path/to/it'

    gem 'gemfromrubygems1'  
    gem 'gemfromrubygems2'
    gem 'gemfromourrepo'

Which seems obviously vulnerable to the dependency confusion issue mentioned.

So is the understanding that Shopify's CI systems were running `bundle upgrade` or another non-lockfile operation? (possibly as a greenkeeper-like cron job?) Or is `--pure-lockfile` itself more subtly vulerable?

withinboredom5y ago

Someone will eventually update deps, not necessarily CI. But now that devs machine is compromised. The attacker probably only has a small window of time after it gets in, but it should be long enough to exfiltrate dot-files and the source code of whatever it gets included in. Now they have ssh keys (mine are on a yubikey), and the GitHub url. They can further push malicious code into the repo.

nightpool5y ago

I would hope most SSH keys are password-encrypted if not protected by a hardware token like yours, but I agree that the "unscoped-source" Gemfile syntax is a huge vulnerable hole, and a bad one. I'm just confused about how what seems like a pretty uncommon operation led to such an immediate response and code execution from Shopify.

(I also don't think it's true that the attacker has a "small window of time"—as soon as they get a single RCE, it's over, if they're running on a normal dev machine then they can daemonize into the background, add persistence, and snoop events over time. CI systems are obviously less vulnerable to this by nature.)

2 more replies

bslorence5y ago

If I'm reading bundler's docs rightly, the new 'source' syntax only appears to prevent this: https://bundler.io/man/gemfile.5.html#SOURCE-PRIORITY

nightpool5y ago

I don't think this is correct—I read the earlier section of the docs ("Block Form Of Source, Git, Path, Group And Platforms") as saying that the block form is equivalent to the "source explicitly attached to the gem", the first priority item in your link

However, this section is concerning:

> The presence of a source block in a Gemfile also makes that source available as a possible global source for any other gems which do not specify explicit sources. Thus, when defining source blocks, it is recommended that you also ensure all other gems in the Gemfile are using explicit sources, either via source blocks or :source directives on individual gems.

Yikes! This is yet another easy footgun for people to reintroduce this issue

xurukefi5y ago· 3 in thread

I never understood why these package repositories don't include some (opt-in?) integrity checking option using digital signatures. If I download code that executes on my machine there should be at least the option to establish some level of trust. We have been doing that with linux distro package managers for decades. Seems like common sense to me.

BillinghamJ5y ago

They largely do in various forms. Both npm and yarn, by default, record hashes of the dependencies you're using and check them when redownloading.

I think the issue tends to be more that there's just so many packages (often nested 10+ deep) and it's best practice to keep them as up to date as possible.

When it's fairly typical for a JS project to have thousands of dependencies, there isn't really any practical way to both stay up to date and carefully review everything you pull in.

I think the only viable solution for companies taking this issue really seriously is to keep their numbers of dependencies down and avoid having significant deep/indirect dependencies.

Edit: as an example, in my company's Node stack (for 10 services) - there's >900 dependencies. In our React stack (for 2 sites), more than 1600.

Contrary to what you might think, these are actually pretty small, lightweight systems. So really whatever you might have thought was the worst-case scenario on numbers of deps, the reality is more like 10x that in the modern JS ecosystem.

In many ways, the vast number of tiny dependencies are one of the strongest points of the JS ecosystem. But it doesn't come without caveats.

sp3325y ago

The package integrity would be fine in this case. The packages downloaded from PyPI would be legitimately signed by PyPI, and the internal packages would be signed by the local package server. The issue is not knowing which source to use for each package, and you'd have the same issue with not knowing which certificate to use to check them.

magicalhippo5y ago

I was thinking the same thing. Surely PayPal's packages should be signed by a certificate only PayPal has, and they would want to verify that before using their packages?

1 more reply

CGamesPlay5y ago· 3 in thread

I know that node has `package-lock.json` and `yarn.lock`, which include integrity checks. Are these checks decorative only? How could npm have been affected by this issue?

nstart5y ago

IIRC you need to use npm ci to ensure that package-lock.json is used. That said, when developing locally you are going to use npm install or npm update and update the package.json and package-lock.json files accordingly. I could be entirely off target here since I'm writing purely from memory. But there seems to be a few different ways one could trigger a pull from the malicious repo and end up with it inside the package-lock.json file

2 more replies

raihansaputra5y ago

You can easily misconfigure npm by using the `npm i`/`npm install` command on CI/CD instead of `npm ci`. `npm install` does not take any lockfiles into account and only uses the package.json and upgrades any package/dependency that is not pinned.

matsemann5y ago

The final build would probably have failed (on a build server using CI). But when developing locally I think package.json wins over the lock file (? at least often the lockfile is updated after doing an npm install here).

So this probably wouldn't show up on the final build distributed and deployed somewhere. But it did manage to run arbitrary code on developers' machines of those companies.

pinacarlos905y ago· 3 in thread

Question for you guys here:

Is this kind of attack possible using Nuget-Package manager?

maximcus5y ago

Fresh [PDF from MSFT mentioning nuget](https://azure.microsoft.com/mediahandler/files/resourcefiles...)

forecast105y ago

Yes, it's possible. I've tried it myself.

progre5y ago

Seems like it's a possibility at least. I'd be very very interested to know as well.

donaldihunter5y ago· 2 in thread

This was inevitable from the moment we let build systems and runtime systems fetch things automatically and unsupervised from public repos. This is the simplest and most blatant approach yet, but taking ownership of existing projects and adding malicious code is an ongoing problem. Even deleting a public project can have the effect of a DOS attack.

When I first used maven, I was appalled by how hard it was to prevent it from accessing maven central. And horrified to see karaf trying to resolve jars from maven central at run time. What a horrible set of defaults. This behaviour should be opt-in, disabled by default, not opt-out through hard to discover and harder to verify configuration settings.

brabel5y ago

Funny that you mention Maven, because Maven is not really vulnerable to this kind of attack simply because it requires a groupId in all dependencies, and to publish under a certain groupId you must prove control of the domain it refers to, which makes this attack nearly impossible (it's only possible if you use an internal groupId which is not controlled by you on Maven Central, AND an attacker could claim that groupId successfully with Sonatype, AND you configure Maven to first look at Maven Central, and only then at your internal repos which would be stupid to do as you normally do the exact opposite - and most enterprise setups won't even proxy to Maven Central at all).

Also, Maven uses pinned versions, normally, and won't just download whatever newer minor version happens to be published when it builds, which again makes this attack quite hard to pull off.

donaldihunter5y ago

Back then it would have been maven 2 which supported version ranges in a similar way to OSGi manifests. But I really only mentioned maven as the first build tool I used which reached out to public repos uninvited and could break my builds as a consequence of that.

andrejserafim5y ago· 2 in thread

Why would you want your CI to depend on an external source. Say a legit upgrade happened, but it has a breaking change. Now your build is broken.

Fixed versions for as many things as you can (including OS images, apt packages, Docker images, etc) lead to changes in your CI under your control.

Sure, you have to upgrade manually or by a script. But isn't plain build stability worth it? Not even talking about security.

HereBeBeasties5y ago

It probably doesn't. But are you saying devs never updates their dependencies?

andrejserafim5y ago

When one updates internal dependency versions one usually has to find them. At least that was the story with my gigs. So there's a listing somewhere.

So you wouldn't get a random version even considered.

Version shadowing and overriding is a totally different concern of course.

jl65y ago· 2 in thread

The real solution is to design and build software components that can be finished, so they can be ruthlessly vetted - rather than the endless churn of updates.

Daho0n5y ago

Gamers often complain how they become free QA testers if they buy a game in the first few months after release as most games are full of bugs (hi Bethesda!) but it is way worse in things like JavaScript libraries etc. It's as if finished have become a foreign word to most developers. Look at the resent story about Linux stable kernels that have had more than 255 minor releases and think how much of a shit show it would have been if they added features too like most developers do. The excellent small stable tools of Unix should have taught us something.

1 more reply

CuriousNinja5y ago

Not sure why parent is being down-voted as I believe this is an important point. In my opinion this would be applying the unix philosophy of having small tools that does one thing and does it well to code libraries.

2 more replies

mbag5y ago· 2 in thread

To mitigate this kind of supply chain attacks for python, we have created following tool [1], that will check python packages on Artifactory instance you specify and create packages with the same name on the PyPi.

[1] https://github.com/pan-net-security/artifactory-pypi-scanner

joshlk5y ago

Uploading dummy packages to PyPi isn't the solution. It just pollutes PyPi and a nuisance to others.

You have always been able to specify the `index-url` when installing packages using pip. This can also be added to `requirements.txt` files as well.

2 more replies

seg_lol5y ago

The thing that just happened is like a catastrophic chain-reaction collision in space. Now we will have to use guids for everything. Nothing has meaning.

technics2565y ago· 2 in thread

Can't there be a "package signature" of some sort that is specified and checked against in a package-lock.json or yarn.lock?

dininski5y ago

I'll try to answer this from a JS-specific perspective. As someone previously mentioned - you do get hash checks if you're using `npm ci` in your CI/CD setup. You get the resolution path as well. Which is all you need to reproducibly resolve dependencies, *if* you have set up npm correctly in your pipeline. It would be unlikely to be exposed to this particular attack, at least not automatically in your deployment pipelines.

However this is still very, very dangerous, because of day-to-day engineering, really. Any engineer doing a simple `npm install` can inadvertently bring in and execute malicious code from their machine. From there on out it would be somewhat trivial to gain further access to the same network the code war run from.

fernandotakai5y ago

pip has hashing signatures and i don't know why people don't use it. it's quite easy too.

https://pip.pypa.io/en/stable/reference/pip_hash/

https://pip.pypa.io/en/stable/reference/pip_install/#hash-ch...

seniorivn5y ago· 2 in thread

solution: use nix as your package manager

ben_w5y ago

Helps in this specific case, but will not eliminate the broader issue. The broader issue is how can you trust 3rd party code to not do anything harmful, and it’s not like we can even perfectly trust our own fingers in that regard.

1 more reply

medecau5y ago

lol

has if the nix peeps are reading the code nix is wget'ing

baybal25y ago· 2 in thread

This is why end-developer signing is essential.

This is not as amendable to CI, but that's the point.

HereBeBeasties5y ago

How does it work in practice, though? For example, create-react-app in NPM has a bajillion deps. Do I trust 8,000 keys? Which ones are OK?

I get that you could in principle namespace things (at least for package managers that support this) and insist on a small set of company-internal signing keys for those namespaces. But managing all that isn't easy and what about for package ecosystems that don't really have namespaces (e.g. PyPI, NuGet)?

1 more reply

rmoriz5y ago

As you see with chrome extensions and android barcode apps, this is not a solution. Developers change or for whatever reasons can change their mind and ship bad things.

1 more reply

jbverschoor5y ago· 1 in thread

It won't be just companies. It'll be developers, sysops, etc who npm install a bazillion of packages, because the core language and libaries are not enough. Those people have keys, credentials and access to the internal networks.

randomsearch5y ago

Yep a big part of npm’s problems are actually just flaws with JS. Comparing with Java, it’s insane how many dependencies you have to manage, and that batteries are not included (esp weird when you consider front end apps involve downloading the code!).

heipei5y ago· 1 in thread

Previous discussion: https://news.ycombinator.com/item?id=26081149

database_lost5y ago

and the link to the original author's post: https://medium.com/@alex.birsan/dependency-confusion-4a5d60f...

forty5y ago· 1 in thread

I don't understand why there is this issue. We publish our internal npm packages in the @company namespace and we own this namespace on the public npm registry. Problem solved, isn't it?

andrethegiant5y ago

Yes, I'm confused by this too. Scoped packages on npm solves this problem, yet it isn't mentioned in the article at all.

walrus015y ago· 1 in thread

npm in particular has been problematic for a long time:

https://naildrivin5.com/blog/2019/07/10/the-frightening-stat...

https://techbeacon.com/security/check-your-dependencies-gith...

https://thenewstack.io/npm-password-resets-show-developers-n...

joepie91_5y ago

Except this wasn't a problem with npm but rather with private registry implementations, and a setup with npm + Verdaccio is apparently actually one of the few configurations that isn't vulnerable to this problem.

Not that I didn't expect someone to immediately take the opportunity to complain about npm, of course, despite it having nothing to do with the problem at hand... as has become tradition in tech circles.

ipsum25y ago· 1 in thread

Package management isn't what I initially think of when I hear "supply chain". Neat hack! It's like left-pad but malicious.

numbsafari5y ago

It should be. If you are a developer, your package manager, OS distributions, and any commercial software you use is all part of your supply chain.

Your code is what it depends on.

nine_k5y ago· 1 in thread

randompwd5y ago

Doesn't cover the same attack vector, and that link is a piece of fiction.

1 more reply

alkonaut5y ago· 1 in thread

What I don’t get from the article is the reasoning behind the design that the central repository “wins” over the local/override repository.

How was that design chosen, not just once but in all 3 of those large package ecosystems. Did pypi/gems/node borrow their design from each other given their similarity in other aspects?

Are there any situations where this behavior is desired?

Does any of the other ecosystems have flaws like this (nuget, cargo..)?

carols10cents5y ago

Cargo will not ever look on crates.io for a library if you specify a `registry` attribute for a dependency. https://doc.rust-lang.org/cargo/reference/registries.html

wyc5y ago· 1 in thread

Here's the application called deptrust I submitted to the Mozilla Builders program (didn't get in :P) to address this problem space before I had to focus more on my current job. Please let me know if there are any collaborators who would like to work on this together someday!

https://docs.google.com/document/d/1EW6uSZB0_D0qZuDSGuxujuVE...

Jefro1185y ago

Hey there, any chance you want to turn this doc into a dedicated webpage? Here's a demo: https://demos.writary.app/deptrust

unilynx5y ago· 1 in thread

These install hooks... Why are they needed at all and why can't package (de)installation be without side effects ?

I'm sure the hooks are needed for things NPM can't do by itself, but they shouldn't run by default. That puts pressure on developers to avoid them, and puts pressure on NPM to add whatever functionality is missing from package.json in a safe way.

(and have npmjs.com search rank packages without scripts above those that do)

bagacrap5y ago

What would happen if the install hooks weren't there? You'd still have client code calling into the compromised package. Would it be possible to handle those calls without knowing the symbol names used by the internal package?

pabs35y ago· 1 in thread

For apt repositories you can do pinning by origin, which should prevent this issue.

ex3ndr5y ago

Does apt use tls by default today?

1 more reply

tantalor5y ago· 1 in thread

Like some other commenters, I too initially balked at the apparent misuse of "supply chain attack" but the linked paper provides a good definition,

A software supply chain attack is characterized by the injection of malicious code into a software package in order to compromise dependent systems further down the chain.

Backstabber’s Knife Collection: A Review of Open Source Software Supply Chain Attacks

https://link.springer.com/chapter/10.1007%2F978-3-030-52683-...

To be clear, just calling this a "supply chain attack" and omitting "software" is going to cause confusion with traditional supply chains.

The analogy is not quite apt: in a software build system you have complete visibility into the dependency tree, so this attack is less useful, whereas with hardware suppliers you are relying on the security of your vendor.

Terretta5y ago

> The analogy is not quite apt: in a software build system you have complete visibility into the dependency tree, so this attack is less useful, whereas with hardware suppliers you are relying on the security of your vendor.

Not necessarily — plenty software still ships with the third party supply chain bits incorporated as binaries, including commercial software. User is relying on security of one or more in a chain of upstream vendors.

See Cyberpunk 2077 DLLs for instance.

https://twitter.com/CDPRED_Support/status/135660404767189811...

Cyberpunk “builds” their game with a software build system, but not all of it is them building it.

1 more reply

rectang5y ago· 1 in thread

PGP signing of packages should be table stakes for publishing to a public repository. If unsigned packages are accepted by a public repository to reduce friction for newbies, such packages should be hidden by default.

Then, build tools should be configurable such that they only pull in dependencies signed by PGP keys drawn from a whitelist.

Finally, companies need to maintain private repositories of vetted dependencies and avoid pulling from public repositories by default — and this requirement needs to be configurable from the project's build spec and captured in version control.

angry_octet5y ago

If you've seen the PGP/GPG code you'll know what a trash fire it is, and if you follow its development you'll see how unfriendly the maintainers are when bugs are pointed out.

Adding dependencies on PGP just makes everything worse.

X.509 PKI for code signing is also terrible and very very complicated and error prone.

Also consider the community nature of development. You need to handle all sorts of painful crypto issues now.

PhineasRex5y ago· 1 in thread

The only really shocking part of this is that Artifactory is vulnerable to this. I expect developers to be lazy about build security because I've seen it over and over again at multiple companies, but Artifactory's whole purpose is to provide secure build dependency management.

I'll be rethinking using Artifactory in my infrastructure.

HereBeBeasties5y ago

A good look at their (public) bug tracker might change your mind about how surprising this is.

NicoJuicy5y ago· 1 in thread

I used this version trick in nuget, but the other way around.

To update existing non-maintained public packages, mostly because they were on. Net framework and a lot moved to .net core.

In visual studio you can set the priority of where packages have to be checked. My own package repo has a higher priority.

I never thought about using it as an attack vector though.

wozer5y ago

I believe you can work around this attack vector .NET by referencing strong-named assemblies.

jcims5y ago· 1 in thread

Can anyone point me at a resource where I can download the full set of packages that are in the npm registry?

jcims5y ago

using https://replicate.npmjs.com/_all_docs

megous5y ago· 1 in thread

But will anything change in the minds and hearts of developers?

sillysaurusx5y ago

Probably not. :)

trishankkarthik5y ago· 1 in thread

This is NOT a supply chain attack. Solarwinds was a supply chain attack. This is a typosquatting demonstration that happens every one or two years.

Abimelex5y ago

it's not and the article explains why

1 more reply

DecoPerson5y ago

Imagine we navigated the web using a command line tool called “goto” which works exactly like a package manager. If I want to open my bank’s site, I type “goto mybank” .

I could easily find myself in trouble, because:

- There’s no autocomplete or bookmarks, so typos are easy.

- If “mybank” is a name provided by my company’s name server, I could find myself redirected to the public “mybank” entry because Mr. Not-A-Hacker says his name entry is more up to date (or because I forgot to tell ‘goto’ to check the company name server.)

- There’s no “green padlock” to check while I’m actively using the destination site. (Though at this point it’s too late because a few moments after I hit enter the destination site had the same access to my machine & network that I do from my current terminal.)

- A trusted site may later become malicious, which is bad due to the level of unrestricted and unmonitored access to my PC the site can have.

- Using scripting tricks, regular sandboxed browser websites can manipulate my clipboard so I paste something into ‘goto’ that I didn’t realize would be in my clipboard, making me navigate to some malicious site and giving it full access to my machine (if ‘sudo’ as added to the front).

This is just a few cases off the top of my head. If ‘goto’ was a real thing, we’d laugh it into being replaced by something more trustable.

How have current package managers not had these vulnerabilities fixed yet? I don’t understand.

siraben5y ago

This attack demonstrates one of the problems outlined in the Nix thesis[0], that is the problem of nominal dependencies. That is, dependencies of the dependencies, build flags and so on are not taking into account, and in particular, the source of a package.

Nix makes it possible to query the entire build time and runtime dependency graph of a package, and because network access during build time is disabled, such a substitution attack would be harder to pull off.

The declarations for how the source is downloaded is specified declaratively and can be pinned to a specific commit of a specific Git repository, for instance.

[0] https://edolstra.github.io/pubs/phd-thesis.pdf

umvi5y ago

> I have been fascinated by the level of trust we put in a simple command like this one

sigh... am I the only one that likes environments where you can run simple commands to install stuff and you can generally trust your package managers? All the security folks love to act dumbfounded when people trust things, but post-trust environments have terrible UX in my experience. I hate 2FA, for example, because now I have to tote my phone around at all times in order to be able to access any of my accounts. If I lose my phone or my phone is stolen while travelling, I'm hosed until I can figure out how to get back in.

> So can this blind trust be exploited by malicious actors?

Yes, it can. Trust can always be exploited by malicious actors, and no amount of software can change that. And it creates a world that sucks over time. Show me a post-trust, highly secure environment that isn't a major PITA to use. And not just for computers. I'm sure you could use social engineering to abuse trust of customer service reps (or just people in general) and do bad things, and the end result will be a world where people are afraid do any favors for other people because of the risk of getting burned by a "malicious actor".

vlovich1235y ago

Does this work with AOT compiled languages? Surely the fake packages that get uploaded don't know the structure of the internal libraries enough, so for something like Cargo this would just cause in your build suddenly failing mysteriously & easy to spot. A build.rs could probably do some damage to your build systems temporarily for the 1 or 2 days (if not hours) it takes for engineers to track down what's happening.

robertlagrant5y ago

I've often wondered about this, even in the accidental case of someone registering a package you use internally.

And I know it's not perfect, but in Python if you use Poetry means you get a poetry.lock file with package hashes built in, so that's something.

p0d5y ago

I teach and one of my students, with little IT experience, asked me last week about the security of package management. I found myself using the many eyeballs argument. It only takes one set of bad eyeballs.

It seems to me that down through the years ease of deployment trumps security. npm, mongodb, redis, k8s.

Or maybe sysadmin has just become outdated? Maybe front of house still needs a grumpy caretaker rather than your friendly devops with a foot in both camps.

We can now even outsource our security to some impersonal third-party so they can 'not' monitor our logs.

EOG # end of grump

snarfy5y ago

Sadly I've had to fix this at more than one company.

It's a bit of cognitive dissonance having to explain why downloading random shit from the internet during the build is a bad idea, yet here we are.

furstenheim5y ago

Npm ecosystem already has the solution. Use namespaces @yelp/infra-js where @yelp is the npm user.

It's not possible for an attacker to publish on that name in the public npm

jwr5y ago

I have to build some CSS libraries that sadly use npm for building. The way I approach this is through rubber gloves: I create custom docker containers with npm and a specific set of dependencies, frozen in time. This way I can at least get reproducible and reliable builds.

This doesn't mean I'm not vulnerable to dependency attacks, but it at least limits the window, because I update these dependencies very, very rarely.

fareesh5y ago

This seems to be tending towards the generic problem of permissions that we have seen previously elsewhere.

For example in the case of Facebook, it used to be that users would accept permissions without considering them, and in-turn, various apps would access their data in bad faith.

Likewise for mobile apps.

Eventually Facebook removed many of the overtly powerful permissions entirely, likewise with the mobile operating systems.

In the case of mobile, the concept of "runtime permissions" was also introduced that required explicit approval to be granted at the time of authorization.

On Android, location access now prompts the user in the notification area informing the user of an app that accessed their location.

Can some of these ideas be borrowed to the package/dependency management world? "The package you are about to install requires access to your hard drive including the following folders: x/y/z/all?

smilliken5y ago

This is both a security bug and a reproducibility bug. If anyone outside your network can break your build, your build is broken! It's mission critical to have a working build.

The way Nix handles this is that every external resource is cached and hashed, and every reference to an external resource must have a hash integrity check. If someone swaps out a package on a web server somewhere, rebuilds keep working because they don't need to re-fetch (because the hash wasn't changed by an operator), and fresh builds fail with an error indicating the hash is invalid, which should trigger an investigation (in practice, this is exceedingly rare, and IMO always deserves attention).

I dream for when build reproducibility is considered table stakes like version control.

adolph5y ago

I think JFrog and Azure won the prize for product placement on this one. When the article listed “Azure Artifactory” I wondered if Azure was “sherlocking” JFrog, but no, they have a partnership. Given the SolarWinds vector I expect more investment in tooling security.

SideburnsOfDoom5y ago

The upstream article was posted yesterday, here

"Dependency Confusion: RCE via internal package name squatting " https://news.ycombinator.com/item?id=26081149

"Dependency Confusion: How I Hacked Into Apple, Microsoft and Dozens of Other Companies, The Story of a Novel Supply Chain Attack, Alex Birsan" https://medium.com/@alex.birsan/dependency-confusion-4a5d60f...

konaraddi5y ago

For npm enterprise, it looks like setting the scope (e.g. @acmecorp/internal-pkg) would mitigate the public and private confusion. For Verdaccio, an open source light weight npm registry, it first checks if a private package is available before searching the public npm registry (however, their best practices say to use a prefix for private packages https://verdaccio.org/docs/en/best )

fortran775y ago

I don't use npm much, but once I'm out of the initial development phase with any package manager and am "feature complete" we generally lock versions down so at least we're always pulling a specific version in.

And, of course, on production build machines, all packages are local.

This isn't just for "security" -- it's to ensure we can always build the same bits we shipped, and to avoid any surprises when something has a legitimate update that breaks something else.

0xbadcafebee5y ago

My favorite supply chain attack is still the chip vendors. Even if you come up with a hardware security module in your chip to verify the code that's running on it, that can be (and has been) hacked too. Sleeping dragons could be lying in wait in billions of devices and nobody would know unless they went out of their way to do a low-level analysis.

angry_octet5y ago

I've been wishing npm/pypi/apt etc would improve for ages, but it seems like infrastructure improves one disaster at a time, software one hack at a time. I'm only annoyed I didn't do it myself.

The pypi maintainer is being ridiculous, it is much better to have this guy poke MSFT than have the Russians do it, he's doing them a favour.

wepple5y ago

Does NPM offer cryptographic hash pinning of packages the way that PyPI does?* why is this not more widely used?

* https://flawed.net.nz/2021/02/02/PyPI-Security-State/

milo_im5y ago

Diffend allows you to manage the risks that come with using open-source third party dependencies by providing malware detecting security scanning and a risk management platform for your Ruby dependencies.

https://diffend.io/

soheil5y ago

This brings a whole new level of awareness to package files where a simple typo can mean your machine can be rooted. From now on I'll always be terrified whenever changing any of my package.json, Gemfile or requirements.txt files.

homakov5y ago

Why didn't npmjs/rubygems just check failed lookup requests for "shopify-cloud" etc and block those for a while to prevent damage, and notify the companies (doing their best)? Seems like low hanging solution.

ddtaylor5y ago

Does anyone else skip reading articles on Medium because of their login policies?

3np5y ago

It surprises me a bit the way they refer to in-house dependencies purely by version number. When we have internal dependencies in e.g. package.json, it's always referred to by an explicit url and git ref.

peteretep5y ago

> After spending an hour on taking down these packages, Ingram stressed that uploading illicit packages on PyPI puts an undue burden on the volunteers who maintain PyPI.

I dunno, feels like fair game to me

stephenr5y ago

And still people won’t vendor their dependencies, so changes to dependencies are never reviewed.

To paraphrase family guy: you’re making this harder than it needs to be.

claw_howitzer5y ago

Did this guy just make hundreds of thousands of dollars off this single bug (type)?

cooervo5y ago

this shouldn't be a problem with golang right? because it uses an id when go mod is used. I'm rusty on go since I haven't used it in over 2 years but I believe this shouldn't affect it?

malinens5y ago

luckily I reserved my company's namespace in packagist few months ago. each package manager works differently and it is hard to know inner workings of all package managers

tppiotrowski5y ago

Dev 1: “npm -g package_name” doesn’t work.

Dev 2: try “sudo npm -g package_name”.

omega35y ago

Less than $4k average bounty for this.

Bannednad5y ago

Bullshit you would be next to julian

sitkack5y ago

We need a blockchain for source. It is obvious and we just haven't come to terms with it yet. Then anyone can run anything provided they have the right key.

j / k navigate · click thread line to collapse

402 comments

232 comments · 71 top-level

ryukafalz5y ago· 24 in thread

This article from Agoric is extremely relevant here, from a previous such incident (re: the event-stream package on npm): https://medium.com/agoric/pola-would-have-prevented-the-even...

eecc5y ago

Uh, well the original developers of the Sun JVM didn’t do such a bad job after all when designing it: https://docs.oracle.com/javase/7/docs/technotes/guides/secur...

WatchDog5y ago

The security manager is an additional layer of security that most languages don't have, however Java applets have shown it to be full of holes and generally unsuitable for running untrusted code.

The applet security posture has contributed a great deal towards negative opinion towards the language, probably would have been better off never having existed.

1 more reply

szc5y ago

It is my opinion, that the business model Sun had "in mind" for Java was a free runtime for everyone that they were in control of, but to make money from selling an "official" Java compiler suite.

I do not believe that the Sun Java JVM was created with security in mind.

easton5y ago

https://deno.land/manual@v1.7.2/getting_started/permissions

ryukafalz5y ago

I could be wrong, but I don't see any mention of permissions on imported code: https://deno.land/manual@v1.7.2/examples/import_export

2 more replies

nogbit5y ago

I think this is critical. The actual runtime of any code needs to do way more than what it’s doing now.

Simply relying on package signing and the like permits trusted but malicious actors. With Deno packages configured well it can really lock down and limit a ton of attack vectors.

1 more reply

bluetech5y ago

This is so obviously what needs to happen, it's really surprising it's not a feature in all major languages by now. I bet in 10 years time, giving dependencies complete control would seem crazy.

Here is an interesting proposal on how to possibly get there in JS with import maps: https://guybedford.com/secure-modular-runtimes

Deno uses ambient permissions for the entire process and unfortunately missed the opportunity to do it right.

detaro5y ago

Indeed, being able to apply capabilities on a package level would be great, but I don't know many languages/environments that implement this as a first-class feature.

newhouseb5y ago

1. https://hacks.mozilla.org/2019/11/announcing-the-bytecode-al... (ignore the title, it's irrelevant to the excellent explanation that constitutes 70% of the post)

ryukafalz5y ago

[0] https://github.com/Agoric/ses-shim

[1] https://github.com/LavaMoat/LavaMoat

eecc5y ago

Java does. Of course it’s never been used systematically and it has received precious little attention to DevOps ergonomics, but the infrastructure is there

2 more replies

dwohnitmok5y ago

The main hope at the moment seems to be JS.

bryanrasmussen5y ago

>why do we consider it acceptable that code you install through a package manager implicitly gets to do anything to your system that you can do?

I thought it was because operating systems still use access based instead of capabilities based security?

ptx5y ago

jjav5y ago

Yes, with Java you can.

I wrote this support into several Java web container and J2EE application server products back in the day. AFAIK, all that still works great today in Java.

ryukafalz5y ago

3 more replies

thw0rted5y ago

safog5y ago

nahuel0x5y ago

Giving fine-grained permissions to sandboxed libraries is the way forward, and probably a really good use case for WASM.

edejong5y ago

It sounds like you are interested in a (distributed) capability-based security model. [1]

[1] https://enacademic.com/dic.nsf/enwiki/295618

ryukafalz5y ago

Yes, I didn't use the term explicitly in my comment but you're precisely right. ;)

Agoric (the company whose blog post I linked to) and the people behind it have done a ton of object capability work over the years.

jjn20095y ago

POLA is good to live by regardless if it can be implemented.

zomgwat5y ago

Not a complete answer (by any means) but keeping tight control over egress network access helps (I wish it was easier to limit egress access over port 443).

Systemd has some capability to restrict access to system resources. I haven't experimented with the capabilities yet so not sure what's all there.

linuxftw5y ago

I'm late to this conversation, but the thing that can restrict "doing anything" on your machine is SELinux.

rachelbythebay5y ago· 23 in thread

CaptArmchair5y ago

There are a lot of expensive things you can outsource. Responsibility isn't among those.

This issue resonates with bullet point 17 in particular:

> A security system is only as secure as its secret. Beware of pseudo-secrets.

https://en.wikipedia.org/wiki/The_Cathedral_and_the_Bazaar

iib5y ago

[1] http://esr.ibiblio.org/

2 more replies

syshum5y ago

>>There are a lot of expensive things you can outsource. Responsibility isn't among those.

To outsource responsibility... Network down -- Call Cisco... Storage Down Call EMC or Nimble... etc

1 more reply

greggman35y ago

prepend5y ago

I wish GitHub would create a proper auth design. I won’t grant blanket permissions to tokens because there’s too much risk of something going wrong.

I want to be able to give read-only access to private repos.

I want to be able to give fine grained function level and repo level access.

If I’m an admin on multiple repos, I want to be able to issue a token for just a single repo so I can give that to a CI job without worrying if every single repo I admin is at risk.

They allow ssh keys with some similar functionality, but ssh keys can’t be used as much as tokens.

1 more reply

gardaani5y ago

At the moment, there's a story about github1s.com on the front page of HN and people are asking how to give it access to their company private repos [1][2]. Scary.

[1] https://news.ycombinator.com/item?id=26087017

[2] https://news.ycombinator.com/item?id=26086789

leetrout5y ago

https://docs.github.com/en/developers/apps/scopes-for-oauth-...

https://docs.github.com/en/rest/reference/permissions-requir...

PartiallyTyped5y ago

Different access tokens have different permissions, you can't just do whatever you want.

2 more replies

christophilus5y ago

Anyway. I’m sticking with Pop / Linux. But it does make me nervous!

danhor5y ago

At least with windows, the drivers aren't checked that much and accordingly, have had some serious issues.

I'd guess distros are generally better off in that respect, but kernel space & user space aren't that different nowadays, when caring about your own security

jonny_eh5y ago

For Linux, it's about choosing a distro you can trust. Most manage their own repos, some more carefully than others.

tyingq5y ago

Also happy.

walrus015y ago

Hey let's just sudo curl | bash

what could possibly go wrong?

thombles5y ago

Random libraries, possibly pulled in by a dependency of a dependency of a dependency... not so much.

1 more reply

beermonster5y ago

You even see Microsoft offering those types of install one-liners. e.g.

curl -sSL https://dot.net/v1/dotnet-install.sh | bash /dev/stdin <additional install-script args>

See https://docs.microsoft.com/en-us/dotnet/core/tools/dotnet-in...

There are other examples I've seen from time to time.

da_big_ghey5y ago

That's curl | sudo bash smh what an amateur

beermonster5y ago

And wonder how long it will take for similar repeats on other repos such as dockerhub, apt, nuget, homebrew etc etc.

geek_at5y ago

I'm more amazed by the fact that they got bounties because the attack wouldn't be (easily) possible without insider knowledge on which dependencies their internal build system used

knowhy5y ago

If I'm not mistaken insider knowledge wasn't necessary.

1 more reply

peteretep5y ago

You say this, but I feel like in the 20-odd years I’ve been using package managers I’ve seen very very few real world exploits?

dspillett5y ago

While the risk is low, the potential impact of a successful exploit is massive so the matter should be taken seriously.

Remember how much was temporarily broken in the leftPad event? Imagine if all that had been silently back-doored instead?

Cthulhu_5y ago

Consider yourself lucky I guess? I mean in the 20-odd years I've been driving I've never had an accident either.

hanselot5y ago

Why would the ones getting away with it bother publishing articles about it? Rather, 20 years is a plenty long enough time to cover your tracks.

ehnto5y ago· 15 in thread

I guess it's a case of the ease of use proving too great, so convenient in fact that we just kind of swept the implications under the rug.

TeMPOraL5y ago

> I can understand it in a development environment

I can't. It's incredibly wasteful time and resource-wise, and ties your development process to third-party providers (and your ISP), which fall over often enough in practice.

It's a good practice to have a local cache of all the third-party dependencies you use, available to both developers and CI infrastructure.

SamuelAdams5y ago

> It's a good practice to have a local cache of all the third-party dependencies you use, available to both developers and CI infrastructure.

We have that, it's called Java and .NET, but apparently solved problems aren't interesting anymore.

golergka5y ago

> local cache

You do re-download packages for every build in CI pipeline as you build a docker image from scratch though, and that's when NPM mirror is usually set up.

1 more reply

ehnto5y ago

nijave5y ago

Pulling packages from the internet is fine and that's how all Linux distros work but the more important thing is signature verification, imo

HereBeBeasties5y ago

This really doesn't help when devs just upgrade everything. Or if they simply install the latest (p0wned) version of something. Pinning hashes really isn't the answer here.

golergka5y ago

anchochilis5y ago

EDIT:

3 more replies

andrethegiant5y ago

The goal of verdaccio is to make this less complicated. https://github.com/verdaccio/verdaccio

thrower1235y ago

WrtCdEvrydy5y ago

If you are using `npm i` instead of `npm ci`... you are also guilty of this.

ehnto5y ago

The secret is to not use NPM at all. It's the worst offender for exploding dependency trees.

uncledave5y ago

Yes. The way we develop software quite frankly scares the shit out of me on a daily basis.

RGamma5y ago

I try not to think about it too much and have faith in the powers that be

2 more replies

alistairSH5y ago

Even worse - if you consider yourself average, half the software out there was built on top of even weaker foundations.

The web really is held together by duct tape and bubble gum.

kasperni5y ago· 14 in thread

[1] https://issues.sonatype.org/browse/OSSRH-61509

jeswin5y ago

brabel5y ago

> Javas FQDN model is actually pretty bad in practice

2 more replies

didibus5y ago

Hum... I think you're misunderstanding something.

You have a username/password to Maven Central and you also have a private key to it.

But in order to be granted a groupID (think of it as an account), you need to prove at the time of account creation that you own the domain that matches the groupID (think account name).

So if you try to register com.foo on Maven Central, at that time you need to own foo.com, otherwise you'll be rejected.

If you do own it at that time, well your account is approved and now you have a username/password to it and a private key you need to use to sign artifacts when you publish them.

If your domain expires and is later bought by someone else, that doesn't make them the new owner of your Maven Central groupID.

kasperni5y ago

You only need to validate the domain once using a TXT record. And then you use another authentication mechanism such as a username/password combination.

tarruda5y ago

I find it hard to believe any high profile organization would allow their domains to expire, or else they would also lose e-mail and websites, right?

1 more reply

tarruda5y ago

It seems some of the new package systems such as node/npm fail to learn from years of maturity of existing ecosystem such as Java's

raverbashing5y ago

"Years of maturity" or, just thinking about the problem for a bit.

How long did it take npm to have scoped packages. Sure, let me create a "paypal" project, they only need one js project no?

If Java suffers from excessive bureaucracy, the newer package developers/repos suffer from too much eagerness to ship something without thinking

Not to mention dependency and version craziness. If you want your software to be repeatable you need to be specific with the versions and code you're taking.

1 more reply

ratherbefuddled5y ago

No maven wasn't perfection, and it could be (and has been) improved on - but npm doesn't even get into spitting distance.

Aissen5y ago

Or the URL-based model used by Go.

XorNot5y ago

At this point I really wish we'd just go with a proper cryptography model, with a discovery overlay to provide names.

What I want as a developer is to establish my trust relationship to developers of libraries I depend on.

2 more replies

Philip-J-Fry5y ago

Using a URL isn't what makes Go's dependency management that good. It's just convenience that the import is a URL.

The key thing with Go is that all dependencies have a checksum (go.sum file) and that should be committed to the repo.

So even if the domain gets hijacked and a malicious package is served up, then the checksum will fail and it will refuse to build.

People should be using internal module proxies anyway for Go. You can just store the module files in a directory, a git repo or a web service and serve up an internal cache.

SideburnsOfDoom5y ago

What does the url model bring?

Also packages aren't always small. So it makes sense to cache a copy locally. On dev machine "package cache" and in an org's "internal feed" and only check upstream if it's not there.

So I shouldn't need to go to the source url to get it. Ideally, I just ask "who has "FooLib.Bar v 1.2.3" for me?"

It also means that tampering can be detected with a hash.

But the "check upstream" model is now vulnerable to fake new versions.

2 more replies

kevinherron5y ago

Hmm, I wonder when this policy started. I did not have to prove ownership of the domains for the coordinates I use, though I do happen to own them.

numpad05y ago

because without proper incentive mechanism people just use "com.mycompany.greatproduct12345" for everything

mavhc5y ago· 8 in thread

https://security.googleblog.com/2021/02/know-prevent-fix-fra...

At Google, we have those resources and go to extraordinary lengths to manage the open source packages we use—including keeping a private repo of all open source packages we use internally

actuator5y ago

I wonder how they built this culture and if it is even realistic for smaller companies to aim for it.

ftio5y ago

I work on developer infrastructure at Google. Opinions my own.

As a counterfactual, establishing Google's strong testing culture seems to have been a mostly bottoms-up affair. Good article on the history of that at Mike Bland's blog[1].

0. https://opensource.google/docs/thirdparty/ 1. https://mike-bland.com/2011/09/27/testing-grouplet.html

tetraodonpuffer5y ago

jpalomaki5y ago

We could pay for Google (or somebody else) to do it for us.

We would pay to access their ”distribution”, a limited set of packages vetted by them. Distribution vendor would screen changes from upstream and incorporate into their versions.

Of course this is more limited world. It’s like using a paid Linux distribution with certain amount of software covered by the vendors support policies.

0xbadcafebee5y ago

zaphar5y ago

actuator5y ago

One more advantage of keeping it together can be easier development cycle. IDE features like autocompletion and building would be faster if artifacts can be cached.

1 more reply

rrdharan5y ago

Google has a whitepaper on exactly how this works and the security aspects of the verifiable build system:

https://cloud.google.com/security/binary-authorization-for-b...

jtsiskin5y ago· 7 in thread

That is insane that any company allowed this to happen.

No, npm has scopes for a reason, why would that not fix this issue?

fillest5y ago

Probably, it's more fun to play with syscall filtering in containers or with fuzzers than to review side-channels or educating coworkers. Therefore, security theater.

asiachick5y ago

Maybe the bug wasn't explained correctly but if it prefers public over private that seems like a bug.

OTOH, it certainly is an issue that if you forget and happen to test some code without being configured to have the private package server as your default then you'd get public repos.

Maybe instead of named packages companies should be using private URLs for packages. That way you always get what you ask for?

joepie91_5y ago

Artifactory apparently didn't, and served up whichever was the highest version of public vs. private. Which is stupid.

Cthulhu_5y ago

Scopes were only introduced in NPM 2, and iirc it's still an optional feature. Companies that used NPM early on may have opted to never use those.

But that's just NPM, it's an issue in all of the mentioned package managers.

high_byte5y ago

true. simplest solution is just always prioritize internal over external.

2 more replies

Triv8885y ago

Automatic updates from poor sources is probably a bad idea anyways... whether they prioritize local packages or not. (I.E.: Play Store, PyPy, etc...)

lukeplato5y ago

GitHub could provide a default merge rule to prevent this attack

Communitivity5y ago· 6 in thread

This doesn't surprise me. Horrify.. yes.

As an example, many blog articles I found a while back suggest using yumdownloader from the yum-utils package. This is unfortunately not reliable, as there are some packages that get skipped.

Diggsey5y ago

I disagree: the problem is not that package managers make things easy, it's just that several of them are poorly designed.

The fact that pip/npm/gem etc. look for packages in a fallback location if not found in the private repository is a terrible design flaw. One which not all package managers have.

Similarly, many package managers do not support pinning of transitive dependencies (with hashes), or pinning does not happen by default, so that many people are still using floating dependencies.

austincheney5y ago

Whether the package managers are poorly designed is completely ancillary. It really is primarily about developer laziness, incompetence, easiness.

Proof: https://www.theregister.com/2016/03/23/npm_left_pad_chaos/

[1] https://en.wikipedia.org/wiki/Information_security#Key_conce...

2 more replies

Spooky235y ago

2 more replies

specialp5y ago

In the case of RubyGems for some time now it has been throwing a warning if you do not use the `source` block to scope for gems coming from multiple gemservers.

path4115y ago

I haven't used private packages, but it astonishes me you don't just add private packages with some kind of flag so it knows to not try to pull a public package.

Anyone who uses this must have already understood and just overlooked this vulnerability when they realize their private package must have a unique name that doesn't match a public package

1 more reply

ywei34105y ago

In which case, would you not get the same issue, if you do the same attack, but with a transitive dependency which you haven't specified?

ex_amazon_sde5y ago· 6 in thread

Ex-Amazon SDE here.

> a unique design flaw of the open-source ecosystems

This is a big generalization.

Inside Amazon, as well as in various Linux distributions, you cannot do network traffic at build time and you can only use dependencies from OS packages.

Each library has its own package and the code and licensing is reviewed. The only open source distribution that I know to have similar strict requirements is Debian.

[I'm referring to the internal build system, not Amazon Linux]

[Disclaimer: things might have changed after I left the company]

trishankkarthik5y ago

I work in this area. This is not a supply chain attack. This is a typosquatting "attack" people keep rediscovering every year or two.

I know, because I wrote an as yet unpublished paper on safely pulling packages from private and public repos.

acdha5y ago

ex_amazon_sde5y ago

This is not correct.

Installing packages only from a trusted (and signed) source protects against typosquatting, misread or confusing package names and many other risks.

1 more reply

pgo5y ago

There are no typos here, the root issue is package manager preferring public packages of the same name over private ones

psanford5y ago

There's no typos in this attack.

itake5y ago

Can you share a link to the paper? My email is in my HN bio.

koolba5y ago· 6 in thread

> The packages had preinstall scripts that automatically launched a script to exfiltrate identifying information from the machine as soon as the build process pulled the packages in.

bennofs5y ago

Are there many package manager that do not have either pre-, post- or build scripts or plugins allowing arbitrary code execution during build?

pkg managers that do have that: cargo (build.rs), pip (setup.py), npm (install scripts), apt/rpm/pacman (postinstall hooks)

Maybe the only exceptions are Go and Java package managers?

dathinab5y ago

I'm pretty sure all package managers which produce packages which might bind to C do have "some form" of pre-, post- or build scripts.

The reason is simple because without it you can't properly bind to system libraries.

kyrra5y ago

Go considers this to be a security bug, as was seen recently with: https://blog.golang.org/path-security (talked about briefly here: https://news.ycombinator.com/item?id=25881212)

joepie91_5y ago

I don't understand why people keep endlessly complaining about postinstall scripts.

This is really a complete nothingburger.

koolba5y ago

> Such 'nagging donation requests' were banned by npm pretty much days after they first appeared, IIRC,

The defaults for NPM are such that you have to know quite a bit of how NPM works to download a package and inspect the contents without executing random code.

> This is really a complete nothingburger.

It's defensive in depth. With the default being to execute remote code, a single typo could be installing a package that immediately runs malware.

minitech5y ago

There’s also the possibility that there’s no “execute” step at all, like installing a dependency tree just to inspect source, or in theory being able to skip auditing unused code paths.

1 more reply

nstart5y ago· 4 in thread

This post seems like a good time to note that by default, there's no direct way to verify that what you are downloading from dockerhub is the exact same thing that exists on dockerhub [1].

[1] - https://github.com/docker/hub-feedback/issues/1925

guax5y ago

I remember when we used to sign binaries and packages and nobody checked the pgp files anyways. We could have something similar better today, just need to be automated enough.

beermonster5y ago

I think image signing support (or at least was) is not as good as it can be. It would be nice if more images were signed by publishers and verification performed by default.

Debian has a reproducible builds initiative[1] so people can compile packages themselves and them match byte for byte what Debian built. Not sure how far they've got with that.

https://wiki.debian.org/ReproducibleBuilds

iam-TJ5y ago

Approximately 25,000 of just over 30,000 source packages are now reproducible builds - generating over 80,000 binary packages. See the graphic on the page you linked to:

https://tests.reproducible-builds.org/debian/unstable/amd64/...

1 more reply

noisenotsignal5y ago

[1] - https://docs.docker.com/engine/security/trust/#client-enforc...

1 more reply

keyle5y ago· 4 in thread

prepend5y ago

frakkingcylons5y ago

Even public packages have been hijacked. Pin all your dependencies (transitive included) and then use automation (e.g. dependabot) to update the pinned versions as needed.

thw0rted5y ago

beermonster5y ago

Debian has apt pinning for this kind of thing.

nightpool5y ago· 4 in thread

That is, given a Gemfile.lock like, e.g.

    GIT
      remote: https://github.com/thoughtbot/appraisal
      revision: 5675d17a95cfe904cc4b19dfd3f1f4c6d54d3502
      specs:
        appraisal (2.1.0)
          bundler
          rake
          thor (>= 0.14.0)

How would Bundler ever try and download the `appraisal` gem from RubyGems?

The Gemfile section is more explicable. While newer Gemfiles look like this:

    source "http://our.own.gem.repo.com/the/path/to/it" do
      gem 'gemfromourrepo'
    end
    # or
    gem 'gemfromourrepo', source: "http://our.own.gem.repo.com/the/path/to/it"

Older Gemfiles apparently looked like the following:

    source 'https://rubygems.org'
    source 'http://our.own.gem.repo.com/the/path/to/it'

    gem 'gemfromrubygems1'  
    gem 'gemfromrubygems2'
    gem 'gemfromourrepo'

Which seems obviously vulnerable to the dependency confusion issue mentioned.

withinboredom5y ago

nightpool5y ago

2 more replies

bslorence5y ago

If I'm reading bundler's docs rightly, the new 'source' syntax only appears to prevent this: https://bundler.io/man/gemfile.5.html#SOURCE-PRIORITY

nightpool5y ago

However, this section is concerning:

Yikes! This is yet another easy footgun for people to reintroduce this issue

xurukefi5y ago· 3 in thread

BillinghamJ5y ago

They largely do in various forms. Both npm and yarn, by default, record hashes of the dependencies you're using and check them when redownloading.

I think the issue tends to be more that there's just so many packages (often nested 10+ deep) and it's best practice to keep them as up to date as possible.

When it's fairly typical for a JS project to have thousands of dependencies, there isn't really any practical way to both stay up to date and carefully review everything you pull in.

I think the only viable solution for companies taking this issue really seriously is to keep their numbers of dependencies down and avoid having significant deep/indirect dependencies.

Edit: as an example, in my company's Node stack (for 10 services) - there's >900 dependencies. In our React stack (for 2 sites), more than 1600.

In many ways, the vast number of tiny dependencies are one of the strongest points of the JS ecosystem. But it doesn't come without caveats.

sp3325y ago

magicalhippo5y ago

I was thinking the same thing. Surely PayPal's packages should be signed by a certificate only PayPal has, and they would want to verify that before using their packages?

1 more reply

CGamesPlay5y ago· 3 in thread

I know that node has `package-lock.json` and `yarn.lock`, which include integrity checks. Are these checks decorative only? How could npm have been affected by this issue?

nstart5y ago

2 more replies

raihansaputra5y ago

matsemann5y ago

So this probably wouldn't show up on the final build distributed and deployed somewhere. But it did manage to run arbitrary code on developers' machines of those companies.

pinacarlos905y ago· 3 in thread

Question for you guys here:

Is this kind of attack possible using Nuget-Package manager?

maximcus5y ago

Fresh [PDF from MSFT mentioning nuget](https://azure.microsoft.com/mediahandler/files/resourcefiles...)

forecast105y ago

Yes, it's possible. I've tried it myself.

progre5y ago

Seems like it's a possibility at least. I'd be very very interested to know as well.

donaldihunter5y ago· 2 in thread

brabel5y ago

Also, Maven uses pinned versions, normally, and won't just download whatever newer minor version happens to be published when it builds, which again makes this attack quite hard to pull off.

donaldihunter5y ago

andrejserafim5y ago· 2 in thread

Why would you want your CI to depend on an external source. Say a legit upgrade happened, but it has a breaking change. Now your build is broken.

Fixed versions for as many things as you can (including OS images, apt packages, Docker images, etc) lead to changes in your CI under your control.

Sure, you have to upgrade manually or by a script. But isn't plain build stability worth it? Not even talking about security.

HereBeBeasties5y ago

It probably doesn't. But are you saying devs never updates their dependencies?

andrejserafim5y ago

When one updates internal dependency versions one usually has to find them. At least that was the story with my gigs. So there's a listing somewhere.

So you wouldn't get a random version even considered.

Version shadowing and overriding is a totally different concern of course.

jl65y ago· 2 in thread

The real solution is to design and build software components that can be finished, so they can be ruthlessly vetted - rather than the endless churn of updates.

Daho0n5y ago

1 more reply

CuriousNinja5y ago

2 more replies

mbag5y ago· 2 in thread

[1] https://github.com/pan-net-security/artifactory-pypi-scanner

joshlk5y ago

Uploading dummy packages to PyPi isn't the solution. It just pollutes PyPi and a nuisance to others.

You have always been able to specify the `index-url` when installing packages using pip. This can also be added to `requirements.txt` files as well.

2 more replies

seg_lol5y ago

The thing that just happened is like a catastrophic chain-reaction collision in space. Now we will have to use guids for everything. Nothing has meaning.

technics2565y ago· 2 in thread

Can't there be a "package signature" of some sort that is specified and checked against in a package-lock.json or yarn.lock?

dininski5y ago

fernandotakai5y ago

pip has hashing signatures and i don't know why people don't use it. it's quite easy too.

https://pip.pypa.io/en/stable/reference/pip_hash/

https://pip.pypa.io/en/stable/reference/pip_install/#hash-ch...

seniorivn5y ago· 2 in thread

solution: use nix as your package manager

ben_w5y ago

1 more reply

medecau5y ago

lol

has if the nix peeps are reading the code nix is wget'ing

baybal25y ago· 2 in thread

This is why end-developer signing is essential.

This is not as amendable to CI, but that's the point.

HereBeBeasties5y ago

How does it work in practice, though? For example, create-react-app in NPM has a bajillion deps. Do I trust 8,000 keys? Which ones are OK?

1 more reply

rmoriz5y ago

As you see with chrome extensions and android barcode apps, this is not a solution. Developers change or for whatever reasons can change their mind and ship bad things.

1 more reply

jbverschoor5y ago· 1 in thread

randomsearch5y ago

heipei5y ago· 1 in thread

Previous discussion: https://news.ycombinator.com/item?id=26081149

database_lost5y ago

and the link to the original author's post: https://medium.com/@alex.birsan/dependency-confusion-4a5d60f...

forty5y ago· 1 in thread

I don't understand why there is this issue. We publish our internal npm packages in the @company namespace and we own this namespace on the public npm registry. Problem solved, isn't it?

andrethegiant5y ago

Yes, I'm confused by this too. Scoped packages on npm solves this problem, yet it isn't mentioned in the article at all.

walrus015y ago· 1 in thread

npm in particular has been problematic for a long time:

https://naildrivin5.com/blog/2019/07/10/the-frightening-stat...

https://techbeacon.com/security/check-your-dependencies-gith...

https://thenewstack.io/npm-password-resets-show-developers-n...

joepie91_5y ago

ipsum25y ago· 1 in thread

Package management isn't what I initially think of when I hear "supply chain". Neat hack! It's like left-pad but malicious.

numbsafari5y ago

It should be. If you are a developer, your package manager, OS distributions, and any commercial software you use is all part of your supply chain.

Your code is what it depends on.

nine_k5y ago· 1 in thread

randompwd5y ago

Doesn't cover the same attack vector, and that link is a piece of fiction.

1 more reply

alkonaut5y ago· 1 in thread

What I don’t get from the article is the reasoning behind the design that the central repository “wins” over the local/override repository.

How was that design chosen, not just once but in all 3 of those large package ecosystems. Did pypi/gems/node borrow their design from each other given their similarity in other aspects?

Are there any situations where this behavior is desired?

Does any of the other ecosystems have flaws like this (nuget, cargo..)?

carols10cents5y ago

Cargo will not ever look on crates.io for a library if you specify a `registry` attribute for a dependency. https://doc.rust-lang.org/cargo/reference/registries.html

wyc5y ago· 1 in thread

https://docs.google.com/document/d/1EW6uSZB0_D0qZuDSGuxujuVE...

Jefro1185y ago

Hey there, any chance you want to turn this doc into a dedicated webpage? Here's a demo: https://demos.writary.app/deptrust

unilynx5y ago· 1 in thread

These install hooks... Why are they needed at all and why can't package (de)installation be without side effects ?

(and have npmjs.com search rank packages without scripts above those that do)

bagacrap5y ago

pabs35y ago· 1 in thread

For apt repositories you can do pinning by origin, which should prevent this issue.

ex3ndr5y ago

Does apt use tls by default today?

1 more reply

tantalor5y ago· 1 in thread

Like some other commenters, I too initially balked at the apparent misuse of "supply chain attack" but the linked paper provides a good definition,

A software supply chain attack is characterized by the injection of malicious code into a software package in order to compromise dependent systems further down the chain.

Backstabber’s Knife Collection: A Review of Open Source Software Supply Chain Attacks

https://link.springer.com/chapter/10.1007%2F978-3-030-52683-...

To be clear, just calling this a "supply chain attack" and omitting "software" is going to cause confusion with traditional supply chains.

Terretta5y ago

See Cyberpunk 2077 DLLs for instance.

https://twitter.com/CDPRED_Support/status/135660404767189811...

Cyberpunk “builds” their game with a software build system, but not all of it is them building it.

1 more reply

rectang5y ago· 1 in thread

Then, build tools should be configurable such that they only pull in dependencies signed by PGP keys drawn from a whitelist.

angry_octet5y ago

If you've seen the PGP/GPG code you'll know what a trash fire it is, and if you follow its development you'll see how unfriendly the maintainers are when bugs are pointed out.

Adding dependencies on PGP just makes everything worse.

X.509 PKI for code signing is also terrible and very very complicated and error prone.

Also consider the community nature of development. You need to handle all sorts of painful crypto issues now.

PhineasRex5y ago· 1 in thread

I'll be rethinking using Artifactory in my infrastructure.

HereBeBeasties5y ago

A good look at their (public) bug tracker might change your mind about how surprising this is.

NicoJuicy5y ago· 1 in thread

I used this version trick in nuget, but the other way around.

To update existing non-maintained public packages, mostly because they were on. Net framework and a lot moved to .net core.

In visual studio you can set the priority of where packages have to be checked. My own package repo has a higher priority.

I never thought about using it as an attack vector though.

wozer5y ago

I believe you can work around this attack vector .NET by referencing strong-named assemblies.

jcims5y ago· 1 in thread

Can anyone point me at a resource where I can download the full set of packages that are in the npm registry?

jcims5y ago

using https://replicate.npmjs.com/_all_docs

megous5y ago· 1 in thread

But will anything change in the minds and hearts of developers?

sillysaurusx5y ago

Probably not. :)

trishankkarthik5y ago· 1 in thread

This is NOT a supply chain attack. Solarwinds was a supply chain attack. This is a typosquatting demonstration that happens every one or two years.

Abimelex5y ago

it's not and the article explains why

1 more reply

DecoPerson5y ago

Imagine we navigated the web using a command line tool called “goto” which works exactly like a package manager. If I want to open my bank’s site, I type “goto mybank” .

I could easily find myself in trouble, because:

- There’s no autocomplete or bookmarks, so typos are easy.

- A trusted site may later become malicious, which is bad due to the level of unrestricted and unmonitored access to my PC the site can have.

This is just a few cases off the top of my head. If ‘goto’ was a real thing, we’d laugh it into being replaced by something more trustable.

How have current package managers not had these vulnerabilities fixed yet? I don’t understand.

siraben5y ago

The declarations for how the source is downloaded is specified declaratively and can be pinned to a specific commit of a specific Git repository, for instance.

[0] https://edolstra.github.io/pubs/phd-thesis.pdf

umvi5y ago

> I have been fascinated by the level of trust we put in a simple command like this one

> So can this blind trust be exploited by malicious actors?

vlovich1235y ago

robertlagrant5y ago

I've often wondered about this, even in the accidental case of someone registering a package you use internally.

And I know it's not perfect, but in Python if you use Poetry means you get a poetry.lock file with package hashes built in, so that's something.

p0d5y ago

It seems to me that down through the years ease of deployment trumps security. npm, mongodb, redis, k8s.

Or maybe sysadmin has just become outdated? Maybe front of house still needs a grumpy caretaker rather than your friendly devops with a foot in both camps.

We can now even outsource our security to some impersonal third-party so they can 'not' monitor our logs.

EOG # end of grump

snarfy5y ago

Sadly I've had to fix this at more than one company.

It's a bit of cognitive dissonance having to explain why downloading random shit from the internet during the build is a bad idea, yet here we are.

furstenheim5y ago

Npm ecosystem already has the solution. Use namespaces @yelp/infra-js where @yelp is the npm user.

It's not possible for an attacker to publish on that name in the public npm

jwr5y ago

This doesn't mean I'm not vulnerable to dependency attacks, but it at least limits the window, because I update these dependencies very, very rarely.

fareesh5y ago

This seems to be tending towards the generic problem of permissions that we have seen previously elsewhere.

For example in the case of Facebook, it used to be that users would accept permissions without considering them, and in-turn, various apps would access their data in bad faith.

Likewise for mobile apps.

Eventually Facebook removed many of the overtly powerful permissions entirely, likewise with the mobile operating systems.

In the case of mobile, the concept of "runtime permissions" was also introduced that required explicit approval to be granted at the time of authorization.

On Android, location access now prompts the user in the notification area informing the user of an app that accessed their location.

Can some of these ideas be borrowed to the package/dependency management world? "The package you are about to install requires access to your hard drive including the following folders: x/y/z/all?

smilliken5y ago

This is both a security bug and a reproducibility bug. If anyone outside your network can break your build, your build is broken! It's mission critical to have a working build.

I dream for when build reproducibility is considered table stakes like version control.

adolph5y ago

SideburnsOfDoom5y ago

The upstream article was posted yesterday, here

"Dependency Confusion: RCE via internal package name squatting " https://news.ycombinator.com/item?id=26081149

konaraddi5y ago

fortran775y ago

And, of course, on production build machines, all packages are local.

This isn't just for "security" -- it's to ensure we can always build the same bits we shipped, and to avoid any surprises when something has a legitimate update that breaks something else.

0xbadcafebee5y ago

angry_octet5y ago

I've been wishing npm/pypi/apt etc would improve for ages, but it seems like infrastructure improves one disaster at a time, software one hack at a time. I'm only annoyed I didn't do it myself.

The pypi maintainer is being ridiculous, it is much better to have this guy poke MSFT than have the Russians do it, he's doing them a favour.

wepple5y ago

Does NPM offer cryptographic hash pinning of packages the way that PyPI does?* why is this not more widely used?

* https://flawed.net.nz/2021/02/02/PyPI-Security-State/

milo_im5y ago

https://diffend.io/

soheil5y ago

homakov5y ago

ddtaylor5y ago

Does anyone else skip reading articles on Medium because of their login policies?

3np5y ago

peteretep5y ago

> After spending an hour on taking down these packages, Ingram stressed that uploading illicit packages on PyPI puts an undue burden on the volunteers who maintain PyPI.

I dunno, feels like fair game to me

stephenr5y ago

And still people won’t vendor their dependencies, so changes to dependencies are never reviewed.

To paraphrase family guy: you’re making this harder than it needs to be.

claw_howitzer5y ago

Did this guy just make hundreds of thousands of dollars off this single bug (type)?

cooervo5y ago

this shouldn't be a problem with golang right? because it uses an id when go mod is used. I'm rusty on go since I haven't used it in over 2 years but I believe this shouldn't affect it?

malinens5y ago

luckily I reserved my company's namespace in packagist few months ago. each package manager works differently and it is hard to know inner workings of all package managers

tppiotrowski5y ago

Dev 1: “npm -g package_name” doesn’t work.

Dev 2: try “sudo npm -g package_name”.

omega35y ago

Less than $4k average bounty for this.

Bannednad5y ago

Bullshit you would be next to julian

sitkack5y ago

We need a blockchain for source. It is obvious and we just haven't come to terms with it yet. Then anyone can run anything provided they have the right key.

j / k navigate · click thread line to collapse