Tarballs, the ultimate container image format (opens in new tab)

(gnu.org)

342 pointsseverus_snape8y ago191 comments

191 comments

89 comments · 15 top-level

RX148y ago· 14 in thread

I really love the work the guix folk are doing. I'd love to run guixsd on my laptop if it was easy and supported to run plain upstream linux instead of linux-libre. It just seems like such a lovely easy to use project from the little time I've spent playing with it, it's actually a small shame they're part of the "unsexy" GNU project and subject to GNU politics.

nextos8y ago

I found it quite easy to switch to linux from linux-libre.

However, they package IceCat instead of Firefox, and that's a much tougher one. Note IceCat is not very well maintained.

Nonetheless, there are a few third party repos from users with non-GNU-sanctioned software. I hope it becomes a bit like Emacs, where GNU Elpa coexists in harmony with MELPA.

davexunit8y ago

I think eventually we'll have our own firefox package that sticks much closer to upstream and makes minimal branding/config changes. A lot of active community members want it.

1 more reply

RX148y ago

Well pretty much every Wifi card doesn't work in linux-libre, so that's the main thing. I'm sure I'd find a lot more that doesn't work if I tried linux-libre.

1 more reply

mikegerwitz8y ago

> Note IceCat is not very well maintained.

Its maintainer is working on upgrading to the latest ESR now. If anyone is interested in helping maintain IceCat, please e-mail maintainers@gnu.org.

matthewbauer8y ago

NixOS is a pretty good alternative. There are definitely areas where GuixSD is better than NixOS but also lots more places where NixOS is a lot better than GuixSD.

weberc28y ago

I would really like to hear from more people who've used NixOS in anger. We used the Nix package manager (for packinging our application and managing dependencies) in our organization for a while, and it seemed to create a lot of pain, so I'm wondering if we were using it poorly or if the Nix ecosystem just needs to mature.

jolmg8y ago

What GNU politics are you referring to that makes you reconsider using guixsd?

EDIT: Also what's unsexy about GNU? I'm really curious.

tremon8y ago

Its refusal to package firmware binaries, for one, even if that firmware is required to have a useful machine. I'm looking at AMD specifically here, where recent graphics cards (including APU's) don't even do text-mode without the firmware.

(edit: I understand the why of it, and even agree on principle, but it still prevents me from running linux-libre on most of my systems)

3 more replies

pecg8y ago

GNU stands for a philosophy of freedom, thus guixsd won't provide official repositories for installing proprietary software, some users don't like it, even though they might be interested in the technological approach of the system.

GNU utilities, are not only unsexy, they are bloated and messy, and prone to failure; the GNU implementations (coreutils: grep, cat, tail, etc) of standard UNIX tools are not done with simplicity in mind.

But hey, after all GNU is Not Unix. For those of us, who really appreciate the UNIX philosophy still have OpenBSD, which is the only light in a world of chaos, in my opinion.

1 more reply

jcoffland8y ago

It's not currently cool to like Richard Stallman because he has opinions that run contrary to Silicon Valley.

3 more replies

jpeg_hero8y ago

It starts with that weird ink drawing of a goat for a logo, it just screams “70’s green screen”

And this is coming from a genx open source / Linux guy. What it must look like to the current generation?!?!

1 more reply

peterwwillis8y ago

> Also what's unsexy about GNU?

"Gnu's Not Unix": A recursive acronym used as a pun about an operating system from the 1970s, existing solely as a reflection of an aging neckbearded hippie hacker's personal philosophy about software, that is pronounced "GUH-NEW".

1 more reply

rauhl8y ago

I love that they took the NixOS idea and converted it from brackets to S-expressions, but I do wish that they’d used Common Lisp instead of Scheme. Had they gone with the former, I think that we’d be one step closer to computing’s ultimate goal of a Lisp machine on every desk …

rekado8y ago

Guile Scheme is the GNU system's designated extension language. In GNU there are more applications that support Guile scripting/extensions than there are CL applications.

(I'm a Schemer and I'd love to have a Lisp machine user environment using Scheme.)

justinsaccount8y ago· 12 in thread

Articles like this are pointless. I get that guix and nix are neat, and I think that every single time something about one of them is posted, but I don't have the slightest clue how to use either one of them.

Do you want to convince people that something like guix is better than docker? Then take something that is currently distributed using docker and actually show how the guix approach is simpler.

i.e. I have a random app I recently worked on where the dockerfile was something like

  FROM python:2.7
  WORKDIR /app
  ADD requirements.txt /app
  RUN pip install -r requirements.txt
  ADD . /app

  RUN groupadd -r notifier && useradd --no-log-init -r -g notifier notifier
  USER notifier
  EXPOSE 8080/tcp
  CMD ./notify.py

How do I actually take a random application like that and build a guix package of it?

Another project I work on is built on top of zeromq, and it would be great to use something like guix to define all the libsodium+zeromq+czmq+zyre dependancies and be able to spit out an 'ultimate container image' of all of that, but all this post shows me how to do is install an existing guile package.

t0nt0n8y ago

With Guix you get full introspection of your entire package dependency graph, you can check and manipulate every aspect - and it is still simple and easy to work with. With GuixSD you get this same introspection and overview, but of your entire system. creating a container, vm or even a docker image is a simple '$ guix system <container|vm> config.scm' away. And your config.scm is as complex as you like it to.

The simplest way would be to package the app for guix and you could just run '$ guix environment <name-of-package>' and you would be dropped into an environment with all your dependencies and whatever else the application requires in your path ready for hacking, get your sources and editor and start working.

If you need a vm or similar though I'd translate your example above into a system config where:

- packages include python-2.7 and whatever is in requirements.txt (this may mean you have to package a few things, but again this is usually super easy)

- users and groups are added to the config, as they always are, no extra step necessary.

- exposing ports and networking is available as options for qemu script guix produces to launch the vm.

- CMD ./notify.py: create a "simple" service that can be autostarted by the system on boot.

- filesystem access is also handled by arguments to the qemu script.

As always though there are several paths to Rome, and these are just two of them.

Zeromq and libsodium are already packaged on guix, czmq and zyre looks like they would be simple to package, guix is really quite simple to work with, which I think is the reason so many of the users and devs are running it as our daily drivers, even though it is strictly beta (0.14. I think is the last release).

And pointless, come on - what does that even mean? Does it mean you don't value them? I was quite happy to read about a neat new thing I can use my favorite tool for.

justinsaccount8y ago

> With Guix you get full introspection of your entire package dependency graph

Yes, I know all that. It's neat. I would like to learn more about it.

> The simplest way would be to package the app for guix

I was asking how to package the app for guix, and your response is the simplest way would be to package the app for guix...

> If you need a vm or soimilar though I'd translate your example above into a system config where: - packages include python-2.7 and whatever is in requirements.txt (this may mean you have to package a few things, but again this is usually super easy) - users and groups are added to the config, as they always are, no extra step necessary. - exposing ports and networking is available as options for qemu script guix produces to launch the vm. - CMD ./notify.py: create a "simple" service that can be autostarted by the system on boot. - filesystem access is also handled by arguments to the qemu script.

Yes, I'm sure it is super easy. How do I do it?

Do you know how to use the dockerfile I posted above? You run

  docker build -t myapp .
  docker run myapp

that's super easy. 9 lines and 2 commands. You can now add docker expert to your resume.

> Zeromq and libsodium are already packaged on guix, czmq and zyre looks like they would be simple to package,

Well, I was working on a fork of things, so I would have needed to install my forks.

> guix is really quite simple to work with

I'm sure it is!

> And pointless, come on - what does that even mean? Does it mean you don't value them? I was quite happy to read about a neat new thing I can use my favorite tool for.

You are correct, I don't really value posts saying how cool and easy something is and how much better it is than other solutions, when they don't actually present a complete solution someone can actually use.

I get that it is not other peoples job to teach me how to use something like guix, but do people not understand why things like Docker won?

1 more reply

gfosco8y ago

I think you've reinforced the point they were making. It's pitched as easier, but clear examples of common usage aren't provided. You've provided a response longer than the 9 line Dockerfile, and we still don't know how to replicate it with guix.

1 more reply

tscs378y ago

To some extend I sympathize with GP because your post is exactly why I'm currently not using nix or guix.

While it's neat that I can do introspection on my package graph, I don't immediately see any benefit for me when I startup my containers.

I would love to see a full guix/nix script of what GP asked to see a comparison, I like to see hands-on stuff not theoretical.

rekado8y ago

> Do you want to convince people that something like guix is better than docker

No, we show that Guix is a tool that gives you a way to work with software environments at a higher level; but at the same time you don't have to give up on application bundles like Docker. You can simply generate Docker images or other forms of applications bundles from that higher-level representation.

You are welcome to take a look at this paper that I co-authored where we explain why we use Guix for a reproducible bioinformatics pipeline, and the rigorous, declarative functional package management approach instead of the imperative approach of Docker files:

    https://www.biorxiv.org/content/early/2018/04/21/298653

We're also providing Docker images, but we generate them from a higher-level declarative specification that ensures a high degree of bit-reproducibility.

Ruud-v-A8y ago

> it would be great to use something like guix to define all the libsodium+zeromq+czmq+zyre dependancies and be able to spit out an 'ultimate container image'

You define a package for your own project that depends on libsodium/zeromq/etc from GuixSD. Then you export your own package with 'guix pack'. For an example of what a package definition looks like, take a look in /gnu/packages in the GuixSD repository, for instance libsodium [1] or Vim [2].

I did something similar recently to build an Nginx "application bundle" [3]. It uses Nix (previously Guix, but Nix worked better for me in the end) to build a squashfs image. You can then run the binary on that filesystem with systemd-nspawn, or as a regular service by setting RootImage=. Some advantages over the Docker approach are that you can easily customise the build (e.g. changing the ./configure flags for Nginx without having to manually perform all other build steps), and bit by bit reproducibility (if you build the same commit six months from now, on a different machine, you will still get the same image out).

[1]: https://git.savannah.gnu.org/cgit/guix.git/tree/gnu/packages... [2]: https://git.savannah.gnu.org/cgit/guix.git/tree/gnu/packages... [3]: https://github.com/ruuda/miniserver#readme

t0nt0n8y ago

Packaged zyre and czmq. I'll send the patch for their inclusion, but until then here is the code: https://notabug.org/thomassgn/guixsd-configuration/src/maste...

myWindoonn8y ago

From memory, not tested, not spell-checked:

    FROM nixos/nix
    RUN nix-channel --update
    RUN nix-env -i python2.7-{twisted,treq,txgithub}
    WORKDIR /app
    ADD . /app
    EXPOSE 8080/tcp
    CMD python notify.py

The next level would be using the nixpkgs Docker builder directly: https://nixos.org/nixpkgs/manual/#sec-pkgs-dockerTools

davexunit8y ago

It's hard to give you any specific recommendations with so little context, but I will try. For starters, I should point out that you can't really compare Guix directly to Docker. Guix is a package manager, Docker isn't. The article talks about 'guix pack', which makes it possible for Guix to interoperate with non-Guix systems, and one supported system is Docker. You can deploy software with just Guix, too, either on GuixSD or a foreign distro with Guix installed.

Anyway, in your Dockerfile I see that your application uses Python and you do some package management and service management stuff that is mixed together. In Guix, these things are separated. So the first step would be to define a package for your software, and then you would deploy that package. For a real world example of a Python application, here is what the AWS CLI package looks like:

    (define-public awscli
      (package
       (name "awscli")
       (version "1.14.41")
       (source
        (origin
         (method url-fetch)
         (uri (pypi-uri name version))
         (sha256
          (base32
           "0sispclx263lybbk19zp1n9yhg8xxx4jddypzgi24vpjaqnsbwlc"))))
       (build-system python-build-system)
       (propagated-inputs
        `(("python-colorama" ,python-colorama)
          ("python-botocore" ,python-botocore)
          ("python-s3transfer" ,python-s3transfer)
          ("python-docutils" ,python-docutils)
          ("python-pyyaml" ,python-pyyaml)
          ("python-rsa" ,python-rsa)))
       (arguments
        '(#:tests? #f))
       (home-page "https://aws.amazon.com/cli/")
       (synopsis "Command line client for AWS")
       (description "AWS CLI provides a unified command line interface to the
    Amazon Web Services (AWS) API.")
       (license license:asl2.0)))

The package recipe contains all the metadata, build instructions, and dependencies. Now that you have a package, it can be built with Guix and then deployed in a variety of ways. Judging from the Dockerfile, your software is some daemon that listens on port 8080, so:

* You can install the software directly using 'guix package -i your-package-name' and run the notify.py program. Good for trying things out.

* If you are deploying to the Guix system distribution, you could write a service definition so that you can manage the daemon via the init system. The service would take care of creating the notifier user and group, starting the service on boot, etc.

* You could use 'guix pack --format=docker' to export an image suitable for running with 'docker load'

* You could use a different 'guix pack' format (and maybe make it relocatable) for running on some other non-Guix system

I should also add that I don't think the work is fully done yet on handling the entirety of Docker use-cases. It's a work in progress. I can think of a number of things that I want to add to Guix to make this workflow better that I haven't had a chance to hack on yet.

justinsaccount8y ago

That's interesting, but where does it specify which python version is used and the version of all the dependencies?

If the versions are specified in the 'python-botocore' type definitions, how do you install more than one version of a library?

Does guix only track the latest version of dependencies or can you request any version of something?

1 more reply

jancsika8y ago

@justinsaccount: can you give t0nt0n a clue what the contents of requirements.txt are, plus anything else needed to create a complete port to guix?

Then it would be great to see t0nt0n or someone else who knows guix do the port so we can fully compare these two approaches.

justinsaccount8y ago

it's not really application specific, just stuff like

  requests==2.18.4

the actual packages generally aren't important.

The cases were that would become interesting are where they require some C library dependencies first, like libpq-dev. In those cases something like guix/nix would be nice because it could be used to pull in the specific external dependencies as well.

theamk8y ago· 9 in thread

I like simple archives, but can it be not tarballs? For the kinds of application described in this article, tarballs are pretty bad:

Either you extract it from scratch every time you run an app, taking a long time penalty...

... or you extract once to cache, and assume that nothing changes the cache. This is pretty bad from both operational and security perspective:

- backups have to walk through tens of thousands of files, thus becoming much slower

- a damaged disk or a malicious actor can change one file in the cache, making damage which is very hard to detect.

There are plenty of mountable container formats -- ISO, squashfs, even zip files -- which all provide much faster initial access, and much better security/reliability guarantees, especially with things like dm-verity.

2ion8y ago

Yes, most tarballs do not support random access (there are some metadata extensions that allow this). This makes large tarballs annoying to use on systems with slow disk I/O (even a hard disk may be too slow (to the degree of being annoying to work with)). This is by far my biggest gripe with the format. Certainly, smaller tarballs are a very handy format as long as you stay inside the Unixy world of computing – and as long as you keep looking out for the various incompatibilities between the different tar implementations.

textmode8y ago

"... there are some metadata extensions that allow this)."

Where to find these extensions? Are they portable between Linux and BSD?

The 1998 dict project included a utility called "dictzip" for random access to the contents of gzip compressed files.

Dumb question: Is it possible to create a utility or even a hack that performs "random access" into tar archives?

Example use case: the user only wants to untar a small number of selected files from a large tarball such as a source tree. The user has tried both the "-T filelist" option and using memory file systems instead of hard disk drives.

3 more replies

pif8y ago

> This makes large tarballs annoying to use on systems with slow disk I/O

Funny how tar was originally developed for tape drives!

1 more reply

RX148y ago

I'm pretty sure the article implies this is for user-facing applications where the user would manually extract it once to a place of their choosing then run it from there. I think you're missing the point of the whole article.

theamk8y ago

But why would you want to extract if you can mount the file directly? For simple archives, extracting is fine. But for larger archives (like a compiler -- 1000 files or more), loop-mounting is much better than extracting:

- Does not slow down your backup by adding thousands of files

- No need to wait for initial file extraction

- You can quickly and easily verify integrity of the whole archive

And if you are using fuse, it does not require any special privileges either!

justincormack8y ago

Mountable formats have the security issue that the kernel is not that great at protecting against hostile images in mount. On disk format fuzzing has not been common and there are definitely bugs.

theamk8y ago

This is solved very nicely with fuse mounts (and fuse is surprisingly performant on the modern multicore systems)

solatic8y ago

Do you not still pay a significant performance penalty by reverifying the container upon each application load? Especially considering that, if the container is signed, you need to verify the signature itself before trusting the container, and full signature verification - including checking whether the signature has been revoked - involves expensive network calls?

If your operational and security model really frowns on trusting your extraction cache, then perhaps a different workflow is more appropriate - download the container, verify the container, extract, bake the OS plus extracted apps into an image, sign the image, verify the image upon each boot and mount apps read-only. Then you don't need to re-verify anything upon each launch, instead trusting that your image creation process is routinely updating and re-verifying the software in your current images.

theamk8y ago

Verification of a single file is much faster than walking entire tree, especially when there are lots of small files, for example when there is a compiler or large python project inside.

A simple example: my /usr/include is 33037 files, 356M uncompressed. On SSD with cold cache, it takes 6.7 sec to read each file individually, or 0.7 sec to checksum a single 356M archive, a 10x difference.

The difference in the backup time is even more dramatic -- the backup program has to call stat() either 33K times, or just once, a 3,330,000% improvement! The other filesystem tools (What takes all the space? What has changed in the last X hours? Please sync this directory elsewhere.) will have similarly high speed improvements.

So if I had a choice, I would love my dev environment to come in mountable form. Similarly, I don't understand why container runtimes (like docker) don't use loop mounts more -- it seems like many advantages and very few disadvantages.

As for signature verification -- I don't care about 3rd party signature and revocation, I just want to ensure that I am running the same code every time. There are many ways one can damage extraction cache, especially if it is owned by the same user as application (like the topicstarter post described) -- sysadmin errors (`sudo find / -name app-old -delete`), application errors (create cache file in bin dir), disk errors (silent corruption), transfer errors (one file did not get transferred to a new computer). Loop mounting makes disk errors easier to detect, and eliminates other classes of error entirely.

AdmiralAsshat8y ago· 9 in thread

Do tarballs still have that unfixed/unfixable bug where the extracted files will have the permissions of the person who untarr'd the file?

tannhaeuser8y ago

It's a feature: you must be running tar as root or equivalently to restore to uids/gids other than the effective process uid. Otherwise you could happily overwrite any host system file including parts of the O/S. It's a restriction shared by all archivers.

vinceguidry8y ago

You can use the --same-owner flag and extract the tarball as root in order to preserve ownership. The -p flag ensures that the permissions umask will match the archive's as well.

Grue38y ago

I like the amazing "feature" where the act of extracting a tar file into a directory can change permissions on this directory. You have to pass --no-overwrite-dir flag to disable this.

master-litty8y ago

That seems sensible to me, what else would you expect?

AdmiralAsshat8y ago

I expect that they should preserve the ownership and permissions of the original file if I tell it to.

2 more replies

delinka8y ago

I think you mean owner rather than permissions. In most cases, you want to maintain permissions/file mode (read/write/execute) but not the original owner.

AdmiralAsshat8y ago

Except it doesn't do either. I've had files that had 666 user:group permissions/owner that I tar into a backup file, then untar, only to find that the file is now 664 with me:me ownership.

It's brought production to a halt on more than one occasion if I try to "restore" from a backup by extracting the files and moving into production without manually fixing them first.

3 more replies

cyphar8y ago

That's a detail of the extraction tool. In umoci (which extracts tar archives as part of an OCI image)[1] you can remap the users or even extract as yourself and then add an xattr which represents the original owner in the archive (which is then read back when creating a new tar archive from the delta of the rootfs).

[1]: https://github.com/openSUSE/umoci

oconnor6638y ago

Or where the paths in the tarball can start with `..`?

stuaxo8y ago· 8 in thread

Please, can we move to an archive format that isn't so sprawlingly massive ?

peterwwillis8y ago

Or one that can list/extract files without reading the entire archive, or one that can use binary diffs, or one that supports encryption, or one that supports long file names, or one that isn't hamstrung by different implementations of different standards on different platforms, or one that doesn't use 512 byte blocks, or one that is actually usable on modern operating systems, ....

sitkack8y ago

That time is now [0].

[0] https://www.sqlite.org/sar/doc/trunk/README.md

2 more replies

rekado8y ago

Sure. `guix pack` is a neat hack and it isn't tied to any particular archive format.

When using plain Guix you won't need to use any archive format at all; packages simply end up each in their own unique directory and can be used just like that. You can easily spawn a container environment where only the relevant directories under `/gnu/store` are mounted.

It's on my list to add more target formats for `guix pack`, but generally I'd recommend using Guix directly to reap all benefits. `guix pack` is only really useful for cases where you cannot use Guix on the target system.

rekado8y ago

A squashfs backend for `guix pack` exists now:

http://lists.gnu.org/archive/html/guix-patches/2018-05/msg00...

cpburns20098y ago

Are you complaining about the complexity of file format itself? My understanding is it's pretty simple: a linked list of headers with the contents of each file after each header. Or are you complaining that it doesn't do compression itself like ZIPs do?

GrayShade8y ago

One think I dislike about tarballs is the lack of random access support.

1 more reply

TylerE8y ago

He's complaining about the simplicity. He wants something with less suck.

spookthesunset8y ago

What do you mean by massive?

cyphar8y ago· 4 in thread

This is remarkably off-beat for the GNU project. Tar files are far from the most ideal tool for container images because they are sequential archives and thus extraction cannot be done using any parallelism (without adding an index and being in a seekable medium, see the rest of this comment). I should really write a blog post about this.

Another problem is that there is no way to just get the latest entry in a multi-layered image without scanning every layer sequentially (this can be made faster with a top-level index but I don't think anyone has implemented this yet -- I am working on it for umoci but nobody else will probably use it even if I implement it). This means you have to extract all of the archives.

Yet another problem is that if you have a layer which just includes a metadata change (like the mode of a file), then you have to include a full copy of the file into the archive (same goes for a single bit change in the file contents -- even if the file is 10GB in size). This balloons up the archive size needlessly due to restrictions in the tar format (no way of representing a metadata entry in a standard-complying way), and increases the effect of the previous problem I mentioned.

And all of the above ignores the fact that tar archives are not actually standardised (you have at least 3 "extension" formats -- GNU, PAX, and libarchive), and different implementations produce vastly different archive outputs and structures (causing problems with making them content-addressable). To be fair, this is a fairly solved problem at this point (though sparse archives are sort of unsolved) but it requires storing the metadata of the archive structure in addition to the archive.

Despite all of this Docker and OCI (and AppC) all use tar archives, so this isn't really a revolutionary blog post (it's sort of what everyone does, but nobody is really happy about it). In the OCI we are working on switching to a format that solves the above problems by having a history for each file (so the layering is implemented in the archiving layer rather than on top) and having an index where we store all of the files in the content-addressable storage layer. I believe we also will implement content-based-chunking for deduplication to allow us to handle minor changes in files without blowing up image sizes. These are things you cannot do in tar archives and are fundamentally limited.

I appreciate that tar is a very good tool (and we shouldn't reinvent good tools), but not wanting to improve the state-of-the-art over literal tape archives seems a bit too nostalgic to me. Especially when there are clear problems with the current format, with obvious ways of improving them.

JdeBP8y ago

It sounds like you are progressing along the same road that led Rahul Dhesi to invent the ZOO file format.

cyphar8y ago

As far as I can tell the only thing ZOO has over tar archives is having a history of each file (using the VMS concepts of file versions) -- meaning that it probably still has some of the problems I outlined above. While that is useful, it is still not as good as it could be. Also, you don't really want file versions with container images, you want to have conceptual "layers" (which would be sort of like having versioned files but it's more like snapshot IDs -- or like ZFS's birth-times).

1 more reply

Rapzid8y ago

It sounds there is a lot of use cases that overlap with services provided by general file systems. I'd be curious to here your thoughts on that.

cyphar8y ago

You're right that general-purpose filesystems have solved quite a few of the indexing problems already, unfortunately there are a few things stopping general filesystems on a loopback device from being practical (or safe, or the best idea):

* The container (file) for the filesystem must necessarily be larger than the metadata+data for the filesystem because filesystems really don't like almost-full disks. And unless I'm mistaken sparse files are not usable for loopback devices (so you can't hack your way out of it).

* Most filesystems don't have a snapshot-style history so you would have to pick a specific filesystem from that list (otherwise you'd be forced to make CoW duplicates of the filesystem to create snapshots -- which is interestingly how Docker does layered storage with devicemapper) which has slightly similar problems to layered tar archives.

* The kernel's filesystem parsers are not really considered to be safe against an adversary, from what I've been told by filesystem engineers. So mounting random loopback files with filesystems on them might end badly.

* There is no way of looking at the archive using a userspace tool (without mounting), unless you re-implement the kernel parser for the filesystem. To be fair, this is true for any format, but filesystems are far more complicated and harder-to-parse than most other formats.

* Having a single blob as your entire image history and so on will mean that you can no longer have content-addressable storage for your images without adding something like content-defined chunking on top (which is then another layer of storage on top of your underlying storage).

* Using a Linux filesystem would mean you couldn't use the filesystem on different operating systems very easily. Even if it was compatible on whatever other filesystem you are using, userspace has no way of being sure there isn't a bug in either side's parser -- and what happens if one side changes the on-disk format. If the protocol is in userspace then it can be handled there.

* Most filesystems don't let you remap users, so if you wanted to run a container in a user namespace you would need to either rewrite the filesystem structure or mount the filesystem and copy it to another filesystem. To be fair, tar archives require you to do the mapping on extraction which is a similar problem, but far less complicated.

* Everyone would be opinionated about what filesystem to use, which means that you'd have to deal with every filesystem people throw at you, making it harder to be interoperable and adding choices where they aren't necessary. It should be up to the user what filesystem they use for storage, not the image distributor.

Now, this hasn't stopped people from trying to use this. Singularity's internal format is a loopback file with a filesystem inside, and they have privileged suid binaries that mount it. And it does have genuine performance benefits, and if you don't want things like content-addressability then it can work for some usecases.

nerpderp838y ago· 4 in thread

Tarballs don't have a TOC and can't easily index into individual entities.

One could create a utility to make tarballs with a TOC and the ability to index while still remaining compatible with tar and gzip. Pigz is one step in the direction.

matthewbauer8y ago

I think AppImage does this with SquashFS images currently.

tejtm8y ago

to list what is in a tar ball `tar -vtf tarball.tar`

to extract a particular entity 'tar -vxf tarball.tar path_in tarball_to_entity`

edit: good points on it not being efficient for large archives, just demonstrating it is possible.

nerpderp838y ago

A tar is a linked list of file paths and contents, it cannot be indexed to a particular file. A compressed tar has to first be decompressed and then the chain of links traversed. Accessing a file in compressed tar is o(n) with where the file is placed within the compressed tar stream.

It isn't that it is possible, it is that is horribly inefficient.

Zips on other hand unify storage and compression such that one has random access to particular file, hence most modern file formats are zips with xml or json inside.

discreditable8y ago

The problem is that to know what files are in the tarball you have to read the whole thing. If the archive is large that's a lot of reading just to get a file list.

geofft8y ago· 3 in thread

I realize the title is just a hook for the (very cool!) work in the article, but a couple things that tarballs don't/can't specify that Docker containers can:

- environment variables like locales. If your software expects to run with English sorting rules and UTF-8 character decoding, it shouldn't run with ASCII-value sorting and reject input bytes over 127.

- Entrypoints. If your application expects all commands to run within a wrapper, you can't enforce that from a tarball.

You can make conventions for both of these like "if /etc/default/locales exists, parse it for environment variables" and "if /entrypoint is executable, prepend it to all command lines", but then you have a convention on top of tarballs. (Which, to be fair, might be easier than OCI—I have no particular love for the OCI format—but the problem is harder than just "here are a bunch of files.")

catern8y ago

It's not necessarily a good thing for the container to be able to specify locale. Locale should be picked up from the surrounding system; it's just that unfortunately the surrounding system is usually not configured correctly.

And entrypoints/wrappers are definitely possible from a tarball. Just wrap the executables in bin/, replacing them with shell script (or whatever) wrappers pointing to the real executables. That's what Nix/Guix do for languages like Python which require dependencies to be provided by environment variables (as they don't have a way to "close over" the locations of their dependencies).

oconnore8y ago

> Locale should be picked up from the surrounding system; it's just that unfortunately the surrounding system is usually not configured correctly.

And around and around we go

sleepybrett8y ago

Also docker containers are just tarballs of tarballs (one per layer)

digi_owl8y ago· 3 in thread

A quick FYI, Gobolinux operates much the same way.

1. Binary packages are simply compressed archives (tarballs) of the relevant branch in the /Programs tree.

2. branches do not have to actually live inside the /Programs tree. There are tools available to move the branches in and out of /Programs.

All this because Gobolinux leverages symbolic links as much as possible.

matthewbauer8y ago

Gobolinux sort of does this. The main difference is GoboLinux uses “version numbers” while Nix & Guix use hashes. It makes a lot of difference for more complicated stuff.

digi_owl8y ago

True.

I suspect there are ways to introduce hashes to Gobo, if one were so inclined. But so far nobody has.

TylerE8y ago

Nitpick: A vanilla tarball is a concatenation, not a compression.

paulfitz8y ago· 2 in thread

How about sqlar as a container format? https://sqlite.org/sqlar.html A regular sqlite database file, with anything you like in it. Mountable as a file system with sqlarfs. Written by the sqlite guy.

infogulch8y ago

Interesting I didn't know this existed. Is there a way to layer sqlar like docker images? (Besides just tarring them up I guess.)

I wonder if this could be implemented with the WAL/journal system. Make each layer immutably append to the previous layers to make restarting at any layer trivial. I'm not sure if there's such a way to hook into the journal directly like that though.

zaarn8y ago

Should be doable with overlayfs (or similar) or alternatively some extensions to sqlar.

sqlar is after all only a table definition, if you don't need FUSE access or are willing to write your own, SQLite3 can go a long way of providing arbitrary neat functionality.

chx8y ago· 2 in thread

For relocatable ELF binaries, there's also https://github.com/intoli/exodus

foob8y ago

The packages that Exodus produces are actually quite similar to those introduced in this announcement. Both tools generate simple tarballs that can be extracted anywhere to relocate programs along with their dependencies, and both tools bootstrap the program execution using small statically compiled launchers written in C. They contrast guix pack against Snap, Flatpak, and Docker, but Exodus would probably make a more apt comparison in many ways.

civodul8y ago

Interesting! The trick that Exodus uses (invoking ld-linux.so directly) is very smart. Perhaps an option to add to 'guix pack' in the future. :-)

AnIdiotOnTheNet8y ago· 2 in thread

Reinventing Application Bundles only 30 years after NeXTStep, poorly.

matthewbauer8y ago

Why poorly? I don’t see anything worse about this.

AnIdiotOnTheNet8y ago

Really? Seems like an awful lot of tooling for what is essentially "Put binary and dependencies in folder. Move folder around at will" in sane environments.

1 more reply

tannhaeuser8y ago· 1 in thread

That article made me warm up to guix and its practical side. Are guix app bundles just bare tar archives with /usr/local prefix semantics or do they need special metadata files? How are compiled binaries with hardcoded and/or autoconf'd prefixes handled for relocation (I guess using Linux namespaces somehow)?

rekado8y ago

In Guix every package ends up in its own directory, which may have references to other packages in /gnu/store. An application bundle is really just a package closure, i.e. the directory for the package and all directories it references, recursively. One way to bundle up things is with `tar` (the default of `guix pack`), but Guix also supports other bundling targets, such as Docker. No special metadata files are required.

Relocation currently requires a little C wrapper, which uses Linux namespaces, as the blog post indicates.

If you want something more advanced, such as a bundle that includes an init and services, it's best to use `guix system`, which builds VM images among others.

kuwze8y ago· 1 in thread

Does anyone know how this would apply, for example, to sharing a Guile 2.2 application with Debian/Red Hat based distributions? I want to use Guile 2.2 for development, but I am worried because it was only recently was released for major distros (at least with Ubuntu I know it was released with 18.04) and it doesn't seem to support the creation of executables.

sitkack8y ago

See this older discussion on statically linking guile [0], one should be able to bake your source into a C program that statically links Guile 2.2 to create a self contained executable. If that is too cumbersome, I would use a container.

[0] https://lists.gnu.org/archive/html/bug-guile/2013-03/msg0000...

matthewbauer8y ago

Nix has a very similar tool called nix-bundle[1].

[1]: https://github.com/matthewbauer/nix-bundle

j / k navigate · click thread line to collapse

191 comments

89 comments · 15 top-level

RX148y ago· 14 in thread

nextos8y ago

I found it quite easy to switch to linux from linux-libre.

However, they package IceCat instead of Firefox, and that's a much tougher one. Note IceCat is not very well maintained.

Nonetheless, there are a few third party repos from users with non-GNU-sanctioned software. I hope it becomes a bit like Emacs, where GNU Elpa coexists in harmony with MELPA.

davexunit8y ago

I think eventually we'll have our own firefox package that sticks much closer to upstream and makes minimal branding/config changes. A lot of active community members want it.

1 more reply

RX148y ago

Well pretty much every Wifi card doesn't work in linux-libre, so that's the main thing. I'm sure I'd find a lot more that doesn't work if I tried linux-libre.

1 more reply

mikegerwitz8y ago

> Note IceCat is not very well maintained.

Its maintainer is working on upgrading to the latest ESR now. If anyone is interested in helping maintain IceCat, please e-mail maintainers@gnu.org.

matthewbauer8y ago

NixOS is a pretty good alternative. There are definitely areas where GuixSD is better than NixOS but also lots more places where NixOS is a lot better than GuixSD.

weberc28y ago

jolmg8y ago

What GNU politics are you referring to that makes you reconsider using guixsd?

EDIT: Also what's unsexy about GNU? I'm really curious.

tremon8y ago

(edit: I understand the why of it, and even agree on principle, but it still prevents me from running linux-libre on most of my systems)

3 more replies

pecg8y ago

But hey, after all GNU is Not Unix. For those of us, who really appreciate the UNIX philosophy still have OpenBSD, which is the only light in a world of chaos, in my opinion.

1 more reply

jcoffland8y ago

It's not currently cool to like Richard Stallman because he has opinions that run contrary to Silicon Valley.

3 more replies

jpeg_hero8y ago

It starts with that weird ink drawing of a goat for a logo, it just screams “70’s green screen”

And this is coming from a genx open source / Linux guy. What it must look like to the current generation?!?!

1 more reply

peterwwillis8y ago

> Also what's unsexy about GNU?

1 more reply

rauhl8y ago

rekado8y ago

Guile Scheme is the GNU system's designated extension language. In GNU there are more applications that support Guile scripting/extensions than there are CL applications.

(I'm a Schemer and I'd love to have a Lisp machine user environment using Scheme.)

justinsaccount8y ago· 12 in thread

Do you want to convince people that something like guix is better than docker? Then take something that is currently distributed using docker and actually show how the guix approach is simpler.

i.e. I have a random app I recently worked on where the dockerfile was something like

  FROM python:2.7
  WORKDIR /app
  ADD requirements.txt /app
  RUN pip install -r requirements.txt
  ADD . /app

  RUN groupadd -r notifier && useradd --no-log-init -r -g notifier notifier
  USER notifier
  EXPOSE 8080/tcp
  CMD ./notify.py