Djbsort: A new software library for sorting arrays of integers (opens in new tab)

(sorting.cr.yp.to)

284 pointsxmmrm7y ago149 comments

149 comments

70 comments · 17 top-level

nrclark7y ago· 21 in thread

My job involves a lot of packaging/cross-compilation, and djb's libraries always seem consistently hostile to the lowly packaging engineer.

Would it really be all that much work to package in autotools or CMake? Why do I need his special-snowflake build system with its hard-coded assumptions about system paths?

I know that the cult of djb will downvote this into oblivion, but seriously, what is the rationale for a build flow that involves:

  1. Downloading a text file
  2. Parsing it to get a URL
  3. Making a new user
  4. Symlinking the user's HOME directory into the build tree
  5. Run an extremely non-standard build system.
  6. Hope you're not trying to cross-compile, because good luck with that.
  7. Guess at where the files came out (hint: it probably won't be in FHS locations)
  8. Copy the output yourself once you find it.

Would it really be that much harder to give us a git repo and a ./configure or a CMakeLists.txt?

Panino7y ago

I've been using DJB's stuff for ~20 years and I don't like his recent build systems either. Without defending it, I'd just like to offer a theory on why he packages things the way he does.

Back in the day, his build processes were atypical but still much more "normal" than now. He also released his software without licenses. During this time of heavy software development, DJB was concerned about people screwing around with the internals of his software, hurting security or reliability, and then blaming the software rather than the modifications. So his build systems were, I think, designed to lead to the result he wanted, where software behaved and was administered in the same way on various platforms.

In the mid-2000s he re-licensed existing software as public domain and began publishing all new code as public domain as well. Around this time, build systems began to get more wonky. Also, his public work that garnered the most attention shifted away from software toward cryptography. He did some attacks on existing crypto and authored Curve25519, Salsa20, etc.

He's also been putting out a tremendous volume of work in multiple categories. I bet he'd rather work on this stuff than on user-friendly build systems.

So given these points, I think the explanation for his unfriendly build systems is

  A) a very strong aversion to people modifying his stuff where he gets blamed if modifications do harm;
  B) a shift away from software development, where people generally care more about build systems anyway;
  C) a huge level of productivity which results in very atypical Pareto principle choices/tradeoffs;
  D) his public-domain licensing.

Given these 4 points, I think DJB is unwilling to take time away from crypto and other work and put it into build systems he doesn't enjoy that will take more time upfront and more babysitting down the road. Fewer people will package it, but the software is public domain and competent people can just add their own build system. This squares with his available time and interests.

So, I don't like his build systems either, but I think I understand where they come from.

nrclark7y ago

Appreciate the response! Interesting to read.

The Libsodium guys wound up doing exactly what you're suggesting, because of the impossibility of trying to package NaCl as-is.

So they essentially had to re-do/duplicate all of his build work just to make it packageable. And now there are two competing implementations (three if you count tweetnacl). And a bit of a confusing mess in the documentation department.

It seems a little selfish for djb to take the "works on my machine" attitude, because it means that a bunch of other people have to reverse-engineer all that stuff just to make it portable.

But I guess OTOH it's his software, so it's his choice. And maybe he doesn't care whether people choose to use his stuff or not, as long as he's publishing.

1 more reply

JdeBP7y ago

The underpinnings of that argument are flawed. You're accepting the premise that there were relatively "normal" build processes "back in the day" and "abnormal" ones now. This is not in fact the truth of the matter at all.

M. Bernstein's own build system was redo. But he never actually published a redo tool. Other people did that. There are traces of a precursor to redo in one of his packages. But even they were not the actual build system for it as released to the public. Again, it was other people who took them and fleshed them out into a working build system. The slashpackage system, similarly, only existed in (another) one of his packages. And again, it was other people who extended it to other packages.

* http://jdebp.info./FGA/introduction-to-redo.html

* http://jdebp.info./FGA/slashpackage.html

* http://jdebp.info./Softwares/djbwares/

The reality is that the build system evident in djbsort is not a sudden inconsistency. The various packages over the years are all inconsistent. One can in fact derive from them a timeline both of development of ways of building packages and evolution of the various "DJB libraries". But they are all snapshots of these processes at different points. They aren't a coherent single system across multiple packages. I, Paul Jarc, and others had to make that part. (-:

So do not deduce that there's been a change. Deduce, rather, that build systems have always been a lower priority and an unfinished loose end. As such, this is nothing to do with the announcement at Sage Days 6 and the like, nor to do with people "screwing around" as you put it. Indeed, the clear motivations of redo expressed in M. Bernstein's own writings on the subject have nothing to do with either copyright or preventing modifications to packages, and everything to do with problems of make and indeed autotools. Observe the motivations of "honest" dependencies from the build instruction files themselves, from the command-line tools and their options, and from things like the absence of header files.

mcguire7y ago

"He's also been putting out a tremendous volume of work in multiple categories. I bet he'd rather work on this stuff than on user-friendly build systems."

I don't know about that part...a halfway normal build process would be considerably less work to set up than this monstrosity.

mwcampbell7y ago

There's also a world beyond Unix. Prof. Bernstein may not be interested in targeting Windows. But for library developers in general, I'd recommend CMake. Microsoft has adopted CMake as its standard cross-platform C/C++ build system, and has built a cross-platform library package manager [1] on top of it.

[1]: https://github.com/Microsoft/vcpkg

rurban7y ago

As you see in the code this is linux only: CLOCK_MONOTONIC, HW_CPUSPEED, -soname, -rpath, linux/perf_events.h,

Even on linux it would fail: sort.c:386:21: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'djbsort_int32' that is compiled without support for 'sse4.2'

-march=native or -msse4.2 would be needed.

tbirdz7y ago

I really disagree with you on autotools, that has been a big problem for me when cross-compiling. Configure often detects features by compiling a binary and executing it, which can be a problem when you're cross compiling, since the generated target can't be executed on your native cpu. Most software build systems don't even take cross building into account at all, since they just copied someone elses' broken autoconf and automake source that only works on native builds so you have to work around a lot of issues.

Even "good" software packages written by people who understand autotools like glib have issues. Just look at the official glib cross compiling instructions: https://developer.gnome.org/glib/stable/glib-cross-compiling... The "correct" way to cross compile it is to figure out all the autoconfigured values yourself, and write them into the configure cache.

Personally, I have had the least trouble with plain Makefile based packages. Yes you often have to reach into the Makefile's guts and modify it to make a working cross build, and you'll have to set the rpath manually, and install the files yourself, but at least it's easier than having to modify an autotools based build.

Autotools "just works", and when it doesn't it's very hard to fix. Makefiles don't "just work", but it's much easier to fix them yourself.

altfredd7y ago

"Configure often detects features by compiling a binary and executing it" — I am not sure, where this impression comes from. Autotools as whole have excellent cross-compilation support. Autoconf has feature-detection routines, checking for presence of headers, exported symbols and pkg-config files. None of those trigger "execute the binary" part when cross-compiling. In addition, custom-written host-side checks can easily be skipped when cross-compiling (either by code of check itself or by user via environent-variable overrides). Do you you know of any build system, that handles this better?

Cmake and several other buildsystems either don't support cross-compilation at all (because their primary audience is Windows) or use pkg-config only. Few others are nightmarish parody of autotools with much worse support. Most don't have ounce of autotools features.

I know many projects, that offer horrible autotools "support": for example, glib2 autotools scripts can't be cross-compiled to Android without ample application of hacks. But those issues are caused by incompetence and lack of testing, not some innate fault of Autotools. When such projects migrate to something else, their cross-compilation process becomes WORSE.

1 more reply

R0b0t17y ago

To clarify for anyone who doubts autotools has problems - it is true that the documentation mentions cross compiling options, but it appears that the vast majority of tests in use today do not actually use these values (host/target arch, etc).

Some users recommend compiling with qemu-user to avoid fixing the tests, but besides being very slow, that methodology will still generate garbage. What is the point of running feature detection code on a desktop that will be used to configure code compiled to run on e.g. a small ARM system?

The fix is to explicitly set configuration parameters or, even better, to detect features at runtime. However if you try to contact any given project using autotools to help solve these configuration problems you will likely receive "that's not our problem" and no help from the maintainers.

Autotools is in practice less work for me than alternatives, at least while using a FOSS operating system. On Windows CMake is very well developed but handling dependencies is always a chore.

ur-whale7y ago

1. You're assuming the software is intended to be deployed

2. Writing anything sane with autotools or CMakes is an exercise in frustration that borders on torture. Not to mention how either of these systems chafes on one's sense of aesthetics.

3. I've installed a few of djb's package over the years. While indeed non standard, unlike either of the solutions you've mentioned, djb's stuff: a) works b) is usually very simple to understand and change when needed (unlike the autotools steaming monstrosity which in 20 years of using opensource software I've never dared to touch)

nrclark7y ago

Why would somebody release software that they didn't want to see used?

To your other point, Autotools is little crusty for sure. It's Yet Another Language to learn (and so is CMake), but it's really not as bad as people make it out to be. There are many thousands of examples in the wild, good and bad.

Yes, Autoconf tests for a bunch of stupid things that don't matter. But so what? You can add the tests you _do_ care about, and take advantage of the directory control and compiler control aspects.

If you've never actually worked with Autotools (which you just said you haven't), I'd encourage you to actually try it out. It's not as bad as you think.

4 more replies

geocar7y ago

> Why do I need ... assumptions about system paths?

> what is the rationale for a build flow that involves:

It solves problems.

https://cr.yp.to/compatibility.html

https://cr.yp.to/slashpackage/studies.html

https://cr.yp.to/slashpackage/finding.html

https://cr.yp.to/slashpackage/sharability.html

> Would it really be that much harder to give us a git repo and a ./configure or a CMakeLists.txt?

Yes.

nrclark7y ago

FWIW, I read through each of your links. They don't address cross-compilation at all, or the needs of software packagers.

I can 100% promise you that somebody packaging this library for any Linux distro (or for a Yocto/Buildroot system) would grind their teeth in frustration at everything in the those links.

The solution to having inconsistent packaging paths isn't to introduce _yet another_ packaging path system, but this one specific to djb stuff. It's to use a standard build system with overrideable paths, and not to assume the author knows better than the packager.

1 more reply

mcguire7y ago

Reading those complaints cold, I would suspect DJB to be between 16 and 22 years old; old enough to have seen more than his own system, but young enough to be absolutely certain his off-the-cuff solution is better than any of those other idiots can come up with. It helps to be the smartest person in any room, I guess.

"When oaf was integrated into ``the system,'' it was moved to /usr/bin/oaf, and its files moved to /usr/share/oaf, because FHS doesn't let ``the system'' touch /usr/local. What happened in this case is that oaf didn't find files that were put into /usr/local/share/oaf by another package."

Once upon a time, I worked as a sysadmin for a major university computer science department. We supported a half-dozen or so different architectures. We built a long list of software, including all of the X and Gnu programs, so that our users had the same environment essentially everywhere. Oh, and we all built it to live under /lusr, because our convention started before "/usr/local" was a thing. (The weird looks when you started babbling about "slash-loser" were just a fringe benefit.)

imake was a giant pain in the rear. Autotools were also a giant pain in the rear. But everything else, except in very, very simple cases, was much worse. DJB's software would come under the header of "sorry, not supportable without unreasonable effort."

1 more reply

JdeBP7y ago

Bear in mind that steps 1, 3, 4, 6, and 7 are not related to slashpackage. nrclark has got step #2 wrong, as all that it yields is the current source package version number not a URL and there's no real parsing going on (except for the shell stripping the trailing newline); and xyr step #8 is xyr own invention and not actually in the build instructions as given.

That leaves step #5. What that does is both fairly obvious and documented in the instructions. It tries out a whole bunch of compilers, compiler options, and implementations in turn to see which results in the best code. Ironically, it is probably the sort of thing that autotools might be persuaded to do. But in practice no-one would, or it would be buried under reams of copied-and-pasted cargo cult tests for things that the actual source at hand cannot in fact do without in the first place.

This is of course a long-recognized fly in any let's-just-use-autotools ointment, discussed over many years. (-:

* https://news.ycombinator.com/item?id=5276876

* https://queue.acm.org/detail.cfm?id=2349257

* http://freshmeat.sourceforge.net/articles/stop-the-autoconf-...

* https://news.ycombinator.com/item?id=1499738

* https://news.ycombinator.com/item?id=4407486

dilap7y ago

as a user, i'd be pretty fucking happy if ./configure disappeared off the face of the earth.

jedisct17y ago

As a user, I'm super happy with `./configure`, that works the same way across all applications using it, and has been for the past 20 years.

You don't want to look at the actual script, but as a user, it is very convenient. I don't have to read installation guides to know how to change compilation options, compilers, cross-compile, installation paths, run the test suite, etc.

1 more reply

ben5097y ago

Do you know of any guidelines for making libraries easy to package? Or examples of projects that do it right?

wahern7y ago

The most important thing is to put all the compiler flags behind standard variables. This is 80% of what autoconf/automake does. The standard variables for C programs are

  CPPFLAGS = # e.g. -I and -D preprocess options
  CFLAGS = # various ad hoc compiler switches
  LDFLAGS = # e.g. -L options
  LDLIBS = # e.g. -l options
  SOFLAGS = # -shared or -bundle -undefined dynamic_lookup

(NOTE: SOFLAGS is less standard--i.e. not in GNU style guide or GNU make manual--but perfectly fits the pattern and extremely useful to specify independently. You'll often find a variable like this used, if not the same name. IME SOFLAGS it the most common name.)

Library dependencies should follow a similar pattern:

  LIBFOO_CPPFLAGS = # e.g. -I/usr/local/foo/include
  LIBFOO_CFLAGS =
  LIBFOO_LDFLAGS = # e.g. -L/usr/local/foo/lib
  LIBFOO_LDLIBS = -lfoo

To make it easier to control feature-specific options without overriding all other defaults, you should do likewise for feature groups:

  PIC_CFLAGS = -fPIC
  ...
  PTHREAD_LDLIBS = -lpthread
  ...
  WARN_CFLAGS = -Wall -Wextra

If you want to get really fancy, you can use top-level MY* variables so people can easily append flags without overriding existing variables.

Then for most basic target recipes you can pre-compose all the flags to make your rules simpler:

  ALL_CPPFLAGS = $(PIC_CPPFLAGS) $(LIBFOO_CPPFLAGS) $(WARN_CPPFLAGS) $(CPPFLAGS) $(MYCPPFLAGS)
  ALL_CFLAGS = $(PIC_CFLAGS) $(LIBFOO_CFLAGS) $(WARN_CFLAGS) $(CFLAGS) $(MYCFLAGS)
  ALL_LDFLAGS = $(PIC_LDFLAGS) $(LIBFOO_LDFLAGS) $(WARN_LDFLAGS) $(LDFLAGS) $(MYLDFLAGS)
  ALL_LDLIBS = $(MYLDLIBS) $(LDLIBS) $(LIBFOO_LDLIBS) $(WARN_LDLIBS) $(PTHREAD_LDLIBS)

  .o.c:
    $(CC) $(ALL_CFLAGS) $(ALL_CPPFLAGS) -c -o $@ $<

Because of the ubiquity of GCC and clang, and because of the commonality across most modern Unix platforms (e.g. SOFLAGS will be "-shared" on all BSD- and SysV-derived systems, including Linux, and "-bundle -undefined dynamic_lookup" on macOS), the above will allow people to build your software without modifying the actual Makefile in almost all scenarios. It will certainly make packagers' jobs easier.

For installation you should always use the standard prefixes

  DESTDIR = # temporary root to assist in packaging 
  prefix = /usr/local
  bindir = $(prefix)/bin
  datadir = $(prefix)/share
  includedir = $(prefix)/include
  libdir = $(prefix)/lib
  sysconfdir = $(prefix)/etc

and install targets should be written like

  $(DESTDIR)$(libdir)/libacme.so: libacme.so
    ...

This may seem like alot of boilerplate, but the important thing is that it's very simple and time-tested. Autogenerating just this basic boilerplate is overkill and adds unnecessary indirection and complexity. You can get fancier than this scheme, but avoid the temptation as much as you can unless it's an obvious win--remedies a regular, concrete issue that is actually resulting in significant wasted effort. For complex builds (e.g. multiple targets where flags might not have the same semantics across target rules), consider splitting the build into multiple Makefiles. You can create a convenience wrapper, but at least the sub Makefiles will be reusable by packagers or contributors without having to modify anything directly. It's not ideal but there is no ideal solution. (I once heavily advocated non-recursive make, but these days I'm much more averse to complexity.)

There's no easy answer for Windows, particularly because NMake uses significantly different syntax for some constructs. For really basic builds the above might actually work as long as you stick to NMake-compatible syntax. With the advent of Windows Subsystem for Linux (WSL) you could consider requiring WSL for the build (so you have access to POSIX-compatible shell and Make) and perhaps providing a wrapper Makefile with more convenient defaults for compiling with Visual Studio (which is very much still a command-line driven compiler). Requiring clang-cl might be even easier.

For simple stuff IME autotools and CMake are just overkill.[1] For complex stuff they often won't be sufficient, anyhow.[2] So the important thing is to layer your build so the maximum amount of build code can be reused by somebody with unforeseen needs; and reused in a way that is obvious and transparent so their tweaks can be simple. Stick to convention as much as possible. Even if you don't use autotools, autotools has created de facto conventions wrt build variables that will be intuitive to all packagers. Please don't invent your own naming scheme! (It's annoying when someone uses 'PREFIX' instead of 'prefix'. There's logic to the autotools scheme. Uppercase variable are for [build] flags used in rule recipes, lowercase for [installation] paths used to define rule targets and dependencies. Exceptions like DESTDIR are well-known and few.)

Always be mindful of the needs of people doing embedded work--cross-compiling, complex build flags, etc. But don't bend over backwards. All you really need to do is remove obstructions and the necessity to patch. They're used to writing build wrappers, but patching is a maintenance nightmare and impedes tracking upstream.

Ideally you would support out-of-tree builds, but in reality few packages support this properly (even autotools or CMake projects), and if you're not regularly doing out-of-tree builds your build probably won't work out-of-the-box anyhow. That's why I didn't mention the use of macros like VPATH, top_srcdir, etc, in your rules. Do that at your discretion, but beware of descending down the rabbit hole.

The same thing applies to automatic feature detection--unless you're regularly building on esoteric targets your feature detection is unlikely to work out of the box, and bit rot happens quickly as platforms evolve.[3] Don't put too much effort into this; just make sure it's easy to manually toggle feature availability and that the defaults work on the most common platforms and environments. Maximize benefits, minimize costs, but don't think you can automate everything away.

All those fancy builds people tout are ridiculously brittle in reality. People just don't realize it because they're only using the builds on what are in fact relatively homogenous environments. Most of the effort is wasted, and build complexity (including bespoke build environments) is a big reason why people don't reuse software as much as they could.

[1] Indeed, for simple builds it's not much of a burden to provide and maintain a POSIX build and a Visual Studio project file build.

[2] For CMake this is especially true. For feature detection CMake often requires actually updating your version of CMake. That's a gigantic PITA. Anyhow, most feature detection can be done inline using the preprocessor, or by simply defaulting to being available (this is 2018, not 1988--there's much more commonality today.) And as long as your feature-detection macros are controllable from build-time flags, people can easily work around issues by explicitly toggling availability. Thus, the autotools convention of using "#ifdef FEATURE_FOO" for feature flags is bad advice. Always use boolean flags like "#if FEATURE_FOO", and only define FEATURE_FOO in config.h-like files if not already defined. That allows people to do, e.g., "MYCPPFLAGS=-DFEATURE_FOO=0" to override feature detection as necessary.

[3] Autoconf's feature checking (check the feature, not the version) sounds robust in practice, but you'd be surprised how many ways it can break, or how difficult writing a properly robust feature check is. Plus, especially on Linux people expect a feature to work at run-time if the kernel supports it, regardless of whether glibc had a wrapper at build time or has it at run time. These issues are where you'll spend much of your time, and this can't be automated. You can't foresee all these issues! Focus on making it easy to supplement or work around your build!

numbsafari7y ago

Is this software that he presently wants to see disseminated randomly, or is it provided here for academic purposes at this time?

I don't know if it's his intention to have this library become part of a bunch of linux or BSD distros or not. But, it seems kind of presumptuous on your part to assume that he does.

lorenzhs7y ago

I'm a PhD student in theoretical computer science and have an academic interest in this sorter. But his build system is just too much of a pain to deal with voluntarily.

Not to mention that his cpu cycle counting doesn't actually seem to compile on Linux, which doesn't have sysctlbyname? Or maybe build errors are expected behaviour?

yoklov7y ago· 7 in thread

The problem with AVX2 accelerated code (and much of AVX) is that unless you have a lot of it to run you end up with a substantial speed hit that comes from the cost of switching to a different power bin (which often takes 1 or 2ms!) and then running at a lower clock speed.

This often still ends up being an improvement over scalar code (at the cost of higher power usage), but for occasional workloads that don't need to do multiple milliseconds of AVX instructions you tend to have better results from 4-wide vectors, which don't have this cost.

jwr7y ago

See also the excellent writeup at: https://gist.github.com/rygorous/32bc3ea8301dba09358fd2c64e0...

stochastic_monk7y ago

So in a sense, using wider vectors should be done at times when you’re certain to need heavy lifting, much like a GPU, but at a smaller timescale. I should update my vectorization library so that it doesn’t simply use the widest possible.

rphlx7y ago

FWIW Haswell - on which this was benchmarked - does not downclock on AVX2; it just boosts vcore and thermal throttles if the cooling solution can't keep the die below ~100C (which of course, the crappy TIM plus bundled desktop heatsink have essentially zero chance of doing with many extended workloads). To preclude thermal throttling the auto-downclock-on-AVX2 was added later, I think for Skylake.

puzzle7y ago

You also have penalties from context switches, although you can reduce their impact by performing them in a lazy fashion. AVX512 is even worse, of course.

rphlx7y ago

Transferring SIMD/FPU regs to/from memory is actually fast - a modest fraction of the typical context switch cost - on modern x86_64 thanks to XSAVE.

jeffreyrogers7y ago

You need a context switch to use AVX?

1 more reply

yoklov7y ago

That's true, these are always costly though.

robin_reala7y ago· 7 in thread

Why might the installation instructions require the creation of a new user specific to the sorting program? Purely for security of the normal user given that the installation is using a wget / shell script process? https://sorting.cr.yp.to/install.html

VMG7y ago

I thought this guy was joking

> djb presents djbsort, a constant-time djbsolution to a djbproblem you probably don't have. To install, just unpack djbsort.djb and run the ./djb script. Make sure you have djbtools installed first

https://twitter.com/bascule/status/1016904259848192000

jwilk7y ago

This is bizarre. Why not just give users link to the source?

https://sorting.cr.yp.to/djbsort-20180710.tar.gz

And it's certainly not about good security practices:

* The page teaches users to paste stuff copied from web into a terminal, whereas many terminals are still vulnerable to this: https://thejh.net/misc/website-terminal-copy-paste

* The page teaches users to use su to lower privileges, whereas many (most?) su implementations are vulnerable to tty hijacking.

textmode7y ago

"The page teaches users to use su to lower privileges ..."

In the example, he could have used his own utilities for dropping privileges (setuidgid, envuidgid from daemontools).

If I am not mistaken, busybox includes their own copies of setuidgid and envuidgid, meaning it is found in myriad Linux distributions. I believe OpenBSD has their own program for dropping privileges. Maybe there are others on other OS.

Instead he picked a ubiquitous choice for the example, su.

It is interesting to see someone express disdain for the version.txt idea. I had the opposite reaction. To me, it is beautiful in its simplicity.

As a user I like the idea of accessing a tiny text file, version.txt, similar to robots.txt, etc., that contains only a version number and letting the user insert the number into an otherwise stable URL.

This is currently how it works for libpqcrypto.

https://libpqcrypto.org/install.html

I would actually be pleased to see this become a "standard" way of keeping audiences up to date on what software versions exist.

By simplifying "updates" in this way, any user can visit the version.txt page or write scripts that retrieve version.txt to check for updates, in the same way any user can visit/retrieve robots.txt to check for crawl delay times, etc.

It is not necessary to "copy and paste" from web pages. Save the "installation" page containing the stable URL as text, open it in an editor, insert the desired version number into the stable URL.

Save the file. Repeat when version number changes, appending to the file.

I like to keep a small text file containing URLs to all versions so I can easily retrieve them again at any time.

lifthrasiir7y ago

I wonder if it is an instinctive reaction to the common complaints (no, relatively, easier packaging or installation routes) to djb softwares. It was previously seen in TweetNaCl [1] which is arguably an irritated response to libsodium [2] which in turn wraps djb's NaCl library [3] in the way djb considers suboptimal [4]. djb is known to be not very interested in the modern packaging systems or practices, so it might well be his way to protest them.

[1] https://tweetnacl.cr.yp.to/

[2] https://download.libsodium.org/doc/

[3] https://nacl.cr.yp.to/

[4] Page 3 in https://tweetnacl.cr.yp.to/tweetnacl-20140917.pdf

F307y ago

Yes, presumably to run the build as a user which is as unprivileged as possible. Which is a reasonable idea, though it might seem paranoid in today's `curl | sudo sh` world.

hawkice7y ago

I actually really like the authenticity and humility of DJB including that in the instructions. I think it's likely many people trust his code (and he's certainly written a lot of extremely security sensitive stuff), but of course it's a much better practice to not trust him quite so much.

2 more replies

pjc507y ago

Yes, I think so; djb is not the sort of person to reccomend you run arbitary shell scripts from the internet as root.

Ono-Sendai7y ago· 5 in thread

The median sort time for sorting 1048576 elements wth djbsort is 61467822 cycles: https://sorting.cr.yp.to/speed.html

On a say 3.6 Ghz processor that would be around 17ms. So the number of elements sorted per second would be around 61.4 M elements/s.

My parallel radix sort can sort floats (a little harder than integers) at around 165 M elements/s: http://forwardscattering.org/post/34

A serial radix sort should still be similar or faster to djbsort.

SiempreViernes7y ago

Do you also sort in constant time at fixed array size, like djbsort claims it does?

For the 1024 array their cycle count quartiles differ from the median at the 3 promille level, but I don't know if this counts as "constant" for timing attack purposes.

Ono-Sendai7y ago

Hmm, interesting question. Radix sorts, by their nature, execute the same code regardless of the contents of their elements. ( I think, I haven't thought about this before). However you will see some variation in execution time based on varying memory access patterns.

1 more reply

herf7y ago

My single-threaded floating point sort is also twice as fast as djbsort:

http://stereopsis.com/radix.html

This one uses 11-bit radix to save memory bandwidth. Without all the floating-point stuff it would be faster.

AstralStorm7y ago

The point of djbsort is to be constant time while not being super slow.

Of course a fast directly implemented radix sort will be faster, especially if you sprinkle SIMD on top.

herf7y ago

Actually, there is only an advantage to radix after 100,000 elements - djb is much faster for small arrays. Also, while radix is cpu constant time, you'd need to do some kind of secure shuffle in advance to make it truly constant-time (considering memory locality timing).

pushcx7y ago· 4 in thread

The library doesn't have a license on the site or in the tarball. From djb's previous writing and software, he probably intends it to be license-free software, which is an uncommon situation worth investigating before use: https://en.wikipedia.org/wiki/License-free_software

(Trying to describe this neutrally because I've seen enough bickering about it over the last ~20 years and don't have strong feelings about it.)

Tepix7y ago

Three of the source code files contain a notice that they are in the public domain:

* cpucycles/mips/cpucycles.c

* cpucycles/cortex_vct/cpucycles.c

* cpucycles/cortex/cpucycles.c

Regarding the missing license:

I guess if you download the software from his website you are not allowed to distribute it yourself. Is that correct?

masklinn7y ago

> Three of the source code files contain a notice that they are in the public domain:

Making it legally dodgy to dangerous in mainland europe, either way certainly not reliably licensed.

1 more reply

textmode7y ago

https://twitter.com/hashbreaker/status/1016951373005455360

JdeBP7y ago

You are over a decade behind the times and not fully aware of said previous writing. See what M. Bernstein said on 2007-11-11 at Sage Days 6. Hint: You should have read that Wikipedia article. (-:

MrBuddyCasino7y ago· 3 in thread

The 2.5 cycles/byte compared to the 32 cycles/byte that Intel managed pulled off seems like an improbably large improvement over the current state of the art? Is this real?

Cyphase7y ago

The cycles/byte for all the sizes listed in the table on the Speed page[0], courtesy of copy-paste and a one-liner in the Python REPL:

  size      cycles/byte (based on median)
  1         6.0
  2         3.375
  4         11.625
  8         9.625
  16        7.984375
  32        7.515625
  64        6.0390625
  128       4.31640625
  256       2.4873046875
  512       2.29443359375
  1024      2.5048828125
  2048      2.77893066406
  4096      3.0791015625
  8192      3.89566040039
  16384     5.23690795898
  32768     6.35472106934
  65536     7.55914306641
  131072    9.23189163208
  262144    10.789557457
  524288    12.4885950089
  1048576   14.6550707817

[0] https://sorting.cr.yp.to/speed.html

Ono-Sendai7y ago

Well, that's for a very small and particular size. (1024 elements) Intel's library might have more overhead at that size.

bdukic7y ago

My guess is that the "catch" comes in the form of the list of limitations.

carapace7y ago· 2 in thread

(Kind of a tangent, but if you're into sorting check out:

"Generic top-down discrimination for sorting and partitioning in linear time"

https://www.cambridge.org/core/journals/journal-of-functiona...

Abstract: "We introduce the notion of discrimination as a generalization of both sorting and partitioning, and show that discriminators (discrimination functions) can be defined generically, by structural recursion on representations of ordering and equivalence relations. Discriminators improve the asymptotic performance of generic comparison-based sorting and partitioning, and can be implemented not to expose more information than the underlying ordering, respectively equivalence relation. For a large class of order and equivalence representations, including all standard orders for regular recursive first-order types, the discriminators execute in the worst-case linear time. The generic discriminators can be coded compactly using list comprehensions, with order and equivalence representations specified using Generalized Algebraic Data Types. We give some examples of the uses of discriminators, including the most-significant digit lexicographic sorting, type isomorphism with an associative-commutative operator, and database joins. Source code of discriminators and their applications in Haskell is included. We argue that built-in primitive types, notably pointers (references), should come with efficient discriminators, not just equality tests, since they facilitate the construction of discriminators for abstract types that are both highly efficient and representation-independent.")

KirinDave7y ago

I've tried so many times to frontpage that, but I've failed. Maybe we should have another go?

Nothing in djbsort's approach is inapplicable to another sorting algorithm, so maybe we can hope for better primitive support for discrimination sort implementations (or at least american flag sort implementations). I seem to recall reading that discrimination sorts are inherently content-independent.

carapace7y ago

It was probably you I heard about it from! Submission upvoted. ;-)

1 more reply

bluetech7y ago· 1 in thread

I liked this bit, using the fastest compiler for each primitive:

> ./do tries a list of compilers in compilers/c, keeping the fastest working implementation of each primitive. Before running ./do you can edit compilers/c to adjust compiler options or to try additional compilers.

rphlx7y ago

It is sadly necessary; 30%+ performance regressions from, say, gcc 4 to gcc 6 are not uncommon w/ vector intrinsics.

southern_cross7y ago· 1 in thread

I haven't read the algorithm here yet nor all of the comments, so this may have already been covered, but in many cases when it comes to sorting integers and such you don't really need to sort them at all - you just need to count them.

ur-whale7y ago

Yeah, except that when the integers are very large, that tends to fail. Also, as pointed out by many others, the point of this work is to sort in constant time to avoid side-channel attacks. I doubt the histogram sort (which I think you're referring to) has this property.

jonlandrum7y ago· 1 in thread

Am I the only one who thought the name had something to do with Djibouti?

eesmith7y ago

Probably one of the few. On HN, a search for "djb" results in about 60 submissions where "djb" is in the submission title. There are about 6 for Djibouti.

Also, the airport code "DJB" is for Sultan Thaha Airport. The Djibouti–Ambouli International Airport code is JIB.

eggie7y ago· 1 in thread

It's nice when things fit in RAM. But very often they don't. When you want to sort arbitrary-sized binary records on disk look no further than bsort: https://github.com/pelotoncycle/bsort

whazor7y ago

When sorting on disk, there are actually limitations in sorting linear time. Has to do with being able to read blocks in memory and write them back to disk. As everything is in blocks, you are limited to O(n/B log(n/B)) with B being the amount of items per block. For more speed, a merge sort that keeps in mind the block size works quite well. Search for external sorting.

lixtra7y ago

The main feature of the algorithm is to sort fast in constant time (for fixed n) for cryptographic purposes [0].

[0] https://ntruprime.cr.yp.to/ntruprime-20170816.pdf P. 48

JdeBP7y ago

> Other modern Linux/BSD/UNIX systems should work with minor adjustments to the instructions.

I can report that I got it to build and run on slightly out of date FreeBSD by deleting all of the -m32 variants, and deleting all of the -march=haswell variants. I haven't looked into whether this is down to the version of GCC that comes in ports and the version of Clang that comes in base, or something else. No other changes were needed to the build process, though.

    JdeBP /package/prog/djbsort % /tmp/djbsort/command/int32-speed
    int32 implementation int32/portable4
    int32 version -
    int32 compiler clang -fPIC -Wall -O2 -fomit-frame-pointer -fwrapv
    int32 1 72 72 72
    ...
    int32 1048576 1979077401 1979993070 1983745962

amorousf00p7y ago

Create a user and env to run a one-off build + application. DJB cracks me up. He may have the right thing in mind but this type of prophylactic approach is no longer proof against anything.

atesti7y ago

Direct download link (if you don't want to run the script)

https://sorting.cr.yp.to/djbsort-20180710.tar.gz

rurban7y ago

beware, Linux only. Needs the usual BSD/macOS patches for HW_CPUSPEED and CLOCK_MONOTONIC and do away with for linux/perf_event.h.

Unfortunately I have no idea how he deals with patches, I don't think he does.

ibuildoss7y ago

That's pretty cool, are there any bindings for e.g. Go out there?

1 more reply

j / k navigate · click thread line to collapse

149 comments

70 comments · 17 top-level

nrclark7y ago· 21 in thread

My job involves a lot of packaging/cross-compilation, and djb's libraries always seem consistently hostile to the lowly packaging engineer.

Would it really be all that much work to package in autotools or CMake? Why do I need his special-snowflake build system with its hard-coded assumptions about system paths?

I know that the cult of djb will downvote this into oblivion, but seriously, what is the rationale for a build flow that involves:

  1. Downloading a text file
  2. Parsing it to get a URL
  3. Making a new user
  4. Symlinking the user's HOME directory into the build tree
  5. Run an extremely non-standard build system.
  6. Hope you're not trying to cross-compile, because good luck with that.
  7. Guess at where the files came out (hint: it probably won't be in FHS locations)
  8. Copy the output yourself once you find it.

Would it really be that much harder to give us a git repo and a ./configure or a CMakeLists.txt?

Panino7y ago

I've been using DJB's stuff for ~20 years and I don't like his recent build systems either. Without defending it, I'd just like to offer a theory on why he packages things the way he does.

He's also been putting out a tremendous volume of work in multiple categories. I bet he'd rather work on this stuff than on user-friendly build systems.

So given these points, I think the explanation for his unfriendly build systems is

  A) a very strong aversion to people modifying his stuff where he gets blamed if modifications do harm;
  B) a shift away from software development, where people generally care more about build systems anyway;
  C) a huge level of productivity which results in very atypical Pareto principle choices/tradeoffs;
  D) his public-domain licensing.

So, I don't like his build systems either, but I think I understand where they come from.

nrclark7y ago

Appreciate the response! Interesting to read.

The Libsodium guys wound up doing exactly what you're suggesting, because of the impossibility of trying to package NaCl as-is.

It seems a little selfish for djb to take the "works on my machine" attitude, because it means that a bunch of other people have to reverse-engineer all that stuff just to make it portable.

But I guess OTOH it's his software, so it's his choice. And maybe he doesn't care whether people choose to use his stuff or not, as long as he's publishing.

1 more reply

JdeBP7y ago

* http://jdebp.info./FGA/introduction-to-redo.html

* http://jdebp.info./FGA/slashpackage.html

* http://jdebp.info./Softwares/djbwares/

mcguire7y ago

"He's also been putting out a tremendous volume of work in multiple categories. I bet he'd rather work on this stuff than on user-friendly build systems."

I don't know about that part...a halfway normal build process would be considerably less work to set up than this monstrosity.

mwcampbell7y ago

[1]: https://github.com/Microsoft/vcpkg

rurban7y ago

As you see in the code this is linux only: CLOCK_MONOTONIC, HW_CPUSPEED, -soname, -rpath, linux/perf_events.h,

-march=native or -msse4.2 would be needed.

tbirdz7y ago

Autotools "just works", and when it doesn't it's very hard to fix. Makefiles don't "just work", but it's much easier to fix them yourself.

altfredd7y ago

1 more reply

R0b0t17y ago

Autotools is in practice less work for me than alternatives, at least while using a FOSS operating system. On Windows CMake is very well developed but handling dependencies is always a chore.

ur-whale7y ago

1. You're assuming the software is intended to be deployed

2. Writing anything sane with autotools or CMakes is an exercise in frustration that borders on torture. Not to mention how either of these systems chafes on one's sense of aesthetics.

nrclark7y ago

Why would somebody release software that they didn't want to see used?

Yes, Autoconf tests for a bunch of stupid things that don't matter. But so what? You can add the tests you _do_ care about, and take advantage of the directory control and compiler control aspects.

If you've never actually worked with Autotools (which you just said you haven't), I'd encourage you to actually try it out. It's not as bad as you think.

4 more replies

geocar7y ago

> Why do I need ... assumptions about system paths?

> what is the rationale for a build flow that involves:

It solves problems.

https://cr.yp.to/compatibility.html

https://cr.yp.to/slashpackage/studies.html

https://cr.yp.to/slashpackage/finding.html

https://cr.yp.to/slashpackage/sharability.html

> Would it really be that much harder to give us a git repo and a ./configure or a CMakeLists.txt?

Yes.

nrclark7y ago

FWIW, I read through each of your links. They don't address cross-compilation at all, or the needs of software packagers.

I can 100% promise you that somebody packaging this library for any Linux distro (or for a Yocto/Buildroot system) would grind their teeth in frustration at everything in the those links.

1 more reply

mcguire7y ago

1 more reply

JdeBP7y ago

This is of course a long-recognized fly in any let's-just-use-autotools ointment, discussed over many years. (-:

* https://news.ycombinator.com/item?id=5276876

* https://queue.acm.org/detail.cfm?id=2349257

* http://freshmeat.sourceforge.net/articles/stop-the-autoconf-...

* https://news.ycombinator.com/item?id=1499738

* https://news.ycombinator.com/item?id=4407486

dilap7y ago

as a user, i'd be pretty fucking happy if ./configure disappeared off the face of the earth.

jedisct17y ago

As a user, I'm super happy with `./configure`, that works the same way across all applications using it, and has been for the past 20 years.

1 more reply

ben5097y ago

Do you know of any guidelines for making libraries easy to package? Or examples of projects that do it right?

wahern7y ago

The most important thing is to put all the compiler flags behind standard variables. This is 80% of what autoconf/automake does. The standard variables for C programs are

  CPPFLAGS = # e.g. -I and -D preprocess options
  CFLAGS = # various ad hoc compiler switches
  LDFLAGS = # e.g. -L options
  LDLIBS = # e.g. -l options
  SOFLAGS = # -shared or -bundle -undefined dynamic_lookup

Library dependencies should follow a similar pattern:

  LIBFOO_CPPFLAGS = # e.g. -I/usr/local/foo/include
  LIBFOO_CFLAGS =
  LIBFOO_LDFLAGS = # e.g. -L/usr/local/foo/lib
  LIBFOO_LDLIBS = -lfoo

To make it easier to control feature-specific options without overriding all other defaults, you should do likewise for feature groups:

  PIC_CFLAGS = -fPIC
  ...
  PTHREAD_LDLIBS = -lpthread
  ...
  WARN_CFLAGS = -Wall -Wextra

If you want to get really fancy, you can use top-level MY* variables so people can easily append flags without overriding existing variables.

Then for most basic target recipes you can pre-compose all the flags to make your rules simpler:

  ALL_CPPFLAGS = $(PIC_CPPFLAGS) $(LIBFOO_CPPFLAGS) $(WARN_CPPFLAGS) $(CPPFLAGS) $(MYCPPFLAGS)
  ALL_CFLAGS = $(PIC_CFLAGS) $(LIBFOO_CFLAGS) $(WARN_CFLAGS) $(CFLAGS) $(MYCFLAGS)
  ALL_LDFLAGS = $(PIC_LDFLAGS) $(LIBFOO_LDFLAGS) $(WARN_LDFLAGS) $(LDFLAGS) $(MYLDFLAGS)
  ALL_LDLIBS = $(MYLDLIBS) $(LDLIBS) $(LIBFOO_LDLIBS) $(WARN_LDLIBS) $(PTHREAD_LDLIBS)

  .o.c:
    $(CC) $(ALL_CFLAGS) $(ALL_CPPFLAGS) -c -o $@ $<

For installation you should always use the standard prefixes

  DESTDIR = # temporary root to assist in packaging 
  prefix = /usr/local
  bindir = $(prefix)/bin
  datadir = $(prefix)/share
  includedir = $(prefix)/include
  libdir = $(prefix)/lib
  sysconfdir = $(prefix)/etc

and install targets should be written like

  $(DESTDIR)$(libdir)/libacme.so: libacme.so
    ...

[1] Indeed, for simple builds it's not much of a burden to provide and maintain a POSIX build and a Visual Studio project file build.

numbsafari7y ago

Is this software that he presently wants to see disseminated randomly, or is it provided here for academic purposes at this time?

I don't know if it's his intention to have this library become part of a bunch of linux or BSD distros or not. But, it seems kind of presumptuous on your part to assume that he does.

lorenzhs7y ago

I'm a PhD student in theoretical computer science and have an academic interest in this sorter. But his build system is just too much of a pain to deal with voluntarily.

Not to mention that his cpu cycle counting doesn't actually seem to compile on Linux, which doesn't have sysctlbyname? Or maybe build errors are expected behaviour?

yoklov7y ago· 7 in thread

jwr7y ago

See also the excellent writeup at: https://gist.github.com/rygorous/32bc3ea8301dba09358fd2c64e0...

stochastic_monk7y ago

rphlx7y ago

puzzle7y ago

You also have penalties from context switches, although you can reduce their impact by performing them in a lazy fashion. AVX512 is even worse, of course.

rphlx7y ago

Transferring SIMD/FPU regs to/from memory is actually fast - a modest fraction of the typical context switch cost - on modern x86_64 thanks to XSAVE.

jeffreyrogers7y ago

You need a context switch to use AVX?

1 more reply

yoklov7y ago

That's true, these are always costly though.

robin_reala7y ago· 7 in thread

VMG7y ago

I thought this guy was joking

> djb presents djbsort, a constant-time djbsolution to a djbproblem you probably don't have. To install, just unpack djbsort.djb and run the ./djb script. Make sure you have djbtools installed first

https://twitter.com/bascule/status/1016904259848192000

jwilk7y ago

This is bizarre. Why not just give users link to the source?

https://sorting.cr.yp.to/djbsort-20180710.tar.gz

And it's certainly not about good security practices:

* The page teaches users to paste stuff copied from web into a terminal, whereas many terminals are still vulnerable to this: https://thejh.net/misc/website-terminal-copy-paste

* The page teaches users to use su to lower privileges, whereas many (most?) su implementations are vulnerable to tty hijacking.

textmode7y ago

"The page teaches users to use su to lower privileges ..."

In the example, he could have used his own utilities for dropping privileges (setuidgid, envuidgid from daemontools).

Instead he picked a ubiquitous choice for the example, su.

It is interesting to see someone express disdain for the version.txt idea. I had the opposite reaction. To me, it is beautiful in its simplicity.

This is currently how it works for libpqcrypto.

https://libpqcrypto.org/install.html

I would actually be pleased to see this become a "standard" way of keeping audiences up to date on what software versions exist.

It is not necessary to "copy and paste" from web pages. Save the "installation" page containing the stable URL as text, open it in an editor, insert the desired version number into the stable URL.

Save the file. Repeat when version number changes, appending to the file.

I like to keep a small text file containing URLs to all versions so I can easily retrieve them again at any time.

lifthrasiir7y ago

[1] https://tweetnacl.cr.yp.to/

[2] https://download.libsodium.org/doc/

[3] https://nacl.cr.yp.to/

[4] Page 3 in https://tweetnacl.cr.yp.to/tweetnacl-20140917.pdf

F307y ago

Yes, presumably to run the build as a user which is as unprivileged as possible. Which is a reasonable idea, though it might seem paranoid in today's `curl | sudo sh` world.

hawkice7y ago

2 more replies

pjc507y ago

Yes, I think so; djb is not the sort of person to reccomend you run arbitary shell scripts from the internet as root.

Ono-Sendai7y ago· 5 in thread

The median sort time for sorting 1048576 elements wth djbsort is 61467822 cycles: https://sorting.cr.yp.to/speed.html

On a say 3.6 Ghz processor that would be around 17ms. So the number of elements sorted per second would be around 61.4 M elements/s.

My parallel radix sort can sort floats (a little harder than integers) at around 165 M elements/s: http://forwardscattering.org/post/34

A serial radix sort should still be similar or faster to djbsort.

SiempreViernes7y ago

Do you also sort in constant time at fixed array size, like djbsort claims it does?

For the 1024 array their cycle count quartiles differ from the median at the 3 promille level, but I don't know if this counts as "constant" for timing attack purposes.

Ono-Sendai7y ago

1 more reply

herf7y ago

My single-threaded floating point sort is also twice as fast as djbsort:

http://stereopsis.com/radix.html

This one uses 11-bit radix to save memory bandwidth. Without all the floating-point stuff it would be faster.

AstralStorm7y ago

The point of djbsort is to be constant time while not being super slow.

Of course a fast directly implemented radix sort will be faster, especially if you sprinkle SIMD on top.

herf7y ago

pushcx7y ago· 4 in thread

(Trying to describe this neutrally because I've seen enough bickering about it over the last ~20 years and don't have strong feelings about it.)

Tepix7y ago

Three of the source code files contain a notice that they are in the public domain:

* cpucycles/mips/cpucycles.c

* cpucycles/cortex_vct/cpucycles.c

* cpucycles/cortex/cpucycles.c

Regarding the missing license:

I guess if you download the software from his website you are not allowed to distribute it yourself. Is that correct?

masklinn7y ago

> Three of the source code files contain a notice that they are in the public domain:

Making it legally dodgy to dangerous in mainland europe, either way certainly not reliably licensed.

1 more reply

textmode7y ago

https://twitter.com/hashbreaker/status/1016951373005455360

JdeBP7y ago

You are over a decade behind the times and not fully aware of said previous writing. See what M. Bernstein said on 2007-11-11 at Sage Days 6. Hint: You should have read that Wikipedia article. (-:

MrBuddyCasino7y ago· 3 in thread

The 2.5 cycles/byte compared to the 32 cycles/byte that Intel managed pulled off seems like an improbably large improvement over the current state of the art? Is this real?

Cyphase7y ago

The cycles/byte for all the sizes listed in the table on the Speed page[0], courtesy of copy-paste and a one-liner in the Python REPL:

  size      cycles/byte (based on median)
  1         6.0
  2         3.375
  4         11.625
  8         9.625
  16        7.984375
  32        7.515625
  64        6.0390625
  128       4.31640625
  256       2.4873046875
  512       2.29443359375
  1024      2.5048828125
  2048      2.77893066406
  4096      3.0791015625
  8192      3.89566040039
  16384     5.23690795898
  32768     6.35472106934
  65536     7.55914306641
  131072    9.23189163208
  262144    10.789557457
  524288    12.4885950089
  1048576   14.6550707817

[0] https://sorting.cr.yp.to/speed.html

Ono-Sendai7y ago

Well, that's for a very small and particular size. (1024 elements) Intel's library might have more overhead at that size.

bdukic7y ago

My guess is that the "catch" comes in the form of the list of limitations.

carapace7y ago· 2 in thread

(Kind of a tangent, but if you're into sorting check out:

"Generic top-down discrimination for sorting and partitioning in linear time"

https://www.cambridge.org/core/journals/journal-of-functiona...

KirinDave7y ago

I've tried so many times to frontpage that, but I've failed. Maybe we should have another go?

carapace7y ago

It was probably you I heard about it from! Submission upvoted. ;-)

1 more reply

bluetech7y ago· 1 in thread

I liked this bit, using the fastest compiler for each primitive:

rphlx7y ago

It is sadly necessary; 30%+ performance regressions from, say, gcc 4 to gcc 6 are not uncommon w/ vector intrinsics.

southern_cross7y ago· 1 in thread

ur-whale7y ago

jonlandrum7y ago· 1 in thread

Am I the only one who thought the name had something to do with Djibouti?

eesmith7y ago

Probably one of the few. On HN, a search for "djb" results in about 60 submissions where "djb" is in the submission title. There are about 6 for Djibouti.

Also, the airport code "DJB" is for Sultan Thaha Airport. The Djibouti–Ambouli International Airport code is JIB.

eggie7y ago· 1 in thread

It's nice when things fit in RAM. But very often they don't. When you want to sort arbitrary-sized binary records on disk look no further than bsort: https://github.com/pelotoncycle/bsort

whazor7y ago

lixtra7y ago

The main feature of the algorithm is to sort fast in constant time (for fixed n) for cryptographic purposes [0].

[0] https://ntruprime.cr.yp.to/ntruprime-20170816.pdf P. 48

JdeBP7y ago

> Other modern Linux/BSD/UNIX systems should work with minor adjustments to the instructions.

    JdeBP /package/prog/djbsort % /tmp/djbsort/command/int32-speed
    int32 implementation int32/portable4
    int32 version -
    int32 compiler clang -fPIC -Wall -O2 -fomit-frame-pointer -fwrapv
    int32 1 72 72 72
    ...
    int32 1048576 1979077401 1979993070 1983745962

amorousf00p7y ago

Create a user and env to run a one-off build + application. DJB cracks me up. He may have the right thing in mind but this type of prophylactic approach is no longer proof against anything.

atesti7y ago

Direct download link (if you don't want to run the script)

https://sorting.cr.yp.to/djbsort-20180710.tar.gz

rurban7y ago

beware, Linux only. Needs the usual BSD/macOS patches for HW_CPUSPEED and CLOCK_MONOTONIC and do away with for linux/perf_event.h.

Unfortunately I have no idea how he deals with patches, I don't think he does.

ibuildoss7y ago

That's pretty cool, are there any bindings for e.g. Go out there?

1 more reply

j / k navigate · click thread line to collapse