Self-Contained Pure-Go Web Server with Lua, MD, HTTP/2, QUIC, Redis Support (opens in new tab)

(github.com)

216 pointsPropolice7y ago60 comments

60 comments

43 comments · 5 top-level

tyingq7y ago· 19 in thread

"Files that are sent to the client are compressed with gzip, unless they are under 4096 bytes."

That's interesting. Is that a common optimization? I hadn't heard of any other web server doing that.

xyproto7y ago

I did some quick benchmarking and for files under roughly 4096 bytes, not compressing with gzip was faster.

It's not terribly exact, +-1000 bytes would probably not make a big difference, but I think it's a good default.

And of course, some people may have unique use cases where a custom threshold may be better.

nnx7y ago

Nginx uses a similar optimization with a configurable threshold defaulted to 1kb.

nodesocket7y ago

Correct, the NGINX configuration should be something like:

    gzip on;
    gunzip on;
    gzip_http_version 1.1;
    gzip_proxied any;
    gzip_comp_level 5;
    gzip_disable "msie6";
    gzip_vary on;
    gzip_min_length 2048;

DarkWiiPlayer7y ago

I heared somewhere (Some blogs comment section, I believe) that gzip actually reduces the security of HTTPS; maybe someone can confirm / explain that?

tialaramex7y ago

Compression technologies, including gzip, obviously have the goal of making things smaller by predicting later data based on earlier data. If the later data looks more like the earlier data, the result is smaller than if it was random gibberish. Compression!

If an attacker controls /some/ of this data, and would like to read /other parts/, they can abuse compression to measure whether the parts they don't know are "like" the part they control, because if they are then the compression will make the results shorter than otherwise which they can passively measure.

It's not a problem to move a compressed object over a secure channel on its own, the problem arises if either you try to compress the channel which is moving objects from different origins (e.g. a cookie set by a random advertising web site and your Facebook password) or compress a composite object e.g. maybe your backups mixed with a file you downloaded from a dodgy "pirate" video site.

takeda7y ago

In scenario when an attacker can see encrypted message (e.g. monitoring network traffic) and can affect part of the message that is being encrypted, he can use the compression to his advantage. He can for example try different inputs and observe the length of encrypted message, if with the certain input the length drops that means the given input contains string that's similar to another part of the decrypted message and the compressing algorithm did its job and used that to reduce the size.

This was mentioned in 2012 when CRIME[1] (also included BEAST[2] exploit) and later BREACH[2] vulnerability (when it was considered cool to come up with cool sounding name, creating a logo and a website for specific vulnerabilities)

[1] https://en.wikipedia.org/wiki/CRIME

[2] https://en.wikipedia.org/wiki/Transport_Layer_Security#BEAST...

[3] https://en.wikipedia.org/wiki/BREACH

emilfihlman7y ago

This shouldn't be true in any sane system.

hombre_fatal7y ago

I'd be surprised if it didn't exist in every compression middleware.

For example, https://github.com/expressjs/compression/blob/dd5055dc92fdea...

tyingq7y ago

I don't see anything like that documented for apache's mod_deflate/zlib.

1 more reply

codebeaker7y ago

Also, according to the spec HTTP servers may not always honour the value in the `Accept-Encoding` header[0].

> Even if both the client and the server supports the same compression algorithms, the server may choose not to compress the body of a response, if the identity value is also acceptable.

I've actually run into this twice in my career and it has been a surprise to those around me in both cases. Both times in the context of small payloads where the server is applying some heuristic about whether to encode or not. (e.g status page stops sending gzipped output when the server is becoming "unhealthy")

[0]: https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/Ac...

hombre_fatal7y ago

Makes more sense to use the verb "negotiating" with Accept-* headers rather than "honoring".

This makes obvious sense once you consider that the client tells the server which compression formats it supports in every request yet not every data format is compressible, nor does the server necessary support any candidate compression format.

For example, the server wouldn't gzip a jpeg since it's already compressed.

All Accept-* headers are like this. e.g. the server doesn't necessarily support any of the languages requested in the Accept-Language header, but it doesn't hurt to ask. You always have to inspect the response headers to see the result of negotiation.

takeda7y ago

"Accept-Encoding" means only that the client also understands specific encoding (in this case compression) it is still up to the server to chose what to dot. There was a time when browsers didn't support any compression. This header was introduced to signal to server what is acceptable by the client, that's why the header allows specifying multiple compression algorithms.

Similar thing is with header that's quite useful, but for some reason very few sites honor it: "Accept-Language" browser can specify which languages are preferred, but it is up to server to honor it (for example given language version is not available).

arkadiyt7y ago

Cloudfront is similar [1]:

> The file size must be between 1,000 and 10,000,000 bytes.

[1]: https://docs.aws.amazon.com/AmazonCloudFront/latest/Develope...

Traubenfuchs7y ago

Tomcat, one of the most used Java application servers and the default server behind Spring Boot applications defaults to a "compressionMinSize " value of 2048.

The docs do not explain why.

jand7y ago

Not a direct answer, but also interesting to consider:

As everything will end up in a packet when sent through the network stack, you might want to choose your minimum input size in such way, that you generate gzip-compressed output big enough. Why big enought? Nagle's algorithm [1]

So yet another reason to think about 'what to gzip'.

[1] https://en.wikipedia.org/wiki/Nagle%27s_algorithm

tialaramex7y ago

Applications that always know exactly what they want to send disable this algorithm, as that article explains, by setting TCP_NODELAY or its moral equvialents in their framework. A web server will almost invariably set TCP_NODELAY.

More sophisticated algorithms can either decide exactly which packets to send, or use TCP_CORK to shove part of a packet into a buffer before they add the rest of the stuff, e.g. preparing HTTP headers and then adding the static document that goes after them.

tedunangst7y ago

If you read the page you linked, it says the algorithm applies to data of any size.

Malic7y ago

I don't know about "under 4096 bytes" but I have heard of not compressing data that is under ~1500 bytes. Part of the thinking is this - if your result data (plus HTTP overhead) is already smaller than the data payload of an IP packet (MTU settings come into play here), then you are spending CPU time that will not save you any network I/O time.

xyproto7y ago

Yes, also compressing and decompressing a small amount of bytes may take longer than just sending it uncompressed.

est317y ago· 15 in thread

This is quite impressive, but this claim is a bit wrong:

> All in one small self-contained executable.

Size of algernon executable: 24.4 MiB

Size of nginx-full executable: 1.1 MiB

Size of apache2 executable: 648K

sagichmal7y ago

For self-contained architecture-specific server binaries, there is no practical difference between 240KB, or 2.4MB, or 24.4MB, or even, at a stretch, 244MB. It's not worth mentioning or optimizing for, except as novelty. I wish people would stop golfing with these numbers.

takeda7y ago

You have it backwards, apache or nginx size is not for the novelty. Go is just a pig and its size grows every new release, and the size isn't really for being statically linked or debugging symbols, because it's huge even when those options are disabled.

Right now a "hello world" application in Go has comparable size to an OS with full GUI.

2 more replies

PropoliceOP7y ago

Algernon does a bit more than plain Nginx or Apache. 24.4 MiB also includes bloat. See: https://github.com/golang/go/issues/27266 https://github.com/golang/go/issues/2559

Hopefully we can get smaller binaries by Go 1.13.

takeda7y ago

My Apache installation with modules takes 4.3MB, and I'm quite sure Apache with modules can do more than Algernon.

mholt7y ago

If that's the main criticism you have of the project, I'd say that's pretty good.

In fact, the readme of this project is really thorough!

interfixus7y ago

Very civilized comment from mr. Caddy himself. Heartening, not least considering the kind of vendettas some projects - Caddy among them - have to put up with from competing developers.

est317y ago

Well yeah it's definitely an interesting approach. I guess it's much more lean than running a whole Ubuntu container VM that has all of these things installed, or running a gigantic bloated Javascript toolchain just to convert the scss or md files to css or html.

lawl7y ago

A big part is almost certainly the difference between dynamic and static linking.

Can you run ldd on all of these and then report the combined size for each binary+libraries?

marcus_holmes7y ago

As someone else commented, the huge size of Go executables is down to a design decision to include a map of functions for panic reporting. There was a whole discussion on this recently on HN.

I don't know why the grandparent was downvoted. Go binaries are not small and the claim that this is a "small" single executable is untrue.

Hopefully the Go team will give us a flag to decide for ourselves whether to optimise for executable size or initialisation time. I know I'm fed up of uploading 50Mb files over dodgy wifi+vpn connections to update my server.

(edit fix repetition of design)

2 more replies

est317y ago

I don't have nginx installed (got the number from a web download of the .deb package), but running this for apache:

3200

So 3.2 MB of shared library dependencies. 1.8 MB just being the libc which is almost guaranteed to be used by a different program already.

1 more reply

chvid7y ago

I don't think it is fair that you are being downvoted.

Size does matter and not just in sense that it is using resources. The largest part of the 24 MB probably never gets executed but it adds unnecessary complexity that may hide bugs and security flaws.

xyproto7y ago

Algernon would be a lot smaller if it was not statically compiled and compiled with gccgo instead.

Sadly, the Go package that provides support for QUIC does not compile with gccgo, yet.

justinsaccount7y ago

apache2 is mostly modules that are loaded at runtime.

takeda7y ago

On my FreeBSD machine, all files in the entire apache package (including modules, manpages, headers, default pages, graphic for displaying directories in gif and png formats, tools are) take 4.3MB

xyproto7y ago

The docker image is 9MB. https://hub.docker.com/r/xyproto/algernon/tags

marcus_holmes7y ago· 2 in thread

I wonder why they didn't include Let's Encrypt integration - it's completely painless using the acme library, and that would prevent the whole "HTTP or HTTPS?" discussion around HTTP/2

xyproto7y ago

It's in progress. Algernon is an open source project where I am the main contributor, and I develop Algernon in my spare time. Pull requests are welcome.

marcus_holmes7y ago

I'd love to help, but my coding time is already taken building a product.

I pretty much followed the instructions here: https://godoc.org/golang.org/x/crypto/acme/autocert

edit, better here: https://blog.kowalczyk.info/article/Jl3G/https-for-free-in-g...

I didn't believe it could be that simple, but it worked first time and has proven really robust.

1 more reply

a_imho7y ago· 1 in thread

How does this compare to OpenResty?

DarkWiiPlayer7y ago

I suppose this one offers an all-in-one package, while openresty is really just an nginx server with builtin Lua(JIT) support.

mtw7y ago· 1 in thread

What is the benefit of using this? In what scenario would this excel? Thanks.

xyproto7y ago

Good question. I'm not sure if it excels in any scenario. There are specialized web servers that excel at caching or at raw performance. There are dedicated backends for popular front-end toolkits like Vue or React. There are dedicated editors that excel at editing and previewing Markdown, or HTML.

I guess the main benefit is that Algernon covers a lot of ground, with a minimum of configuration, while being powerful enough to have a plugin system and support for programming in Lua. There is an auto-refresh feature that uses Server Sent Events, when editing Markdown or web pages. There is also support for the latest in Web technologies, like HTTP/2, QUIC and TLS 1.3. The caching system is decent. And the use of Go ensures that also smaller platforms like NetBSD and systems like Raspberry Pi are covered. There are no external dependencies, so Algernon can run on any system that Go can support.

The main benefit is that is is versatile, fresh, and covers many platforms and use cases.

For a more specific description of a potential benefit, a more specific use case would be needed.

j / k navigate · click thread line to collapse

60 comments

43 comments · 5 top-level

tyingq7y ago· 19 in thread

"Files that are sent to the client are compressed with gzip, unless they are under 4096 bytes."

That's interesting. Is that a common optimization? I hadn't heard of any other web server doing that.

xyproto7y ago

I did some quick benchmarking and for files under roughly 4096 bytes, not compressing with gzip was faster.

It's not terribly exact, +-1000 bytes would probably not make a big difference, but I think it's a good default.

And of course, some people may have unique use cases where a custom threshold may be better.

nnx7y ago

Nginx uses a similar optimization with a configurable threshold defaulted to 1kb.

nodesocket7y ago

Correct, the NGINX configuration should be something like:

    gzip on;
    gunzip on;
    gzip_http_version 1.1;
    gzip_proxied any;
    gzip_comp_level 5;
    gzip_disable "msie6";
    gzip_vary on;
    gzip_min_length 2048;

DarkWiiPlayer7y ago

I heared somewhere (Some blogs comment section, I believe) that gzip actually reduces the security of HTTPS; maybe someone can confirm / explain that?

tialaramex7y ago

takeda7y ago

[1] https://en.wikipedia.org/wiki/CRIME

[2] https://en.wikipedia.org/wiki/Transport_Layer_Security#BEAST...

[3] https://en.wikipedia.org/wiki/BREACH

emilfihlman7y ago

This shouldn't be true in any sane system.

hombre_fatal7y ago

I'd be surprised if it didn't exist in every compression middleware.

For example, https://github.com/expressjs/compression/blob/dd5055dc92fdea...

tyingq7y ago

I don't see anything like that documented for apache's mod_deflate/zlib.

1 more reply

codebeaker7y ago

Also, according to the spec HTTP servers may not always honour the value in the `Accept-Encoding` header[0].

> Even if both the client and the server supports the same compression algorithms, the server may choose not to compress the body of a response, if the identity value is also acceptable.

[0]: https://developer.mozilla.org/en-US/docs/Web/HTTP/Headers/Ac...

hombre_fatal7y ago

Makes more sense to use the verb "negotiating" with Accept-* headers rather than "honoring".

For example, the server wouldn't gzip a jpeg since it's already compressed.

takeda7y ago

arkadiyt7y ago

Cloudfront is similar [1]:

> The file size must be between 1,000 and 10,000,000 bytes.

[1]: https://docs.aws.amazon.com/AmazonCloudFront/latest/Develope...

Traubenfuchs7y ago

Tomcat, one of the most used Java application servers and the default server behind Spring Boot applications defaults to a "compressionMinSize " value of 2048.

The docs do not explain why.

jand7y ago

Not a direct answer, but also interesting to consider:

So yet another reason to think about 'what to gzip'.

[1] https://en.wikipedia.org/wiki/Nagle%27s_algorithm

tialaramex7y ago

tedunangst7y ago

If you read the page you linked, it says the algorithm applies to data of any size.

Malic7y ago

xyproto7y ago

Yes, also compressing and decompressing a small amount of bytes may take longer than just sending it uncompressed.

est317y ago· 15 in thread

This is quite impressive, but this claim is a bit wrong:

> All in one small self-contained executable.

Size of algernon executable: 24.4 MiB

Size of nginx-full executable: 1.1 MiB

Size of apache2 executable: 648K

sagichmal7y ago

takeda7y ago

Right now a "hello world" application in Go has comparable size to an OS with full GUI.

2 more replies

PropoliceOP7y ago

Algernon does a bit more than plain Nginx or Apache. 24.4 MiB also includes bloat. See: https://github.com/golang/go/issues/27266 https://github.com/golang/go/issues/2559

Hopefully we can get smaller binaries by Go 1.13.

takeda7y ago

My Apache installation with modules takes 4.3MB, and I'm quite sure Apache with modules can do more than Algernon.

mholt7y ago

If that's the main criticism you have of the project, I'd say that's pretty good.

In fact, the readme of this project is really thorough!

interfixus7y ago

Very civilized comment from mr. Caddy himself. Heartening, not least considering the kind of vendettas some projects - Caddy among them - have to put up with from competing developers.

est317y ago

lawl7y ago

A big part is almost certainly the difference between dynamic and static linking.

Can you run ldd on all of these and then report the combined size for each binary+libraries?

marcus_holmes7y ago

As someone else commented, the huge size of Go executables is down to a design decision to include a map of functions for panic reporting. There was a whole discussion on this recently on HN.

I don't know why the grandparent was downvoted. Go binaries are not small and the claim that this is a "small" single executable is untrue.

(edit fix repetition of design)

2 more replies

est317y ago

I don't have nginx installed (got the number from a web download of the .deb package), but running this for apache:

3200

So 3.2 MB of shared library dependencies. 1.8 MB just being the libc which is almost guaranteed to be used by a different program already.

1 more reply

chvid7y ago

I don't think it is fair that you are being downvoted.

Size does matter and not just in sense that it is using resources. The largest part of the 24 MB probably never gets executed but it adds unnecessary complexity that may hide bugs and security flaws.

xyproto7y ago

Algernon would be a lot smaller if it was not statically compiled and compiled with gccgo instead.

Sadly, the Go package that provides support for QUIC does not compile with gccgo, yet.

justinsaccount7y ago

apache2 is mostly modules that are loaded at runtime.

takeda7y ago

On my FreeBSD machine, all files in the entire apache package (including modules, manpages, headers, default pages, graphic for displaying directories in gif and png formats, tools are) take 4.3MB

xyproto7y ago

The docker image is 9MB. https://hub.docker.com/r/xyproto/algernon/tags

marcus_holmes7y ago· 2 in thread

I wonder why they didn't include Let's Encrypt integration - it's completely painless using the acme library, and that would prevent the whole "HTTP or HTTPS?" discussion around HTTP/2

xyproto7y ago

It's in progress. Algernon is an open source project where I am the main contributor, and I develop Algernon in my spare time. Pull requests are welcome.

marcus_holmes7y ago

I'd love to help, but my coding time is already taken building a product.

I pretty much followed the instructions here: https://godoc.org/golang.org/x/crypto/acme/autocert

edit, better here: https://blog.kowalczyk.info/article/Jl3G/https-for-free-in-g...

I didn't believe it could be that simple, but it worked first time and has proven really robust.

1 more reply

a_imho7y ago· 1 in thread

How does this compare to OpenResty?

DarkWiiPlayer7y ago

I suppose this one offers an all-in-one package, while openresty is really just an nginx server with builtin Lua(JIT) support.

mtw7y ago· 1 in thread

What is the benefit of using this? In what scenario would this excel? Thanks.

xyproto7y ago

The main benefit is that is is versatile, fresh, and covers many platforms and use cases.

For a more specific description of a potential benefit, a more specific use case would be needed.

j / k navigate · click thread line to collapse