Redis on the Raspberry Pi: Adventures in unaligned lands (opens in new tab)

https://www.reddit.com/r/redis/comments/4mmrgr/stream_data_s...

yeswecatan8y ago

Here's a discussion on reddit. There's a link to the proposal on github, too.

johnny228y ago

I'm pretty sure I saw implementations that used the existing publish subscribe mechanism in Redis to handle it and seemed happy with it. I have no personal experience with it though.

k__8y ago· 5 in thread

OT: Is blattimwind shadow banned?

make38y ago

? I see his post

yorwba8y ago

Probably someone vouched for it.

retox8y ago

No, but posting while green will usually get your comment downvoted to oblivion, even if you are erudite and contribute to the conversation.

Turn on "show dead comments" and see how many greens are deleted. I screenshot many examples.

taneq8y ago

Is this cause (ie. people downvote greens out of prejudice) or effect (greens are often created to shitpost?

And to concentrate all my meta in one place... Is shadow banning a thing at HN? I thought they just, well, banned you.

icebraining8y ago

I doubt it's downvotes. Probably cases of the spam/ringvoting detector gone wrong.

drewg1238y ago· 4 in thread

I fondly remember unaligned access faults "back in the day" with FreeBSD/alpha. We implemented a fixup for applications, but not for the kernel. I seem to recall that even though x86 could deal with unaligned accesses, it caused minor performance problems, so fixing alignment issues on alpha would benefit x86 as well.

Most (definitely not all) of the mis-alignment problems were in the network stack, and were centered around the fact that ethernet headers are 14 bytes, while nearly all other protocols had headers that were a multiple of at least 4 bytes.

I've said it before, and I'll say it again: If I had a time machine, I would not kill Hitler. I'd go back to the 70s and make the ethernet header be 16 bytes long, rather than 14.

IgorPartola8y ago

Why in god's name did they make it 14?!

pjc508y ago

Ethernet was invented in 1973 and the first 32-bit processors were available in 1979.

While you've got the time machine, can you fix it so that "network byte order" and Intel endianness are the same too?

jandrese8y ago

It's all they needed. 6 bytes per address, and 2 more bytes to mark the protocol. Back in the 70s and 80s memory was very expensive and developers bent over backwards to save bytes everywhere. This is also why IP addresses are only 32 bits long, even though they knew that it wouldn't be enough if the protocol went global.

Hindsight is 20/20, and a lot of times people don't appreciate the constraints these old systems had. This was being developed decade before the Commodore 64 came out with its luxurious 64 kilobytes of memory (39k usable).

jimktrains28y ago

They didn't feel like they needed those 2 bytes and, hey, why waste space?

Also, was a "byte" standardized at the time? Didn't they still have systems working in not-8-bit "byte", nibbles, byte, and 2-byte boundaries?

throwaway0000028y ago· 4 in thread

I'm probably the only weirdo that thinks this, but if you support byte-addressing you'd better as well be happy with byte-alignment. Atomics being the only place where it's reasonable to be different.

Which brings me to padding. I wonder what percentage of memory of the average 64-bit user's system is padding? I'm afraid of the answer. The heroes of yesteryear could've coded miracles in the ignored spaces in our data.

wzdd8y ago

> if you support byte-addressing you'd better as well be happy with byte-alignment

All ARM processors do this. The concept is called "natural alignment" and it's pretty common on non-x86. See e.g. http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.... . The problem here is that a lot of code written for x86 wants more than that, e.g. byte addressing for non-byte-wide values.

I understand. What I mean is that if your word-size is not your addressing-size, you'd better not have a concept of mis-aligned accesses. It's trouble you brought on all by yourself.

pm2158y ago

Alignment requirements are and have historically been very common -- you can see them on the PDP-11, the 680x0, and so on. It's only because a few very popular architectures like x86 have had very loose or no alignment requirements that we've ended up with a lot of code that assumes there is no alignment requirement, and this has dragged other architectures down the "we need to support this" path. If your architecture faults on misaligned accesses it's really not hard to deal with -- you have to be doing something a bit odd to even run into the problem usually.

I can understand the historical requirements for alignment, the necessary transistors, what not. But, much like branch-delay slots, there is no modern reason to expose this to the programmer. Of course, I gave an exception to atomics, but if you will, they're like memory-mapped communication, and now that all I/O is memory-mapped, with no concept of ports, the (ordering) semantics of memory access becomes really important.

I'm also the weirdo that feels process isolation, memory management, and I/O mechanisms need a rethink. But that's something that would take me forever to get into.

One thing I will say, though, is alignment issues "infect" everything. Assume your architecture doesn't allow misaligned access. Now, all your data has to be naturally aligned. Your structs now have to be aligned to the alignment of the largest sub-structure within them. This is all because code is alignment sensitive. Given a pointer to a struct, generic code is unnecessarily larger. Any why would we care? Communication, of course. If we're exchanging data between systems then idiosyncrasies such as this suddenly become globally visible.

Endian-ness must be little. Byte-aligment a non-issue, and network-bit order should be from bit zero up, with any upper layer need, say for cut-through forwarding, expressed as a data ordering requirement, so for example an IP4 address is not a blind 32-bit word, but specifies the structure of those 32-bits.

blattimwind8y ago· 2 in thread

There is a funny mode on ARM processors (turned on in some images, by default) which causes unaligned reads to silently return bogus data (just increasing a kernel counter).

PowerPC, and really, most non-x86 architectures, do this one way or another.

faragon8y ago

PowerPC (and POWER) has reasonable hardware support for unaligned memory access, at least for 32-bit data, and if the data is in the data cache. Depending on the processor, the exceptions that reach the OS can be more or less frequent.

ARM v6-A and later (except for some microcontrollers, like Cortex M0/R0, that don't support hardware unaligned access at all, triggering a exception) is similar to the Intel x86 case (reference in transparent unaligned memory access -except for SIMD, where x86 can raise exceptions, too, in the case of unaligned load/store opcodes-), where there is hardware support for unaligned memory access.

For software that uses intensive non-aligned data access, e.g. data compression algorithms doing string search, PowerPC, ARM v6-A (and later ARM Application processors), new MIPS with HW support for unaligned memory access, and Intel are pretty much the same (i.e. b = * (uint32_t * )(a + 23) will take 1-2 cycles, not requiring doing a memcpy(&b, a + 23, sizeof(uint32_t))).

For SIMD, though, there is no transparent fix, although there are specific opcodes for aligned and unaligned memory access (e.g. load/store, unaligned load/store).

antirez8y ago

I would say that ARM v6 and later is a major step forward, but is v8 that really seems to be similar to Intel finally. The v6 was able to deal only with single fetch/store unaligned instructions, but things like accessing a double or multiple words with the same instruction would raise an exception.

https://github.com/antirez/redis/pull/3889

amelius8y ago· 2 in thread

Could Rust's typesystem catch unaligned pointer dereferences?

bbatha8y ago

Sort of, Rust is supposed to make references to packed structure members unsafe, but currently doesn't. An RFC was accepted to change the behavior but it has not been fully implemented. Here's the tracking issue: https://github.com/rust-lang/rust/issues/27060

wofo8y ago

Considering dereferencing a pointer after doing some arithmetic on it can only be done within unsafe blocks, I would say you are at least warned about it. But it will happily compile.

drej8y ago· 1 in thread

I never deal with such low level issues, so I don't have to read this, but... reading these posts by antirez is such a joy. He makes this topic so clear and understandable, he doesn't assume much, he doesn't use overly complex explanations, he just "says it like it is" :-)

Thanks!

hellwd8y ago

++ :)

MrBuddyCasino8y ago

Accessing memory locations ending in 0x7? Gather round the campfire folks, James Mickens has a story to tell: https://www.usenix.org/system/files/1311_05-08_mickens.pdf

msarnoff8y ago

Recently I've been doing a lot of low-level work with ARMv7-M microcontrollers (specifically, NXP's Kinetis Cortex-M4 chips) and was quite pleased to find out that they are pretty lenient about unaligned accesses. To quote from the ARM Cortex-M4 Processor Technical Reference Manual:

"Unaligned word or halfword loads or stores add penalty cycles. A byte aligned halfword load or store adds one extra cycle to perform the operation as two bytes. A halfword aligned word load or store adds one extra cycle to perform the operation as two halfwords. A byte-aligned word load or store adds two extra cycles to perform the operation as a byte, a halfword, and a byte. These numbers increase if the memory stalls."

However, multi-word memory instructions (LDRD, STRD, LDM, STM, etc.) always require their arguments to be word-aligned.

type08y ago

Great article, this project just begs the name of Redisberry Pi

JefeChulo8y ago

In future project I might be interested in the use of Redis for queuing jobs, this comes very handy to now early the main issues I could get when developing.

dis-sys8y ago

wondering what kind of performance overhead it is going to cause by letting the kernel to handle unaligned access vs. fixing the software to actually always use aligned access?

crncosta8y ago

Nice article!

j / k navigate · click thread line to collapse

59 comments

36 comments · 13 top-level

luhn8y ago· 5 in thread

This sounds like it could be really exciting. Is there anywhere I can find out more?

Specifically, I've been struggling to find an appropriate backend for HTTP Server-Sent Events, could this feature help with that?

antirez8y ago

Hello, please check my two Redis Conf 2017 talks on youtube. There is info about Streams.

luhn8y ago

Thanks antirez! This looks exactly like the feature I've been searching for. :)

For posterity, here's the referenced videos:

General overview: https://youtu.be/U7J33pd3hLU?t=23m54s

Implementation details: https://youtu.be/Wzy8dIjsY6Y

fancy_pantser8y ago

Did my enhancement make it into the skip list implementation being used for the STREAM type? I am hoping it would be in place before you publish benchmarks for it.

https://www.reddit.com/r/redis/comments/4mmrgr/stream_data_s...

yeswecatan8y ago

Here's a discussion on reddit. There's a link to the proposal on github, too.

johnny228y ago

I'm pretty sure I saw implementations that used the existing publish subscribe mechanism in Redis to handle it and seemed happy with it. I have no personal experience with it though.

k__8y ago· 5 in thread

OT: Is blattimwind shadow banned?

make38y ago

? I see his post

yorwba8y ago

Probably someone vouched for it.

retox8y ago

No, but posting while green will usually get your comment downvoted to oblivion, even if you are erudite and contribute to the conversation.

Turn on "show dead comments" and see how many greens are deleted. I screenshot many examples.

taneq8y ago

Is this cause (ie. people downvote greens out of prejudice) or effect (greens are often created to shitpost?

And to concentrate all my meta in one place... Is shadow banning a thing at HN? I thought they just, well, banned you.

icebraining8y ago

I doubt it's downvotes. Probably cases of the spam/ringvoting detector gone wrong.

drewg1238y ago· 4 in thread

I've said it before, and I'll say it again: If I had a time machine, I would not kill Hitler. I'd go back to the 70s and make the ethernet header be 16 bytes long, rather than 14.

IgorPartola8y ago

Why in god's name did they make it 14?!

pjc508y ago

Ethernet was invented in 1973 and the first 32-bit processors were available in 1979.

While you've got the time machine, can you fix it so that "network byte order" and Intel endianness are the same too?

jandrese8y ago

jimktrains28y ago

They didn't feel like they needed those 2 bytes and, hey, why waste space?

Also, was a "byte" standardized at the time? Didn't they still have systems working in not-8-bit "byte", nibbles, byte, and 2-byte boundaries?

throwaway0000028y ago· 4 in thread

wzdd8y ago

> if you support byte-addressing you'd better as well be happy with byte-alignment

I understand. What I mean is that if your word-size is not your addressing-size, you'd better not have a concept of mis-aligned accesses. It's trouble you brought on all by yourself.

pm2158y ago

I'm also the weirdo that feels process isolation, memory management, and I/O mechanisms need a rethink. But that's something that would take me forever to get into.

blattimwind8y ago· 2 in thread

There is a funny mode on ARM processors (turned on in some images, by default) which causes unaligned reads to silently return bogus data (just increasing a kernel counter).

PowerPC, and really, most non-x86 architectures, do this one way or another.

faragon8y ago

For SIMD, though, there is no transparent fix, although there are specific opcodes for aligned and unaligned memory access (e.g. load/store, unaligned load/store).

antirez8y ago