A Vulnerability in Implementations of SHA-3, Shake, EdDSA (opens in new tab)

(eprint.iacr.org)

185 pointsstate3y ago47 comments

47 comments

31 comments · 8 top-level

eulgro3y ago· 6 in thread

I didn't read the whole paper, but how can this even happen? Seems like the buffer overflow would be triggered for any file larger than 4 GiB, which I assume someone has tested in the 8 years since it was released.

cesarb3y ago

> I didn't read the whole paper, but how can this even happen? Seems like the buffer overflow would be triggered for any file larger than 4 GiB

I skimmed the paper, and as far as I understood it:

Most cryptographic hash functions operate in fixed-size blocks (for instance, 32 bytes). Additionally, most cryptographic hash function implementations are designed to be streaming, that is, they do not receive the whole input at once. If you give them a partial input which is not a multiple of their block size, these implementations have to buffer the partial input, so that it can be combined with the next partial input (or flushed, if the next call is to finish the computation and generate the output). The arithmetic overflow which leads to the buffer overflow happens when computing how much it has to buffer, given a partial input and any previous partial input already in its internal buffer.

That is, having a file larger than 4GiB is not enough; it has to also be cut into pieces which are not a multiple of the block size (which is normally a power of two). Most users of a cryptographic hash function will either give it the input in large power-of-two pieces (for instance, 8 KiB or 64 KiB), or give it the input all at once, and thus will not hit the bug.

rurban3y ago

You'd be surprised how many of those submitted and approved crypto standards are still not tested with industry best practices.

buffer overflows or integer UB's and overflows are very common. ubsan, asan, valgrind tests are missing. some do offer symbolic verification of the algo, but not the implementations.

See my https://github.com/rurban/smhasher#crypto paragraph, and "Finding Bugs in Cryptographic Hash Function Implementations", Nicky Mouha, Mohammad S Raunak, D. Richard Kuhn, and Raghu Kacker, 2017. https://eprint.iacr.org/2017/891.pdf

miohtama3y ago

Hopefully people working on hashes and codecs will use Rust in the future, so Valgrinding and such are less needed.

2 more replies

userbinator3y ago

You need to feed it a block slightly less than its blocksize, then another one slightly less than 4GB.

pas3y ago

You might remember that JDK 15-18 versions shipped to GA with a bug that accepted (0,0) as valid key for ECDSA.

https://news.ycombinator.com/item?id=31089216 ... and it's not like there wasn't a FOSS test suite for this.

mm20233y ago

It was worse, it wasn't a (0,0) key it accepted. If that was all then you could blame the user for loading in a bad key etc. No the vuln was that it accepted (0,0) as being a valid signature over any text and validated using any public key! So you could forge any signature by simply using (0,0) as the sig itself!

Donckele3y ago· 6 in thread

Is this due to stupidity or malice?

I just can’t get my head round the idea that software written and reviewed by experts and submitted to the “National Institute of Standards and Technology” with a budget of 1 billion dollars can fuck up this way.

I’m no mathematician but I would have thought implementing pure number crunching code is not rocket science.

Buffer overflow, overwrite memory, run arbitrary code, seriously? LOL, WTF.

drivebycomment3y ago

Nobody with any experience would laugh at mistakes like this. It's only easy in hindsight. Past 30+ years of collective experience in our industry shows that these classes of bugs are nearly impossible to completely stamp out in any language but especially in memory unsafe ones, even with dramatically better compile time and runtime tools that can spot many of these nowadays. During the early days of the internet and the buffer overflow attacks after Morris worm, buffer overflow bugs existed in practically all software. There were times when pretty much any servers connected to the internet could be had relatively easily.

Even with memory safe languages, there are dangers. Humanity just hasn't figured out how to produce completely bug-free code at the scale we need in general, let alone in a memory-unsafe language.

loup-vaillant3y ago

This particular mistake is all the more infuriating because it comes from a precaution. Or trying to silence a compiler warning:

  partialBlock = (unsigned int)(dataByteLen - i);

Where both `dataByteLen` and `i` where actually `size_t`.

Assuming this is close enough to C, what happens is that we're converting a difference between `size_t` into a mere `unsigned`, and since they're not the same sizes on 64-bit platforms this can give `partialBlock` the wrong value, and the whole thing then snowballs into a catastrophic error that is not trivial to test because it only happens with huge buffer sizes.

The biggest mistake here is having written `(unsigned int)` instead of `(size_t)`. But the reason it happened in the first place is because they tried to do the right thing: writing the cast as a precaution, even though the following would have worked:

  partialBlock = dataByteLen - i;

I really can't fault them: because it was a difference it could theoretically yield a "negative" result, and therefore intuitively the type of a difference should be signed, so we should cast it back to unsigned to be crystal clear. I knew C was dangerous, but to be honest I didn't expect such a wicked mind game.

Now I'm going to have to take a look at my code.

2 more replies

jongjong3y ago

Agreed, people wildly underestimate how difficult it is to produce correct code. I think if people adopted the mindset that it's impossible, we'd be closer to the truth. Some of these important projects should only hire genius-level developers who code at a max speed of 1 line per day. Every line of code should be added with an unfathomable degree of thoughtfulness and care. I wish I was joking, but I'm not.

Shipping a non-trivial program which has more than a few thousand lines of code borders on impossible.

GoblinSlayer3y ago

Buffer overflow in hash function is unprecedented, buffering simply doesn't work this way, it was figured out long ago and never changed. Smells like bullrun to me.

red_admiral3y ago

I'm going with "neither".

It's true that Snowden revealed the NSA had their fingers in NIST's cryptographic standards team, with Dual-EC a specific example that is considered suspicious since the revelations. So a suspicion of malice is not completely unfounded, but as far as I can tell, this code was written by the Keccak team not NIST itself, and for any claim of "they're NSA stooges", I would need evidence.

It also seems to me that the Keccak team are not stupid people. That leaves "honest mistake" as the most likely explanation.

There are lots of studies on human behaviour that show a competent and diligent human will mess up every Nth time they perform a routine task; some forms of probabilistic risk analysis take N = 10^3 as a lower bound for this.

There is a long history of railroad operation in the UK (who invented the steam train after all) where occasionally, a signaller would send an express onto a line where they'd forgotten that the local was standing, or two trains head-on down a single line in opposite directions. This led to the development of interlockings and token working systems as technological solutions to mitigate the risks of human error, and later on to today's computerised safety systems, because signallers are (almost always) neither stupid nor malicious, but still human. The same can be said for programmers.

(As I understand, the recent train disaster in Greece was on a line where there should have been interlocking in place, but it wasn't active.)

bilekas3y ago

> Buffer overflow, overwrite memory, run arbitrary code, seriously? LOL, WTF.

Do you think everything (arguably anything) is released flawless?

eterevsky3y ago· 5 in thread

I wonder if this could be avoided by writing the canonical implementations in Rust or better yet in some system with formal verification.

This is such a critical part of the software stack, that we need a more reliable way of validation than just a bunch of people staring at the code written in C.

Arnavion3y ago

Rust won't help. Sure the compiled code would be bounds-checked, but nobody would notice the bug unless they gave it the crashing input. And then when they reimplemented the code in their non-bounds-checked language then that would reintroduce the bug anyway.

A formal verification implementation would catch it at authoring time, yes.

ysleepy3y ago

It seems to be an array out of bounds read/write. Rust does bound checks, so this should be covered.

1 more reply

execveat3y ago

Rust doesn't prevent integer over/underflows.

roca3y ago

It helps. In Rust debug builds, integer overflows crash -> tests will detect them. In release builds they're not detected by default, but you can add "overflow-checks = true" to the Cargo profile to enable those checks in release builds too if you want.

3 more replies

eptcyka3y ago

Yes, but crypto should probably use under/overflow safe arithmetic, which the rust standard library allows for.

jonstewart3y ago· 4 in thread

> partialBlock = (unsigned int)(dataByteLen - i);

The paper makes no mention of compiler warnings… but shouldn’t this cast trigger a compiler warning?

astrange3y ago

There is a mode of UBSan that would catch it, but I don't think you could run it on SHA code because that uses unsigned overflow for the hash.

Basically, this is why you shouldn't use unsigned types unless you explicitly want them to overflow.

nyberg3y ago

This is rather sad that one must give up the range and add the signed range just to avoid overflow bugs. Is there no way to make overflow not the default and instead trap unless one uses `add_wrap()` or `add_no_wrap()` in case it's not default?

1 more reply

caf3y ago

No? The effect of that is well-defined, and the cast is a pretty strong signal that the author is deliberately converting the value. Casts to unsigned that deliberately discard the high bits are relatively common.

loup-vaillant3y ago

I believe Clang under -Weverything has a warning about possible loss of precision.

It also has lots of annoying warnings that would dissuade many people from running -Weverything by default.

1 more reply

makeworld3y ago· 2 in thread

This is over 4 months old, and is already patched in Python. Was discussed on HN at the time: https://news.ycombinator.com/item?id=33281106

EGreg3y ago

What about PHP?

tomudding3y ago

Yes, fixed in all (at the time) supported versions in October last year [0].

[0]: https://github.com/php/php-src/commit/248f647

harveywi3y ago

It may be helpful to give this vulnerability a name, contributing to public awareness of the issue. For example, The SHA-Shake Redemption.

red_admiral3y ago

To clarify, this only affects EdDSA as far as implementations use SHA-3 to hash a message before applying the signature. The actual elliptic curve operations code seems to be fine.

rurban3y ago

I find the current polynonce attack much worse: https://news.ycombinator.com/item?id=35048431

j / k navigate · click thread line to collapse

47 comments

31 comments · 8 top-level

eulgro3y ago· 6 in thread

cesarb3y ago

> I didn't read the whole paper, but how can this even happen? Seems like the buffer overflow would be triggered for any file larger than 4 GiB

I skimmed the paper, and as far as I understood it:

rurban3y ago

You'd be surprised how many of those submitted and approved crypto standards are still not tested with industry best practices.

buffer overflows or integer UB's and overflows are very common. ubsan, asan, valgrind tests are missing. some do offer symbolic verification of the algo, but not the implementations.

miohtama3y ago

Hopefully people working on hashes and codecs will use Rust in the future, so Valgrinding and such are less needed.

2 more replies

userbinator3y ago

You need to feed it a block slightly less than its blocksize, then another one slightly less than 4GB.

pas3y ago

You might remember that JDK 15-18 versions shipped to GA with a bug that accepted (0,0) as valid key for ECDSA.

https://news.ycombinator.com/item?id=31089216 ... and it's not like there wasn't a FOSS test suite for this.

mm20233y ago

Donckele3y ago· 6 in thread

Is this due to stupidity or malice?

I’m no mathematician but I would have thought implementing pure number crunching code is not rocket science.

Buffer overflow, overwrite memory, run arbitrary code, seriously? LOL, WTF.

drivebycomment3y ago

Even with memory safe languages, there are dangers. Humanity just hasn't figured out how to produce completely bug-free code at the scale we need in general, let alone in a memory-unsafe language.

loup-vaillant3y ago

This particular mistake is all the more infuriating because it comes from a precaution. Or trying to silence a compiler warning:

  partialBlock = (unsigned int)(dataByteLen - i);

Where both `dataByteLen` and `i` where actually `size_t`.

  partialBlock = dataByteLen - i;

Now I'm going to have to take a look at my code.

2 more replies

jongjong3y ago

Shipping a non-trivial program which has more than a few thousand lines of code borders on impossible.

GoblinSlayer3y ago

Buffer overflow in hash function is unprecedented, buffering simply doesn't work this way, it was figured out long ago and never changed. Smells like bullrun to me.

red_admiral3y ago

I'm going with "neither".

It also seems to me that the Keccak team are not stupid people. That leaves "honest mistake" as the most likely explanation.

(As I understand, the recent train disaster in Greece was on a line where there should have been interlocking in place, but it wasn't active.)

bilekas3y ago

> Buffer overflow, overwrite memory, run arbitrary code, seriously? LOL, WTF.

Do you think everything (arguably anything) is released flawless?

eterevsky3y ago· 5 in thread

I wonder if this could be avoided by writing the canonical implementations in Rust or better yet in some system with formal verification.

This is such a critical part of the software stack, that we need a more reliable way of validation than just a bunch of people staring at the code written in C.

Arnavion3y ago

A formal verification implementation would catch it at authoring time, yes.

ysleepy3y ago

It seems to be an array out of bounds read/write. Rust does bound checks, so this should be covered.

1 more reply

execveat3y ago

Rust doesn't prevent integer over/underflows.

roca3y ago

3 more replies

eptcyka3y ago

Yes, but crypto should probably use under/overflow safe arithmetic, which the rust standard library allows for.

jonstewart3y ago· 4 in thread

> partialBlock = (unsigned int)(dataByteLen - i);

The paper makes no mention of compiler warnings… but shouldn’t this cast trigger a compiler warning?

astrange3y ago

There is a mode of UBSan that would catch it, but I don't think you could run it on SHA code because that uses unsigned overflow for the hash.

Basically, this is why you shouldn't use unsigned types unless you explicitly want them to overflow.

nyberg3y ago

1 more reply

caf3y ago

loup-vaillant3y ago

I believe Clang under -Weverything has a warning about possible loss of precision.

It also has lots of annoying warnings that would dissuade many people from running -Weverything by default.

1 more reply

makeworld3y ago· 2 in thread

This is over 4 months old, and is already patched in Python. Was discussed on HN at the time: https://news.ycombinator.com/item?id=33281106

EGreg3y ago

What about PHP?

tomudding3y ago

Yes, fixed in all (at the time) supported versions in October last year [0].

[0]: https://github.com/php/php-src/commit/248f647

harveywi3y ago

It may be helpful to give this vulnerability a name, contributing to public awareness of the issue. For example, The SHA-Shake Redemption.

red_admiral3y ago

To clarify, this only affects EdDSA as far as implementations use SHA-3 to hash a message before applying the signature. The actual elliptic curve operations code seems to be fine.

rurban3y ago

I find the current polynonce attack much worse: https://news.ycombinator.com/item?id=35048431

j / k navigate · click thread line to collapse