My C code works with -O3 but not with -O0 (opens in new tab)

(mulle-kybernetik.com)

177 pointsmulle_nat6y ago159 comments

159 comments

77 comments · 19 top-level

hannob6y ago· 15 in thread

This may be an instance of "you should really know the gcc/clang sanitizers and use them to test your code":

clang test.c -O0 -fsanitize=undefined

./a.out

[...]

test.c:17:12: runtime error: 9.22337e+18 is outside the range of representable values of type 'long'

Interestingly gcc doesn't throw that warning.

htfy966y ago

I Second this. Most weird behaviors in C++ today can be detected by ASAN and UBSAN.

There also exists low-cost random-sampling-based ASAN implementation that can be enabled in production: Google uses GWP-ASAN for all server-side applications as well as Chrome on Windows/Mac. See https://www.youtube.com/watch?v=RQGWMLkwrKc for details.

pjmlp6y ago

According to most surveys, they aren't used as much on real life.

Here 14%, https://www.jetbrains.com/lp/devecosystem-2019/cpp/

Here 40 - 55%, https://www.bfilipek.com/2019/12/cpp-status-2019.html

At CppCon 2015 or something, at Herb's question during his keynote, about 1% of the audience as per his comment on the video.

1 more reply

ndesaulniers6y ago

Just yesterday I had upgraded the version of clang used to compile Android's emulator. It got reverted due to some post submit test failing suddenly. The test case was shifting values in a byte array into one value but the LHS of a left shift didn't have enough bits to represent the shift, which is explicit UB. The statement had multiple shift operations (and other sub expressions with templated types) so it wasn't immediately clear that was the issue. -fsanitize=undefined found it immediately. "Spot the UB" is seemingly becoming my pastime.

tyingq6y ago

Does that verifier have a strong possibility of false positives? I'm curious why C compilers have such a strong history of making reasonable checks optional and hidden behind a bunch of switches.

hannob6y ago

No, actually the false positive rate of these flags is practically zero. (I'm not sure if it's 100% zero, but I used those extensively, reported many bugs and every time a developer told me "this is a false positive" they were wrong.)

The reason they aren't enabled by default is that that's not what they're designed for. They have a significant performance impact, you can't enable them all at once, they conflict with other security features and they may introduce security issues.

These are developer features. They aren't there to run your production code, they are there to test during development and bug finding.

3 more replies

flohofwoe6y ago

In my experience both clang ASAN and UBSAN (and TSAN, the thread sanitizer) are very solid tools, IFIR I haven't seen a false positive yet (of course some code may be specifically written to rely on undefined behaviour, but of course that's a mine field).

On the other hand, the clang static analyzer may have a shocking amount of messages when run first on a large existing code base, and some of those warnings can be considered more "opinions" than warnings. It's still makes sense and is very rewarding to make a code base "static analyzer clean".

The runtime sanitizers in comparison are very precise and always pointed to actual "sleeper bugs", it's almost definitely a good idea to use them and take their warnings serious.

But anyway, clang ASAN, UBSAN, TSAN and the static analyzer are all really excellent and important tools for everybody writing C or C++ code.

PS: the reason why those checks are optional is that they increase compilation time (sometimes dramatically, like 10x slower compilation or more), and they add runtime instrumentation code which both increases the executables size and decreases performance dramatically (also 2..10x times or more, although the clang sanitizers are really quite fast compared to other solutions).

1 more reply

wolf550e6y ago

When most people who know C learned C, this technology didn't exist. Currently John Regehr teaches his students that they must use sanitizers.

Note that ruining sanitizers in prod might be insecure. They're for development.

1 more reply

jzwinck6y ago

UBSan and ASan have essentially zero false positives. They do point out undefined behavior which happens to work on your platform, but at the very least those are still portability bugs.

Liquid_Fire6y ago

This is a runtime check and so has a (small) performance overhead.

mulle_natOP6y ago

That's true. I am using memory sanitizers in my workflow, but I haven't been using the `undefined` sanitizer. This could have saved me a day worth of effort.

gameswithgo6y ago

or just start moving away from using c. the last five years have brought nice alternatives like zig and rust

gnulinux6y ago

So start over 40 years of progress? Everything in my OS is written in C. Why am I always told not to use this language in HN if it's what my computer runs on. It's dangerous, sure, but seems like as a software engineer it's something I need to learn instead of run away from.

2 more replies

pjmlp6y ago

Not only the last 5 years, but I digress.

1 more reply

benibela6y ago

That is why I use Pascal

20 years ago it was advertised as the safe C alternative.

0xdead6y ago

No. I'd rather learn how to use my tools (that I have invested years to be comfortable with) properly than starting over.

2 more replies

ThreeFx6y ago· 10 in thread

A precise integer value is only guaranteed to be representable losslessly in a double if it is up to `64 - 1 (sign) - 11 (exponent) = 52` bits in magnitude.

This should be fairly obvious with knowledge about how floating point numbers are represented internally IMO.

Edit: Be more precise about what can be represented.

stephencanon6y ago

Perhaps it should be, but every 53 bit integer is exactly representable in double, because there's an implicit leading significand bit in the representation.

It's also worth noting that every finite double with magnitude larger than 2^52 has a precise integer value; it's just that once you get beyond 2^53, not every integer is representable.

ThreeFx6y ago

Yes you're right - thanks for clarifying. I meant to say that not every integer in magnitude greater than 52 bit has an exact floating point representation in IEEE doubles.

aidenn06y ago

IIRC not every 53-bit integer is representable in double, since float has two zero representations, but twos-complement integers have only one.

[edit]

Since the extra value is precisely a power of 2 (-2^52), then it will round correctly, however the value is arguably not precisely -2^52 since it has an epsilon of greater than 1.

1 more reply

yoz-y6y ago

I've seen quite a lot of errors stemming up from assumptions about floating point numbers. Not sure there is a good way of handling this in the end, except exert caution. Even basic assumptions like f + 1 > f will have this issue.

magicalhippo6y ago

The problem is that in most languages, to novices floating point numbers swim like a duck, quack like a duck, but it turns out they're alligators.

A good way to get around this is reading https://floating-point-gui.de/ to weed out any preconceptions, but yeah it's difficult to steer novices there without them stepping on one of the pitfalls first.

stephencanon6y ago

Note that f + 1 > f can also fail for integers in many languages (e.g. it does not hold for signed integers in C or C++, because the behavior is undefined when you add 1 to INT_MAX, and unsigned integers always wrap around). This particular gotcha is not unique to floating-point.

1 more reply

andrepd6y ago

i+1>i breaks even for unsigneds

2 more replies

patrec6y ago

Well, it may seem fairly obvious but it's also wrong.

syockit6y ago

When doing floating point arithmetic on the x86 though, it can extend to `80 - 1 (sign) - 15 (exponent) = 64` bits. So if the result of a floating point just so happens to have zero exponent, the mantissa can fit just right in a long int.

stephencanon6y ago

This hasn't been dependably true on x86 for almost two decades. SSE2 does double-precision computation at native width, not in the extended 80-bit format. Some 32b compilers still use 80-bit x87, but almost no 64b compilers do so.

1 more reply

0xff00ffee6y ago· 9 in thread

Hoo boy. This is what happens the first time C programmers start to work with floating point and don't know the fundamentals.

When you work with floating point, you need to remember you work with a tolerance to epsilon for comparisons because you are rounding to 1/n^2 precision and different floating point units perform the conversion in different ways.

You must abandon the idea of '==' for floats.

This is why his code is unpredictable, because you cannot guarantee the conversion of any integer to and from float is the same number. Period. The LSBs of the mantissa can and do change, which is why we mask to a precision or use signal-to-noise comparisons when evaluation bit drift between FP computations.

He has the first part correct, < and > are your friend with FP. But to get past the '==' hurdle, he needs to define his tolerance, the code should be something like:

if (fabs(f1 - f2) > TOLERANCE) ... fits = true.

I was irked by his arrogance when he asks, "Intel CPUs have a history of bugs. Did I hit one of those?" First, learn about floating point, then, work on an FPUnit team for 10 years, and even then, don't assume you're smarter than a team of floating point architects, you're not.

titzer6y ago

Floating point is indeed hard. It's made harder because the C programming language, due to its history supporting a large number of targets, especially for high performance computing, does not even mandate IEEE 754. (C predated IEEE 754). IEEE 754 is actually very precise about rounding, rounding modes, etc. The x87 floating point coprocessor is also heavily to blame, because of its internal 80-bit precision. Decades of headaches. Other headaches come from ARM NEON's vector instructions not implementing subnormals (RTZ mode), which is not IEEE, which makes vector math differ from scalar math in corner cases. GPUs also went through a similar evolution. Slowly the industry is moving to all-IEEE 754 compliant arithmetic. There's a lot more to say, but yes, I agree with you, it's complicated.

Gibbon16y ago

Yeah and then we have the problem that most floating point variables shouldn't be IEEE 754. And we're mostly fucked because most languages don't support anything else.

1 more reply

mark-r6y ago

> you cannot guarantee the conversion of any integer to and from float is the same number.

Totally false. Any integer between -(2^53) and 2^53 can be converted to IEEE 64-bit double without any loss of information.

coldtea6y ago

Read the parent again.

Those are a specific integer range, not "any integer".

1 more reply

chungus_khan6y ago

Absolutely. It's true that Intel CPUs have a history of bugs, and it's always a danger when working with any CPU. But unless you really, really know what you're doing you probably didn't, especially if you are doing something relatively basic and think you've somehow uncovered a new bug.

slavik816y ago

Choosing a tolerance is hard enough when you know the floating point model you're using, but it seems like an impossible task to try to support all possible floating point hardware. I couldn't tell you what guarantees you have when __STDC_IEC_559__ remains undefined.

These days I write my floating point code assuming float is a 32-bit IEEE 754 floating point number, and double is 64-bit. You can get those guarantees on any desktop hardware with the right flags, and the same semantics are commonly available on other platforms too.

Picking a well-defined implementation makes it much easier to reason about conversions between integer and floating point types. In fact, it allows you to reason about a lot of operations, e.g. 1.0 + 2.0 == 3.0 is true; (float)183 == 183.0 is true; 0.1 == (double)0.1f is false; etc.

ambrop76y ago

You don't always need epsilon. I've written lots of floating point code that works well without epsilon checks. See my comment in this thread (double_to_uint64) how what the OP needs can be done correctly without an epsilon check.

1 more reply

thanatropism6y ago

How is == still defined for floats (in typeful languages)? Fixed precision decimals should be on offer and suggested by compilers in response to an equality-on-floats.

ambrop76y ago

Each floating point value that is not a NaN represents a certain real number, -inf or +inf (this can be expressed in terms of the sign bit, exponent and mantissa). Knowing that, a == b when neither operand is a NaN is defined as equality of what they represent, in purely mathematical terms. Similar can be said for inequality operators.

Be aware that +0.0 and -0.0 are different floating point values but represent the same real number, so +0.0 == -0.0 follows.

People who say == means nothing for floating point and you always need epsilon checks are wrong, plain and simple. == is very well defined. Don't confuse the definition of floating point operations with common practices for using them effectively.

You can iterate through all non-NaN values and check that successive ones are indeed not equal:

    #include <math.h>
    #include <stdint.h>
    #include <assert.h>
    #include <stdio.h>
    #include <inttypes.h>

    int main()
    {
        float x = (float)-INFINITY;
        uint64_t count = 1;
        while (x != (float)INFINITY) {
            float y = nextafterf(x, (float)INFINITY);
            assert(y != x);
            x = y;
            ++count;
        }
        printf("Found %" PRIu64 " floats.\n", count);
        return 0;
    }
    
    $ gcc -std=c99 -O3 a.c -lm -o a
    $ ./a
    Found 4278190081 floats.

(a little bit harder for doubles)

Interestingly, this only finds one zero (-0.0), hence the assert doesn't actually fail around zero.

1 more reply

mokus6y ago· 7 in thread

The title is actually wrong - the -O0 version is correct, the -O3 version is not (despite giving the output the author expected).

Casting the value to double ends up converting the long value 0x7fffffffffffffff to the nearest double value: 0x8000000000000000. As the -O0 version CORRECTLY reports, this does not round-trip back to the same value in the "long" type. Many other values, though not all, down to about 1/1024 of that value (1 / 2^(63-53)) will also fail to round-trip for similar reasons.

Unless my coffee-deficient brain is missing something at the moment, it should be the case that any integer with 53 bits or fewer between the first and last 1 bit (inclusive) will roundtrip cleanly. Any other integer will not.

Edit: fixed a typo above, and coded up the idea I expected to work and ran it through quickcheck for a few min, and this version seems to be correct ('int' return rather than bool is just because haskell's ffi doesn't natively support C99 bool):

    #include <limits.h>
    
    int fits(long x) {
      if (x == LONG_MIN) return 0;
    
      unsigned long ux = x < 0 ? -x : x;
      while (ux > 0x1fffffffffffffUL && !(ux & 1)) {
        ux /= 2;
      }
    
      return ux <= 0x1fffffffffffffUL;
    }

OskarS6y ago

So the problem here is that in this line:

      if( value < (double) LONG_MIN || value > (double) LONG_MAX)
          return( 0);

The cast of LONG_MAX to double rounds upwards (which is allowed by the standard, which says that rounding direction is "implementation defined"), so that "value > (double) LONG_MAX" is false, right? Even though the mathematical value of "value" is larger than LONG_MAX?

Which then leads to this line:

   l_val = (long) value;

Where value is cast to a long despite being outside of the range of longs, thus causing undefined behavior. So to be clear, BOTH of the -O0 and -O3 versions are correct, since both invoke undefined behaviour.

When -O0 and -O3 give different results, either at least one of them is incorrect and you've stumbled on a compiler bug, or both of them are correct and you're invoking UB (the far more common situation).

EDIT: no, I think I misunderstood it: it's not that value is larger than (double) LONG_MAX, it IS (double)LONG_MAX, so of course "value > (double)LONG_MAX" is false.

The problem is that the "(long)((double)LONG_MAX))" is undefined behaviour on implementations that round (double)LONG_MAX upwards instead of downwards. Which is allowed by the standard. Ok, cool :)

fargle6y ago

This ^^^^ is the core issue. It's insidious that LONG_MAX is too big to be exactly representable as a double.

The answer by ambrop7 below solves a different problem but appears to be correct. The reason escaped me at first and is super subtle. The trick is that LONG_MAX is 2^63 - 1, not 2^63. And the subtlety is that 2^63 is guaranteed to be exactly representable in IEEE double because it is an even power of 2, which 2^63-1 is not.

I don't care much for runtime ldexp() anyhow. So I'd be tempted to just pre-compute the exact limits -2^63 and nextafter(+2^63, 0) and encode them as doubles manually (omitting some #if method of portably determining the width of LONG):

  #define FLONG_MIN 9223372036854775808.0 // exact -2^63
  #define FLONG_MAX 9223372036854774784.0 // exact nextbefore(2^63)

  if (value < FLONG_MIN || value > FLONG_MAX)
     return(0);

Then the UB is avoided. I think the rest of the module works as is. Now, I question whether the author really wants integers between 2^53 and 2^63 to sparsely return true. So it might be better to just change the whole design to use +-2^53 as a hard limit, for trivially guaranteed round trip of the entire range and dispense with these nasty edge cases.

Joker_vD6y ago

Huh. So, how does one check that a cast from double to long would succeed in C? Just go and do lround()+errno check?

2 more replies

raverbashing6y ago

> converting the long value 0x7fffffffffffffff to the nearest double value: 0x8000000000000000

From what I understand from the spec it should be the nearest in value, no? Not the nearest in memory representation.

eMSF6y ago

Who said anything about memory representations?

The language spec (or at least a summary of it) is linked in the article, and it is pretty loose: nearest higher or nearest lower, chosen by the implementation (regardless of which is nearer).

IEEE double-precision float cannot store LONG_MAX (which is 2^63-1 with 8-byte longs) precisely, so it gets converted to 2^63; which you cannot cast back to a long, because it doesn't fit (resulting in undefined behaviour).

gok6y ago

OP wants the opposite; they're converting from double to long.

Someone6y ago

“Unless my coffee-deficient brain is missing something at the moment”

Add enough zeroes, and you’ll run out of exponent range.

dfranke6y ago· 6 in thread

Clear your floating point exception register by calling feclearexcept(FE_ALL_EXCEPT). Convert to long by calling lrint(rint(x)). Then check your exception register using fetestexcept(). FE_INEXACT will indicate that the input wasn't an integer, and FE_INVALID will indicate that the result doesn't fit in a long.

Edit: check for me whether just calling lrint(x) works. The manpage doesn't specify that lrint() will set FE_INEXACT, but it seems weird to me that it wouldn't.

inetknght6y ago

As someone who's had to read C and C++ code using `double`, it's been a few years since I've heard of `feclearexcept` and how important it is.

Great, thanks, now I have to go back and restart some of those code reviews I've been doing of certain third party matrix math libraries...

clarry6y ago

> The manpage doesn't specify that lrint() will set FE_INEXACT, but it seems weird to me that it wouldn't.

Annex F:

The lrint and llrint functions provide floating-to-integer conversion as prescribed by IEC 60559. They round according to the current rounding direction. If the rounded value is outside the range of the return type, the numeric result is unspecified and the ''invalid'' floating-point exception is raised. When they raise no other floating-point exception and the result differs from the argument, they raise the ''inexact'' floating-point exception.

dfranke6y ago

Thanks. I should file a bug about this against the Linux man-pages project.

pjc506y ago

I had no idea this feature existed! Does it behave usefully in a multithreaded context?

clarry6y ago

Yes it does. N1570 7.6:

> The floating-point environment has thread storage duration. The initial state for a thread's floating-point environment is the current state of the floating-point environment of the thread that creates it at the time of creation.

dfranke6y ago

I think it uses thread-local storage like errno does, but I'd have to verify.

mojuba6y ago· 5 in thread

    if( ! fits)

Why this (constently) terrible formatting though? Never seen anyone using this style.

petee6y ago

Gotta say, without seeing it in context, it's flat-out clear what it means and non ambiguous

mojuba6y ago

Sure, but it's anti-mathematical let's say :) Left and right brackets should have symmetrical formatting. Never seen a style with such asymmetry, it's pretty odd.

1 more reply

inetknght6y ago

Ahh you ought to see my style. I've been told it's unique and quite ugly. Linters tend to very much dislike it. Nonetheless my style has a purpose to me and I'm sure so does the author's.

mojuba6y ago

I think glueing punctuation to the previous word but always having a space before the next one makes the words more readable. Probbaly makes sense although you always need research to back up such claims. For example there's research that says snake_case is more readable than camelCase (and yet most languages encourage camelCase for some reason)

rootlocus6y ago

That's perfectly fine if you're working alone.

CGamesPlay6y ago· 2 in thread

How did you decide that "the method works for LONG_MIN"? Did the method return the expected output of false? Because it really seems like the code is working correctly on `-O0` and incorrectly on `-O3`...

eMSF6y ago

Why would you expect the output to be false? On any typical system around (with 8-byte longs), LONG_MIN has the value of -2^63 which (converted to a IEEE double-precision float) passes the function's checks just fine -- even if the values around it don't.

CGamesPlay6y ago

Ah, my mistake. Still, seems like the -O0 is actually what's correct and the -O3 is reporting an incorrect answer.

pjc506y ago· 1 in thread

> When something very basic goes wrong, I have this hierarchy of potential culprits:

I don't know if this is supposed to be a joke or part of the setup for an explanatory post about undefined behaviour, but that list is in exactly the wrong order.

scoutt6y ago

I agree, but I'd also say that silicon bugs are rarer, so I put them at the end of the list.

NullPrefix6y ago· 1 in thread

The frst rule of floating point comparison is you do not compare them for equality, but instead calculate the difference and check if the difference is less than epsilon.

ska6y ago

This is super common advice but it is generally wrong, at least the second part.

Comparing floats is more subtle than most programmers realize, and there really isn't a one-size-fits-all solution.

Things to consider due to the nature of fp representation - comparing results close to zero is different (i.e. "is a small" needs a differen test than "are a & b close"

- the distance between fp numbers depends on their magnitude, so comparing two large numbers to each other shouldn't have the same bounds as comparing two numbers near 1, say[1].

- if you aren't quite careful you can easily create tests where a == b but b != a , which can cause sorting issues, etc.

Hand-wavily speaking if you want to do this "right", you should probably look at doing the analysis in ULP (units in last place) rather than directly on the floats. Don't do it for values near zero though. And have a fast path for differently signed values.

The above doesn't even get into denormalized values.

[1] note that what people usually mean for epsilon is the version of machine epsilon that is the difference between 1 and the next representable float above 1 [2]. So by definition this is smaller than the representable difference between any two numbers in larger decades

[2] MS .NET somewhat confusingly defines Epsilon as the smallest representable normalized number.

ginko6y ago· 1 in thread

>I am still looking for a better way to check, if a double will convert cleanly to an integer of the same size or not.

I'd say the cleanest would be to decode exponent and mantissa, check if the exponent is within the 64-bit limit of long, then check if there's any bits set below the decimal point. (+plus some extra care for two's complement negative numbers)

The problem with this is of course that this would be platform dependent.

clarry6y ago

I'd say the correct way to is to use lrint or lround and check for errors the standard way.

apta6y ago· 1 in thread

This is why using safe languages is important. Even frequent users of C and C++ end up making mistakes that are difficult to track down.

syockit6y ago

I'm not sure what you mean by safe in this context. A language that forces type casts to be explicit? That throws runtime error when invalid/imprecise cast is done?

g829186y ago

To save some time for new readers, the author is unfamiliar with floating point representation and thinks that a double precision number since it is 64 bits can hold any 64 bit integer and is somewhat confused as to what an xmm register can hold(they believe that it has 128 bits of precision instead of being able to hold 2 64-bit doubles, or 4 32-bit singles). They attempt to find the issue a few ways. The correct solution is not to convert any integer larger than 2^53 in absolute value since only integers that large can be successfully converted to double and back( aside from a few others that exist sparsely).

heftig6y ago

SSE xmm registers might be 128 bits wide, but the precision is still 64 bits. The additional (high) bits are zeroed out.

What you're seeing is not excess precision due to wide registers but excess precision due to optimization and constant propagation, which means GCC calculates a fast path for (argc == 1) that doesn't round correctly and ends up with "it fits".

Interestingly it does optimize to the correct "doesn't fit" with -mfpmath=387 -fexcess-precision=standard, so I guess this is a bug in how GCC treats SSE math. The sanitizer (-fsanitize=float-cast-overflow) also notices the problem.

yuriko6y ago

Based on my experience, this title is a strong hint that some undefined behaviour is triggered.

mulle_natOP6y ago

With the help of your comments, I could now write the conclusion to my article. In a nutshell this is the solution:

    #include <math.h>
    #include <fenv.h>


    int   fits_long( double d)
    {
       long     l_val;
       double   d_val;
 
    // may be needed ?
    // #pragma STDC FENV_ACCESS ON
 
       feclearexcept( FE_INVALID);   
       l_val = lrint( d);            
       d_val = (double) l_val;       
       if( fetestexcept( FE_INVALID))
          return( 0);
 
       return( d_val == d);
    }

The article explains it in more detail. Thanks for the help.

Aardwolf6y ago

The most surprising thing for me out of this is that casting a high positive integer to double will output the nearest double which could be higher, not the highest one smaller than or equal to the integer value.

Is there a way to get the largest double smaller or equal than some positive integer?

correct_horse6y ago

> When something very basic goes wrong, I have this hierarchy of potential culprits: the compiler buggy hardware OS vendor last and least me, because I don’t make mistakes :)

I really dislike the arrogant programmer trope. Can we all stop?

adammunich6y ago

I had something similar happen but with GCC generating an internal compiler error and just plain failing. Still haven't figured out why.

syockit6y ago

I'd say just put volatile and be done with it. Now your -O3 will also break, but at least it's consistent with -O0 :p

j / k navigate · click thread line to collapse

159 comments

77 comments · 19 top-level

hannob6y ago· 15 in thread

This may be an instance of "you should really know the gcc/clang sanitizers and use them to test your code":

clang test.c -O0 -fsanitize=undefined

./a.out

[...]

test.c:17:12: runtime error: 9.22337e+18 is outside the range of representable values of type 'long'

Interestingly gcc doesn't throw that warning.

htfy966y ago

I Second this. Most weird behaviors in C++ today can be detected by ASAN and UBSAN.

pjmlp6y ago

According to most surveys, they aren't used as much on real life.

Here 14%, https://www.jetbrains.com/lp/devecosystem-2019/cpp/

Here 40 - 55%, https://www.bfilipek.com/2019/12/cpp-status-2019.html

At CppCon 2015 or something, at Herb's question during his keynote, about 1% of the audience as per his comment on the video.

1 more reply

ndesaulniers6y ago

tyingq6y ago

Does that verifier have a strong possibility of false positives? I'm curious why C compilers have such a strong history of making reasonable checks optional and hidden behind a bunch of switches.

hannob6y ago

These are developer features. They aren't there to run your production code, they are there to test during development and bug finding.

3 more replies

flohofwoe6y ago

The runtime sanitizers in comparison are very precise and always pointed to actual "sleeper bugs", it's almost definitely a good idea to use them and take their warnings serious.

But anyway, clang ASAN, UBSAN, TSAN and the static analyzer are all really excellent and important tools for everybody writing C or C++ code.

1 more reply

wolf550e6y ago

When most people who know C learned C, this technology didn't exist. Currently John Regehr teaches his students that they must use sanitizers.

Note that ruining sanitizers in prod might be insecure. They're for development.

1 more reply

jzwinck6y ago

UBSan and ASan have essentially zero false positives. They do point out undefined behavior which happens to work on your platform, but at the very least those are still portability bugs.

Liquid_Fire6y ago

This is a runtime check and so has a (small) performance overhead.

mulle_natOP6y ago

That's true. I am using memory sanitizers in my workflow, but I haven't been using the `undefined` sanitizer. This could have saved me a day worth of effort.

gameswithgo6y ago

or just start moving away from using c. the last five years have brought nice alternatives like zig and rust

gnulinux6y ago

2 more replies

pjmlp6y ago

Not only the last 5 years, but I digress.

1 more reply

benibela6y ago

That is why I use Pascal

20 years ago it was advertised as the safe C alternative.

0xdead6y ago

No. I'd rather learn how to use my tools (that I have invested years to be comfortable with) properly than starting over.

2 more replies

ThreeFx6y ago· 10 in thread

A precise integer value is only guaranteed to be representable losslessly in a double if it is up to `64 - 1 (sign) - 11 (exponent) = 52` bits in magnitude.

This should be fairly obvious with knowledge about how floating point numbers are represented internally IMO.

Edit: Be more precise about what can be represented.

stephencanon6y ago

Perhaps it should be, but every 53 bit integer is exactly representable in double, because there's an implicit leading significand bit in the representation.

It's also worth noting that every finite double with magnitude larger than 2^52 has a precise integer value; it's just that once you get beyond 2^53, not every integer is representable.

ThreeFx6y ago

Yes you're right - thanks for clarifying. I meant to say that not every integer in magnitude greater than 52 bit has an exact floating point representation in IEEE doubles.

aidenn06y ago

IIRC not every 53-bit integer is representable in double, since float has two zero representations, but twos-complement integers have only one.

[edit]

Since the extra value is precisely a power of 2 (-2^52), then it will round correctly, however the value is arguably not precisely -2^52 since it has an epsilon of greater than 1.

1 more reply

yoz-y6y ago

magicalhippo6y ago

The problem is that in most languages, to novices floating point numbers swim like a duck, quack like a duck, but it turns out they're alligators.

stephencanon6y ago

1 more reply

andrepd6y ago

i+1>i breaks even for unsigneds

2 more replies

patrec6y ago

Well, it may seem fairly obvious but it's also wrong.

syockit6y ago

stephencanon6y ago

1 more reply

0xff00ffee6y ago· 9 in thread

Hoo boy. This is what happens the first time C programmers start to work with floating point and don't know the fundamentals.

You must abandon the idea of '==' for floats.

He has the first part correct, < and > are your friend with FP. But to get past the '==' hurdle, he needs to define his tolerance, the code should be something like:

if (fabs(f1 - f2) > TOLERANCE) ... fits = true.

titzer6y ago

Gibbon16y ago

Yeah and then we have the problem that most floating point variables shouldn't be IEEE 754. And we're mostly fucked because most languages don't support anything else.

1 more reply

mark-r6y ago

> you cannot guarantee the conversion of any integer to and from float is the same number.

Totally false. Any integer between -(2^53) and 2^53 can be converted to IEEE 64-bit double without any loss of information.

coldtea6y ago

Read the parent again.

Those are a specific integer range, not "any integer".

1 more reply

chungus_khan6y ago

slavik816y ago

ambrop76y ago

1 more reply

thanatropism6y ago

How is == still defined for floats (in typeful languages)? Fixed precision decimals should be on offer and suggested by compilers in response to an equality-on-floats.

ambrop76y ago

Be aware that +0.0 and -0.0 are different floating point values but represent the same real number, so +0.0 == -0.0 follows.

You can iterate through all non-NaN values and check that successive ones are indeed not equal:

    #include <math.h>
    #include <stdint.h>
    #include <assert.h>
    #include <stdio.h>
    #include <inttypes.h>

    int main()
    {
        float x = (float)-INFINITY;
        uint64_t count = 1;
        while (x != (float)INFINITY) {
            float y = nextafterf(x, (float)INFINITY);
            assert(y != x);
            x = y;
            ++count;
        }
        printf("Found %" PRIu64 " floats.\n", count);
        return 0;
    }
    
    $ gcc -std=c99 -O3 a.c -lm -o a
    $ ./a
    Found 4278190081 floats.

(a little bit harder for doubles)

Interestingly, this only finds one zero (-0.0), hence the assert doesn't actually fail around zero.

1 more reply

mokus6y ago· 7 in thread

The title is actually wrong - the -O0 version is correct, the -O3 version is not (despite giving the output the author expected).

    #include <limits.h>
    
    int fits(long x) {
      if (x == LONG_MIN) return 0;
    
      unsigned long ux = x < 0 ? -x : x;
      while (ux > 0x1fffffffffffffUL && !(ux & 1)) {
        ux /= 2;
      }
    
      return ux <= 0x1fffffffffffffUL;
    }

OskarS6y ago

So the problem here is that in this line:

      if( value < (double) LONG_MIN || value > (double) LONG_MAX)
          return( 0);

Which then leads to this line:

   l_val = (long) value;

EDIT: no, I think I misunderstood it: it's not that value is larger than (double) LONG_MAX, it IS (double)LONG_MAX, so of course "value > (double)LONG_MAX" is false.

The problem is that the "(long)((double)LONG_MAX))" is undefined behaviour on implementations that round (double)LONG_MAX upwards instead of downwards. Which is allowed by the standard. Ok, cool :)

fargle6y ago

This ^^^^ is the core issue. It's insidious that LONG_MAX is too big to be exactly representable as a double.

  #define FLONG_MIN 9223372036854775808.0 // exact -2^63
  #define FLONG_MAX 9223372036854774784.0 // exact nextbefore(2^63)

  if (value < FLONG_MIN || value > FLONG_MAX)
     return(0);

Joker_vD6y ago

Huh. So, how does one check that a cast from double to long would succeed in C? Just go and do lround()+errno check?

2 more replies

raverbashing6y ago

> converting the long value 0x7fffffffffffffff to the nearest double value: 0x8000000000000000

From what I understand from the spec it should be the nearest in value, no? Not the nearest in memory representation.

eMSF6y ago

Who said anything about memory representations?

The language spec (or at least a summary of it) is linked in the article, and it is pretty loose: nearest higher or nearest lower, chosen by the implementation (regardless of which is nearer).

gok6y ago

OP wants the opposite; they're converting from double to long.

Someone6y ago

“Unless my coffee-deficient brain is missing something at the moment”

Add enough zeroes, and you’ll run out of exponent range.

dfranke6y ago· 6 in thread

Edit: check for me whether just calling lrint(x) works. The manpage doesn't specify that lrint() will set FE_INEXACT, but it seems weird to me that it wouldn't.

inetknght6y ago

As someone who's had to read C and C++ code using `double`, it's been a few years since I've heard of `feclearexcept` and how important it is.

Great, thanks, now I have to go back and restart some of those code reviews I've been doing of certain third party matrix math libraries...

clarry6y ago

> The manpage doesn't specify that lrint() will set FE_INEXACT, but it seems weird to me that it wouldn't.

Annex F:

dfranke6y ago

Thanks. I should file a bug about this against the Linux man-pages project.

pjc506y ago

I had no idea this feature existed! Does it behave usefully in a multithreaded context?

clarry6y ago

Yes it does. N1570 7.6:

dfranke6y ago

I think it uses thread-local storage like errno does, but I'd have to verify.

mojuba6y ago· 5 in thread

    if( ! fits)

Why this (constently) terrible formatting though? Never seen anyone using this style.

petee6y ago

Gotta say, without seeing it in context, it's flat-out clear what it means and non ambiguous

mojuba6y ago

Sure, but it's anti-mathematical let's say :) Left and right brackets should have symmetrical formatting. Never seen a style with such asymmetry, it's pretty odd.

1 more reply

inetknght6y ago

Ahh you ought to see my style. I've been told it's unique and quite ugly. Linters tend to very much dislike it. Nonetheless my style has a purpose to me and I'm sure so does the author's.

mojuba6y ago

rootlocus6y ago

That's perfectly fine if you're working alone.

CGamesPlay6y ago· 2 in thread

eMSF6y ago

CGamesPlay6y ago

Ah, my mistake. Still, seems like the -O0 is actually what's correct and the -O3 is reporting an incorrect answer.

pjc506y ago· 1 in thread

> When something very basic goes wrong, I have this hierarchy of potential culprits:

I don't know if this is supposed to be a joke or part of the setup for an explanatory post about undefined behaviour, but that list is in exactly the wrong order.

scoutt6y ago

I agree, but I'd also say that silicon bugs are rarer, so I put them at the end of the list.

NullPrefix6y ago· 1 in thread

The frst rule of floating point comparison is you do not compare them for equality, but instead calculate the difference and check if the difference is less than epsilon.

ska6y ago

This is super common advice but it is generally wrong, at least the second part.

Comparing floats is more subtle than most programmers realize, and there really isn't a one-size-fits-all solution.

Things to consider due to the nature of fp representation - comparing results close to zero is different (i.e. "is a small" needs a differen test than "are a & b close"

- the distance between fp numbers depends on their magnitude, so comparing two large numbers to each other shouldn't have the same bounds as comparing two numbers near 1, say[1].

- if you aren't quite careful you can easily create tests where a == b but b != a , which can cause sorting issues, etc.

The above doesn't even get into denormalized values.

[2] MS .NET somewhat confusingly defines Epsilon as the smallest representable normalized number.

ginko6y ago· 1 in thread

>I am still looking for a better way to check, if a double will convert cleanly to an integer of the same size or not.

The problem with this is of course that this would be platform dependent.

clarry6y ago

I'd say the correct way to is to use lrint or lround and check for errors the standard way.

apta6y ago· 1 in thread

This is why using safe languages is important. Even frequent users of C and C++ end up making mistakes that are difficult to track down.

syockit6y ago

I'm not sure what you mean by safe in this context. A language that forces type casts to be explicit? That throws runtime error when invalid/imprecise cast is done?

g829186y ago

heftig6y ago

SSE xmm registers might be 128 bits wide, but the precision is still 64 bits. The additional (high) bits are zeroed out.

yuriko6y ago

Based on my experience, this title is a strong hint that some undefined behaviour is triggered.

mulle_natOP6y ago

With the help of your comments, I could now write the conclusion to my article. In a nutshell this is the solution:

    #include <math.h>
    #include <fenv.h>


    int   fits_long( double d)
    {
       long     l_val;
       double   d_val;
 
    // may be needed ?
    // #pragma STDC FENV_ACCESS ON
 
       feclearexcept( FE_INVALID);   
       l_val = lrint( d);            
       d_val = (double) l_val;       
       if( fetestexcept( FE_INVALID))
          return( 0);
 
       return( d_val == d);
    }

The article explains it in more detail. Thanks for the help.

Aardwolf6y ago

Is there a way to get the largest double smaller or equal than some positive integer?

correct_horse6y ago

> When something very basic goes wrong, I have this hierarchy of potential culprits: the compiler buggy hardware OS vendor last and least me, because I don’t make mistakes :)

I really dislike the arrogant programmer trope. Can we all stop?

adammunich6y ago

I had something similar happen but with GCC generating an internal compiler error and just plain failing. Still haven't figured out why.

syockit6y ago

I'd say just put volatile and be done with it. Now your -O3 will also break, but at least it's consistent with -O0 :p

j / k navigate · click thread line to collapse