Python rounds float values by converting them to string and then back (opens in new tab)

(github.com)

268 pointsbishala6y ago144 comments

144 comments

97 comments · 20 top-level

ChrisSD6y ago· 17 in thread

Maybe I'm missing something but what's wrong with rounding floats this way?

Rounding a number is, in the common case, multiplying it by some base, truncating to an integer, and dividing by the base. You do have to handle extremely high exponents, but even the logic for that is not complex.

Example of implementing it the sane way: https://github.com/numpy/numpy/blob/75ea05fc0af60c685e6c071d...

Every step of this function is complex and expensive, especially printing a float as a decimal is very complex. And round is routinely used in a tight loop.

dagw6y ago

The numpy approach sacrifices correctness for speed (you sometimes get unexpected results in some corner cases, see below), the cpython way sacrifices speed for correctness.

  >>> round(56294995342131.5, 2)
  56294995342131.5
  >>> round(56294995342131.5, 3)
  56294995342131.5

  >>> np.round(56294995342131.5, 2)
  56294995342131.5
  >>> np.round(56294995342131.5, 3)
  56294995342131.51

phkahler6y ago

The problem is the division after truncation. That division by a power of 10 can produce errors in binary.

Armisael166y ago

How does truncating a positive number ever round up?

1 more reply

yjftsjthsd-h6y ago

I don't know that it is "wrong", just unexpected. I suspect most people expect all math functions to be purely implemented in numerical terms, so finding string manipulation is surprising/interesting.

CogitoCogito6y ago

> I don't know that it is "wrong", just unexpected. I suspect most people expect all math functions to be purely implemented in numerical terms, so finding string manipulation is surprising/interesting.

You kind of got me thinking now. The decimal representation of a number is really a string representation (in the sense of a certain sequence of characters). Hence rounding to a certain decimal is essentially a string operation. You can of course do it by (say) dividing by 10^whatever or something else in some numerical fashion, but the more I think about it, the more natural it is to just think of the whole thing as a string.

Or you could flip it around and consider that the string manipulation can also be described numerically so whether you consider the operation as a string operation or a numerical operation is sort irrelevant. It's just a point of view.

3 more replies

mark-r6y ago

There are many corner cases involved with rounding, and the folks who did the string conversion had to put a lot of effort into handling all of them. It makes sense to piggyback on their efforts, even if it isn't the most efficient way 99% of the time.

kzrdude6y ago

And then strings are text implemented in numerical terms

emsy6y ago

Python already doesn't have the best performance. If you need to round a lot of floats in a loop you better bring some time.

pletnes6y ago

Or use numpy, like the rest of us.

sqrt176y ago

round() is specifically about rounding to decimal places, and there are other, faster functions for the more common cases

Aardwolf6y ago

They're using base 10 which is much slower to use here than power of 2 bases would be. Plus then also the memory management for the string.

viraptor6y ago

The memory management will virtually never kick in. You'd need a number which expands to more than 100 characters for that to happen.

bottled_poe6y ago

The two concerns I have are performance and correctness. I don’t know enough about the implementation of round(3) to know... perhaps someone else does?

dagw6y ago

This approach is used specifically because of correctness. Doing things the 'obvious' way with round(3) or truncation introduces precision problems in corner cases.

f00zz6y ago

That's what round(3) is for

masklinn6y ago

round(3) can only round to an integer. Python's round works to an arbitrary decimal position.

It would previously scale up, round (ceil/floor really) then scale down. That turned out to induce severe precision issues: https://bugs.python.org/issue1869

1 more reply

fs1116y ago· 15 in thread

Apples libc used to shell-out to perl in a function: https://github.com/Apple-FOSS-Mirror/Libc/blob/2ca2ae7464771...

stefan_6y ago

I thought this is what the Unix philosophy is supposed to be all about.

(Realistically, calling wordexp should just abort the program. Now I actually want to make a hacked up musl that aborts in all the various "libc functions no one should ever use" and see how far I get into a Ubuntu boot..)

f00zz6y ago

Would be pretty awesome if Perl called wordexp(3) somewhere along this code path

1 more reply

xienze6y ago

> I thought this is what the Unix philosophy is supposed to be all about.

Perhaps from the perspective of an end user running things from a shell. Generally speaking though, shelling out from within a program is not ideal.

ericfrederich6y ago

/* wordexp is also rife with security "challenges", unless you pass it WRDE_NOCMD it must support subshell expansion, and even if you don't beause it has to support so much of the standard shell (all the odd little variable expansion options for example) it is hard to do without a subshell). It is probbably just plan a Bad Idea to call in anything setuid, or executing remotely. */

tus886y ago

That's hilarious. You're not supposed to go the other direction libc!

f00zz6y ago

    /* XXX this is _not_ designed to be fast */

vectorEQ6y ago

it is hard to do without a subshell). It is probbably just plan a Bad Idea to call in anything setuid, or executing remotely. */

laughing so hard :')

benj1116y ago

Which raises the question what libc functions perl calls...

And imagine the debug errors:

>perl error X

"But I'm just calling libc ?!?"

tehlike6y ago

Or unintended stackoverflows.

1 more reply

gbacon6y ago

Die ganzen Zahlen hat der liebe Gott gemacht, alles andere ist Menschenwerk.

https://en.wikipedia.org/wiki/Leopold_Kronecker

peterwwillis6y ago

What if Perl uses libc ?????

saagarjha6y ago

It does on my system:

  $ otool -L /usr/bin/perl
  /usr/bin/perl:
      /System/Library/Frameworks/CoreFoundation.framework/Versions/A/CoreFoundation (compatibility version 150.0.0, current version 1663.0.0)
      /usr/lib/libSystem.B.dylib (compatibility version 1.0.0, current version 1281.0.0)

masklinn6y ago

That's not an issue unless perl calls wordexp (as part of the stuff it does when called by wordexp).

ImNotTheNSA6y ago

then it calls the Perl implementation... is there some nuance of perl that would cause a problem with that?

2 more replies

imglorp6y ago

O_o

latchkey6y ago· 14 in thread

https://0.30000000000000004.com/

moring6y ago

Somewhat offtopic, but is there a reason some many explanations of this issue lump together the fundamental principle of how numbers are represented (integers vs. fractions vs. exact reals (technically impossible) vs. IEEE 754) and the base (decimal vs. binary)? Every time I read something like the explanation on that site, I wonder if I would understand it if I didn't knew it already.

mumblemumble6y ago

The "why does this happen" on that page hits on it, at least tangentially. A lot of what's confusing about IEEE floats isn't the inability to represent all rationals in and of itself, it's more that the particular patterns of inaccuracy end up being different between the computer approximation and the approximations we'd make on paper, because of the different numeric bases.

1 more reply

mark-r6y ago

The difference between decimal and binary is essential to understanding the problem. Just as there's no elegant way to represent 1/3 in base 10, there's no elegant way to represent 1/10 in base 2.

1 more reply

ilovepeppapig6y ago

The worst way to explain something is to begin with "It's actually pretty simple."

The_rationalist6y ago

Not always! E.g Let's say someone begin to explain me a topic and use e.g two words that I didn't knew before: e.g supervenience and congency.

Humans generally are afraid of new words (especially weird sounding ones) and often will assume that the subject is complex and might intimidate them.

But unknown words can have extremely simple meanings, or be synonyms of already known words.

By asserting: "It's actually pretty simple" You give them confidence that there's not reason to be afraid of the topic or of the words.

1 more reply

furyofantares6y ago

I tend to agree.

If someone has some anxiety about not understanding something, telling them it's actually pretty simple can just reinforce the framing they already have going in that maybe they're too dumb to get it.

I've found it's usually better to acknowledge that it's a little difficult or otherwise totally normal not to already know / have grasped the thing in question.

I think the intent is in the right place when saying "it's actually pretty simple" -- you want to provide optimism. The approach I like is along the lines of "this part is a little tricky, so let's break it down."

olooney6y ago

I punched "It's actually pretty simple" into http://talktotransformer.com (which generates nonsense text from a seed using an OpenAI language model) and after a couple of tries it gave me this:

> It's actually pretty simple. We'll be looking at something called the "D-Wave P-500", which is a version of the P500 chip for quantum computers.

> It's basically a single bit computer, but with more than 500 qubits. Which means that our "real number" will have more numbers than the number of qubits that are available. That's really important.

> Quantum computers are theoretically able to do more things than just solve equations. For example, the way that a quantum computer uses energy from an electron to solve a classical math problem, or the way that it can break a complex calculation into smaller bits of information that each can solve on its own, is very different from how computers currently work.

> But I am not suggesting that a quantum computer can be used to solve more abstract problems. Because that would be crazy.

> But to give an example of what it could do, imagine doing a number crunching function that was 10× faster than a classical chip, and that had some really useful, and practical things that would be interesting to try.

Because Talk to Transformer is trained on real-world data, this supports the hypothesis that the phrase "It's actually pretty simple" is often followed by an unintelligible and highly technical explanation.

TekMol6y ago

    Computers can only natively store integers,
    so they need some way of representing decimal numbers.

Really? How do computers "natively" store integers?

mumblemumble6y ago

The intent of that sentence is a lot clearer if you also consider the subsequent text, which expands on the idea quite a bit:

> Computers can only natively store integers, so they need some way of representing decimal numbers. This representation comes with some degree of inaccuracy. . . Why does this happen? It's actually pretty simple. When you have a base 10 system (like ours), it can only express fractions that use a prime factor of the base. . .

Perhaps a more formally correct way to put it is that integers (and natural numbers) are the only numbers that a computer can manage in a way that behaves reasonably similarly to the corresponding mathematic set. Specifically, as long as you stick to their numeric range, computer ints behave like a group, just like real integers do. But IEEE floats break the definition of a field in every which way, so they're really not a great match for the rationals.

That said, you could represent the rationals as a pair of integers, and that would be better-behaved, and some programming languages do do that. But I'm not aware of an ISA that supports it directly.

tyingq6y ago

It seems to just be layman shorthand for storing (not huge) integers as binary isn't lossy.

1 more reply

bregma6y ago

Maybe you need to parse "decimal numbers" as "fractional numbers" rather than "expressed using a radix of 10".

Perhaps it would be better phrased as "computers can only store whole numbers, so they need some way to represent other information, including integers, rational numbers and approximations to complex numbers."

aitchnyu6y ago

Whoa. In FF, I just see a screenfull of blank boxes. I scroll down and see content wrongly rendered. On Chrome its a dated design with an uncomfortable text size, and the narrow column creates boxes where I have to use horizontal scrollbars. The Motherfucking Website revolution cant come too soon enough!

skykooler6y ago

Renders just fine for me in Firefox on Ubuntu.

Shorel6y ago

For me it works fine in latest Firefox in both Windows 10 and Ubuntu 19.04.

coldtea6y ago· 6 in thread

Seems to be one of the best ways to go about it.

From the comment in protobuf source (which does the same thing as Python), mentioned in the Twitter thread:

(...) An arguably better strategy would be to use the algorithm described in "How to Print Floating-Point Numbers Accurately" by Steele & White, e.g. as implemented by David M. Gay's dtoa(). It turns out, however, that the following implementation is about as fast as DMG's code. Furthermore, DMG's code locks mutexes, which means it will not scale well on multi-core machines. DMG's code is slightly more accurate (in that it will never use more digits than necessary), but this is probably irrelevant for most users.

Rob Pike and Ken Thompson also have an implementation of dtoa() in third_party/fmt/fltfmt.cc. Their implementation is similar to this one in that it makes guesses and then uses strtod() to check them. (...)

https://github.com/protocolbuffers/protobuf/blob/ed4321d1cb3...

SideQuark6y ago

>Seems to be one of the best ways to go about it.

The C/C++ standards do not require formatting to round correctly or even be portable. I recently had an issue where a developer used this method to round floats for display, and there were differences on PC and on Mac. It literally rounded something like 18.25 to 18.2 on one platform and 18.3 on the other. This led to all sorts of other bugs as some parts of the program used text to transmit data, which ended up in weird states.

The culprit was this terrible method. If you want anything approaching consistency or predictability, do not use formatting to round floating point numbers. Pick a numerically stable method, which will be much faster of done correctly.

Coincidentally, C/C++ do not require any of their formatting and parsing routines to round-trip floating point values correctly (except the newly added hex formatted floats which are a direct binary representation, and some newly added function allowing an obscure trick I do not recall at the moment... )

eesmith6y ago

> The C/C++ standards do not require formatting to round correctly or even be portable.

The linked-to method uses PyOS_snprintf(). Its documentation at https://docs.python.org/3/c-api/conversion.html says:

"""PyOS_snprintf() and PyOS_vsnprintf() wrap the Standard C library functions snprintf() and vsnprintf(). Their purpose is to guarantee consistent behavior in corner cases, which the Standard C functions do not."""

2 more replies

okl6y ago

IEEE754 defines 5 rounding modes. This one sounds like nearest, ties to even. Not all decimal values are representable as float. Depending on if you compile for x87 (80-bit internal repreaentation) or SSE (64-bit) you might get slightly different results.

2 more replies

mixmastamyk6y ago

At some point in 3.x Python moved to "bankers rounding" which is slightly less biased than the one we learn in school, perhaps C++ did the same. Might be a factor in the discrepancy.

dheera6y ago

Yep, I'd be curious what any better alternative is.

Consider that a float's internal representation is in base 2, and you're trying to round in base 10. Even if you didn't use a string, I'd assume you'd have to create an array of ints that contain base 10 digits in order to do the rounding, unless there are some weird math tricks that can be employed that can avoid you having to process all base 2 digits. And an array of ints isn't all that computationally different from a string.

masklinn6y ago

In fact, I don't know what cdecimal now does but back when decimal.Decimal was pure python it would store the "decimal number" as a string and manipulate that.

bhouston6y ago· 5 in thread

In my experience there are few things slower that float to string and string to float. And it seems so unnecessary.

I always implemented round to a specific digit based on the built-in roundss/roundsd functions which are native x86-64 assembler instructions (i.e. https://www.felixcloutier.com/x86/roundsd).

I do not understand why this would not be preferable to the string method.

float round( float x, int digits, int base) { float factor = pow( base, digits ); return roundss( x * factor ) / factor; }

I guess this has the effect of not working for numbers near the edge of it's range.

One could check this and fall back to the string method. Or alternatively use higher precision doubles internally:

float round( float x, int digits, int base ) { double factor = pow( base, digits ); return (float)( roundsd( x * factor ) / factor ); }

But then what do you do if you have a double rounded and want to maintain all precision? I think there is likely some way to do that by somehow unpacking the double into a manual mantissa and exponent each of which are doubles and doing this manually - or maybe using some type of float128 library (https://www.boost.org/doc/libs/1_63_0/libs/multiprecision/do...)...

But changing this implementation now could cause slight differences and if someone was rounding then hashing this type of changes could be horrible if not behind some type of opt-in.

StephanTLavavej6y ago

float to string is incredibly fast now - look at Ulf Adams’ Ryu and Ryu Printf algorithms, which I’ve used to implement C++17 <charconv> in Visual Studio 2019 (16.2 has everything implemented except general precision; the upcoming 16.4 release adds that).

I don’t know of truly fast algorithms for string to float, although I improved upon our CRT’s performance by 40%.

ChrisLomont6y ago

Formatting is much faster than before, but still terribly slow compared to simply rounding numbers using math and floor and ceiling appropriately.

Ryu is more than 100x slower than something like

rval = floor(100*val+0.5)/100.0

(which is not quite right due to numerical issues, but close, and illustrates the idea).

Formatting, to get a rounded float, is terribly slow.

Vindicis6y ago

I don't think converting is slow by itself depending on what you need done. I have a function in my code I wrote to convert strings of floats/doubles to a rounded string of however many digits you want(for storing financial data that was muxed with multiple streams), and converting 1.59688139452f to a string, and then rounding that string to the 5th decimal place took 8.759 seconds for 10 million iterations (87.5 nanoseconds/iteration). Granted, it was written for my specific use case, so I don't need to handle the various edge cases/formats. But, little is inherently slow if you take the time to write a solution for your own need.

messe6y ago

Using native x86_64 instructions isn't portable.

jharger6y ago

Why not have optimized versions that use native instructions when available, and then fall back to the portable version when they are not?

2 more replies

Jenz6y ago· 5 in thread

I dunno, how efficient is this?

coldtea6y ago

So efficient that nobody really cared about or mentioned it all those decades, so there's that...

1 more reply

ThePadawan6y ago

I agree - my takeaway is "float-to-string and string-to-float conversion is probably faster than I thought"

ben5096y ago

Compared to base=10^places, multiply, truncate, divide? Horriby inefficient.

dagw6y ago

Sure, but their approach is on the other hand more correct. All numerical code reaches a point where you have to balance performance vs. correctness, and here cpython has chosen correctness over speed.

1 more reply

ben5096y ago

It's slower than native python.

    %timeit round(12335423552.33, -6)
    %timeit int(12335423552.33 / 1_000_000.0) * 1_000_000.0
    500 ns ± 3.88 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)
    219 ns ± 1.1 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

1 more reply

analog316y ago· 4 in thread

My quick impression is that the choice of a rounding algorithm is relative to the purpose that it serves. For instance, floor(x + 0.5) is good enough in many applications.

In some cases, rounding is performed for the primary purpose of displaying a number as a string, in which case it can't be any less complicated than the string conversion function itself.

raphlinus6y ago

Fun fact: floor(x + 0.5) rounds 0.49999997 to 1.0 (this is 32 bit floats, the same principle applies to 64). Most libraries have slower than ideal round conversion because of historical dross; modern chips have a very fast SIMD round instruction but its behavior doesn't exactly match libc round. See https://github.com/rust-lang/rust/issues/55107 for a deeper discussion.

dwheeler6y ago

I just tried this on Python3 on a 64-bit x86 system:

    import math
    x = 0.49999999999999994
    print(x-0.5)
    print(math.floor(x+0.5))

I got these printouts:

    -5.551115123125783e-17
    1

So yes, something less than 1/2, with 1/2 added to it, has a floor of 1 in floating point math.

Yet another reminder that floating point calculations are approximations, and not exact.

3 more replies

analog316y ago

Good call. I usually only use this conversion when the input is approximate to begin with, e.g., feeding a floating point "signal" to a DAC, or computing the contents of a lookup table to be coded into an arduino. To put it another way, I can live with some uncertainty as to the precise threshold going from one output value to the next.

jacobolus6y ago

If the number is positive you can substitute floor(x - 0.5) + 1

Or just explicitly check for 0.5 - 1ulp as a special corner case.

zelly6y ago· 3 in thread

This is what we are promised will make trucks drive themselves and usher in the 4th industrial revolution.

olooney6y ago

Python?

StreamBright6y ago

I am not sure if people understand what kind of hellhole is IT in general.

Scarblac6y ago

There is an XKCD for that: https://www.xkcd.com/2030/

ericfrederich6y ago· 3 in thread

I was looking once at Python and Redis and how numbers get stored. I remember Python would in the end send Redis some strings. I dove pretty deep and found that Python floats when turned into a string and then back are exactly the same float.

I remember even writing a program that tested every possible floating point number (must have only been 32 bit). I think I used ctypes and interpreted every binary combination of 32 bits as a float, turned it into a string, then back and checked equality. A lot of them were NaN.

mark-r6y ago

Checking equality on NaN doesn't work the way it does for other numbers. Your test might have had a fatal flaw.

dagw6y ago

A lot of them were NaN.

I seem to recall that ~0.5% of the IEEE 32 bit float space is NaN.

ekimekim6y ago

A NaN is any value where the 7-bit (for 32-bit floats) exponent is all 1s, except for +/-inf. So a quick approximation is that 1/128 ~= 0.78% of the space is NaN.

That means there's 25 bits that we can change while still being either a NaN or an inf. But two of those values are infs, so we need to remove them. Divide that by the entire range and we have (2^25 - 2) / 2^32 = 16777215/2147483648, or about 0.78124995%.

seamyb886y ago· 2 in thread

Am I the only one grimacing at the lack of curlies around if/else scope? Just good practice!

bhd_movie6y ago

In case you didn't know, Python doesn't use curly braces to block scope. They're pretty much only used for dictionaries and sets.

seamyb886y ago

In case you didn't know, we're talking about c code.

shellac6y ago· 1 in thread

OpenJDK BigDecimal::doubleValue() goes via a string in certain situations https://github.com/openjdk/jdk/blob/master/src/java.base/sha...

SloopJon6y ago

I just ran into a similar booby trap the other day: whereas BigDecimal::BigDecimal(double) does the full decimal expansion, BigDecimal::valueOf(double) goes through Double::toString(double), which is generally a lot fewer digits.

jancsika6y ago· 1 in thread

A bit on topic...

Is there a phrase for the ratio between the frequency of an apparent archetype of a bug/feature and the real-world occurrences of said bug/feature? If not then perhaps the "Fudderson-Hypeman ratio" in honor of its namesakes.

For example, I'm sure every C programmer on here has their favored way to quickly demo what bugs may come from C's null-delimited strings. But even though C programmers are quick to cite that deficiency, I'd bet there's a greater occurrence of C string bugs in the wild. Thus we get a relatively low Fudderson-Hypeman ratio.

On the other hand: "0.1 + 0.2 != 0.3"? I'm just thinking back through the mailing list and issue tracker for a realtime DSP environment that uses single-precision floats exclusively as the numeric data type. My first approximation is that there are significantly more didactic quotes of that example than reports of problems due to the class of bugs that archetype represents.

Does anyone have some real-world data to trump my rank speculation? (Keep in mind that simply replying with more didactic examples will raise the Fudderson-Hypeman ratio.)

pvg6y ago

What's the 'class' of the second thing? Numerics/fp bugs of all stripes are super common. Just often less crashy or noticeable.

science4046y ago· 1 in thread

Misleading title is misleading...

CPython rounds float values by converting them to string and then back

duckerude6y ago

PyPy does it too: https://bitbucket.org/pypy/pypy/src/2fc0a29748362f2a4b99ab57...

Jython instead uses BigDecimal::doubleValue: https://github.com/jythontools/jython/blob/b9ff520f4f6523120...

But as another comment noted, BigDecimal::doubleValue can pull a similar trick: https://news.ycombinator.com/item?id=20818586

Noe20976y ago

Well, the problem is precisely that rounding as it is generally conceived, is expressed in base 10 - as we generally conceive numbers including floating point ones in base 10. Yet at the lowest level, the representation of numbers is in base 2, including floating point ones. It is imaginable, would be more correct and efficient to perform rounding (or flooring or ceiling, for that matter) in base 2, but it would be that more difficult to comprehend when dealing with non integers in code. Rounding in base 10 needs some form of conversion anyway, going for the string is one way that is, at least, readable (pun intended).

bishalaOP6y ago

Related thread on Twitter https://twitter.com/whitequark/status/1164395585056604160

d--b6y ago

Note that there is a fallback version that doesn't use strings. This is definitely something that's been thought through.

deckar016y ago

`blob/master` isn't a suitable permalink. Use the first few letters of the commit hash so the line numbers and code are still relevant when this file inevitably gets modified.

dahart6y ago

Not entirely unlike how one of the better ways to deep-copy a JSON object in Javascript is json.parse(json.stringify(obj))

kstenerud6y ago

This is where decimal floating point really shines. Since the exponential portion is base 10, it's trivially easy to round the mantissa.

The only silly part of ieee754 2008 is the fact that they specified two representations (DPD, championed by IBM, and BID, championed by Intel) with no way to tell them apart.

acoye6y ago

Another pragmatic aspect of Python as I see it.

j / k navigate · click thread line to collapse

144 comments

97 comments · 20 top-level

ChrisSD6y ago· 17 in thread

Maybe I'm missing something but what's wrong with rounding floats this way?

ben5096y ago

Example of implementing it the sane way: https://github.com/numpy/numpy/blob/75ea05fc0af60c685e6c071d...

Every step of this function is complex and expensive, especially printing a float as a decimal is very complex. And round is routinely used in a tight loop.

dagw6y ago

The numpy approach sacrifices correctness for speed (you sometimes get unexpected results in some corner cases, see below), the cpython way sacrifices speed for correctness.

  >>> round(56294995342131.5, 2)
  56294995342131.5
  >>> round(56294995342131.5, 3)
  56294995342131.5

  >>> np.round(56294995342131.5, 2)
  56294995342131.5
  >>> np.round(56294995342131.5, 3)
  56294995342131.51

phkahler6y ago

The problem is the division after truncation. That division by a power of 10 can produce errors in binary.

Armisael166y ago

How does truncating a positive number ever round up?

1 more reply

yjftsjthsd-h6y ago

CogitoCogito6y ago

3 more replies

mark-r6y ago

kzrdude6y ago

And then strings are text implemented in numerical terms

emsy6y ago

Python already doesn't have the best performance. If you need to round a lot of floats in a loop you better bring some time.

pletnes6y ago

Or use numpy, like the rest of us.

sqrt176y ago

round() is specifically about rounding to decimal places, and there are other, faster functions for the more common cases

Aardwolf6y ago

They're using base 10 which is much slower to use here than power of 2 bases would be. Plus then also the memory management for the string.

viraptor6y ago

The memory management will virtually never kick in. You'd need a number which expands to more than 100 characters for that to happen.

bottled_poe6y ago

The two concerns I have are performance and correctness. I don’t know enough about the implementation of round(3) to know... perhaps someone else does?

dagw6y ago

This approach is used specifically because of correctness. Doing things the 'obvious' way with round(3) or truncation introduces precision problems in corner cases.

f00zz6y ago

That's what round(3) is for

masklinn6y ago

round(3) can only round to an integer. Python's round works to an arbitrary decimal position.

It would previously scale up, round (ceil/floor really) then scale down. That turned out to induce severe precision issues: https://bugs.python.org/issue1869

1 more reply

fs1116y ago· 15 in thread

Apples libc used to shell-out to perl in a function: https://github.com/Apple-FOSS-Mirror/Libc/blob/2ca2ae7464771...

stefan_6y ago

I thought this is what the Unix philosophy is supposed to be all about.

f00zz6y ago

Would be pretty awesome if Perl called wordexp(3) somewhere along this code path

1 more reply

xienze6y ago

> I thought this is what the Unix philosophy is supposed to be all about.

Perhaps from the perspective of an end user running things from a shell. Generally speaking though, shelling out from within a program is not ideal.

ericfrederich6y ago

tus886y ago

That's hilarious. You're not supposed to go the other direction libc!

f00zz6y ago

    /* XXX this is _not_ designed to be fast */

vectorEQ6y ago

it is hard to do without a subshell). It is probbably just plan a Bad Idea to call in anything setuid, or executing remotely. */

laughing so hard :')

benj1116y ago

Which raises the question what libc functions perl calls...

And imagine the debug errors:

>perl error X

"But I'm just calling libc ?!?"

tehlike6y ago

Or unintended stackoverflows.

1 more reply

gbacon6y ago

Die ganzen Zahlen hat der liebe Gott gemacht, alles andere ist Menschenwerk.

https://en.wikipedia.org/wiki/Leopold_Kronecker

peterwwillis6y ago

What if Perl uses libc ?????

saagarjha6y ago

It does on my system:

  $ otool -L /usr/bin/perl
  /usr/bin/perl:
      /System/Library/Frameworks/CoreFoundation.framework/Versions/A/CoreFoundation (compatibility version 150.0.0, current version 1663.0.0)
      /usr/lib/libSystem.B.dylib (compatibility version 1.0.0, current version 1281.0.0)

masklinn6y ago

That's not an issue unless perl calls wordexp (as part of the stuff it does when called by wordexp).

ImNotTheNSA6y ago

then it calls the Perl implementation... is there some nuance of perl that would cause a problem with that?

2 more replies

imglorp6y ago

O_o

latchkey6y ago· 14 in thread

https://0.30000000000000004.com/

moring6y ago

mumblemumble6y ago

1 more reply

mark-r6y ago

The difference between decimal and binary is essential to understanding the problem. Just as there's no elegant way to represent 1/3 in base 10, there's no elegant way to represent 1/10 in base 2.

1 more reply

ilovepeppapig6y ago

The worst way to explain something is to begin with "It's actually pretty simple."

The_rationalist6y ago

Not always! E.g Let's say someone begin to explain me a topic and use e.g two words that I didn't knew before: e.g supervenience and congency.

Humans generally are afraid of new words (especially weird sounding ones) and often will assume that the subject is complex and might intimidate them.

But unknown words can have extremely simple meanings, or be synonyms of already known words.

By asserting: "It's actually pretty simple" You give them confidence that there's not reason to be afraid of the topic or of the words.

1 more reply

furyofantares6y ago

I tend to agree.

I've found it's usually better to acknowledge that it's a little difficult or otherwise totally normal not to already know / have grasped the thing in question.

olooney6y ago

I punched "It's actually pretty simple" into http://talktotransformer.com (which generates nonsense text from a seed using an OpenAI language model) and after a couple of tries it gave me this:

> It's actually pretty simple. We'll be looking at something called the "D-Wave P-500", which is a version of the P500 chip for quantum computers.

> It's basically a single bit computer, but with more than 500 qubits. Which means that our "real number" will have more numbers than the number of qubits that are available. That's really important.

> But I am not suggesting that a quantum computer can be used to solve more abstract problems. Because that would be crazy.

TekMol6y ago

    Computers can only natively store integers,
    so they need some way of representing decimal numbers.