Why Is the Migration to Python 3 Taking So Long? (opens in new tab)

(stackoverflow.blog)

202 pointsjosep26y ago351 comments

351 comments

182 comments · 41 top-level

downerending6y ago· 40 in thread

Because there's been not enough carrot and too much stick.

The only real killer feature of Python3 is the async programming model. Unfortunately, the standard library version is numbingly complex. (Curio is far easier to follow, but doesn't appear to have a future.)

On the down side, switching to Unicode strings is a major hurdle. It mostly "just works", but when it doesn't, it can be difficult to see what's going on. Probably most programmers don't really understand all of the ins and outs. And on top of that, you get weird bugs like this one, which apparently is simply never going to be fixed.

https://github.com/pallets/click/issues/1212

hermod6y ago

With respect to async, I'm partial to trio [0], which is a spiritual successor to curio [1].

The model is similar to Golang in many ways, e.g. communication using channels [2] and cancellation [3] reminiscent of context.WithTimeout, except that in Golang you need to reify the context passing.

The author has written some insightful commentary on designing async runtimes [4] and is actively developing the library, so I'm optimistic about its future. There were plans to use it for requests v3 until the fundraiser fiasco [5].

[0] https://github.com/python-trio/trio

[1] https://vorpus.org/blog/announcing-trio/

[2] https://trio.readthedocs.io/en/stable/reference-core.html#us...

[3] https://trio.readthedocs.io/en/latest/reference-core.html#ca...

[4] https://vorpus.org/blog/notes-on-structured-concurrency-or-g...

[5] https://vorpus.org/blog/why-im-not-collaborating-with-kennet...

privateSFacct6y ago

I was curious about [5]

The link to support requests (which is a great piece of software) is here:

https://cash.app/$KennethReitz

Note: This is NOT a charitable donation, it is a gift to an individual. These are not tax deductible under US law.

Njs has a long attacking blog post saying this needs to go through PSF (huh?) and that they should be getting most of this money not the person the funds were directed towards (it's not clear how much they've actually contributed to requests over time). This supposedly also may trigger folks who have also suffered from "gaslighting".

Supporting the developer of a piece of software does not, as far as I know, require that they sign up to handle it on a charitable basis. A big todo is made about the "large" amount raised. The amounts is 33K. To be frank, this is almost zero in tech land at least in the bay area and requests is a very highly used project. I was literally expecting something like 300K or even $1M - silly kickstarter projects raise for more and deliver nothing. Requests has already delivered a lot of utility.

Just a bit of perspective from someone who wasn't familiar with this "fiasco".

shkkmo6y ago

I don't have a horse in the game and I'm not familiar with these people or the "fiasco" but your summary does not seem like accurate description of the issues raised in [5]. Here are the quotes that give me a very different take on the situation:

The money was raised specifically to support development of requests 3

> [Reitz] announced that work had begun on "Requests 3", that its headline feature would be the native async/await support I was working on, and that he was seeking donations to make this happen.

It's not so much that PSF needed to be used, as that there needed to be some accountability as to how those funds were used.

> [Reitz] chose a fundraiser structure that avoids standard accountability mechanisms he was familiar with. He never had any plan or capability to deliver what he promised. And when I offered a way for him to do it anyway, he gave me some bafflegab about how expensive it is to write docs. Effectively, his public promises about how he would use the Requests 3 money were lies from start to finish, and he hasn't shown any remorse or even understanding that this is a problem.

It sounds like a great deal of the work being done on requests is done by volunteers but the funding only goes to support Reitz

> I think a lot of people don't realize how little Reitz actually has to do with Requests development. For many years now, actual maintenance has been done almost exclusively by other volunteers. If you look at the maintainers list on PyPI, you'll see he doesn't have PyPI rights to his own project, because he kept breaking stuff, so the real maintainers insisted on revoking his access. If you clone the Requests git repo, you can run git log requests/ to see a list of every time someone changed the library's source code, either directly or by merging someone else's pull request. The last time Reitz did either was in May 2017, when he made some whitespace cleanups.

The issue is not so much that money is being made, but the way that it is done and the lack of accountability

> I don't have any objection to trying to make money from open-source. I've written before about how open-source doesn't get nearly enough investment. I do object to exploiting volunteers, driving out community members, and lying to funders and the broader community. Reitz has a consistent history of doing all these things.

1 more reply

pmiller26y ago

I wouldn’t say 33k is “almost zero.” For me, a single person living in the Bay Area, $33k would pay all of my expenses for a little over 6 months. When we’re talking about this amount of money going to a single individual, I don’t think it’s fair to call it “almost zero.”

1 more reply

sbierwagen6y ago

It's always amazing how adding a little money to the mix makes people lose their minds. NJS was working for free. KR raised some money from large businesses, then told NJS to keep working for free. NJS retroactively discovered that working for free is stupid, and tried demanding all the money NJS raised. This didn't happen, of course, since he had no leverage at all on NJS besides complaining online.

The takeaway here appears to be "never work for free". If NJS has worked on his own project, controlled by him alone, this wouldn't have happened. If you donate a bunch of work to an open source project, then... well... the source is open.

1 more reply

semiotagonal6y ago

I wonder how many people feel that migrating to Python3 would have been worth doing in the absence of being forced to do so.

Dropbox invested three years of work, actually hired Python's creator, and are still not done. What are they getting out of it that they wouldn't have gotten if Python2 simply had been maintained?

crustacean6y ago

This is such a good point. Take SQL. It has survived because it is well designed, and changes so little and so slowly, and it’s obvious what SQL is and isn’t meant for. Amateur programmers can port SQL queries between database systems semi-painlessly. In the right environment a query can survive with small edits for YEARS.

Who wants to break old SQL? Nobody.

skybrian6y ago

That's something of an illusion. The flaws in SQL often aren't noticed because you usually pick one database vendor and stick with them. There are plenty of differences between vendors, but you don't usually have to support them at the same time, and you don't notice for simple cases.

But changing database vendors for a company can be a big deal, as bad as going from Python 2 to 3.

1 more reply

marcus_holmes6y ago

> Who wants to break old SQL? Nobody.

Every couple of months there's a new startup / dev site that says "SQL is broken/old/bad, so we reinvented it!". They all sink without trace, but there's a cohort who agrees with them.

1 more reply

ogre_codes6y ago

This is backwards thinking.

Yes, it's expensive to upgrade from Python 2 to Python 3, but it's also expensive for the Python project to maintain 2 versions of Python indefinitely. If someone wanted other than the core Python team wants to step up and maintain Python 2, they are free to do so, it's open source. But failing that, expecting the Python team to support the older/ less functional version of the code indefinitely is unrealistic. Corporate owned languages have even shorter lifecycles for exactly this reason.

coldtea6y ago

>This is backwards thinking.

And the alternative is cargo cult "newer is better".

>Yes, it's expensive to upgrade from Python 2 to Python 3, but it's also expensive for the Python project to maintain 2 versions of Python indefinitely.

On the other hand, they could progressively enhance upon a backwards compatible single 2 version. JS manages to do that just fine, as does Java...

7 more replies

flukus6y ago

> Yes, it's expensive to upgrade from Python 2 to Python 3, but it's also expensive for the Python project to maintain 2 versions of Python indefinitely.

No maintaining 2 versions of python is much cheaper, it's only being done in one place compared to the thousands and thousands of python 2 code bases you'd have to convert.

It also only needs bug fixes, there are plenty of people/organisations out there that would be perfectly happy for the language to be unchanging.

1 more reply

daveguy6y ago

> If someone wanted other than the core Python team wants to step up and maintain Python 2, they are free to do so, it's open source.

Only if they name it something completely different from python or py-anything. Guido refuses to allow anyone to just step in to maintain py2.

Tauthon is a project that aims to keep compatability with py2 while adding whatever features of py3 that won't break py2 and to have a maintained py2.

https://github.com/naftaliharris/tauthon

SignalsFromBob6y ago

I suspect that someone, or a group of people, will step up to unofficially maintain Python 2 for the foreseeable future. It's clear that there are a lot of people using it that either can't easily migrate or are unwilling to do so for the various reasons already discussed in this thread.

takeda6y ago

I'm sure that for the right money you can find someone.

Don't forget that that person/organization would not only have to maintain python but also all the packages that you will be used.

JohnFen6y ago

If I encounter Python 2 bugs that matter to me, I can and will fix them myself if needed, and submit or otherwise publish the change.

I'm quite certain I'm not the only one.

1 more reply

erokar6y ago

Python 3 seems like a bit of a missed opportunity. Since they were introducing breaking changes in the first place, why didn't they go bigger? E.g. make the OOP seem less tacked on, immutable data structures, clean up inconsistencies in the standard library?

chungy6y ago

Can you describe how OOP feels tacked on? One of the major changes in Python 3 is that new-style classes are the only style of classes.

erokar6y ago

The nuisance of having to add self as parameter to every class method, no way of enforcing private methods and the mix between methods on objects and functions in the standard library. OO feels more integrated in e.g. Ruby.

About some of the design decisions in Guido's own words: http://python-history.blogspot.com/2009/02/adding-support-fo...

6 more replies

dragonwriter6y ago

I think for some people having the receiver as an explicit parameter of methods rather than references through a special syntactic variable bothers some people and makes it look to them like a procedural language playing OO, though I personally think it's the most clear representation of what every class-based OO language actually does operationally, since methods are attached to a class and take a reference to the instance on which they are invoked.

1 more reply

CoolGuySteve6y ago

I myself, speaking for my self and only my self find that, in regards to myself, the redundant self keyword, in my self's opinion, is somewhat selfish and easy for my self to accidentally omit. Self.

5 more replies

kzrdude6y ago

Modern Python lacks coherent design:

How do you define enums? with a superclass.

How do you define data classes? With a class decorator.

And then there's metaclasses too.

1 more reply

dividuum6y ago

> The only real killer feature of Python3 is the async programming model.

I understand that this is one of the major features, but I personally never saw the appeal, given that gevent exists and in my experience works well most of the time. It also allows me to multiplex IO operations and doesn't rely on new syntax. I'm probably missing something?

d0mine6y ago

In gevent, any call may behave like an implicit "goto". Here's why "goto" is bad https://vorpus.org/blog/notes-on-structured-concurrency-or-g... (the link is stolen from @hermod's comment above)

1 more reply

Scramblejams6y ago

I find it much easier to reason about my async Python code when the async yield points are explicit. If every line can potentially yield to another async task, I find myself thinking long and hard about a lot of those lines.

epx6y ago

For me the major gain in Py3 is exactly the sane handling of strings, that fixes the major flaw of Py2: weaking the String type to make the migration to UnicodeStrings "easy".

meowface6y ago

It's funny, because I consider asyncio (and derivatives like curio) to be the worst part of Python 3. There are plenty of other compelling reasons to move over. No, none of them revolutionize the language, but there are fewer warts and cleaner ways of doing many things.

mxcrossb6y ago

On the topic of not enough carrot, I’m curious how impactful the end of support for python2 will be. How many programs stuck in python2 are encountering bugs in the runtime?

carapace6y ago

I actually find this aspect of the whole thing exciting, perhaps paradoxically. Now that Python 2 is static the runtime can become asymptotically bug-free. Meaning that, if you only change it to fix bugs (as opposed to introducing new syntax|semantics) it's going to approach perfection.

d0mine6y ago

You are assuming that fixing bugs introduces less new bugs.

joshuamorton6y ago

Who will do that though?

2 more replies

takeda6y ago

Python itself won't be as much affected, what will affect you most are your dependencies. Imagine using a library and running in some kind of bug. You check newer version, looks like that bug was fixed, but the library now only works on python 3.6+. What will you do?

1 more reply

x0x06y ago

And the side-effect of an endless trail of shit to deal with. I write primarily ML code so until 3.5 or 3.6 python 3 offered me almost nothing better. It did, however, make managing environments even more of a mess. I quite frankly resented having to deal with code in both, swapping back and forth between python2, python3, pip2, pip3, etc.

rtpg6y ago

- format strings

- mandatory keyword arguments

- multi-dict splatting

- nicer yield semantics for generators

- Fixing system-specific encoding ambiguities

- dataclasses

- inline type annotations

- better metaclass support

- more introspection tooling

- pathlib (for nicer path handling)

- mocking pulled into the standard library in a cleaner way

- stable ABIs for extensions

- secrets handling

- ellipsis instead of pass (yeah who cares but I care)

- lots of standard lib API cleanup

All of this is very helpful for making clean applications. But I would say it's _very_ helpful for making good libraries as well. This stuff is about having a strong language foundation to avoid plain weirdness like the click issue .

Obviously it doesn't kill all of them, but there used to be even more of that kind of thing all the time. Library issues would basically get exported to its users, all basically due to language problems.

oarabbus_6y ago

I ran into an issue recently specific to Pandas and python3 with unicode.

pd.read_excel(filepath) will read an entire dataset even if it contains unicode characters.

pd.ExcelFile() silently drops(!!) unicode rows. The resulting object will simply skip unicode-containing rows (in ANY column) them without even a warning.

For example, if you had an excel file:

word

---

"hello"

你早

"hello"

then pd.read_excel() would give you a dataframe with 5 rows. ExcelFile() on the other hand would return (silently!) a dataframe with only the first two and the last row.

Maybe this is a pandas issue, not a python issue, but it was really horrendous to debug for such a long time only to realize this was the issue.

j88439h846y ago

Is there an issue for this in the bug tracker?

oarabbus_6y ago

I searched, and this is the closest thing (https://github.com/pandas-dev/pandas/issues/11503) but it is not the issue I experienced.

I'm not sure how to submit a bug report, to be honest.

snagglegaggle6y ago

The main change people cite is the UTF-8 "support" but frankly it seems like far more pomp and circumstance than is necessary. Appropriate code point handling could have been provided and everything left as it was. And when you get rid of that, there's not that much left that is breaking.

CoolGuySteve6y ago

I find the stronger differentiation between bytes and strings leads to a lot of gotchas, like when I forget to encode bytes or pass a string where bytes are expected.

I understand why it's the way it is, but when it comes the the typical unixy things I need to do shuffling of files around, tar'ing stuff, etc, it definitely trips me up more than I'd wish.

mkesper6y ago

You at least notice it's wrong. In Python2 sometimes 'it worked' until it broke and you had to figure out why.

1 more reply

recursivecaveat6y ago· 36 in thread

The simple reason is that there was no compelling feature to reward you for upgrading. You'd spend a tremendous amount of effort for dubious return and (until recently) a smaller ecosystem.

1. Unicode support was actually an anti-feature for most existing code. If you're writing a simple script you prefer 'garbage-in, garbage-out' unicode rather than scattering casts everywhere to watch it randomly explode when an invalid byte sneaks in. If you did have a big user-facing application that cared about unicode, then the conversion was incredibly painful for you because you were a real user of the old style.

2. Minor nice-to-haves like print-function, float division, and lazy ranges just hide landmines in the conversion while providing minimal benefit.

In the latest py3 versions we've finally gotten some sugar to tempt people over: asyncio, f-strings, dataclasses, and type annotations. Still not exactly compelling, but at least something to encourage the average Joe to put in all the effort.

takeda6y ago

> Unicode support was actually an anti-feature for most existing code. If you're writing a simple script you prefer 'garbage-in, garbage-out' unicode rather than scattering casts everywhere to watch it randomly explode when an invalid byte sneaks in. If you did have a big user-facing application that cared about unicode, then the conversion was incredibly painful for you because you were a real user of the old style.

Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

In python 3 it always blows up when you mix bytes with text so you can catch the issue early on.

> In the latest py3 versions we've finally gotten some sugar to tempt people over: asyncio, f-strings, dataclasses, and type annotations. Still not exactly compelling, but at least something to encourage the average Joe to put in all the effort.

That's because until 2015 all python 2.7 features were from python 3. Python 2.7 was basically python 3 without the incompatible changes. After they stopped backporting features in 2015. Suddenly python 3 started looking more attractive.

dyingkneepad6y ago

> Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

> In python 3 it always blows up when you mix bytes with text so you can catch the issue early on.

Sometimes you don't care about weird characters being print as weird things. In python 2 it works fine: you receive garbage, you pass garbage. In python 3 it shuts down your application with a backtrace.

Dealing with this was one of my first Python experiences and it was very frustrating, because I realized that simply using #!/usr/bin/python2 would solve my problem but people wanted python3 just because it was fancier. So we played a lot of whack-a-mole to make it not explode regardless of the input. And the documentation was particularly horrible regarding that, not even the experienced pythoners knew how to deal with it properly.

takeda6y ago

Those issues are common when you're having python 2 code that uses unicode datatype and you have a task to migrate it to python 3.

You run your python 2 code on python 3 and it fails, most people at that point will place encode() or decode() in place where you have a failure. When the correct fix would be to place encode/decode at I/O boundary (writing to files (and in python 3 even that is not needed if you open files in text mode), network etc).

Ironically a python 2 code that doesn't use unicode is easier to port.

When you program in python 3 from the start it's very rare to need encode/decode strings. You only do that if you are working on I/O level.

> And the documentation was particularly horrible regarding that, not even the experienced pythoners knew how to deal with it properly.

Because it's not really python specific knowledge. It's really about understanding what the unicode is, what bytes are, and when to use each.

The general practice is to keep everything you do as text, and do the conversion only when doing I/O. You should think of unicode/text as as a representation of a text, as you think of a picture or sound. Similarly to image and audio text can be encoded as bytes. Once it is bytes it can be transmitted over network or written to a file etc. If you're reading the data, you need to decode it back to the text.

This is what Python 3 is doing:

- by default all string is of type str, which is unicode - bytes are meant for binary data - you can open files in text and binary mode, if you open in text the encoding is happening for you - socket communication - here if you need to convert string to bytes and back

Python 2 is a tire fire in this area:

- text is bytes - text also can be unicode (so two ways to represent the same thing) - binary data can also be text - I/O accepts text/bytes, no conversion happening - a lot (most? all?) stdlib is actually expecting string/bytes as input and output - cherry on top is that python2 also implicitly converts between unicode and string so you can do crazy thing like my_string.encode().encode() or my_string.decode()

So now you get a python 2 code, where someone wanted to be correct (it is actually quite hard to do it, mainly because of the implicit conversion) so the existing code will have plenty of encode() and decode() because some functions now expect str some unicode.

At different functions you might then have bytes or unicode as a string.

Now you take such code and try to move it to python 3, which no longer has implicit conversion and will throw an error when it expected text and got bytes and vice versa. The str now is unicode, unicode type no longer exists and bytes is now not the same thing as str. So your code now blows up.

Most people see an error so they add encode() or decode() often trying which one works (like what you were removing) when the proper fix would be actually removing encodes() and decodes() in other places of the code.

It's quite difficult task when your code base is big, so this is why Guido put a lot of effort with type annotations, mypy. One of its benefits supposed to help with these issues.

2 more replies

pmontra6y ago

> In python 3 it always blows up when you mix bytes with text so you can catch the issue early on.

This is definitely the case. I've been wrestling with bytes and strings all the time during the port of a Django application to Python 3 for a costumer. I can see myself encoding and decoding response bodies and JSON for the time being. For reasons I didn't investigate I don't have to do that with projects in Ruby and Elixir. It seems everything is a string there and yet they work.

CaptainMarvel6y ago

I’ve worked in a variety of Django codebases, and the last time I had trouble with string encoding/decoding was with Python 2. Since moving to Python 3, I have rarely needed to manually encode or decode, and I genuinely can't remember the last time I did.

Perhaps there’s something about a port that requires encoding/decoding bytes/strings?

1 more reply

takeda6y ago

you don't have to do these things in python 3 either, your problem was that you had python 2 code that was already broken and you are started adding encode/decode to fix it, typically making the problem worse.

If you write code in python 3 from the start you rarely need to use encode() and decode(). Typically what you always want is a text not bytes.

Exception to it might be places where you want to serialize like IO (network or files, although even files are converted on the fly unless you open file in a binary mode).

1 more reply

rstuart41336y ago

> Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

Not that I've seen.

Example of where Python 3 has rained shit on my parade: I wrote a program that backs up files for Linux. It works fine in python 2, but in python 3 you rapidly learn you must treat filenames as bytes otherwise your backup program blows up on valid Linux filenames. It's not just decoding errors, it's worse. Because Unicode doesn't have a unique encoding for each string, so the round trip (binary -> string -> binary) is not guaranteed to get you the same binary. If you make the mistake of using that route (which Python3 does by default) then one day Python3 will tell you can't open a file you os.listdir() microseconds ago and can clearly see is still there.

Later, you get some sort of error when handling one of those filenames, so you sys.stderr.write('%s: this file has an error' % (filename,)). That worked in python2 just fine, but in python3 generates crappy looking error messages even for good filenames. You can't try to decode the filename to a string because it might generate a coding error. This works: sys.write('b%b: this file has an error' % (filename,)), but then you find you've inserted other strings into error messages and soon the only "sane" thing to do is to to convert every string in your program to bytes. Other solutions like sys.write('%s: this file has an error' % (filename.decode(errors='ignore'),)) but corrupt the filename the user sees, are verbose, and worst of all if you forget it isn't caught by unit tests but still will cause your program to blow up in rare instances.

I realise that for people who live in a land of clearly delineated text and binary, such as the django user posting here, these issues never arise and the clear delineation between text and bytes is a bonus. But people who use python2 as a better bash scripting language than bash don't live in that world. For them python2 was a better scripting language than bash, but is being being depreciated in favour of python3 that's actually more fragile than bash for their use case. (That's a pretty impressive "accomplishment".) Perhaps they will go to back to Perl or something, because it stands Python3 isn't a good replacement.

Enginerrrd6y ago

>For them python2 was a better scripting language than bash

This! IMO Python 2 has better usability for prototyping and thinking and doing things on the fly. Python 3 also often seems to have deprecated the functions I want to use in favor of those that are more cumbersome and take more keystrokes. More explicit sure, but less fluid.

adrianN6y ago

Filenames need to be treated as binary because of bad designs decades ago. Rust handles this correctly imho, by having a separate type for such strings, OsStr.

2 more replies

josefx6y ago

> Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

Not always. As far as I can tell writing garbage bytes to various APIs works fine unless they explicitly try to handle encoding issues. First time I noticed encoding issues in my code was when writing an xml structure failed on windows, all because of an umlaut in an error message I couldn't care less about. The solution was to simply kill any non ascii character in the string, not a nice or clean solution but the issue wasn't worth more effort.

> In python 3 it always blows up when you mix bytes with text so you can catch the issue early on.

That is nice if your job involves dealing with unicode issues. My job doesn't, any time I have to deal with it despite that is time wasted.

rtpg6y ago

So you don't have to deal with it until user data includes _any non-ascii character_ (including emoji, weird spaces copied from other stuff, or loan words like café)

"Dealing with unicode" is really just about dealing with it at the input/output boundaries (and even then libraries handle it most of the time). But without the clear delineation that Python 3 provides, when you _do_ hit some issue you probably insert a "fix" in the wrong space. Leading to the classic Py2 "I just call decode 1000 times on the same string because I've lost track"

1 more reply

c-cube6y ago

What kind of text do you have to process at your job, that you never meet any unicode in it? Nowadays unicode is everywhere, especially with emojis. Even a simple IRC bot needs to handle that.

2 more replies

tedunangst6y ago

Doesn't always blow up. Notably b"key" and "key" are now distinct dictionary keys, and both can coexist in the same dict. Is the absence of an optional key a fatal error? No, the program runs, and just does the wrong thing, or fails to copy the right value to the next stage, or whatever. Fun to debug.

takeda6y ago

To get b'key' and 'key' in a dictionary in python 3 you really need to try hard.

The only reasonable scenario I can think of is when you are porting python 2 code to python 3 and play with .decode() and .encode().

some_random6y ago

>Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

We're talking about simple scripts, the solution is to not send in invalid characters.

takeda6y ago

even in very simple scripts you don't get invalid characters until you actually get them.

coleifer6y ago

Solid take. I'd add that performance was worse for a number of releases, and there were significant warts and incompatibilities in versions before 3.4.

Personally, asyncio and type annotations are a big turnoff. I know this is a bit contrarian, but I've always favored the greenlet/gevent approach to doing cooperative multi-tasking. Asyncio (neé twisted) had a large number of detractors, but now that the red/blue approach has been blessed, it seems like many are just swallowing their bile and using it.

Type annotations really chafe because they seem so unpythonic. I like using python for it's dynamicity, and for the clean, simple code. Type annotations feel like an alien invader, and make code much more tedious to try and read. If I want static typing, I'll use a statically typed language.

Myrmornis6y ago

Another problem with python’s type annotations is that false negatives are common in partially type annotated code bases: i.e. an annotation which is untrue, but for which there are no supporting calls/usages causing the type checker to reject it. This is pretty pathological in my experience: it means that annotations have the semantic status of comments (i.e. might be true, might not, who knows) while being given the syntactic status of “real code”.

elcritch6y ago

I’m writing Elixir code currently and find the red/blue approach in JavaScript a pain. Never used asyncio beyond trying a few "hello world" and it was just baffling. In Rust async seems not terrible with the newer syntax, typing, and of course, huge speed improvement making it worthwhile. But in a dynamic VM? Just a pain. Julia’s approach with "tasklets" seems intriguing as well.

meowface6y ago

I and many others are totally with you when it comes to asyncio vs. gevent.

Redoubts6y ago

They really should have used the breaking nature of v3 to drop features that prevented good JIT implementations or speedups in cpython.

pbreit6y ago

I am flabbergasted every time I see a software project eschew backwards-compatibility.

No one wants to spend energy re-programming to stay in place.

Especially APIs.

someguydave6y ago

Yes python 3 was clearly a mistake. There could have been less hostile ways to make improvements in the language.

michaelmrose6y ago

Probably the mistake was not dropping support much sooner.

Python 3 came out in 2008 so say no backported features after 2009 no bug fixes after 2012. All announced in 2008 of course.

Given 4 years to migrate most would have made the jump sooner.

1 more reply

doctoboggan6y ago

I know its simple, but it wasn't until I learned about f-strings that I actually switched for good.

skinnymuch6y ago

I thought the reason was because Py2 was still getting new features too for some time. I’ve only just started learning And using Python so it isn’t my world.

solotronics6y ago

asyncio is actually really nice and with ThreadPoolExecutor / ProcessPoolExecutor it fit a lot of use cases I had hacked together things for in Python2. That alone was worth it to me.

mylons6y ago

i like the condescending bit at the end of your post. python 3 is for average joe’s.

ageofwant6y ago

Again with the 'Tremendous amount of effort' meme. I've done many ports and they were all trivial:

    - run 2to3
    - spend 2h max fixing any failing tests
    - cook of any remaining issues in a few days of beta testing like you'd do for any new release

Now now doubt Python 2.7 is a excellent and solid release and will remain so for as long anyone keeps the bitrot in check, but to keep using it because porting is 'hard' is patent bs.

Johnny5556y ago

It's not so much that it's "hard", but that it's time consuming when you have hundreds or even thousands of python scripts to port -- and since those scripts already work and you probably weren't going to have to touch them at all, you're not really gaining anything for all of that porting effort.

michaelmrose6y ago

Maybe whomever should have stopped writing new ones by 2009 a decade ago.

Then you wouldn't have much to port.

1 more reply

jordigh6y ago

Behold the tremendous amount of effort for Mercurial:

https://www.mercurial-scm.org/repo/hg/log?rev=py3&revcount=2...

They've been porting hg into Python 3 for the last 10 years and are only now nearing completion.

I've written a bit more about this in Lobsters:

https://lobste.rs/s/3vkmm8/why_i_can_t_remove_python_2_from_...

Smithalicious6y ago

Honest question, how can it possibly take 10 years to port hg to Python 3? If I am to believe the Wikipedia source for the first release of Mercurial[0], it would've been only 4 years old at the start of the 10 year porting process. How on earth does it take 10 years to port 4 year old software?

Even taking into account the fact that new features were still being added and not all focus was on porting, this doesn't really seem like a reasonable representation of what's going on; I have a suspicion that "10 years" of porting here does not entail nearly as much work as it seems.

[0] https://lkml.org/lkml/2005/4/20/45

1 more reply

ageofwant6y ago

Yes of course there will be exceptions. But the vast majority off Python code bases are not mercurial or dropbox or imgur. Just like the vast majority of software using companies are not google or facebook.

The average few hundred to few thousand loc app, which should be 98% of all production code-bases will almost certainly port with no issue.

3 more replies

JshWright6y ago

What's the largest codebase you've migrated?

CriticalCathed6y ago

would you be willing to port my 796,113 line program for two hours of pay at $45.00/ hour? Because if so it would be a bargain to hire you. Last time I tried to plan the conversion by looking over the codebase it took me two days of concerted effort to just come to the conclusion that it wasn't worth the effort.

Animats6y ago· 9 in thread

First off. Python 2.6 and 2.7 supported Unicode just fine. I had a large all-Unicode system in Python 2.6. You had to write u'word" to get a Unicode constant, and use a "unicode(s)" function here and there. Also, the part that remained "compatible" was that "str" remained an array of bytes, even though there was also a type "bytes" and a "bytearray".

Early Python 3 was hell for conversion. The syntax was changed for no good reason. u'word" became illegal. (That later went back in.) The "2 to 3 converter" was a joke. I didn't have the "print statement problem" because my code called a logging function for all debug output.

Many of the P3 libraries didn't work. (The all-Python MySQL connector failed the first time I tried to do a bulk load bigger than a megabyte, indicating that nobody was using it.) It took years before the libraries were cleaned up.

Python 3 got some really weird features, such as type declarations that don't do anything. I can see having type declarations, especially for parameters, but they need to be used both for checking and optimization. CPython boxes everything, which is terrible for numerics and is why most serious math has to be done in C libraries. My comment on that was "Stop him before he kills again."

viraptor6y ago

> Python 2.6 and 2.7 supported Unicode just fine

It did, but in a way that chainsaws support sculpting just fine. Technically possible. Very advanced people will know how to handle it. Everybody else is just going to injure themselves randomly.

Most people writing py2 got the text/binary processing working on accident. Things appear to work until you throw actual Unicode into parameters and then nobody knows what happens. There's a number of "what does this decoding exception mean" questions on stackoverflow every day. They're often actual bugs people could ignore before. Now they're told immediately and I believe that's better.

fireattack6y ago

Can vouch that as an amateur Python user from a country using "non-ASCII" language, py2 is a pain in ass when I was learning the language.

It didn't help that Py2's IDLE had (have? I didn't recall they actually resolved this, simply closed the issue) a major bug [1] that even if you explicitly use u-literals (a = u'日本語'), it will still be encoded in your locale (shift_JIS [2] in Japanese case), instead of unicode/utf-8. You can imagine how confused people would get when they were testing unicode support of py2 in IDLE and saw this.

[1] https://bugs.python.org/issue15809

[2] https://en.wikipedia.org/wiki/Shift_JIS

Izkata6y ago

> Now they're told immediately

Or so you wish, it's not necessarily true though. It's just as likely to pass through gibberish without blowing up.

I have a tiny relay service written in Django that lets me pass messages between my phone and home computer, that I recently upgraded both the python and Django version. The service is only two views of about 3 lines each - and a unicode conversion bug crept in such that it stored "b'text'" in the database instead of "text". No warnings, no errors.

user59944616y ago

Pictures of chainsaw sculptures for reference: https://www.google.com/search?q=chainsaw+sculptures&tbm=isch

Some are quite good and finely detailed in my opinion. It's really nothing like what you'd expect after hearing chainsaw. There actually are small chainsaw, maybe even one-handed ones, to do that.

jeltz6y ago

I think ice sculpting on the other hand is usually done with a normal sized chainsaw.

kevin_thibedeau6y ago

"Just fine" fails as soon as someone does isinstance(foo, str) in 2.x. If foo was a unicode object, well it sucks to be you. There are a lot of footguns in the old string handling. Python 3's fail fast approach ensures they get caught early on.

wnissen6y ago

They still did stuff that makes no sense, to me, like not supporting `iteritems` on dictionaries and `xrange` on sequences. Or using `str` to mean bytes? That would have been possible to make backwards compatible until Python 4. At this point, does anyone think there will be a Python 4?

kevin_thibedeau6y ago

You're expected to clean that stuff up with 2to3. The whole idea is to not leave band-aids in place.

jimbob456y ago

OTOH, Python screwed up so badly that modern languages now know not to call a redesign a sequel, so it's not all gloom and doom.

alexhutcheson6y ago· 8 in thread

[copying comment from an older HN thread, not speaking on behalf of any employer, opinions my own]

I think many people underestimate the challenge that the 2 to 3 migration presents for large enterprises. The core issue is that even though the migration for any given module is normally really easy, the total effort required to migrate is still essentially O(n) in module count/file count, because even with current tooling you still need to have an engineer look at every module to do the change safely. Even if it only takes ~5 minutes per module to make the changes and validate that it works correctly, this becomes a giant undertaking when you have tens of thousands of files to migrate.

The fact that it takes a long time also creates other problems. Your business isn't going to hit "pause" on other development, so there will be changes constantly introduced into modules you've already "swept". It's going to be hard to make sure 100% of your engineers and code reviewers are knowledgeable about the specific requirements to make sure the code works in both 2 and 3, so you would really like some automated safeguards to make sure they don't introduce anything that won't work in 3. Pylint helps with this, but won't catch everything. Unit tests are obviously essential, but:

1. Even a well-tested project won't have tests that cover 100% of code paths and behavior.

2. You're stuck running the tests on both python2 and python3 for the duration of the migration, which doubles the resource (compute, memory, etc.) cost of your Python CI and regression testing infrastructure for the duration of the migration.

Most big companies have passionate Python advocates who really want to be on Python 3, but the scale of the problem and the lack of tooling to tackle it with a sub-O(n) amount of effort make the overall project risky and expensive for the business.

war10256y ago

As a member of a team hopefully nearing the end of the python3 migration, this is exactly correct.

The unicode switch is a nightmare in terms of having to go through and double/triple check everything and still get it wrong half the time. Particularly when it comes to moving data over the network.

The big selling point for Python3 finally came with the built-in async support, but we've been using Twisted for a decade, which works nearly identically, so even that wasn't a huge draw for us.

Further, many of our dependencies were python2-only up until the last year or two.

Really the only reason we're going through the effort right now is that Python2 is rapidly approaching End of life.

mistrial96y ago

right - and .. for the one that posted on YNews to say "yes we did it" there are of course others who do not post on YNews and did not do it.. There is nothing wrong or bad about Python 2.7, in fact, its great and works as advertised.

criddell6y ago

> works as advertised

That's kind of a low bar. An IBM PC running MS-DOS 3.3 works as advertised but I wouldn't want to use one today. Except for the keyboard.

kevin_thibedeau6y ago

The way to start a migration is to first get the 2.7 code as forward compatible as possible. Bring in the future imports. Make sure equivalent iterators are used where 3.x mandates them (xrange, iterkeys, etc.). Make sure Unicode is managed correctly on I/O. Explicitly call out truncating division.

This doesn't require parallel testing. These all improve the quality of 2.x code even if you never make the leap to 3.x.

Once this is done you can use 2to3 to mechanically fix the remaining differences. Anything else that remains broken can be special-cased in the 2.7 code until 2to3 works without intervention.

coldtea6y ago

2to3 never worked and will never work "without intervention".

That's why six and manual changes are always needed...

kevin_thibedeau6y ago

Using six is intervention. You end up with code that has one foot stuck in the past.

alexhutcheson6y ago

It requires parallel testing to make sure the files you've "future-proofed" aren't accidentally un-future-proofed by later commits.

dang6y ago

Please link to previous comments rather than copying them. Copy/paste isn't good for fresh conversation: https://hn.algolia.com/?dateRange=all&page=0&prefix=false&qu...

It's a great comment otherwise.

_skel6y ago· 7 in thread

I recently went through a fairly large upgrade from JDK8 to JDK 11 and it was a bit of a pain -- lots of dependencies to update, etc. But very few code changes were required, and the static type system made it pretty clear when the codebase was broken -- it just wouldn't build. It still took my team several weeks.

Migrating from Python 2 to Python 3 is way worse than that -- code changes are required, and because Python is a dynamic language you may not notice bugs until you actually run the code (or even worse, until after you release it to production and some code branch that is rarely invoked somehow gets called...). In other words, the tooling and the type system are not confidence-inspiring and it's really hard to verify that you migrated without breaking stuff.

LIV26y ago

Will this will make people less likely to use Python in future for some projects? I'm no developer or manager but I figure there'd be a thought in the back of my head thinking "what if we need to rewrite this all again for the next major release"

xemdetia6y ago

I don't think it will make people less likely to use Python overall, but for certain categories of projects there are some things that are just done better in other languages. Both Java and Go for example have specific programming language constructs and contracts to ensure that codebases can move from version to version easily and I do not feel like there is a dynamic/untyped language that really provides that same level of stability. The ones that do get close to that are the ones that are extremely small in scope and are not in the same class of batteries included language, and because of their small scope they have a general limit of change.

At a certain point this sort of compatibility/forward motion of a codebase through big language revisions is something that has to be designed as part of the language in either being able to break it down into small enough chunks to chew through in pieces (updating a submodule with the updated language without affecting anything else), completely transparent to the code being run through it (this happens for compilers for C for different standards), or to have a version to version automated rewriting mechanism that is so reliable the outcome of the automated tool is not in question (tools like Go's gofmt). Python in my opinion only has partial solutions to all of those answers so it turns into a lot of hand work.

So while there are other languages that may do other things better there are still a class of programs that are very effective to write in Python, and that's plenty enough reason to keep it around. Do not forget that Python 2 was released in 2000 and Python 3 was released almost a decade later. The general time scale makes worrying about the next release for many people, but for people who do they start considering other languages because that's important to them.

twblalock6y ago

What I've been seeing is people using Python 3 for new projects but leaving their older projects on Python 2. As a result of that, Python 2 will continue to be supported internally at a lot of companies, even though it is "officially" end of life. Nobody wants to rewrite the stuff they finished years ago that is now in maintenance mode.

JohnFen6y ago

I considered doing this myself, but decided that I didn't want the hassle of having two versions of Python hanging around.

If/when the day comes that using Python 2 isn't realistic, I may go with 3, or I may choose a different language, depending on the project. I'll cross that bridge when I come to it.

user59944616y ago

You're not gonna have to rewrite it in a few years. Languages only go through the mess of retrofitting unicode once.

Besides Java and Python already discussed, another big mess of a transition was from Qt 4 to Qt 5, where all the strings became unicode.

gowld6y ago

Rewriting your code once every 20 years is a ferrari problem.

mixmastamyk6y ago

Pyflakes, unit tests, and type annotations will find the vast majority of such bugs in a large program. If you used logging instead of print it’s even easier.

digitalsushi6y ago· 6 in thread

I never knew if this was a cynic's answer or truthful, but I was told by my manager at one point that RedHat's OS is superglued to python2 and spends a lot of money to keep python2 in good working order. I highly expect it's a cynic's response and please read it as such until someone in-the-know can retort my post.

geofft6y ago

I don't think that's true, or at least it's no longer true: RHEL 8 does not ship with Python 2 by default. https://developers.redhat.com/blog/2018/11/14/python-in-rhel... Red Hat software that depends on Python, like yum, is in Python 3.

I think it is true (as of pretty recently) that Red Hat is the only company employing a Python core dev to work on Python core dev stuff full time (see https://discuss.python.org/t/official-list-of-core-developer...). But the core dev team is focused on Python 3, so that isn't a sign of Red Hat's Python 2 commitment either.

navigatr6y ago

Red Hat has long support contracts for their server OSes that shipped with Python 2 when it was still kosher to do so.

That means they'll patch Python 2 should vulnerabilities be found on their OS.

ianai6y ago

It’s a pain from the outside looking into their business. Their contracts, though, are their business.

1 more reply

hathawsh6y ago

See the RHEL release schedule:

https://en.wikipedia.org/wiki/Red_Hat_Enterprise_Linux#Versi...

RHEL 6/7 and Centos 6/7 will support Python 2 until at least mid-2024.

mkesper6y ago

There was no way updating the system Python in Rh/CentOS6 from 2.6 to 2.7 as that broke system scripts. You could only use 2.7 in non-standard paths (or Python 3).

maweki6y ago

A lot of tooling (yum, for example) was written in python. This all takes a lot of time to port, especially gtk/gobject stuff, but nowadays this is all written for python3 or JavaScript or Vala.

SQueeeeeL6y ago· 6 in thread

Python, unlike a C executable, could hypothetically stop working tomorrow. Researchers who have code that works/businesses don't give a shit about the esoteric differences between py2 and py3, they just want their stuff to keep working. This is similar to the banks still running Cobal backends, I don't know why everyone cares, I guarantee multiple servers out there are running executables that couldn't be rebuilt, but those don't curse you out on terminal every time I turn one on

bityard6y ago

> Python, unlike a C executable, could hypothetically stop working tomorrow.

No, it could not. Python itself is a C executable, which makes the distinction moot.

For some reason, a lot of people seem to be laboring under the impression that Python 2 code is just going to stop working in 2020. The only thing stopping is the Python core team's bug-fix releases. Python 2 itself will continue to exist. Existing installations will keep working. Linux distributions _can_ choose to keep Python 2 in their repositories and maintain it separately going forward, although they are not likely to. Ubuntu, Red Hat, and other OS providers all have operating systems which include Python 2 that they are contractually obligated to support and patch for years in the future. And of course, the source code for Python 2 will never just up and disappear within our lifetimes unless human civilization does as well.

As for businesses, if your application is mission-critical and you want to keep it going, then you get to decide whether to invest in keeping your application current with the state of the art, or invest in keeping the application's environment static. This means having a reliable source of the required hardware, archived copies of the OS, all dependencies and libraries, and the application itself. And presumably you still need someone knowledgeable enough to fix bugs in the stack from time to time.

weff_6y ago

At the end of the day, it boils down to dependency management, wouldn't you say? That is, Python2 could be running forever if the environment is maintained. In the same vein, a messed-up OS or libs can stop a C app in its tracks.

sansnomme6y ago

Because software that isn't protected by walls of legislation value features and the end user UX. i.e. "evergreen". Your Nokia N900 running Meego may be sufficient for certain subsets of HN readers but your average user wants Material Design and cannot care less about ideological purity or "if it ain't broke don't fix it".

FpUser6y ago

"average user wants Material Design" - is that a fact or just a wishful thinking from people trying to push said design?

EDIT: Personally for me while I find for example older windows interfaces ugly they were very consistent and functional. In modern designs I sometimes could hardly find what is clickable/actionable. It is not interface working for me but the other way around

bityard6y ago

> but your average user wants Material Design

No, users most definitely do not care about Material Design. They only care about being able to quickly do the task the app or web site claims to allow them to do.

skykooler6y ago

Meego was never stable on the N900. Most people who are still using one are running Maemo.

goatinaboat6y ago· 5 in thread

The 2to3 tool should add .decode(‘utf-8’) to every string manipulation, even better Python 3 should have a flag to make that the behaviour and even better that should default to on.

So much effort wasted doing this in a large codebase. And what do you get for it? It’s just not worth it. Nobody actually needs Python 3, it was foisted on them by the developers. What everyone really wanted was Python 2.8.

kstrauser6y ago

Speak for yourself. After using Python 3, I can't stand touching Python 2 codebases. Is a given str object an actual text string that's been decoded into a valid charset, or is it an array of bytes fresh from the network / database / file? Who knows, unless you trace back to its point of origin. I personally love that Python 3 says "this is text, and this is some binary data I got somewhere, and they are not the same thing".

joshuamorton6y ago

Please no! Text is text. Bytes is bytes. Convert to the correct form on input boundaries, and that's it. Don't switch back and forth internally, it's overly complicated, error prone, and slower.

test77776y ago

unix pipes (stdin, stdout) are bytes, files are bytes, filenames are bytes. yet, for some reason python3 thinks al of those are text. its not the coders that are wrong, it is the language.

kstrauser6y ago

No, it's Python 2 that thinks those are all text. Python 3 makes you explicitly say "convert this stream of bytes into a text encoded string", and from then on it's a str object.

Python 2 was happy to (try to) let you call text methods on a JPEG. Python 3 draws an appropriate distinction between the two.

1 more reply

joshuamorton6y ago

kstrauer is right, but I'll elaborate and say that places where you are interfacing with Unix threads are input boundaries, so having the programmer make a choice makes sense. Arbitrary switching within a program does not.

1 more reply

pmoriarty6y ago· 3 in thread

Another data point:

According to my highly unscientific survey of the packages in Gentoo's package repo, there are roughly:

- 2500 packages that work with Python 2 or 3

- 1350 packages that work with Python 2 only

- 350 that work with Python 3 only

My methodology:

http://dpaste.com/1M0TCV7

stevesimmons6y ago

I bet many of the Py2-only ones are old legacy packages that have been superceded by newer better options.

girst6y ago

and yet another (fedora's): of 3414 packages total:

- 3122 Python 3 only

- 88 Dual support

- 8 Py2 leaf (standalone packages; may be dropped)

- 77 Not ported (will be dropped unless ported)

- 100 Blocked (require 1 or more "not ported" packages)

- 18 Legacy (will be dropped)

https://fedora.portingdb.xyz

note that py3only/dualsupport only reflects how it is packaged in fedora, not what upstream provides.

roland356y ago

That is interesting data. I would imagine the most widely used packages are compatible with both, but that data is probably harder to get!

upofadown6y ago· 3 in thread

Py3 is for all practical purposes a language fork of Py2. So it doesn't really make sense to talk of a "migration". If Py2 becomes unworkable somehow then people will rewrite stuff. Some of it might even be in Py3.

Considering all the stuff that is written in Py2 I really don't see it being out and out abandoned. That wouldn't really make any sense. With computer languages stuff never goes away.

carapace6y ago

Py2 is the new Fortran, I like to say.

sigjuice6y ago

And they both use actual Fortran :)

  $ python3
  Python 3.7.4 (default, Sep  7 2019, 18:27:02) 
  [Clang 10.0.1 (clang-1001.0.46.4)] on darwin
  Type "help", "copyright", "credits" or "license" for more information.
  >>> import numpy
  >>> 
  [2]+  Stopped                 python3
  $ lsof -c Python | sed -n "/fortran/s/$USER/<redacted>/gp" 
  Python  35190 <redacted>  txt    REG    1,4  1550456 12887664541 /Users/<redacted>/Library/Python/3.7/lib/python/site-packages/numpy/.dylibs/libgfortran.3.dylib

int_19h6y ago

Py2 is the new Fortran 77.

It was good in its time, and great things were done in it that are still around... but let's move onto F90 already.

lasermike0266y ago· 3 in thread

Management wants new features not porting. They will only port when they absolutely have to.

compiler-guy6y ago

As well they should. No one uses Dropbox today that didn't a couple of years ago because it is using Python 3 instead of Python 2.

The migration is financially negative in the short term, and very clearly so. It might be financially positive over the long term (due to easier maintenance and higher performance), but that is definitely maybe. Especially for an app that is otherwise very stable.

war10256y ago

Even if management wanted porting, the python 2 -> 3 migration path is very painful in the details, while on the surface not having a lot to offer at the other end in terms of new capability.

lasermike0266y ago

Roger that.

klyrs6y ago· 3 in thread

In my experience, the biggest issue that I face is "what do you mean by Python 3?" I count 4 minor versions which aren't fundamentally broken, and I encounter them all on a regular basis.

A lot of my code is performance critical, and, for example, I'm still salty about dictionary operations taking O(log(n)). But the proliferation of active minor versions makes it very difficult to write portable, performant code.

It's become a sticky wicket. I want to migrate to Python 3 (and, by and large, I have in most of my projects). But what version do I target? Will my dependencies make the same choice? Or does "migration" turn into a sisyphean task? It's becoming burdensome enough that I'm contemplating abandoning the language for something more stable.

user59944616y ago

If it helps you, you can consider that no version below 3.5 is worth thinking of. It's the edge when python added back enough features to ease the migration and many libraries started being ported.

Current version is 3.7. If you expect your migration work to take a year, you should consider going for 3.7 and above only, because the previous minor versions will be dropped by the time you're done.

klyrs6y ago

Thank you (and it's good to say so if another reader doesn't know), but yeah, the 4 versions I was referring to were 3.5-3.8... but my point is that it's now a perpetually moving target.

And fwiw "3.7 is the current version" doesn't help my users.

user59944616y ago

I think pip can select the version automatically as long as you provide the right metadata with the package. Shouldn't matter much really.

Maybe write in the README that the package is only tested on 3.7 and above so users can be aware of that and check. Bet they struggle to figure out what version to upgrade to as well.

1 more reply

izolate6y ago· 3 in thread

As somebody who only occasionally uses Python, the fact that the default `python` binary on my system resolves to Python 2.x and I need to specify `python3` to invoke 3.x, means I am quite often mistakenly using Python 2 instead of 3.

How can we expect Python 3 to become the default if Python 2 still asserts such dominance?

jonfw6y ago

That's not a python thing- that most likely has to do with your package manager.

In my archlinux installation, python resolves to 3, and I have to use python2 if I want 2

izolate6y ago

Well, PEP 394 suggested it be this way, so Python is also a bit complicit.

bscphil6y ago

As the history section of that PEP notes, it was actually written in part as a response to distributions like Arch changing the default Python to Python3. Although it's true that the PEP originally said that python should point to Python2 (it doesn't any more), it also said that code requiring a specific version of Python shouldn't assume anything about the system default Python, instead using either python2 or python3 - and requests that distributors of Python provide these commands.

This suggestion was adopted by nearly everyone, including Arch Linux. If you're writing a script that calls Python, you probably still want python3 as your command for compatibility across all current distributions.

In other words, the intention of the PEP was not to ingrain Python 2 as the "default" Python, it was to get people to stop making assumptions about what Python version they would get and use the python2 and python3 commands instead. I do agree that it has had that effect.

1 more reply

at_a_remove6y ago· 2 in thread

In my current job, I can point out something that might be a contributing factor:

I work in an industry where there is basically one 800lb gorilla of a vendor. They update rarely, because their product is a mission-critical, life-or-death sort of thing. Their current product is heavily, heavily integrated with x.y.z version of software from a different vendor in a different segment, but also weighing in at 800lb. Yes, they specify x.y.z, not just x or even x.y. That software comes bundled with a Python 2.7.5 distribution.

Imagine my woes trying to get pip running, which unhelpfully suggests I upgrade Python. Cannot seem to find any other path to even get pip going because of what I call the "lol just upgrade n00b" factor. Perhaps that information once existed but I cannot find it.

So, I am stuck on this version because of some pretty tight integration, at a couple of removes. I think the vendor-linkage can cause some "drag" that folks who work in a greenfield environment might not be thinking about. It can be unfortunate but there it is.

user59944616y ago

Surprised it's including pip, if it's really this old or tightly coupled, it's common to not have pip at all, or the old pip version simply can't run.

If it can help you. The trick I use is to install a normal python 2.7 interpreter with pip. Then you can use it to install software to any directory, including the one from the other application. There are flags to specify what to install, from where to where, internet or not, something like

    pip install packagename --target=/to/app/lib

tingletech6y ago

I came across a system where pip had broken the other day. easy_install to the rescue!

musicale6y ago· 2 in thread

Because they broke backward compatibility in very annoying ways without providing a fallback mechanism.

I still haven't forgiven them for killing the print statement, which could have peacefully coexisted with a print() function.

mixmastamyk6y ago

Bugged me for a while five years ago. Then I moved everything to logging and use an editor snippet: pr<TAB> —> print(‘foo:’, %cursor%)

akx6y ago

Having two ways to do the same thing is against The Zen of Python.

Groxx6y ago· 1 in thread

Because it's not just syntactic changes, it's implied-semantic changes too. You can't mechanically transform a project and know that it'll work.

And you can't do it gradually, so it's all-or-nothing. (yes, "six" exists, but you still execute one way or another)

And you'll have to change the versions of all your libraries, which is not usually a smooth experience in the Python ecosystem. (this is another place where it's "all or nothing", since six can't help you if your dependencies don't all use it + use it correctly)

---

It's a huge risk with huge cost for already-working, running code. For new stuff, sure, write it in 3, but 2.7 works fine and has the added benefit of being very well understood by this point.

mixmastamyk6y ago

You can definitely do it gradually with a few bumps but there are categories of large apps that are more difficult than average.

MaulingMonkey6y ago· 1 in thread

We're still waiting for Maya to switch. It's still using Python 2, and Autodesk keeps putting off the transition, partially because it'll break the scripts of all their downstream users.

pfranz6y ago

I think that's partially correct. If you follow https://vfxplatform.com/ it says Python3 was pushed from 2019 to 2020 to allow everyone to upgrade Qt and PySide (which are prerequisites). The reason it was deprioritized in previous years were similar transitions like C++14, gcc, etc.

I've been meaning to dig into Maya, Houdini, Nuke's Python 3 transition plans. I know Houdini will offer a Python 3 option with Houdini 18 (shipping in the next month or so).

I don't think the reason was because of downstream users. Python 3 was an inevitable change. Previously, they swapped out PyQt for PySide which wasn't a forced change, but required everyone to update their Python scripts.

sigjuice6y ago· 1 in thread

Are there any examples of rewrites of large code bases from one language to another?

yjftsjthsd-h6y ago

Mercurial, perhaps?

UptownMusic6y ago· 1 in thread

Simple. Python 3 is theoretically better but not in fact better, at least for most applications of Python.

lkbm6y ago

In terms of feature-richness, I'd choose Python 3 over 2.7. Pre-3.6 it was slower, but at this point it's more memory efficient and generally faster, so it is in fact better in basically all applications.

The problem we have where I work is some very clever 2.7 code that isn't easy to redo in Python 3. For any new project I do, I use Python 3.

m4r35n3576y ago· 1 in thread

It _is_ happening, but moaning about it won't make it happen any more quickly.

eej716y ago

I think the intent is less moaning about it - and more learning about the blocking points so they can be better surmounted. At the very least, perhaps the blocking points can be avoided in the future. Those who do not learn from history... etc etc.

m45t3r6y ago

Any non-trivial piece of code will probably be a pain in the ass to port for a more recent version of the language, and the level of PITA depends on the size of the project X size of the changes in the language.

Case in point, I worked in a project using Ruby. When we migrated from Ruby 2.4.0 to 2.4.6 (yeah, a minor upgrade), it broke spectacularly. Trying multiple Ruby versions, the change was actually introduced in Ruby 2.4.1. After some investigation, a change in Net::HTTP library from stdlib had a change that broke a dependency from a dependency. The fix was just a line of code (we just need to change the adapter used for HTTP communication), however it was two days of work for a minor upgrade.

My current job tried to migrate from Java 8 to Java 11. It also broke multiple services. This one is still in progress, months later.

Python 2 to Python 3 is bigger than both of those version changes (however it is equivalent to Ruby 1.8 to 1.9 changes), so yeah, it does take more time. And like some projects that are forever running Ruby 1.8 or Java 8 (or even worse, Java 6), we will have projects forever running Python 2 too.

guardiangod6y ago

>Why Is the Migration to Python 3 Taking So Long?

For the same reason why migration to IPv6 is taking so long.

Both technologies don't solve immediate problems end users are facing. Instead they solve 'nice to fix' problems that few people care about.

linsomniac6y ago

What do you mean taking so long? The original target date, proposed by Guido, was Jan 1, 3000. Looks like we're 980 years ahead of schedule! :-)

I say this in part because comedy, but also because it was anticipated to be a long project. It was originally called "Python 3000".

soyiuz6y ago

In the academic data science-y world, the transition has passed the hump. In other words, where I was reluctant to switch a few years ago because some of my basic tooling was still in the 2 land, today the critical mass of important libraries has been ported and updated. Most of the cutting-edge docs and tutorials (Google's GPT2 for example) default to python3.

paulie_a6y ago

Simple answer. Updating legacy code. If it works now why upgrade? Stability is more important than some new features of the language. Granted that has its downfalls via security issues but stability wins

If you have a hole it's hard to dig yourself out of it. This is why I prefer modular apps instead of monolithic codebases. You can upgrade piece by piece. Otherwise it's all or nothing and dangerous

JohnFen6y ago

I haven't migrated to Python 3 simply because I see no reason to go the the pain and hassle of porting my existing Python code. Python 3 doesn't give me anything that I want or need badly enough to put that much work into it.

KaiserPro6y ago

I now work on a python 3.6 codebase. Something like 100k lines of code. in practice 80% of the code was written for 3.5.

However, barring speed improvements, there isn't much to offer, apart from unicode, f strings and annotations.

If python 3 had proper multithreading, that might have been worth breaking backwards compatibility for.

NelsonMinar6y ago

The key inflection point was around Python 3.3 when enough bridging technologies and tools came along to either migrate code or else write code that supported both languages. Things like adding the u'string' syntax to Python 2, the creation of six.py, all the various features in the future package. That gave a much smoother transition path and enabled crucial libraries to work in Python 3, which then let everyone else migrate too.

neilobremski6y ago

The upward migration is imminent for ALL CONNECTED applications and not just Python 2 to 3. An issue I've seen with PIP (that must be relevant to other platforms as well) is that version-locked packages are for software that can no longer communicate with the actual SaaS/API because THAT layer has changed. It's really THIS that is forcing conversion of Python 2.7 to 3 because API vendors will stop supporting old software while they continue breaking their own interfaces. The alternative is the end user or FOSS picking up the slack but that's only going to happen for SOME of the API's. In the end it will be cheaper (albeit still painful) for companies to upgrade their code to Python 3.

I have a lot of Python 2.7 code that I wrote years ago which has been running smoothly and my team is generally going to rewrite rather than "convert" because I really don't trust conversions. I'd rather see all bugs upfront rather than hidden in the fog.

jeltsin12346y ago

The thing is python3 IS a better language. Unicode is very important for me, as i deal with a multitude of languages. For a good comparison, see how PHP failed unicode with PHP6 and we still deal with an insanity that is the mb_ functions. In python this is a non issue, and its a very nice language to work with.

drdeadringer6y ago

About 10 years ago I started learning Python. I double-timed across both versions. I decided to land on v3 for good. I haven't looked back. Granted that Big Corp cannot have such flexible actions. I reserve sympathy.

jdhawk6y ago

The same reason there is still a ton of legacy PHP 5.6 code.

Migration in interpreted languages that implement major breaking changes is really tedious.

cygned6y ago

> most large organizations, outside of the hype cycle of technical news posts, move much more slowly than the press or blogs would have you think

That’s the reason I am so upset with today’s JavaScript ecosystem - things move so fast that good technology is being deprecated and changed constantly which breaks all kinds of things in other places.

leetrout6y ago

Or, in the case of my employer (and new ecosystem for me), we are using Jython which AFAIK has no clear plans or path to support Python 3 at this time. Iron Python has started work to support 3 but it's still in development and seems to be a ways off.

just_myles6y ago

Personally, I don't see the need to migrate entire code bases if there is no need. I think that much is obvious. Perhaps the focus should be put onto pivoting instead. That way it doesn't leave your dev team on the hook to go back and change existing code bases in python 2.

mbparks6y ago

In my organization, business needs overrule technical details. Management didn't want to spend the resources until forced to do so, rather focus on revenue-generating work. Can see it both ways I guess. We have recently started upgrading our codebase to support 3.X

xvilka6y ago

Find a few remote code executions in Python 2 after Jan 1 2020, and migration will be faster.

qwerty4561276y ago

What are some Python 2 features that make it hard to transpile to Python 3 automatically?

m4636y ago

For me, it was that python on macos was 2.7

munherty6y ago

This is like why are hedge funds still using excel to model. Also why is SAS stilll used

kissgyorgy6y ago

I honestly don't care as long as I don't have to deal with Python 2 code bases anymore. The important point is that all of the popular open source libraries and frameworks are ported.

j / k navigate · click thread line to collapse

351 comments

182 comments · 41 top-level

downerending6y ago· 40 in thread

Because there's been not enough carrot and too much stick.

https://github.com/pallets/click/issues/1212

hermod6y ago

With respect to async, I'm partial to trio [0], which is a spiritual successor to curio [1].

[0] https://github.com/python-trio/trio

[1] https://vorpus.org/blog/announcing-trio/

[2] https://trio.readthedocs.io/en/stable/reference-core.html#us...

[3] https://trio.readthedocs.io/en/latest/reference-core.html#ca...

[4] https://vorpus.org/blog/notes-on-structured-concurrency-or-g...

[5] https://vorpus.org/blog/why-im-not-collaborating-with-kennet...

privateSFacct6y ago

I was curious about [5]

The link to support requests (which is a great piece of software) is here:

https://cash.app/$KennethReitz

Note: This is NOT a charitable donation, it is a gift to an individual. These are not tax deductible under US law.

Just a bit of perspective from someone who wasn't familiar with this "fiasco".

shkkmo6y ago

The money was raised specifically to support development of requests 3

> [Reitz] announced that work had begun on "Requests 3", that its headline feature would be the native async/await support I was working on, and that he was seeking donations to make this happen.

It's not so much that PSF needed to be used, as that there needed to be some accountability as to how those funds were used.

It sounds like a great deal of the work being done on requests is done by volunteers but the funding only goes to support Reitz

The issue is not so much that money is being made, but the way that it is done and the lack of accountability

1 more reply

pmiller26y ago

1 more reply

sbierwagen6y ago

1 more reply

semiotagonal6y ago

I wonder how many people feel that migrating to Python3 would have been worth doing in the absence of being forced to do so.

Dropbox invested three years of work, actually hired Python's creator, and are still not done. What are they getting out of it that they wouldn't have gotten if Python2 simply had been maintained?

crustacean6y ago

Who wants to break old SQL? Nobody.

skybrian6y ago

But changing database vendors for a company can be a big deal, as bad as going from Python 2 to 3.

1 more reply

marcus_holmes6y ago

> Who wants to break old SQL? Nobody.

Every couple of months there's a new startup / dev site that says "SQL is broken/old/bad, so we reinvented it!". They all sink without trace, but there's a cohort who agrees with them.

1 more reply

ogre_codes6y ago

This is backwards thinking.

coldtea6y ago

>This is backwards thinking.

And the alternative is cargo cult "newer is better".

>Yes, it's expensive to upgrade from Python 2 to Python 3, but it's also expensive for the Python project to maintain 2 versions of Python indefinitely.

On the other hand, they could progressively enhance upon a backwards compatible single 2 version. JS manages to do that just fine, as does Java...

7 more replies

flukus6y ago

> Yes, it's expensive to upgrade from Python 2 to Python 3, but it's also expensive for the Python project to maintain 2 versions of Python indefinitely.

No maintaining 2 versions of python is much cheaper, it's only being done in one place compared to the thousands and thousands of python 2 code bases you'd have to convert.

It also only needs bug fixes, there are plenty of people/organisations out there that would be perfectly happy for the language to be unchanging.

1 more reply

daveguy6y ago

> If someone wanted other than the core Python team wants to step up and maintain Python 2, they are free to do so, it's open source.

Only if they name it something completely different from python or py-anything. Guido refuses to allow anyone to just step in to maintain py2.

Tauthon is a project that aims to keep compatability with py2 while adding whatever features of py3 that won't break py2 and to have a maintained py2.

https://github.com/naftaliharris/tauthon

SignalsFromBob6y ago

takeda6y ago

I'm sure that for the right money you can find someone.

Don't forget that that person/organization would not only have to maintain python but also all the packages that you will be used.

JohnFen6y ago

If I encounter Python 2 bugs that matter to me, I can and will fix them myself if needed, and submit or otherwise publish the change.

I'm quite certain I'm not the only one.

1 more reply

erokar6y ago

chungy6y ago

Can you describe how OOP feels tacked on? One of the major changes in Python 3 is that new-style classes are the only style of classes.

erokar6y ago

About some of the design decisions in Guido's own words: http://python-history.blogspot.com/2009/02/adding-support-fo...

6 more replies

dragonwriter6y ago

1 more reply

CoolGuySteve6y ago

I myself, speaking for my self and only my self find that, in regards to myself, the redundant self keyword, in my self's opinion, is somewhat selfish and easy for my self to accidentally omit. Self.

5 more replies

kzrdude6y ago

Modern Python lacks coherent design:

How do you define enums? with a superclass.

How do you define data classes? With a class decorator.

And then there's metaclasses too.

1 more reply

dividuum6y ago

> The only real killer feature of Python3 is the async programming model.

d0mine6y ago

In gevent, any call may behave like an implicit "goto". Here's why "goto" is bad https://vorpus.org/blog/notes-on-structured-concurrency-or-g... (the link is stolen from @hermod's comment above)

1 more reply

Scramblejams6y ago

epx6y ago

For me the major gain in Py3 is exactly the sane handling of strings, that fixes the major flaw of Py2: weaking the String type to make the migration to UnicodeStrings "easy".

meowface6y ago

mxcrossb6y ago

On the topic of not enough carrot, I’m curious how impactful the end of support for python2 will be. How many programs stuck in python2 are encountering bugs in the runtime?

carapace6y ago

d0mine6y ago

You are assuming that fixing bugs introduces less new bugs.

joshuamorton6y ago

Who will do that though?

2 more replies

takeda6y ago

1 more reply

x0x06y ago

rtpg6y ago

- format strings

- mandatory keyword arguments

- multi-dict splatting

- nicer yield semantics for generators

- Fixing system-specific encoding ambiguities

- dataclasses

- inline type annotations

- better metaclass support

- more introspection tooling

- pathlib (for nicer path handling)

- mocking pulled into the standard library in a cleaner way

- stable ABIs for extensions

- secrets handling

- ellipsis instead of pass (yeah who cares but I care)

- lots of standard lib API cleanup

oarabbus_6y ago

I ran into an issue recently specific to Pandas and python3 with unicode.

pd.read_excel(filepath) will read an entire dataset even if it contains unicode characters.

pd.ExcelFile() silently drops(!!) unicode rows. The resulting object will simply skip unicode-containing rows (in ANY column) them without even a warning.

For example, if you had an excel file:

word

---

"hello"

你早

"hello"

then pd.read_excel() would give you a dataframe with 5 rows. ExcelFile() on the other hand would return (silently!) a dataframe with only the first two and the last row.

Maybe this is a pandas issue, not a python issue, but it was really horrendous to debug for such a long time only to realize this was the issue.

j88439h846y ago

Is there an issue for this in the bug tracker?

oarabbus_6y ago

I searched, and this is the closest thing (https://github.com/pandas-dev/pandas/issues/11503) but it is not the issue I experienced.

I'm not sure how to submit a bug report, to be honest.

snagglegaggle6y ago

CoolGuySteve6y ago

I find the stronger differentiation between bytes and strings leads to a lot of gotchas, like when I forget to encode bytes or pass a string where bytes are expected.

I understand why it's the way it is, but when it comes the the typical unixy things I need to do shuffling of files around, tar'ing stuff, etc, it definitely trips me up more than I'd wish.

mkesper6y ago

You at least notice it's wrong. In Python2 sometimes 'it worked' until it broke and you had to figure out why.

1 more reply

recursivecaveat6y ago· 36 in thread

The simple reason is that there was no compelling feature to reward you for upgrading. You'd spend a tremendous amount of effort for dubious return and (until recently) a smaller ecosystem.

2. Minor nice-to-haves like print-function, float division, and lazy ranges just hide landmines in the conversion while providing minimal benefit.

takeda6y ago

Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

In python 3 it always blows up when you mix bytes with text so you can catch the issue early on.

dyingkneepad6y ago

> Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

> In python 3 it always blows up when you mix bytes with text so you can catch the issue early on.

takeda6y ago

Those issues are common when you're having python 2 code that uses unicode datatype and you have a task to migrate it to python 3.

Ironically a python 2 code that doesn't use unicode is easier to port.

When you program in python 3 from the start it's very rare to need encode/decode strings. You only do that if you are working on I/O level.

> And the documentation was particularly horrible regarding that, not even the experienced pythoners knew how to deal with it properly.

Because it's not really python specific knowledge. It's really about understanding what the unicode is, what bytes are, and when to use each.

This is what Python 3 is doing:

Python 2 is a tire fire in this area:

At different functions you might then have bytes or unicode as a string.

It's quite difficult task when your code base is big, so this is why Guido put a lot of effort with type annotations, mypy. One of its benefits supposed to help with these issues.

2 more replies

pmontra6y ago

> In python 3 it always blows up when you mix bytes with text so you can catch the issue early on.

CaptainMarvel6y ago

Perhaps there’s something about a port that requires encoding/decoding bytes/strings?

1 more reply

takeda6y ago

If you write code in python 3 from the start you rarely need to use encode() and decode(). Typically what you always want is a text not bytes.

Exception to it might be places where you want to serialize like IO (network or files, although even files are converted on the fly unless you open file in a binary mode).

1 more reply

rstuart41336y ago

> Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

Not that I've seen.

Enginerrrd6y ago

>For them python2 was a better scripting language than bash

adrianN6y ago

Filenames need to be treated as binary because of bad designs decades ago. Rust handles this correctly imho, by having a separate type for such strings, OsStr.

2 more replies

josefx6y ago

> Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

> In python 3 it always blows up when you mix bytes with text so you can catch the issue early on.

That is nice if your job involves dealing with unicode issues. My job doesn't, any time I have to deal with it despite that is time wasted.

rtpg6y ago

So you don't have to deal with it until user data includes _any non-ascii character_ (including emoji, weird spaces copied from other stuff, or loan words like café)

1 more reply

c-cube6y ago

What kind of text do you have to process at your job, that you never meet any unicode in it? Nowadays unicode is everywhere, especially with emojis. Even a simple IRC bot needs to handle that.

2 more replies

tedunangst6y ago

takeda6y ago

To get b'key' and 'key' in a dictionary in python 3 you really need to try hard.

The only reasonable scenario I can think of is when you are porting python 2 code to python 3 and play with .decode() and .encode().

some_random6y ago

>Actually that's the behavior of python 2, it works fine, until you send invalid characters then it blows up.

We're talking about simple scripts, the solution is to not send in invalid characters.

takeda6y ago

even in very simple scripts you don't get invalid characters until you actually get them.

coleifer6y ago

Solid take. I'd add that performance was worse for a number of releases, and there were significant warts and incompatibilities in versions before 3.4.

Myrmornis6y ago

elcritch6y ago

meowface6y ago

I and many others are totally with you when it comes to asyncio vs. gevent.

Redoubts6y ago

They really should have used the breaking nature of v3 to drop features that prevented good JIT implementations or speedups in cpython.

pbreit6y ago

I am flabbergasted every time I see a software project eschew backwards-compatibility.

No one wants to spend energy re-programming to stay in place.

Especially APIs.

someguydave6y ago

Yes python 3 was clearly a mistake. There could have been less hostile ways to make improvements in the language.

michaelmrose6y ago

Probably the mistake was not dropping support much sooner.

Python 3 came out in 2008 so say no backported features after 2009 no bug fixes after 2012. All announced in 2008 of course.

Given 4 years to migrate most would have made the jump sooner.

1 more reply

doctoboggan6y ago

I know its simple, but it wasn't until I learned about f-strings that I actually switched for good.

skinnymuch6y ago

I thought the reason was because Py2 was still getting new features too for some time. I’ve only just started learning And using Python so it isn’t my world.

solotronics6y ago

asyncio is actually really nice and with ThreadPoolExecutor / ProcessPoolExecutor it fit a lot of use cases I had hacked together things for in Python2. That alone was worth it to me.

mylons6y ago

i like the condescending bit at the end of your post. python 3 is for average joe’s.

ageofwant6y ago

Again with the 'Tremendous amount of effort' meme. I've done many ports and they were all trivial:

    - run 2to3
    - spend 2h max fixing any failing tests
    - cook of any remaining issues in a few days of beta testing like you'd do for any new release

Now now doubt Python 2.7 is a excellent and solid release and will remain so for as long anyone keeps the bitrot in check, but to keep using it because porting is 'hard' is patent bs.

Johnny5556y ago

michaelmrose6y ago

Maybe whomever should have stopped writing new ones by 2009 a decade ago.

Then you wouldn't have much to port.

1 more reply

jordigh6y ago

Behold the tremendous amount of effort for Mercurial:

https://www.mercurial-scm.org/repo/hg/log?rev=py3&revcount=2...

They've been porting hg into Python 3 for the last 10 years and are only now nearing completion.

I've written a bit more about this in Lobsters:

https://lobste.rs/s/3vkmm8/why_i_can_t_remove_python_2_from_...

Smithalicious6y ago

[0] https://lkml.org/lkml/2005/4/20/45

1 more reply

ageofwant6y ago

The average few hundred to few thousand loc app, which should be 98% of all production code-bases will almost certainly port with no issue.

3 more replies

JshWright6y ago

What's the largest codebase you've migrated?

CriticalCathed6y ago

Animats6y ago· 9 in thread

viraptor6y ago

> Python 2.6 and 2.7 supported Unicode just fine

It did, but in a way that chainsaws support sculpting just fine. Technically possible. Very advanced people will know how to handle it. Everybody else is just going to injure themselves randomly.

fireattack6y ago

Can vouch that as an amateur Python user from a country using "non-ASCII" language, py2 is a pain in ass when I was learning the language.

[1] https://bugs.python.org/issue15809

[2] https://en.wikipedia.org/wiki/Shift_JIS

Izkata6y ago

> Now they're told immediately

Or so you wish, it's not necessarily true though. It's just as likely to pass through gibberish without blowing up.

user59944616y ago

Pictures of chainsaw sculptures for reference: https://www.google.com/search?q=chainsaw+sculptures&tbm=isch

Some are quite good and finely detailed in my opinion. It's really nothing like what you'd expect after hearing chainsaw. There actually are small chainsaw, maybe even one-handed ones, to do that.

jeltz6y ago

I think ice sculpting on the other hand is usually done with a normal sized chainsaw.

kevin_thibedeau6y ago

wnissen6y ago

kevin_thibedeau6y ago

You're expected to clean that stuff up with 2to3. The whole idea is to not leave band-aids in place.

jimbob456y ago

OTOH, Python screwed up so badly that modern languages now know not to call a redesign a sequel, so it's not all gloom and doom.

alexhutcheson6y ago· 8 in thread

[copying comment from an older HN thread, not speaking on behalf of any employer, opinions my own]

1. Even a well-tested project won't have tests that cover 100% of code paths and behavior.

war10256y ago

As a member of a team hopefully nearing the end of the python3 migration, this is exactly correct.

The unicode switch is a nightmare in terms of having to go through and double/triple check everything and still get it wrong half the time. Particularly when it comes to moving data over the network.

The big selling point for Python3 finally came with the built-in async support, but we've been using Twisted for a decade, which works nearly identically, so even that wasn't a huge draw for us.

Further, many of our dependencies were python2-only up until the last year or two.

Really the only reason we're going through the effort right now is that Python2 is rapidly approaching End of life.

mistrial96y ago

criddell6y ago

> works as advertised

That's kind of a low bar. An IBM PC running MS-DOS 3.3 works as advertised but I wouldn't want to use one today. Except for the keyboard.

kevin_thibedeau6y ago

This doesn't require parallel testing. These all improve the quality of 2.x code even if you never make the leap to 3.x.

Once this is done you can use 2to3 to mechanically fix the remaining differences. Anything else that remains broken can be special-cased in the 2.7 code until 2to3 works without intervention.

coldtea6y ago

2to3 never worked and will never work "without intervention".

That's why six and manual changes are always needed...

kevin_thibedeau6y ago

Using six is intervention. You end up with code that has one foot stuck in the past.

alexhutcheson6y ago

It requires parallel testing to make sure the files you've "future-proofed" aren't accidentally un-future-proofed by later commits.

dang6y ago

Please link to previous comments rather than copying them. Copy/paste isn't good for fresh conversation: https://hn.algolia.com/?dateRange=all&page=0&prefix=false&qu...

It's a great comment otherwise.

_skel6y ago· 7 in thread

LIV26y ago

xemdetia6y ago

twblalock6y ago

JohnFen6y ago

I considered doing this myself, but decided that I didn't want the hassle of having two versions of Python hanging around.

If/when the day comes that using Python 2 isn't realistic, I may go with 3, or I may choose a different language, depending on the project. I'll cross that bridge when I come to it.

user59944616y ago

You're not gonna have to rewrite it in a few years. Languages only go through the mess of retrofitting unicode once.

Besides Java and Python already discussed, another big mess of a transition was from Qt 4 to Qt 5, where all the strings became unicode.

gowld6y ago

Rewriting your code once every 20 years is a ferrari problem.

mixmastamyk6y ago

Pyflakes, unit tests, and type annotations will find the vast majority of such bugs in a large program. If you used logging instead of print it’s even easier.

digitalsushi6y ago· 6 in thread

geofft6y ago

navigatr6y ago

Red Hat has long support contracts for their server OSes that shipped with Python 2 when it was still kosher to do so.

That means they'll patch Python 2 should vulnerabilities be found on their OS.

ianai6y ago

It’s a pain from the outside looking into their business. Their contracts, though, are their business.

1 more reply

hathawsh6y ago

See the RHEL release schedule:

https://en.wikipedia.org/wiki/Red_Hat_Enterprise_Linux#Versi...

RHEL 6/7 and Centos 6/7 will support Python 2 until at least mid-2024.

mkesper6y ago

There was no way updating the system Python in Rh/CentOS6 from 2.6 to 2.7 as that broke system scripts. You could only use 2.7 in non-standard paths (or Python 3).

maweki6y ago

A lot of tooling (yum, for example) was written in python. This all takes a lot of time to port, especially gtk/gobject stuff, but nowadays this is all written for python3 or JavaScript or Vala.

SQueeeeeL6y ago· 6 in thread

bityard6y ago

> Python, unlike a C executable, could hypothetically stop working tomorrow.

No, it could not. Python itself is a C executable, which makes the distinction moot.

weff_6y ago

sansnomme6y ago

FpUser6y ago

"average user wants Material Design" - is that a fact or just a wishful thinking from people trying to push said design?

bityard6y ago

> but your average user wants Material Design

No, users most definitely do not care about Material Design. They only care about being able to quickly do the task the app or web site claims to allow them to do.

skykooler6y ago

Meego was never stable on the N900. Most people who are still using one are running Maemo.

goatinaboat6y ago· 5 in thread

The 2to3 tool should add .decode(‘utf-8’) to every string manipulation, even better Python 3 should have a flag to make that the behaviour and even better that should default to on.

kstrauser6y ago

joshuamorton6y ago

Please no! Text is text. Bytes is bytes. Convert to the correct form on input boundaries, and that's it. Don't switch back and forth internally, it's overly complicated, error prone, and slower.

test77776y ago

unix pipes (stdin, stdout) are bytes, files are bytes, filenames are bytes. yet, for some reason python3 thinks al of those are text. its not the coders that are wrong, it is the language.

kstrauser6y ago

No, it's Python 2 that thinks those are all text. Python 3 makes you explicitly say "convert this stream of bytes into a text encoded string", and from then on it's a str object.

Python 2 was happy to (try to) let you call text methods on a JPEG. Python 3 draws an appropriate distinction between the two.

1 more reply

joshuamorton6y ago

1 more reply

pmoriarty6y ago· 3 in thread

Another data point:

According to my highly unscientific survey of the packages in Gentoo's package repo, there are roughly:

- 2500 packages that work with Python 2 or 3

- 1350 packages that work with Python 2 only

- 350 that work with Python 3 only

My methodology:

http://dpaste.com/1M0TCV7

stevesimmons6y ago

I bet many of the Py2-only ones are old legacy packages that have been superceded by newer better options.

girst6y ago

and yet another (fedora's): of 3414 packages total:

- 3122 Python 3 only

- 88 Dual support

- 8 Py2 leaf (standalone packages; may be dropped)

- 77 Not ported (will be dropped unless ported)

- 100 Blocked (require 1 or more "not ported" packages)

- 18 Legacy (will be dropped)

https://fedora.portingdb.xyz

note that py3only/dualsupport only reflects how it is packaged in fedora, not what upstream provides.

roland356y ago

That is interesting data. I would imagine the most widely used packages are compatible with both, but that data is probably harder to get!

upofadown6y ago· 3 in thread

Considering all the stuff that is written in Py2 I really don't see it being out and out abandoned. That wouldn't really make any sense. With computer languages stuff never goes away.

carapace6y ago

Py2 is the new Fortran, I like to say.

sigjuice6y ago

And they both use actual Fortran :)

  $ python3
  Python 3.7.4 (default, Sep  7 2019, 18:27:02) 
  [Clang 10.0.1 (clang-1001.0.46.4)] on darwin
  Type "help", "copyright", "credits" or "license" for more information.
  >>> import numpy
  >>> 
  [2]+  Stopped                 python3
  $ lsof -c Python | sed -n "/fortran/s/$USER/<redacted>/gp" 
  Python  35190 <redacted>  txt    REG    1,4  1550456 12887664541 /Users/<redacted>/Library/Python/3.7/lib/python/site-packages/numpy/.dylibs/libgfortran.3.dylib

int_19h6y ago

Py2 is the new Fortran 77.

It was good in its time, and great things were done in it that are still around... but let's move onto F90 already.

lasermike0266y ago· 3 in thread

Management wants new features not porting. They will only port when they absolutely have to.

compiler-guy6y ago

As well they should. No one uses Dropbox today that didn't a couple of years ago because it is using Python 3 instead of Python 2.

war10256y ago

Even if management wanted porting, the python 2 -> 3 migration path is very painful in the details, while on the surface not having a lot to offer at the other end in terms of new capability.

lasermike0266y ago

Roger that.

klyrs6y ago· 3 in thread

In my experience, the biggest issue that I face is "what do you mean by Python 3?" I count 4 minor versions which aren't fundamentally broken, and I encounter them all on a regular basis.

user59944616y ago

If it helps you, you can consider that no version below 3.5 is worth thinking of. It's the edge when python added back enough features to ease the migration and many libraries started being ported.

klyrs6y ago

Thank you (and it's good to say so if another reader doesn't know), but yeah, the 4 versions I was referring to were 3.5-3.8... but my point is that it's now a perpetually moving target.

And fwiw "3.7 is the current version" doesn't help my users.

user59944616y ago

I think pip can select the version automatically as long as you provide the right metadata with the package. Shouldn't matter much really.

Maybe write in the README that the package is only tested on 3.7 and above so users can be aware of that and check. Bet they struggle to figure out what version to upgrade to as well.

1 more reply

izolate6y ago· 3 in thread

How can we expect Python 3 to become the default if Python 2 still asserts such dominance?

jonfw6y ago

That's not a python thing- that most likely has to do with your package manager.

In my archlinux installation, python resolves to 3, and I have to use python2 if I want 2

izolate6y ago

Well, PEP 394 suggested it be this way, so Python is also a bit complicit.

bscphil6y ago

1 more reply

at_a_remove6y ago· 2 in thread

In my current job, I can point out something that might be a contributing factor:

user59944616y ago

Surprised it's including pip, if it's really this old or tightly coupled, it's common to not have pip at all, or the old pip version simply can't run.

    pip install packagename --target=/to/app/lib

tingletech6y ago

I came across a system where pip had broken the other day. easy_install to the rescue!

musicale6y ago· 2 in thread

Because they broke backward compatibility in very annoying ways without providing a fallback mechanism.

I still haven't forgiven them for killing the print statement, which could have peacefully coexisted with a print() function.

mixmastamyk6y ago

Bugged me for a while five years ago. Then I moved everything to logging and use an editor snippet: pr<TAB> —> print(‘foo:’, %cursor%)

akx6y ago

Having two ways to do the same thing is against The Zen of Python.

Groxx6y ago· 1 in thread

Because it's not just syntactic changes, it's implied-semantic changes too. You can't mechanically transform a project and know that it'll work.

And you can't do it gradually, so it's all-or-nothing. (yes, "six" exists, but you still execute one way or another)

---

It's a huge risk with huge cost for already-working, running code. For new stuff, sure, write it in 3, but 2.7 works fine and has the added benefit of being very well understood by this point.

mixmastamyk6y ago

You can definitely do it gradually with a few bumps but there are categories of large apps that are more difficult than average.

MaulingMonkey6y ago· 1 in thread

We're still waiting for Maya to switch. It's still using Python 2, and Autodesk keeps putting off the transition, partially because it'll break the scripts of all their downstream users.

pfranz6y ago

I've been meaning to dig into Maya, Houdini, Nuke's Python 3 transition plans. I know Houdini will offer a Python 3 option with Houdini 18 (shipping in the next month or so).

sigjuice6y ago· 1 in thread

Are there any examples of rewrites of large code bases from one language to another?

yjftsjthsd-h6y ago

Mercurial, perhaps?

UptownMusic6y ago· 1 in thread

Simple. Python 3 is theoretically better but not in fact better, at least for most applications of Python.

lkbm6y ago

The problem we have where I work is some very clever 2.7 code that isn't easy to redo in Python 3. For any new project I do, I use Python 3.

m4r35n3576y ago· 1 in thread

It _is_ happening, but moaning about it won't make it happen any more quickly.

eej716y ago

m45t3r6y ago

My current job tried to migrate from Java 8 to Java 11. It also broke multiple services. This one is still in progress, months later.

guardiangod6y ago

>Why Is the Migration to Python 3 Taking So Long?

For the same reason why migration to IPv6 is taking so long.

Both technologies don't solve immediate problems end users are facing. Instead they solve 'nice to fix' problems that few people care about.

linsomniac6y ago

What do you mean taking so long? The original target date, proposed by Guido, was Jan 1, 3000. Looks like we're 980 years ahead of schedule! :-)

I say this in part because comedy, but also because it was anticipated to be a long project. It was originally called "Python 3000".

soyiuz6y ago

paulie_a6y ago

If you have a hole it's hard to dig yourself out of it. This is why I prefer modular apps instead of monolithic codebases. You can upgrade piece by piece. Otherwise it's all or nothing and dangerous

JohnFen6y ago

KaiserPro6y ago

I now work on a python 3.6 codebase. Something like 100k lines of code. in practice 80% of the code was written for 3.5.

However, barring speed improvements, there isn't much to offer, apart from unicode, f strings and annotations.

If python 3 had proper multithreading, that might have been worth breaking backwards compatibility for.

NelsonMinar6y ago

neilobremski6y ago

jeltsin12346y ago

drdeadringer6y ago

jdhawk6y ago

The same reason there is still a ton of legacy PHP 5.6 code.

Migration in interpreted languages that implement major breaking changes is really tedious.

cygned6y ago

> most large organizations, outside of the hype cycle of technical news posts, move much more slowly than the press or blogs would have you think

leetrout6y ago

just_myles6y ago

mbparks6y ago

xvilka6y ago

Find a few remote code executions in Python 2 after Jan 1 2020, and migration will be faster.

qwerty4561276y ago

What are some Python 2 features that make it hard to transpile to Python 3 automatically?

m4636y ago

For me, it was that python on macos was 2.7

munherty6y ago

This is like why are hedge funds still using excel to model. Also why is SAS stilll used

kissgyorgy6y ago

I honestly don't care as long as I don't have to deal with Python 2 code bases anymore. The important point is that all of the popular open source libraries and frameworks are ported.

j / k navigate · click thread line to collapse