Comprehensions in Python the Jedi way (opens in new tab)

(gist.github.com)

208 pointsbearfrieze10y ago85 comments

85 comments

69 comments · 17 top-level

vadim4110y ago· 13 in thread

Cute oneliners that are close to unreadable and definitely unmaintainable.

If I see this kind of "cute" code in code reviews, there is some serious scolding to be done.

spamizbad10y ago

Eh, while comprehensions with multiple for-ins push the limits of good taste, I don't think:

    planets_set = {
        planet for episode in episodes.values() 
        for planet in episode['planets']
    }

is less maintainable in a Python shop than say...

    planets = set()

    for episode in episodes.values():
        planets.update(episode['planets'])

Although the latter will likely make perfect sense to most non-python developers. The former is faster and has a smaller memory footprint and might be preferred when dealing with a larger or more irregular data sets.

The former also has the advantage of not leaking the "episode" variable into the function/method scope, which could introduce a subtle bug if that variable gets conditionally reused. So while it's harder to understand for a less-experienced python developer, the set comprehension solution is inherently safer due to python's design.

AnkhMorporkian10y ago

Additionally, generator comprehensions can be far more efficient than any simple for loop. If you had a generator with a billion star coordinates being read from some file, you'd never be able to load it all into memory. So, instead of manually making a generator function, you could just do

    coordinates = (star.x, star.y, star.z for star in star_map)

bpicolo10y ago

Though in py2 list comprehensions do leak scope. (dict comprehensions don't, you are correct on that :).

andybak10y ago

Gosh. If I was told off for using list comprehensions I'd start considering my employment options fairly quickly. Whilst they can be abused and pushed beyond the limits of reasonable use - in most cases I've seen in the wild they are the clearest and most Pythonic way to do it.

Niksko10y ago

That seems a little harsh.

Nested list comprehensions with multiple filtering if statements? Scald away.

But the simpler examples there are perfectly readable, understandable and maintainable if you understand list comprehensions. And if you need to alter it to the point where it needs to be a little more verbose in order to convey what you're doing, then sure, you can break it out into some other structure. But just because the complexity might increase later on doesn't mean that you shouldn't use a list comprehension.

matt_wulfeck10y ago

But what do you gain from using nested list comprehension like that? There's no performance gain (or very, very little). The only tangible benefit I see is saving a few lines of code.

And for those few lines of code you've traded the ability for new programmers to understand it easily. To me that's a net loss.

4 more replies

rcarmo10y ago

Actually, these are pretty idiomatic and understandable.

List comprehensions (and inline generators, which you get by using parenthesis) make it trivial to handle fairly complex sequence processing, and are one of the reasons Python gets a lot of love from the FP community.

Sure, it's not really an FP language, but these give it an almost LISPy feel.

Regardless,it would be less Pythonic to build a loop and iterate.

Might be easier for complete novices, but I don't suppose people are just hiring novices these days, right?

(edit: Mobile-sponsored typo)

bpicolo10y ago

List comprehensions are both insanely readable and easy to maintain. They're one of the things python gets very, very right

golergka10y ago

I just learned about this, so they are unreadable to me.

But it's not relevant. What relevant is, do they create a readability problem for a python developer who spent significant time with them? I honestly don't know.

andybak10y ago

I 'got' list comprehensions fairly immediately and find them clearer in most cases than map/filter for an explicit loop. I'm cautious when nesting them (as long as your code formatting is clear - a single nested comprehension is pretty acceptable) and I never use the multiple 'for' form as it's just not intuitive to me.

I am rather fond of dictionary comprehensions as mentioned in the article:

    colors = [jedi['lightsaber_color'] for jedi in jedis]
    frequencies = {color: colors.count(color) for color in set(colors)}

    print(frequencies)
    # {'green': 6, 'red': 5, 'blue': 6}

1 more reply

ardfard10y ago

Really? I am pretty much mind-blown when the first time I learned about this. Given that my background is Mathematics, I thought it was pretty elegant and also looked very intuitive.

kelseyfrancis10y ago

I think I'd resign if someone in a position to be reviewing Python code scolded me for using a list/dict/set/generator comprehension.

jqm10y ago

That's not really "cute" code though. Short list comprehension isn't any less readable (and certainly not less maintainable WTF?) than a multi-line "for" loop. In fact, I find short list comprehension more readable. I don't have to mentally keep track of place in a multistep loop.

ambicapter10y ago· 8 in thread

> planets_flat = [planet for episode in episodes.values() for planet in episode['planets']]

Can somebody explain this one to me (I understand list comprehensions)? I'm having trouble understanding how the second part uses something defined in the first part, but the first part can't stand on its own, so

> [planet for episode in episodes.values()]

returns an error.

thomasahle10y ago

It's equivalent to

  planets_flat = []
  for episode in episodes.values():
    for planet in episode['planets']:
      plants_flat.append(planet)

Notice how the for loops in the comprehension goes in the same order as in the imperative code.

hobarrera10y ago

> Notice how the for loops in the comprehension goes in the same order as in the imperative code.

Thanks, that's a sane way to explain the order. Up to right now, it was always "the opposite of what you'd expect", which was a memory rule that always failed me.

3 more replies

andreasvc10y ago

Look at the second example in https://docs.python.org/3/tutorial/datastructures.html#list-...

Once you know how it translates to regular for loops, this syntax is pretty natural. I always put each for loop on its own line, which makes things more readable.

bpicolo10y ago

That's admittedly a pretty confusing example. I don't recommend ever iterating over multiple lists in a single comprehension. A mental breakdown is:

For episodes in episodes

for planet in episode planets

give me planet

the binding of `planet` isn't evaluated until the end (left to right evaluation for the looping bits)

lmm10y ago

Python for comprehensions are middle-ended for some reason: part of it goes backwards and part of it goes forwards. Read it as:

    planets_flat = [planet for planet in episode['planets'] for episode in episodes.values()]

and it makes a lot more sense. In most languages with list comprehensions you'd write the thing you were iterating over first, e.g. (Scala):

    val planetsFlat = for { episode <- episodes.values; planet <- episode(planets) } yield planet

lisper10y ago

It's a list comprehension with an implicit nested loop:

    >>> [y for z in [[1,2,3],[4,5,6]] for y in z]
    [1, 2, 3, 4, 5, 6]

The error you get is because "planet" is bound by the second FOR clause, so when you leave it out, planet becomes unbound.

andreasvc10y ago

The term nested list comprehension is used differently here: https://docs.python.org/3/tutorial/datastructures.html#neste...

Don't know what a better term might be, but it's good to be aware of the distinction.

1 more reply

andybak10y ago

> Can somebody explain this one to me?

Nope. Well - I could if I tried but that particular form is fairly abhorrent to me. My brain hurts just reading it so this is the stage I would find a clearer way to express the algorithm.

a_bonobo10y ago· 7 in thread

One interesting side-thing with list comprehensions in Python 2 vs. Python 3:

Python 2.7:

>list_of_numbers = [1,2,3]

>[x/2 for x in list_of_numbers]

>print(x)

Python 3:

>list_of_numbers = [1,2,3]

>[x/2 for x in list_of_numbers]

>print(x)

NameError: name 'x' is not defined

They "leak" their variables in Python 2, if you're someone who reuses variables this can lead to an enormous headache!

Cyph0n10y ago

Wow, I never noticed that! Who the hell thought that was a good idea?

raymondh10y ago

Barry Warsaw proprosed List Comprehensions 16 years ago in PEP 202.

The original concept was that it provided a more concise way to write loops so that:

   t = [expr(x, y) for x in s1 for y in s2]

was just a short way to write:

   _t = []
   for x in s1:
       for y in s2
           _t.append(expr(x, y))
   t = _t
   del _t

The original implementation reflected that design. I later added the LIST_APPEND opcode to give the list comprehensions a speed advantage over the unrolled code.

The question of whether to expose the loop induction variable didn't get much discussion until I proposed Generator Expressions in PEP 279 and decided to give them the behavior of hiding the loop induction variables so that the behavior would match that of a normal unrolled generator function.

When set and dict comprehensions (displays) came along afterwards, they were given the latter behavior because there was precedent, because there was a mechanism to implement that precedent, and to provide a short-cut for then common practice of creating dicts and sets with generator comprehensions:

    s = {expr(x) for x in t}

was a short-form for:

    s = set(expr(x) for x in t)

The advent of Python 3 gave us an opportunity to make a four forms (listcomps, genexps, set and dict displays) consistent about hiding the loop induction variable.

The current state in Python 3 has the advantage of being consistent between all four variants and matches how mathematicians treat bound and free variables.

There are some disadvantages as well. List comprehensions can no longer be cleanly explained as being equivalent to the unrolled version. The disassembly is harder to explore be cause you need to drill into the internal code object. Tracing the execution with PDB is no fun because you go up and down the stack. It is more difficult to explain scoping -- formerly, all you had was locals/globals/builtins, but now we have locals/nonlocals/globals/builtins plus variables bound in list comps, genexps, set/dict displays plus exception instances that are only visible inside the except-block.

2 more replies

maxerickson10y ago

It was largely an implementation detail. This mentions it some:

http://python-history.blogspot.com/2010/06/from-list-compreh...

The PEP is silent on the issue:

https://www.python.org/dev/peps/pep-0202/

I guess mail list spelunking would probably reveal some conversations about how to handle it.

bpicolo10y ago

Pretty sure it was a bug that they kept for backwards compat, not intentional, though I may be wrong. Py3 let them shed it off though.

1 more reply

chishaku10y ago

> One interesting side-thing with list comprehensions

Just to clarify, this is the behavior of py2 for loops in general, whether in a comprehension or not.

prlin10y ago

Except dictionary comprehensions don't leak their variables.

1 more reply

bearfriezeOP10y ago

Nice bonus info. I didn't know about this (haven't spent much time with Python 2.x).

unoti10y ago· 6 in thread

Comprehensions are great, but they can hurt readability and maintainability when taken too far. Most of the time you shouldn't be playing code golf with your code, iteratively seeking to pack more and more work into a single line of code. That makes the code harder to understand and harder to re-use.

While many of the examples in this helpful article are good, the first example with the octets is an excellent example of how not to do it. Look at the octet parsing code we ended up with in the article:

  # Snippet 1
  octets = [bbs[i:i+8] for i in range(0, len(bbs), 8)]

It's nice and tight. What does it do? I'd need to peer at it a moment and decode it, executing it in my head. This is subjective, but I think code should be self-explanatory; it's up to the computer to execute code, not people in their heads. Is there an off-by-one error in there? Here's another way to do it that'd be better.

  octets = chunks(bbs, 8)

That function chunks() is something that I keep in an iterutils package which I end up using all the time. It's intuitive, and it has a doctest that shows that we definitely don't have an off-by-one error. It's also easier to re-use than the first one. Maybe it also bears mentioning that chunks() works on an iterator, while the first solution needs to keep the whole thing in memory at once. Here's the chunks method I use:

    def chunks(collection, chunk_size):
        """Divides list l into chunks of up to n elements each.
            >>> l = range(75)
            >>> chunks(l,10)
            [[0, 1, 2, 3, 4, 5, 6, 7, 8, 9],
            [10, 11, 12, 13, 14, 15, 16, 17, 18, 19],
            [20, 21, 22, 23, 24, 25, 26, 27, 28, 29],
            [30, 31, 32, 33, 34, 35, 36, 37, 38, 39],
            [40, 41, 42, 43, 44, 45, 46, 47, 48, 49],
            [50, 51, 52, 53, 54, 55, 56, 57, 58, 59],
            [60, 61, 62, 63, 64, 65, 66, 67, 68, 69],
            [70, 71, 72, 73, 74]]
        """
        for i in xrange(0, len(collection), chunk_size):
            yield collection[i : i + chunk_size]

When you catch yourself playing code golf and trying to pack more and more meaning into a single line, look for ways that you can break the problem down into multiple components that use each other. This kind of functional decomposition is one of the things that makes functional programming so wonderful. Lots of times the intermediate steps in a complex expression have meaning and are useful on their own.

kilburn10y ago

I totally agree with you on the core issue.

However, I would reject your `chunks` function in a code review and tell you to use `grouper` form the itertools recipes [1].

More generally,any time I've ended up with a long list comprehension, the answer has been to "check itertools and see how you would describe this ugly comprehension in those terms"

[1] https://docs.python.org/3/library/itertools.html#itertools-r...

unoti10y ago

Grouper is a recipe, it's not part of itertools. It probably should be!

1 more reply

lalaithion10y ago

(The article was written about python3 so I'll nitpick about this)

There is no xrange function in python 3. Now range has the functionality of xrange.

im3w1l10y ago

That doesn't work on an iterator.

unoti10y ago

Cool! When you need it to work on an input that's an iterator, please submit a patch with a doctest. That's one of the important benefits of reusable code.

mfgs10y ago

In terms of pure readability I find breaking the list comprehensions helps in more complex cases (and is still valid Python), e.g.:

    octets = [bbs[i:i+8]
              for i in range(0, len(bbs), 8)
              if i%2 != 0]

bpicolo10y ago· 4 in thread

List comprehensions are awesome. Not only that - python does them insanely beautifully. Clojure and ES6 are examples that I think aren't as readable, though equally powerful more or less. But in python they don't have any clunkiness. Simple, expressive. Love them.

Nesting them can get ugly, but it's easy to avoid: Just use generator expressions and chain them. No real runtime overhead that way.

nilliams10y ago

Well with regards to clunkiness and awesomeness, are they not still less 'naturally' composable and (as a result) less readable in composition than collection pipelines? I don't see a good reason to prefer them over:

    collection
        .map(x => x * 2)
        ...
        .filter(isOdd)
        .reduce(blargh)

... style syntax that most other modern, C-style languages offer now (ES6, Rust, Ruby, C# ...)

thomasahle10y ago

The comprehension approach, while requiring more syntax, seems to often produce a more 'natural' order of operations. For example, compare

  [p for p in range(2,100) if all(p%q!=0 for q in range(2,p))]

with

  range(2,100).filter(p => range(2,p).map(q => p%q!=0).all())

It might be a small thing, but in the first one I feel the prime 'p' becomes the center piece, whereas in the second, 'p' is burrowed somewhat in the expression.

2 more replies

bpicolo10y ago

I think there are values in both styles. I've written a lot of both and for simpler things (which the vast majority if not all of code should be), I definitely prefer list comprehensions. They feel even more functional because they don't depend on map/filter/reduce defined on your objects.

I think chaining syntax from e.g. Elixir gives you more flexibility than map/filter/reduce defined on objects too. Big fan of that, though it does interact oddly with elixirs optional parentheses for function calls (which is a mistake of the language imo)

p4wnc610y ago

This is nothing new at all, it is called the "Fluent Interface" design. Many Python libraries implement this design, for instance the Pandas library is a popular example.

I hate this design approach deep in my soul. It makes for very brittle code that creates lots of backward compatibility issues. If you're working on some legacy code that has some nonsense like

    foo.get_status().dispatch_handler().log_error().close()

it is maddening! You have to untangle just what exactly gets returned by every step of the chain, so that you can ensure you're in the right context to know exactly what the next call of the chain is doing.

In that example, say someone changes `foo.get_status()` to return some new kind of "status" object, and it alters the `dispatch_handler` and so on. Of course one can implement this in a way where the chain of downstream calls doesn't break, but the point isn't so much that, through huge engineering effort it is possible, but rather that it is extremely brittle and adds a layer of complexity that's not needed.

It's just so much better to write something like:

    dispatch_result = run_dispatcher(foo.get_status())
    log_error(dispatch_result)

When the intermediate points of the chain are just functions, instead of member functions of a class, it means you can easily experiment with them and figure out what's going on without needing to recreate the entire set of context along the whole chain.

`run_dispatcher` in my example would be a hell of a lot easier to unit test and throw some mocked example class into for debugging or refactoring than if it is `some_class.run_dispatcher` ... and then if `some_class` has child classes that specialize the behavior, you're just hosed.

The problem is composability. People think that the fluent interface makes things composable because from some arbitrary point in the middle of the chain of calls, they have easy attribute-like access to the next operation they want to do. This artificially feels easy and convenient.

But contrast this to a functional language like Haskell, where none of these things need to be member functions of an object, and hence the context of the object doesn't have to be created at any point in the fluent chain. Then you can write something even better:

    (close . logError . dispatchHandler . getStatus) foo

We can even easily refer to this whole chain of events with a single function name:

    let statusDispatchLog = (close . logError . dispatchHandler . getStatus)

(And, of course, we get lots of nice type checking in statically typed languages to ensure that the composition actually makes sense -- which not only protects you at run time, but is also a huge help to clue you in to your design flaws. If you're trying to shoehorn some stuff into a fluent interface and it's not working, it probably means you have thought clearly about how the methods should "flow" in the call chain.)

To do the same thing in a fluent interface, we need a horrible lambda or a whole new function definition, exactly because the fluent interface is only sweeping the composability issues under the rug.

    statusDispatchLog = lambda x: x.get_status().dispatch_handler().log_error().close()

The difference is subtle, but important. Instead of making a new function that is explicitly the composition of other functions, you are making a function that just happens to access other functions as attributes, and if you set it up correctly then it acts as a sequence of composition.

In Python this is particularly a shame because functions are first class objects. Of course, you can write helper functions / decorators that sort of do function composition (if you're willing to throw away useful argument signatures), or you can use flaky hacks like the common Infix pattern in Python, and then live with ugly "<< . >>" or "|.|" misleading syntax.

It always makes me sad that Python lacks an extremely short function composition infix operator that provides some information about the function signatures of the functions being composed.

Because not even a comprehension can help you when you need to do the fluent interface stuff in Python.

    [x.h().f().g() for x in some_iterator]

This is so much worse than

    map(g.f.h, someIterator)

    [(g.f.h) x | x <- someIterator]

or even

    [g(f(h(x))) for x in some_iterator]

cgriswald10y ago· 4 in thread

Pedantic SW points:

1. Rey is not a Jedi. A better dict key would be "fav_force_user".

2. For the planets, Episode II is missing Coruscant, Episode VI is missing Dagobah.

Edit: formatting

jordigh10y ago

Hah, like there is any doubt that she'll become a Jedi. It's basically Episode IV all over again. Han Solo instead of Obi Wan, Rey instead of Luke, First Order instead of the Empire, Kylo Ren instead of Darth Vader.

We like hearing a story we already know. Rey is well on her way to become a Jedi, probably in two-movies' time.

zo110y ago

And don't forget, old Luke is probably going to act as the Yoda-equivalent in the next "episode".

bearfriezeOP10y ago

1: Seems pretty clear that The Force Awakens is an "...origin story of a female Jedi." [1]

2: At some point while writing this I realised I would spend all evening if I had to round up all the planets, and chose to note that the lists are "non-exhaustive" instead :)

[1]: http://www.wga.org/content/default.aspx?id=6130

bliti10y ago

Thanks, I always get my trek wars trivia mixed. Which movie is the one where Chewbacca goes into time and rescues some whales?

//Just some Sunday fun. :)

undershirt10y ago· 3 in thread

I don't think the author realizes how appropriate the Jedi tone for this article is. Comprehensions are the gateway drug to the dark side, away from imperative programming and toward languages that treat everything as expressions which snap together more freely. It hints at an idea of making a `for` loop and an `if` statement return a value (see CoffeeScript). But it also hints at the idea that useful idioms like the List Comprehension can merge/simplify existing constructs into a new, easier syntax-- and that there are languages that allow you to do this freely (see Lisp).

Python showed me the Force, but I'm with the Dark Side now.

hellofunk10y ago

One of the most extraordinary APIs for comprehensions can be found in Clojure, where the "for" expression really changed how I view data. A single expression can generate many complex data sequences with such elegant beauty.

bearfriezeOP10y ago

Author here. Thanks for this brilliant comment. May the force be with you.

seivan10y ago

Sorta like signals using RxJS/RxSwift.

d0mine10y ago· 3 in thread

There is an easier way to convert bits to ASCII, filter spaces and reverse the string:

  >>> n = int(bbs, 2)
  >>> n.to_bytes((n.bit_length() + 7) // 8, 'big').decode()
  's no  i sn e  h  e r  pm o c'
  >>> _.replace(' ', '')[::-1]
  'comprehensions'

http://stackoverflow.com/questions/7396849/convert-binary-to...

bearfriezeOP10y ago

I like the replace method. It's a great way of doing the same thing.

I considered using the [::-1] syntax to reverse the list, but decided that there was enough "cute" stuff in the examples already.

d0mine10y ago

[::-1] is an idiomatic way to reverse a string in Python that is the obvious way for a habitual user of the language to do it e.g.:

  def palindrome(s):
      return s == s[::-1]

It could be discussed whether ''.join(reversed(s)) is more readable for a novice programmer learning Python. In general, Python prefers words over punctuation.

Also, there are objects that can be reversed() that are not sequences.

http://stackoverflow.com/questions/931092/reverse-a-string-i...

tyingq10y ago

Another alternative, works on both py2/3, assuming you replace bbs='000... with bbs=b'000...

  import struct

  result=''.join([chr(int(bits,2)) for bits in struct.unpack('8p'*(int(len(bbs)/8)),bbs)])

  print(result.replace(' ','')[::-1])

matt_wulfeck10y ago· 2 in thread

I don't want to be a jedi Python programmer and exploit every neat trick of the language. I want to be a very good programmer and write code that's easy for others to read, debug, and maintain.

santaclaus10y ago

I wouldn't call list comprehensions a Jedi feature of Python -- they are pretty darn idiomatic and common.

JustSomeNobody10y ago

Comprehensions are not a neat trick. If you know and use Python, writing comprehensions is writing good code that can be understood by others.

MatthewWilkes10y ago· 1 in thread

is should not be used to check string equality, as it is in the space filter. This only works as CPython interns short strings automatically.

bearfriezeOP10y ago

Thanks for pointing this out along with some folks over in the comments on GitHub. I've updated the Gist.

distracteddev9010y ago· 1 in thread

One thing I find interesting is that everyone loves to hate CoffeeScript, but its individual features/syntax are consistently lauded in conversations about other languages.

(Not to mention half of ES6 existed in CoffeeScript first, but that's a gripe for another day)

stuartaxelowen10y ago

The composition of features is more important than just the individual features - CoffeeScript is a great example of when feature composition goes wrong.

rebootthesystem10y ago

List comprehension in Python is great. However, that's Padawan territory. If you want to be a Jedi then APL is the only way. It's not "list comprehension" it's the way of the force when you use APL.

Negative110y ago

It's fun switching from Python to Scala where for/yield comprehensions are idiomatic and very natural (nested comprehensions being a good example). Oh, and type safe and fast. It's especially powerful to build off those constructs with currying, partial application, pattern matching, etc...

I love Python and write code in it daily but as a (pseudo-)functional language it feels very awkward to me.

alexandercrohde10y ago

Or, there's a 1-million times better way to do this with a good library. In javascript for example

FA('0...11'.split('')).chunk(8) .map(x => parseInt(x.join(''), 2)) .map(x=>String.fromCharCode(x)) .filter(x=>x!=' ') .reverse()

https://github.com/anfurny/Fancy

RubyPinch10y ago

I just wish that python could stop being such a butt about not being like the rest of the languages

list of items --> filter list --> operate on list

becomes

operate on list <-- (list of items --> filter list)

And because this is how it was decided to tackle map/filter problems, we'll always have a weird gimped anon-function operator instead, to discourage the map/filter patterns of every other language

catnaroek10y ago

List comprehensions are just syntactic sugar over some common higher-order functions. Are we really getting excited over syntactic sugar? How about making Python's higher-order functions not suck instead?

jonesb610y ago

Yes let's make the reputation of Python more cryptic and culty. Because cryptic and culty things are better right?

Han Solo said it best "Hokey religions and ancient weapons are no match for a good blaster at your side, kid."

Lets keep it explicit alright? It's better then implicit.

Edit: read the article, it neither makes python cryptic or culty. The Jedi thing is just a cool SW reference. That said list comprehensions are a little cryptic to me since I haven't written python in awhile. Personally I think an important attribute of elegance in programming is how little you need to read the docs to understand something, and the more one-liners we do the more times we are likely to have to look at the documentation before reading it (not necessarily a bad thing, but sometimes a time sink and sometimes people won't look up the docs when they should!).

j / k navigate · click thread line to collapse

85 comments

69 comments · 17 top-level

vadim4110y ago· 13 in thread

Cute oneliners that are close to unreadable and definitely unmaintainable.

If I see this kind of "cute" code in code reviews, there is some serious scolding to be done.

spamizbad10y ago

Eh, while comprehensions with multiple for-ins push the limits of good taste, I don't think:

    planets_set = {
        planet for episode in episodes.values() 
        for planet in episode['planets']
    }

is less maintainable in a Python shop than say...

    planets = set()

    for episode in episodes.values():
        planets.update(episode['planets'])

AnkhMorporkian10y ago

    coordinates = (star.x, star.y, star.z for star in star_map)

bpicolo10y ago

Though in py2 list comprehensions do leak scope. (dict comprehensions don't, you are correct on that :).

andybak10y ago

Niksko10y ago

That seems a little harsh.

Nested list comprehensions with multiple filtering if statements? Scald away.

matt_wulfeck10y ago

But what do you gain from using nested list comprehension like that? There's no performance gain (or very, very little). The only tangible benefit I see is saving a few lines of code.

And for those few lines of code you've traded the ability for new programmers to understand it easily. To me that's a net loss.

4 more replies

rcarmo10y ago

Actually, these are pretty idiomatic and understandable.

Sure, it's not really an FP language, but these give it an almost LISPy feel.

Regardless,it would be less Pythonic to build a loop and iterate.

Might be easier for complete novices, but I don't suppose people are just hiring novices these days, right?

(edit: Mobile-sponsored typo)

bpicolo10y ago

List comprehensions are both insanely readable and easy to maintain. They're one of the things python gets very, very right

golergka10y ago

I just learned about this, so they are unreadable to me.

But it's not relevant. What relevant is, do they create a readability problem for a python developer who spent significant time with them? I honestly don't know.

andybak10y ago

I am rather fond of dictionary comprehensions as mentioned in the article:

    colors = [jedi['lightsaber_color'] for jedi in jedis]
    frequencies = {color: colors.count(color) for color in set(colors)}

    print(frequencies)
    # {'green': 6, 'red': 5, 'blue': 6}

1 more reply

ardfard10y ago

Really? I am pretty much mind-blown when the first time I learned about this. Given that my background is Mathematics, I thought it was pretty elegant and also looked very intuitive.

kelseyfrancis10y ago

I think I'd resign if someone in a position to be reviewing Python code scolded me for using a list/dict/set/generator comprehension.

jqm10y ago

ambicapter10y ago· 8 in thread

> planets_flat = [planet for episode in episodes.values() for planet in episode['planets']]

> [planet for episode in episodes.values()]

returns an error.

thomasahle10y ago

It's equivalent to

  planets_flat = []
  for episode in episodes.values():
    for planet in episode['planets']:
      plants_flat.append(planet)

Notice how the for loops in the comprehension goes in the same order as in the imperative code.

hobarrera10y ago

> Notice how the for loops in the comprehension goes in the same order as in the imperative code.

Thanks, that's a sane way to explain the order. Up to right now, it was always "the opposite of what you'd expect", which was a memory rule that always failed me.

3 more replies

andreasvc10y ago

Look at the second example in https://docs.python.org/3/tutorial/datastructures.html#list-...

Once you know how it translates to regular for loops, this syntax is pretty natural. I always put each for loop on its own line, which makes things more readable.

bpicolo10y ago

That's admittedly a pretty confusing example. I don't recommend ever iterating over multiple lists in a single comprehension. A mental breakdown is:

For episodes in episodes

for planet in episode planets

give me planet

the binding of `planet` isn't evaluated until the end (left to right evaluation for the looping bits)

lmm10y ago

Python for comprehensions are middle-ended for some reason: part of it goes backwards and part of it goes forwards. Read it as:

    planets_flat = [planet for planet in episode['planets'] for episode in episodes.values()]

and it makes a lot more sense. In most languages with list comprehensions you'd write the thing you were iterating over first, e.g. (Scala):

    val planetsFlat = for { episode <- episodes.values; planet <- episode(planets) } yield planet

lisper10y ago

It's a list comprehension with an implicit nested loop:

    >>> [y for z in [[1,2,3],[4,5,6]] for y in z]
    [1, 2, 3, 4, 5, 6]

The error you get is because "planet" is bound by the second FOR clause, so when you leave it out, planet becomes unbound.

andreasvc10y ago

The term nested list comprehension is used differently here: https://docs.python.org/3/tutorial/datastructures.html#neste...

Don't know what a better term might be, but it's good to be aware of the distinction.

1 more reply

andybak10y ago

> Can somebody explain this one to me?

Nope. Well - I could if I tried but that particular form is fairly abhorrent to me. My brain hurts just reading it so this is the stage I would find a clearer way to express the algorithm.

a_bonobo10y ago· 7 in thread

One interesting side-thing with list comprehensions in Python 2 vs. Python 3:

Python 2.7:

>list_of_numbers = [1,2,3]

>[x/2 for x in list_of_numbers]

>print(x)

Python 3:

>list_of_numbers = [1,2,3]

>[x/2 for x in list_of_numbers]

>print(x)

NameError: name 'x' is not defined

They "leak" their variables in Python 2, if you're someone who reuses variables this can lead to an enormous headache!

Cyph0n10y ago

Wow, I never noticed that! Who the hell thought that was a good idea?

raymondh10y ago

Barry Warsaw proprosed List Comprehensions 16 years ago in PEP 202.

The original concept was that it provided a more concise way to write loops so that:

   t = [expr(x, y) for x in s1 for y in s2]

was just a short way to write:

   _t = []
   for x in s1:
       for y in s2
           _t.append(expr(x, y))
   t = _t
   del _t

The original implementation reflected that design. I later added the LIST_APPEND opcode to give the list comprehensions a speed advantage over the unrolled code.

    s = {expr(x) for x in t}

was a short-form for:

    s = set(expr(x) for x in t)

The advent of Python 3 gave us an opportunity to make a four forms (listcomps, genexps, set and dict displays) consistent about hiding the loop induction variable.

The current state in Python 3 has the advantage of being consistent between all four variants and matches how mathematicians treat bound and free variables.

2 more replies

maxerickson10y ago

It was largely an implementation detail. This mentions it some:

http://python-history.blogspot.com/2010/06/from-list-compreh...

The PEP is silent on the issue:

https://www.python.org/dev/peps/pep-0202/

I guess mail list spelunking would probably reveal some conversations about how to handle it.

bpicolo10y ago

Pretty sure it was a bug that they kept for backwards compat, not intentional, though I may be wrong. Py3 let them shed it off though.

1 more reply

chishaku10y ago

> One interesting side-thing with list comprehensions

Just to clarify, this is the behavior of py2 for loops in general, whether in a comprehension or not.

prlin10y ago

Except dictionary comprehensions don't leak their variables.

1 more reply

bearfriezeOP10y ago

Nice bonus info. I didn't know about this (haven't spent much time with Python 2.x).

unoti10y ago· 6 in thread

  # Snippet 1
  octets = [bbs[i:i+8] for i in range(0, len(bbs), 8)]

  octets = chunks(bbs, 8)

    def chunks(collection, chunk_size):
        """Divides list l into chunks of up to n elements each.
            >>> l = range(75)
            >>> chunks(l,10)
            [[0, 1, 2, 3, 4, 5, 6, 7, 8, 9],
            [10, 11, 12, 13, 14, 15, 16, 17, 18, 19],
            [20, 21, 22, 23, 24, 25, 26, 27, 28, 29],
            [30, 31, 32, 33, 34, 35, 36, 37, 38, 39],
            [40, 41, 42, 43, 44, 45, 46, 47, 48, 49],
            [50, 51, 52, 53, 54, 55, 56, 57, 58, 59],
            [60, 61, 62, 63, 64, 65, 66, 67, 68, 69],
            [70, 71, 72, 73, 74]]
        """
        for i in xrange(0, len(collection), chunk_size):
            yield collection[i : i + chunk_size]

kilburn10y ago

I totally agree with you on the core issue.

However, I would reject your `chunks` function in a code review and tell you to use `grouper` form the itertools recipes [1].

More generally,any time I've ended up with a long list comprehension, the answer has been to "check itertools and see how you would describe this ugly comprehension in those terms"

[1] https://docs.python.org/3/library/itertools.html#itertools-r...

unoti10y ago

Grouper is a recipe, it's not part of itertools. It probably should be!

1 more reply

lalaithion10y ago

(The article was written about python3 so I'll nitpick about this)

There is no xrange function in python 3. Now range has the functionality of xrange.

im3w1l10y ago

That doesn't work on an iterator.

unoti10y ago

Cool! When you need it to work on an input that's an iterator, please submit a patch with a doctest. That's one of the important benefits of reusable code.

mfgs10y ago

In terms of pure readability I find breaking the list comprehensions helps in more complex cases (and is still valid Python), e.g.:

    octets = [bbs[i:i+8]
              for i in range(0, len(bbs), 8)
              if i%2 != 0]

bpicolo10y ago· 4 in thread

Nesting them can get ugly, but it's easy to avoid: Just use generator expressions and chain them. No real runtime overhead that way.

nilliams10y ago

    collection
        .map(x => x * 2)
        ...
        .filter(isOdd)
        .reduce(blargh)

... style syntax that most other modern, C-style languages offer now (ES6, Rust, Ruby, C# ...)

thomasahle10y ago

The comprehension approach, while requiring more syntax, seems to often produce a more 'natural' order of operations. For example, compare

  [p for p in range(2,100) if all(p%q!=0 for q in range(2,p))]

with

  range(2,100).filter(p => range(2,p).map(q => p%q!=0).all())

It might be a small thing, but in the first one I feel the prime 'p' becomes the center piece, whereas in the second, 'p' is burrowed somewhat in the expression.

2 more replies

bpicolo10y ago

p4wnc610y ago

This is nothing new at all, it is called the "Fluent Interface" design. Many Python libraries implement this design, for instance the Pandas library is a popular example.

I hate this design approach deep in my soul. It makes for very brittle code that creates lots of backward compatibility issues. If you're working on some legacy code that has some nonsense like

    foo.get_status().dispatch_handler().log_error().close()

It's just so much better to write something like:

    dispatch_result = run_dispatcher(foo.get_status())
    log_error(dispatch_result)

    (close . logError . dispatchHandler . getStatus) foo

We can even easily refer to this whole chain of events with a single function name:

    let statusDispatchLog = (close . logError . dispatchHandler . getStatus)

To do the same thing in a fluent interface, we need a horrible lambda or a whole new function definition, exactly because the fluent interface is only sweeping the composability issues under the rug.

    statusDispatchLog = lambda x: x.get_status().dispatch_handler().log_error().close()

It always makes me sad that Python lacks an extremely short function composition infix operator that provides some information about the function signatures of the functions being composed.

Because not even a comprehension can help you when you need to do the fluent interface stuff in Python.

    [x.h().f().g() for x in some_iterator]

This is so much worse than

    map(g.f.h, someIterator)

    [(g.f.h) x | x <- someIterator]

or even

    [g(f(h(x))) for x in some_iterator]

cgriswald10y ago· 4 in thread

Pedantic SW points:

1. Rey is not a Jedi. A better dict key would be "fav_force_user".

2. For the planets, Episode II is missing Coruscant, Episode VI is missing Dagobah.

Edit: formatting

jordigh10y ago

We like hearing a story we already know. Rey is well on her way to become a Jedi, probably in two-movies' time.

zo110y ago

And don't forget, old Luke is probably going to act as the Yoda-equivalent in the next "episode".

bearfriezeOP10y ago

1: Seems pretty clear that The Force Awakens is an "...origin story of a female Jedi." [1]

2: At some point while writing this I realised I would spend all evening if I had to round up all the planets, and chose to note that the lists are "non-exhaustive" instead :)

[1]: http://www.wga.org/content/default.aspx?id=6130

bliti10y ago

Thanks, I always get my trek wars trivia mixed. Which movie is the one where Chewbacca goes into time and rescues some whales?

//Just some Sunday fun. :)

undershirt10y ago· 3 in thread

Python showed me the Force, but I'm with the Dark Side now.

hellofunk10y ago

bearfriezeOP10y ago

Author here. Thanks for this brilliant comment. May the force be with you.

seivan10y ago

Sorta like signals using RxJS/RxSwift.

d0mine10y ago· 3 in thread

There is an easier way to convert bits to ASCII, filter spaces and reverse the string:

  >>> n = int(bbs, 2)
  >>> n.to_bytes((n.bit_length() + 7) // 8, 'big').decode()
  's no  i sn e  h  e r  pm o c'
  >>> _.replace(' ', '')[::-1]
  'comprehensions'

http://stackoverflow.com/questions/7396849/convert-binary-to...

bearfriezeOP10y ago

I like the replace method. It's a great way of doing the same thing.

I considered using the [::-1] syntax to reverse the list, but decided that there was enough "cute" stuff in the examples already.

d0mine10y ago

[::-1] is an idiomatic way to reverse a string in Python that is the obvious way for a habitual user of the language to do it e.g.:

  def palindrome(s):
      return s == s[::-1]

It could be discussed whether ''.join(reversed(s)) is more readable for a novice programmer learning Python. In general, Python prefers words over punctuation.

Also, there are objects that can be reversed() that are not sequences.

http://stackoverflow.com/questions/931092/reverse-a-string-i...

tyingq10y ago

Another alternative, works on both py2/3, assuming you replace bbs='000... with bbs=b'000...

  import struct

  result=''.join([chr(int(bits,2)) for bits in struct.unpack('8p'*(int(len(bbs)/8)),bbs)])

  print(result.replace(' ','')[::-1])

matt_wulfeck10y ago· 2 in thread

I don't want to be a jedi Python programmer and exploit every neat trick of the language. I want to be a very good programmer and write code that's easy for others to read, debug, and maintain.

santaclaus10y ago

I wouldn't call list comprehensions a Jedi feature of Python -- they are pretty darn idiomatic and common.

JustSomeNobody10y ago

Comprehensions are not a neat trick. If you know and use Python, writing comprehensions is writing good code that can be understood by others.

MatthewWilkes10y ago· 1 in thread

is should not be used to check string equality, as it is in the space filter. This only works as CPython interns short strings automatically.

bearfriezeOP10y ago

Thanks for pointing this out along with some folks over in the comments on GitHub. I've updated the Gist.

distracteddev9010y ago· 1 in thread

One thing I find interesting is that everyone loves to hate CoffeeScript, but its individual features/syntax are consistently lauded in conversations about other languages.

(Not to mention half of ES6 existed in CoffeeScript first, but that's a gripe for another day)

stuartaxelowen10y ago

The composition of features is more important than just the individual features - CoffeeScript is a great example of when feature composition goes wrong.

rebootthesystem10y ago

List comprehension in Python is great. However, that's Padawan territory. If you want to be a Jedi then APL is the only way. It's not "list comprehension" it's the way of the force when you use APL.

Negative110y ago

I love Python and write code in it daily but as a (pseudo-)functional language it feels very awkward to me.

alexandercrohde10y ago

Or, there's a 1-million times better way to do this with a good library. In javascript for example

FA('0...11'.split('')).chunk(8) .map(x => parseInt(x.join(''), 2)) .map(x=>String.fromCharCode(x)) .filter(x=>x!=' ') .reverse()

https://github.com/anfurny/Fancy

RubyPinch10y ago

I just wish that python could stop being such a butt about not being like the rest of the languages

list of items --> filter list --> operate on list

becomes

operate on list <-- (list of items --> filter list)

And because this is how it was decided to tackle map/filter problems, we'll always have a weird gimped anon-function operator instead, to discourage the map/filter patterns of every other language

catnaroek10y ago

jonesb610y ago

Yes let's make the reputation of Python more cryptic and culty. Because cryptic and culty things are better right?

Han Solo said it best "Hokey religions and ancient weapons are no match for a good blaster at your side, kid."

Lets keep it explicit alright? It's better then implicit.

j / k navigate · click thread line to collapse