Understanding Python through its builtins (opens in new tab)

(sadh.life)

544 pointstusharsadhwani4y ago173 comments

173 comments

96 comments · 22 top-level

submeta4y ago· 12 in thread

Nicely written article! - Slightly off topic: I love seeing and reading Python code. Used to see many flaws in the language (things like `.append` changing the object, returning `None` instead of creating a copy and returning that), but after ten years of working with Python I really appreciate its versatility, it‘s ubiquitous availability, the large number of libraries and the community. There‘s nothing I can‘t solve with it. It‘s the swiss army knife in my pocket, available whenever I need to solve something by coding.

radarsat14y ago

> (things like `.append` changing the object, returning `None` instead of creating a copy and returning that)

This would be horrendously inefficient without immutable data structures like Clojure's. Very few languages have that, so it's a strange assumption to make, especially for a language as old as Python.

Although it is a very nice feature of Clojure.

Blikkentrekker4y ago

It should also be worth nothing that Clojure sacrificed quite a bit to make this as efficient as possible.

“persistent vectors” are certainly an interesting data structure that strike a compromise between fast indexing and being able to relatively quickly create a copy where only one element changes, but it's a compromise and indexing is made slower to allow for the latter. — They also take up more memory on their own but are allowed to share memory with their copies.

I will say that my ideal language contains them in the standard library alongside standard vectors that index in constant time.

Further, it should be noted that much of the performance talk is on the assumption that accessing from memory is truly random access; — with the existence of c.p.u. caches that assumption is not entirely accurate and accessing from contiguous rather than scattered memory in practice is considerably cheaper so one also pays the price for their being scattered more in memory.

1 more reply

l33tc0der4y ago

One of the many reasons why I love Clojure so much!

Rich implement his own brand of persistent data structures which makes Clojure's immutability a lot more efficient.

1 more reply

tusharsadhwaniOP4y ago

Thank you!

Things like `list.append` modifying in-place might feel like a flaw to some, but I think Python is really consistent when it comes to its behaviour. If you ask a person who comes from an object-oriented world, they'll say it only makes sense for a method on an object to modify that object's data directly.

There's always ways to do things the other way, for example you can use x = [*x, item] to append and create a new copy, while being quite a bit more explicit that a new list is being created.

int_19h4y ago

One related area where Python is not consistent is operators like +=.

In pretty much all other languages that have them, the expected behavior of A+=B is exactly the same as A=A+B, except that A is only evaluated once. Now lets look at lists in Python:

   xs = [1, 2]
   ys = xs
   ys = ys + [3]
   print(xs, ys)

This prints [1, 2] [1, 2, 3], because the third line created a new list, and made ys reference that. On the other hand, this:

   xs = [1, 2]
   ys = xs
   ys += [3]
   print(xs, ys)

prints [1, 2, 3] [1, 2, 3], because += changes the list itself, and both xs and ys refer to that same list.

(Note that this is not the same as C++, because in the latter, the variables store values directly, while in Python, all variables are references to values.)

The worst part of it is that Python isn't even self-consistent here. If you only define __add__ in your custom class, you can use both + and += with its instances, with the latter behaving normally. But if you define __iadd__, as list does, then you can do whatever you want - and the idiomatic behavior is to modify the instance!

For comparison, C# lets you overload + but not +=, and automatically synthesizes the latter from the former to enforce the correct behavior.

1 more reply

tyingq4y ago

>Python is really consistent when it comes to its behaviour

True, though you end up with things like:

  ' '.join(thelist)

Instead of

  thelist.join(' ')

Because of the somewhat aggressive mantra to be consistent.

4 more replies

pokepim4y ago

Huh I never even thought we would need to create copy of an object when adding new item to it (like a new item to list for example). Is there any drawback on doing that in standard pythonic way? I actually learned to program using Python and it was my first language. Since then I only used JS. In both I like using functions a lot and rarely dabble in OOP since it is more conveniet to me.

3 more replies

Alex39174y ago

> things like `.append` changing the object, returning `None` instead of creating a copy and returning that

The obvious question is why it can't return a reference to the list instead of returning None. I feel like if I've been using the language on an almost daily basis for ten years now and I still get burned by that all the time, then it's just a poorly designed feature.

canjobear4y ago

The advantage of mutating operations always returning None is that you can easily tell whether a mutation is happening by looking at the code. If you see y = f(x) that means x is unchanged, whereas if you see just f(x) on a line that means something stateful is happening.

3 more replies

aftbit4y ago

random.shuffle() has bitten me that way a few times too:

    array = random.shuffle(array)

because I expected it to return a copy or reference, instead making my array None.

It would also enable chaining operations:

    array = array.append(A).append(B).sort()

In-place vs immutable copy is a language design choice with tradeoffs on both sides, but there's no reason that I can see to not return a reference to the list.

Perhaps recognizing this is really the job of an external linter. Sometimes I wonder if the future of enforcing canonical formatting on save like "gofmt" or "black" will extend to auto-correcting certain goofy errors on each save.

mypy would yell at you about this, but afaik type-checked python still isn't the norm.

1 more reply

brundolf4y ago

> Used to see many flaws in the language (things like `.append` changing the object, returning `None` instead of creating a copy and returning that)

I think it's pretty off-base to call this a "flaw". Immutable structures have their place and can be very helpful where appropriate, but making its core primitives work this way is far outside the scope or the philosophy of Python. If you want otherwise, you're really wanting an entirely different language. And there's nothing wrong with that! But I think it would be a "flaw" for Python to make these operations immutable, even though I love immutability personally.

listenallyall4y ago

And also like a Swiss army knife, it's not particularly great at anything, can be awkward even when functional, and there's always a better tool for any specific job.

cinntaile4y ago· 11 in thread

Are there any good books that deal with writing pythonic code? As well as being focused on more intermediate or advanced features like this? If the book is project focused that's a bonus. Performance trade-offs another bonus.

tusharsadhwaniOP4y ago

I can personally recommend Fluent Python (its 2nd edition is about to come out in a couple months) for learning these intermediate/advanced concepts, and Python Cookbook for code examples using many of these features.

I don't know any books for projects per-se, maybe HN will know!

cinntaile4y ago

To me it looks like a lot of this knowledge is spread out over many different excellent technical blogs like yours. While the content is good, it's hard to get something that resembles a more complete picture compared to just another piece of a big puzzle.

tracyhenry4y ago

Searching python on HackerNews readings (a site I built): https://hacker-recommended-books.vercel.app/category/0/all-t... The results include Fluent Python recommended by tusharsadhwani

disgruntledphd24y ago

Fluent Python is definitely a good book, I knew a whole bunch of the stuff in this article because of it. I only got to Chapter 9, but it legitimately made my Python much, much better.

asdfgeoff4y ago

Not a book, but I have found Trey Hunner's https://www.pythonmorsels.com/ exercises very useful on this front.

The solutions presented typically include both a "basic" approach, a "as pythonic as possible" approach, and a brief discussion of the trade-offs between elegance and readability, etc.

usrme4y ago

I would recommend "Robust Python" by Patrick Viafore. It teaches you a lot about type annotations (among other thing) and gave me personally a whole new way of looking at the code that I write.

cinntaile4y ago

Thanks for the different suggestions, I went with this one. Fluent Python also looked promising but I can't buy the 2nd ed yet.

1 more reply

dehrmann4y ago

In a way, learning Python is harder for people experienced with another language because so much of the content you find is for first-time programmers.

That said, I think I had good luck with Writing Idiomatic Python.

matsemann4y ago

Yes!

When I learned Kotlin, I just read through the docs, and then knew of basically all the different concepts in the language.

For Python, the docs were comparably very bad. For instance, Decorators aren't mentioned even once in the "The Python Tutorial". In "The Python Language Reference" (if one even bother to read such a dry document) it's barely mentioned in passing. How should a new user know it's a concept and how to apply it? And the language reference links only to a glossary item, and none of them specify how parameters in a decorator is supposed to work.

Pretty frustrating experience, put me a bit off the language from the get-go.

1 more reply

mattficke4y ago

Effective Python [0] is my favorite book in this category.

[0] https://effectivepython.com/

cinntaile4y ago

I'm giving this one a go as well, thanks!

eximius4y ago· 7 in thread

In addition to this, I highly recommend just reading the codebase. I haven't written C since college and it's remarkably readable.

I once tried to catalogue all the stdlib operations which release the GIL, meaning if you use only those (well, only those "heavy" bits, you can still use other small blocking glue bits), you can do "real" multithreading.

It was a fun exercise!

hultner4y ago

There's a really nice (although old now) walk through of the cpython code base on YouTube. I watched it on a long 24 hour flight between Canada and Sweden a couple of years back.

Edit: Found it! You're in for about 9 hours of quality watching. https://youtube.com/playlist?list=PLwyG5wA5gIzgTFj5KgJJ15lxq...

matheusmoreira4y ago

That's amazing. Wish there was something similar for the JVM and V8.

submeta4y ago

Excellent, thanks for sharing!

riazrizvi4y ago

And it helps much more in the actual job of writing software than in say, practicing coding puzzles

matheusmoreira4y ago

> I highly recommend just reading the codebase

Me too. When I used to write Ruby I read a lot of CRuby source code. I achieved a much deeper understanding of the language that way. Even answered some really fun stackoverflow questions.

Now the first thing I do when I see a new language is read its source code.

jamesfinlayson4y ago

Yep, when I was writing PHP I found myself digging into php-src on a regular basis to see what exactly was going on. PHP's documentation is good but the code is much more explicit.

dharmab4y ago

Agreed. CPython makes readability and maintainability a priority.

nneonneo4y ago· 7 in thread

Cute fact about __debug__: it is one of the only ways to get compile-time conditionals in Python. Performing a comparison with `if __debug__:` will output byte code for the ensuing statement if and only if the interpreter is in debug mode - notably, in `-O` mode, it will not even generate a load of __debug__ and a conditional jump, and acts as if the statement didn’t exist at all.

Fordec4y ago

Not going to lie, if I could enable stricter compile time conditions without debug mode that would be a very welcome RFC

BiteCode_dev4y ago

Unfortunatly you often can't use -o because of the 3rd party libs that didnt' get the memo and use assert for error checking.

We still have -X dev and sys.flags but it's runtime only.

robot_no_4194y ago

Unless I'm misunderstanding, they are wrapping their asserts in try/catch blocks? That's... yikes.

1 more reply

alshel4y ago

Care to name any 3rd party libs in particular?

3 more replies

globular-toast4y ago

I knew that `assert` behaved in a similar way. Turns out it's actually equivalent to an `if __debug__:` https://docs.python.org/3/reference/simple_stmts.html#gramma...

tusharsadhwaniOP4y ago

that is indeed interesting. Mind if I add this in the article?

nneonneo4y ago

Sure, go ahead :)

sireat4y ago· 7 in thread

I am a pretty average Python programmer(5 years teaching, 15 years writing).

I still wonder what was the reasoning for allowing creation of local objects with the same name as builtins.

Okay it can be nice to redefine pprint as print I suppose.

Still how many sum, list, min, max, dict(!) have been erroneously redefined in beginner tutorials and beginner code.

From my experience sum and list suffer the most.

Sure there are linters that will warn you but there should be a setting for the interpreter (as in -Werror in GCC) to disallow this silliness.

wpietri4y ago

No idea what their actual reasoning is, but here's how I think about it:

This is better for novices, because otherwise you create a whole bunch of land mines for people who are desperately trying to get something done. If they aren't aware of the built-in then they aren't trying to use it. Insisting that they become aware of something they don't want right then will be frustrating.

It's also better for experts, in that they're generally aware they're overriding a built-in and are doing it on purpose, and if not they'll have an IDE or linter reminding them.

To me, I see tooling as a spectrum from supportive to controlling. Python is very much on the supportive end. It feels controlling when I get interrupted because some programmer who has never met me programmed a tool to insist I do things their way. That would very much include insisting I respect a bunch of names they decided long ago to put in the global namespace.

nneonneo4y ago

Python itself doesn’t disallow this because there are quite a lot of builtins with useful names - for example, `file`, `id`, and `hash` to name a few. Disallowing setting these would be tantamount to adding a bunch of new keywords to the language, which they’ve been quite loathe to do in general.

A good linter will catch these, so in production environments you usually don’t run into issues. I agree that it can be a beginner trap though!

int_19h4y ago

Consider what'd happen whenever a new builtin gets added.

tusharsadhwaniOP4y ago

I think linters are really effective to figure out such issues in professional code.

For students for example, I'll have to agree. Maybe having a flag or environment variable that teachers can set up for it would be a nice idea. You should start a thread on the python-ideas mailing list about this, and it might get somewhere :)

klyrs4y ago

   False = None = True

Is probably the most ridiculous thing for a language to support. And yet...

wmanley4y ago

    $ python2 -c 'False = None = True'
      File "<string>", line 1
    SyntaxError: cannot assign to None

    $ python3 -c 'False = None = True'
      File "<string>", line 1
    SyntaxError: cannot assign to False

anttihaapala4y ago

... and yet, it is a SyntaxError. False = True = None works in Python 2, and it is just because the separate Boolean type was a late addition.

1 more reply

kgm4y ago· 6 in thread

This is a neat article, but it does have some errors.

One subtle point that the post gets wrong:

> So where does that come from? The answer is that Python stores everything inside dictionaries associated with each local scope. Which means that every piece of code has its own defined “local scope” which is accessed using locals() inside that code, that contains the values corresponding to each variable name.

The dictionary returned by `locals()` is not literally a function's local namespace, it's a copy of that namespace. The actual local namespace is an array that is part of the frame object; in this way, references to local variables may happen much more quickly than would be the case if it had to look each variable up in a dictionary every time.

One consequence of this is that you can't mutate the dict returned by `locals()` in order to change the value of a function-local variable.

Another, less-subtle error in the post is this:

> int is another widely-used, fundamental primitive data type. It’s also the lowest common denominator of 2 other data types: , float and complex. complex is a supertype of float, which, in turn, is a supertype of int.

> What this means is that all ints are valid as a float as well as a complex, but not the other way around. Similarly, all floats are also valid as a complex.

Oh, no no no. Python integers are arbitrary-precision integers. Floats are IEEE 754 double-precision binary floating-point values, and as such only support full integer precision up to 2^53. The int type can represent values beyond that range which the float type cannot.

And while it is true that the complex type is just two floats stuck together, I would very much not call it a supertype. It performs distinct operations.

> Accessing an attribute with obj.x calls the __getattr__ method underneath. Similarly setting a new attribute and deleting an attribute calls __setattr__ and __detattr__ respectively.

Attribute lookup in Python is way more complex than this. It's an enormous tar pit, too much so to detail in this comment, but __getattr__ is most often not involved, and the `object` type doesn't even have a __getattr__ method.

pansa24y ago

> Attribute lookup in Python is [...] an enormous tar pit

Spot on. Python is widely described as a simple language, but the complexity of attribute lookup is one thing that shows that's not true at all.

Many things in Python are easy, such as adding `@property` above a method definition to turn it into a getter. But `@property` is far from simple - the way it actually works is very complex (for example, properties have to be data descriptors, because non-data descriptors cannot override object attributes of the same name).

dotancohen4y ago

  > Python is widely described as a simple language, but the complexity
  > of attribute lookup is one thing that shows that's not true at all.

Python is a simple language to _learn_. My children learned the basics of Python before their seventh birthdays. But Python is not a simple language to _implement_.

1 more reply

btown4y ago

https://blog.peterlamut.com/2018/11/04/python-attribute-look... is a great overview of how this works. Start with the summary at the end!

The really cool thing about this, how descriptors have their __get__ called, is that methods are implemented this way. So when you access instance.method(), it’s a normal lookup for the attribute named “method”, which is (normally) itself a descriptor, so the __get__ magic is called and this binds the method to the instance at the moment it’s needed! Then you can just call it like a normal function. It’s incredibly elegant but extremely obscure. And vital to understand if you want to dive into monkey patching, which is an incredible skill to have!

1 more reply

tusharsadhwaniOP4y ago

Oh, I didn't know that about locals!

Yeah, calling float and complex "supertypes" probably wasn't the best idea, but I couldn't think of a better explanation that wouldn't take too long to explain. I'll ponder about that one.

the getattr thing seems like a huge rabbit hole, I'm totally going to look into this. Thank you :)

wizzwizz44y ago

The “complex > real > int” thing is true in mathematics. In Python, `bool` inherits from `int`.

kgm4y ago

Yeah, but we're not talking about pure mathematics. We're talking about floats, and I find that it's very important to be clear about the limitations. It's easy to get some nasty bugs if you start assuming that you can cram just any int into a float.

And I have no objections to the article's description of the bool type.

DangitBobby4y ago· 6 in thread

Very well written. Fun little tidbit, Django abuses the fact that bools are ints in it's partition util:

https://github.com/django/django/blob/01bf679e59850bb7b3e639...

int_19h4y ago

It does make for some amusing code golf techniques, e.g.:

   print([
      f"{(not x % 3) * 'Fizz'}{(not x % 5) * 'Buzz'}" or x for x in range(1, 20)
   ])

DangitBobby4y ago

Fantastic

np_tedious4y ago

cool!

  >>> l = ['a','b']
  >>> l[False]
  'a'
  >>> l[True]
  'b'
  >>> d = {0: 'a', 1: 'b'}
  >>> d[False]
  'a'
   >>> d[True]
  'b'
  >>> # TIL!

hultner4y ago

Where? I don't see anything in that code relying on that unless I'm Sunday blind.

iezepov4y ago

`results[predicate(item)]` here they get the first and the second elements of a tuple. Essentially it’s `results[False]` and `results[True]`

1 more reply

NegativeLatency4y ago

Results has 2 elements indexed by 0 and 1 (it’s a tuple not a dict)

matsemann4y ago· 5 in thread

> List comprehensions are basically a more Pythonic, more readable way to write these exact same things

More pythonic maybe, but you can't have more than a single expression in a list comprehension without it becoming completely unintelligible. I also often miss other standard list features. Reduce, flatmap, indexed versions, utils like first of predicate, split, filternonnull etc

weatherlight4y ago

Anything remotely interesting like that is dumped in itertools.

Python's creator, Guido van Rossum, doesn't like functional/functional-ish programming a lot. That's well-known.

Guido: "I value readability and usefulness for real code. There are some places where map() and filter() make sense, and for other places Python has list comprehensions. I ended up hating reduce() because it was almost exclusively used (a) to implement sum(), or (b) to write unreadable code. So we added built-in sum() at the same time we demoted reduce() from a built-in to something in functools (which is a dumping ground for stuff I don't really care about :-)."

glaucon4y ago

> "I value readability and usefulness for real code"

Amen.

There are plenty of languages where your code ends up looking like an entry in an obfuscation competition without even trying. If you're using Python, and working for me, I expect the code to be readable by anyone.

And, no, I don't give a toss whether the code is three times the length it might have been if it was dangerously, and expensively, obscure.

2 more replies

matsemann4y ago

Yes, lots of them are available. But I also would like to be able to call .sum() on my iterable at the end of a chain, instead of having to mentally unwrap sum(map(filter(filter(map(...))))

4 more replies

Doxin4y ago

Also definitely learn about using sum on non-numbers, and the key argument to min and max. They can be incredibly handy, but I hardly see them used. Have a contrived example:

    >>> max(['aaa', 'bb', 'c'], key=lambda item: len(item))
    'aaa'

BerislavLopac4y ago

Are you saying that:

    l = []
    for a in range(10):
        for b in range(10):
            for c in range (10):
                l.append(a + b + c)

is more intelligible than:

    l = [
        a + b + c
        for a in range(10)
        for b in range(10)
        for c in range(10)
    ]

???

thrdbndndn4y ago· 3 in thread

>As a bonus, this also adds support for adding two MyNumber classes together:

Merely having `__add__` (without `__radd__`) is enough to add two MyNumber classes together in your case.

>It mostly exists to support type annotations,

The link for "type annotations" is broken.

tusharsadhwaniOP4y ago

Type annotations link is fixed, thank you!

Just `__add__` doesn't work for me. I get:

    TypeError: unsupported operand type(s) for +: 'int' and 'Number'

Here's the code:

    class Number:
        def __add__(self, x):
            return 42 + x

    num = Number()
    print(num + num)

Edit: Okay. replacing `42 + x` with `x + 42` actually makes it work. But I'll be honest I have no idea what happened there.

thrdbndndn4y ago

It took me quite a long time to understand what happened there too!

(Pseudo code, let's call them num1 and num2 for better readability)

    num1 + num2 
    = num1.__add__(num2) 
    = num2 + 42 (that's why the order is important)
    = num2.__add__(42) 
    = 42 + 42 
    = 84

thrdbndndn4y ago

>Type annotations link is fixed, thank you!

I'm not sure about that - it's still `<a href="mypy-guide">type annotations</a>` which jumps to https://sadh.life/post/builtins/mypy-guide and then jumps to your homepage.

ripe4y ago· 2 in thread

Thank you for writing the post. Newbie question about “nonlocal” from your example:

    def outer_function():
        x = 11

        def inner_function():
            nonlocal x
            x = 22
            print('Inner x:', x)

        inner_funcion()
        print('Outer x:', x)

I get how the example works, but don’t see the point of the declaration? If I just left out the “nonlocal x” line, wouldn’t the example still work the same?

ekimekim4y ago

Python assumes that all assignments assign to the current scope. So by default when you reach "x = 22", it would create a new variable called "x" in the inner_function() scope which overrides the variable "x" in the outer_function() scope. So when you print "Inner x" you would only be printing the inner_function() version of x, not the outer_function() version, which would remain at 11.

faho4y ago

This is a consequence of python not having explicit variable definition. Here it'll decide to define a new x instead of seeing the old one.

And that's also usually what you want because otherwise a function would start altering variables in the enclosing scope if they happen to exist!

E.g.

    foo = 42

    def myfunc(bar):
        foo = bar + 1
        print(foo)
    myfunc(6)
    print(foo) # would print "7" if the "foo =" above took the nonlocal foo automatically!

So the trade-off is to require "nonlocal" if you ever need a variable from the enclosing scope.

tyilo4y ago· 2 in thread

> Python has exactly 6 primitive data types (well, actually just 5, but we’ll get to that). 4 of these are numerical in nature, and the other 2 are text-based. Let’s talk about the text-based first, because that’s going to be much simpler.

What is your definition of a primitive data types? All of these have object as a superclass, so I wouldn't call them primitive data types in python.

Maybe there is just 1 primitive type: type? Or none at all?

tusharsadhwaniOP4y ago

In my case I meant it as things that only extend from object. Kinda like how prime numbers only have two factors including themselves.

tyilo4y ago

What about list, dict, set, tuple, range and map then?

collsni4y ago· 2 in thread

You know what helped me with python? Breakpoints inside vscode's module. It all just kinda clicked.

yuy9106164y ago

python actually has a build-in `breakpoint()` function. I think it brings up a repl at the line.

I've been using that instead of print debug, it's been great.

int_19h4y ago

It's actually customizable via the PYTHONBREAKPOINT environment variable and sys.breakpointhook(). The default does pdb.set_trace(), which gives you a built-in debugger prompt (not a REPL) at that location. But it can be set to execute arbitrary code, and most Python IDEs make it behave like "normal" breakpoints.

faho4y ago· 1 in thread

The list comparison here is also true when the first list is a prefix of the other:

    class list:
        def __eq__(self, other):
            return all(x == y for x, y in zip(self, other))

            # Can also be written as:
            return all(self[i] == other[i] for i in range(len(self)))

run that with `[1,2,3]` and `[1,2,3,4]` and it'll be true because it only checks up to the 3.

It's probably simplest to compare `len(self) == len(other)` before.

Similarly, the set comparison will also be true if the first is a subset of the other.

tusharsadhwaniOP4y ago

Yup, I'll fix this, thanks for pointing it out.

tored4y ago· 1 in thread

Thanks, got a better understanding of the Python "philosophy" because of this article, easy to follow even if you haven’t written a single line of Python like me.

tusharsadhwaniOP4y ago

That's great to hear :D hopefully you'll try out Python sometime.

Pearse4y ago· 1 in thread

This is such a good write up for someone like myself still trying to get their head around Python.

I can appreciate this must have taken considerable effort, It reads really well. Thank you!

tusharsadhwaniOP4y ago

I was concerned if my writing style will click with people, this is good to know :)

escanor4y ago· 1 in thread

> for index, item in enumerate(menu):

should be

> for index, item in enumerate(menu, start=1):

for the example to be correct :)

tusharsadhwaniOP4y ago

you're right, thanks!

tyingq4y ago

In a somewhat similar way, this post about hacking the import system to load modules from strings finally helped me understand how imports work:

https://cprohm.de/blog/python-packages-in-a-single-file/

xojoc4y ago

In a similar vein you may like "WTF Python: Exploring and understanding Python through surprising snippets":

https://github.com/satwikkansal/wtfpython

HN thread: https://news.ycombinator.com/item?id=26097732 (163 comments)

PS: found with a site I'm building: https://discussions.xojoc.pw/?q=Understanding+Python+through...

ps1734y ago

I am not even half way through it and I now understand how python actually works under hood. This is great for understanding how a lot of interpreted languages work

tallguytyo4y ago

Your article is really easy to get through and digest. I'd like to keep it as a reference going forward. Please consider adding a floating TOC to the page.

d_burfoot4y ago

Great article. This is what I come to HN to find.

howolduis4y ago

ever heard of line separator?

j / k navigate · click thread line to collapse

173 comments

96 comments · 22 top-level

submeta4y ago· 12 in thread

radarsat14y ago

> (things like `.append` changing the object, returning `None` instead of creating a copy and returning that)

Although it is a very nice feature of Clojure.

Blikkentrekker4y ago

It should also be worth nothing that Clojure sacrificed quite a bit to make this as efficient as possible.

I will say that my ideal language contains them in the standard library alongside standard vectors that index in constant time.

1 more reply

l33tc0der4y ago

One of the many reasons why I love Clojure so much!

Rich implement his own brand of persistent data structures which makes Clojure's immutability a lot more efficient.

1 more reply

tusharsadhwaniOP4y ago

Thank you!

There's always ways to do things the other way, for example you can use x = [*x, item] to append and create a new copy, while being quite a bit more explicit that a new list is being created.

int_19h4y ago

One related area where Python is not consistent is operators like +=.

In pretty much all other languages that have them, the expected behavior of A+=B is exactly the same as A=A+B, except that A is only evaluated once. Now lets look at lists in Python:

   xs = [1, 2]
   ys = xs
   ys = ys + [3]
   print(xs, ys)

This prints [1, 2] [1, 2, 3], because the third line created a new list, and made ys reference that. On the other hand, this:

   xs = [1, 2]
   ys = xs
   ys += [3]
   print(xs, ys)

prints [1, 2, 3] [1, 2, 3], because += changes the list itself, and both xs and ys refer to that same list.

(Note that this is not the same as C++, because in the latter, the variables store values directly, while in Python, all variables are references to values.)

For comparison, C# lets you overload + but not +=, and automatically synthesizes the latter from the former to enforce the correct behavior.

1 more reply

tyingq4y ago

>Python is really consistent when it comes to its behaviour

True, though you end up with things like:

  ' '.join(thelist)

Instead of

  thelist.join(' ')

Because of the somewhat aggressive mantra to be consistent.

4 more replies

pokepim4y ago

3 more replies

Alex39174y ago

> things like `.append` changing the object, returning `None` instead of creating a copy and returning that

canjobear4y ago

3 more replies

aftbit4y ago

random.shuffle() has bitten me that way a few times too:

    array = random.shuffle(array)

because I expected it to return a copy or reference, instead making my array None.

It would also enable chaining operations:

    array = array.append(A).append(B).sort()

In-place vs immutable copy is a language design choice with tradeoffs on both sides, but there's no reason that I can see to not return a reference to the list.

mypy would yell at you about this, but afaik type-checked python still isn't the norm.

1 more reply

brundolf4y ago

> Used to see many flaws in the language (things like `.append` changing the object, returning `None` instead of creating a copy and returning that)

listenallyall4y ago

And also like a Swiss army knife, it's not particularly great at anything, can be awkward even when functional, and there's always a better tool for any specific job.

cinntaile4y ago· 11 in thread

tusharsadhwaniOP4y ago

I don't know any books for projects per-se, maybe HN will know!

cinntaile4y ago

tracyhenry4y ago

Searching python on HackerNews readings (a site I built): https://hacker-recommended-books.vercel.app/category/0/all-t... The results include Fluent Python recommended by tusharsadhwani

disgruntledphd24y ago

Fluent Python is definitely a good book, I knew a whole bunch of the stuff in this article because of it. I only got to Chapter 9, but it legitimately made my Python much, much better.

asdfgeoff4y ago

Not a book, but I have found Trey Hunner's https://www.pythonmorsels.com/ exercises very useful on this front.

The solutions presented typically include both a "basic" approach, a "as pythonic as possible" approach, and a brief discussion of the trade-offs between elegance and readability, etc.

usrme4y ago

I would recommend "Robust Python" by Patrick Viafore. It teaches you a lot about type annotations (among other thing) and gave me personally a whole new way of looking at the code that I write.

cinntaile4y ago

Thanks for the different suggestions, I went with this one. Fluent Python also looked promising but I can't buy the 2nd ed yet.

1 more reply

dehrmann4y ago

In a way, learning Python is harder for people experienced with another language because so much of the content you find is for first-time programmers.

That said, I think I had good luck with Writing Idiomatic Python.

matsemann4y ago

Yes!

When I learned Kotlin, I just read through the docs, and then knew of basically all the different concepts in the language.

Pretty frustrating experience, put me a bit off the language from the get-go.

1 more reply

mattficke4y ago

Effective Python [0] is my favorite book in this category.

[0] https://effectivepython.com/

cinntaile4y ago

I'm giving this one a go as well, thanks!

eximius4y ago· 7 in thread

In addition to this, I highly recommend just reading the codebase. I haven't written C since college and it's remarkably readable.

It was a fun exercise!

hultner4y ago

There's a really nice (although old now) walk through of the cpython code base on YouTube. I watched it on a long 24 hour flight between Canada and Sweden a couple of years back.

Edit: Found it! You're in for about 9 hours of quality watching. https://youtube.com/playlist?list=PLwyG5wA5gIzgTFj5KgJJ15lxq...

matheusmoreira4y ago

That's amazing. Wish there was something similar for the JVM and V8.

submeta4y ago

Excellent, thanks for sharing!

riazrizvi4y ago

And it helps much more in the actual job of writing software than in say, practicing coding puzzles

matheusmoreira4y ago

> I highly recommend just reading the codebase

Me too. When I used to write Ruby I read a lot of CRuby source code. I achieved a much deeper understanding of the language that way. Even answered some really fun stackoverflow questions.

Now the first thing I do when I see a new language is read its source code.

jamesfinlayson4y ago

Yep, when I was writing PHP I found myself digging into php-src on a regular basis to see what exactly was going on. PHP's documentation is good but the code is much more explicit.

dharmab4y ago

Agreed. CPython makes readability and maintainability a priority.

nneonneo4y ago· 7 in thread

Fordec4y ago

Not going to lie, if I could enable stricter compile time conditions without debug mode that would be a very welcome RFC

BiteCode_dev4y ago

Unfortunatly you often can't use -o because of the 3rd party libs that didnt' get the memo and use assert for error checking.

We still have -X dev and sys.flags but it's runtime only.

robot_no_4194y ago

Unless I'm misunderstanding, they are wrapping their asserts in try/catch blocks? That's... yikes.

1 more reply

alshel4y ago

Care to name any 3rd party libs in particular?

3 more replies

globular-toast4y ago

I knew that `assert` behaved in a similar way. Turns out it's actually equivalent to an `if __debug__:` https://docs.python.org/3/reference/simple_stmts.html#gramma...

tusharsadhwaniOP4y ago

that is indeed interesting. Mind if I add this in the article?

nneonneo4y ago

Sure, go ahead :)

sireat4y ago· 7 in thread

I am a pretty average Python programmer(5 years teaching, 15 years writing).

I still wonder what was the reasoning for allowing creation of local objects with the same name as builtins.

Okay it can be nice to redefine pprint as print I suppose.

Still how many sum, list, min, max, dict(!) have been erroneously redefined in beginner tutorials and beginner code.

From my experience sum and list suffer the most.

Sure there are linters that will warn you but there should be a setting for the interpreter (as in -Werror in GCC) to disallow this silliness.

wpietri4y ago

No idea what their actual reasoning is, but here's how I think about it:

It's also better for experts, in that they're generally aware they're overriding a built-in and are doing it on purpose, and if not they'll have an IDE or linter reminding them.

nneonneo4y ago

A good linter will catch these, so in production environments you usually don’t run into issues. I agree that it can be a beginner trap though!

int_19h4y ago

Consider what'd happen whenever a new builtin gets added.

tusharsadhwaniOP4y ago

I think linters are really effective to figure out such issues in professional code.

klyrs4y ago

   False = None = True

Is probably the most ridiculous thing for a language to support. And yet...

wmanley4y ago

    $ python2 -c 'False = None = True'
      File "<string>", line 1
    SyntaxError: cannot assign to None

    $ python3 -c 'False = None = True'
      File "<string>", line 1
    SyntaxError: cannot assign to False

anttihaapala4y ago

... and yet, it is a SyntaxError. False = True = None works in Python 2, and it is just because the separate Boolean type was a late addition.

1 more reply

kgm4y ago· 6 in thread

This is a neat article, but it does have some errors.

One subtle point that the post gets wrong:

One consequence of this is that you can't mutate the dict returned by `locals()` in order to change the value of a function-local variable.

Another, less-subtle error in the post is this:

> What this means is that all ints are valid as a float as well as a complex, but not the other way around. Similarly, all floats are also valid as a complex.

And while it is true that the complex type is just two floats stuck together, I would very much not call it a supertype. It performs distinct operations.

> Accessing an attribute with obj.x calls the __getattr__ method underneath. Similarly setting a new attribute and deleting an attribute calls __setattr__ and __detattr__ respectively.

pansa24y ago

> Attribute lookup in Python is [...] an enormous tar pit

Spot on. Python is widely described as a simple language, but the complexity of attribute lookup is one thing that shows that's not true at all.

dotancohen4y ago

  > Python is widely described as a simple language, but the complexity
  > of attribute lookup is one thing that shows that's not true at all.

Python is a simple language to _learn_. My children learned the basics of Python before their seventh birthdays. But Python is not a simple language to _implement_.

1 more reply

btown4y ago

https://blog.peterlamut.com/2018/11/04/python-attribute-look... is a great overview of how this works. Start with the summary at the end!

1 more reply

tusharsadhwaniOP4y ago

Oh, I didn't know that about locals!

Yeah, calling float and complex "supertypes" probably wasn't the best idea, but I couldn't think of a better explanation that wouldn't take too long to explain. I'll ponder about that one.

the getattr thing seems like a huge rabbit hole, I'm totally going to look into this. Thank you :)

wizzwizz44y ago

The “complex > real > int” thing is true in mathematics. In Python, `bool` inherits from `int`.

kgm4y ago

And I have no objections to the article's description of the bool type.

DangitBobby4y ago· 6 in thread

Very well written. Fun little tidbit, Django abuses the fact that bools are ints in it's partition util:

https://github.com/django/django/blob/01bf679e59850bb7b3e639...

int_19h4y ago

It does make for some amusing code golf techniques, e.g.:

   print([
      f"{(not x % 3) * 'Fizz'}{(not x % 5) * 'Buzz'}" or x for x in range(1, 20)
   ])

DangitBobby4y ago

Fantastic

np_tedious4y ago

cool!

  >>> l = ['a','b']
  >>> l[False]
  'a'
  >>> l[True]
  'b'
  >>> d = {0: 'a', 1: 'b'}
  >>> d[False]
  'a'
   >>> d[True]
  'b'
  >>> # TIL!

hultner4y ago

Where? I don't see anything in that code relying on that unless I'm Sunday blind.

iezepov4y ago

`results[predicate(item)]` here they get the first and the second elements of a tuple. Essentially it’s `results[False]` and `results[True]`

1 more reply

NegativeLatency4y ago

Results has 2 elements indexed by 0 and 1 (it’s a tuple not a dict)

matsemann4y ago· 5 in thread

> List comprehensions are basically a more Pythonic, more readable way to write these exact same things

weatherlight4y ago

Anything remotely interesting like that is dumped in itertools.

Python's creator, Guido van Rossum, doesn't like functional/functional-ish programming a lot. That's well-known.

glaucon4y ago

> "I value readability and usefulness for real code"

Amen.

And, no, I don't give a toss whether the code is three times the length it might have been if it was dangerously, and expensively, obscure.

2 more replies

matsemann4y ago

Yes, lots of them are available. But I also would like to be able to call .sum() on my iterable at the end of a chain, instead of having to mentally unwrap sum(map(filter(filter(map(...))))

4 more replies

Doxin4y ago

Also definitely learn about using sum on non-numbers, and the key argument to min and max. They can be incredibly handy, but I hardly see them used. Have a contrived example:

    >>> max(['aaa', 'bb', 'c'], key=lambda item: len(item))
    'aaa'

BerislavLopac4y ago

Are you saying that:

    l = []
    for a in range(10):
        for b in range(10):
            for c in range (10):
                l.append(a + b + c)

is more intelligible than:

    l = [
        a + b + c
        for a in range(10)
        for b in range(10)
        for c in range(10)
    ]

???

thrdbndndn4y ago· 3 in thread

>As a bonus, this also adds support for adding two MyNumber classes together:

Merely having `__add__` (without `__radd__`) is enough to add two MyNumber classes together in your case.

>It mostly exists to support type annotations,

The link for "type annotations" is broken.

tusharsadhwaniOP4y ago

Type annotations link is fixed, thank you!

Just `__add__` doesn't work for me. I get:

    TypeError: unsupported operand type(s) for +: 'int' and 'Number'

Here's the code:

    class Number:
        def __add__(self, x):
            return 42 + x

    num = Number()
    print(num + num)

Edit: Okay. replacing `42 + x` with `x + 42` actually makes it work. But I'll be honest I have no idea what happened there.

thrdbndndn4y ago

It took me quite a long time to understand what happened there too!

(Pseudo code, let's call them num1 and num2 for better readability)

    num1 + num2 
    = num1.__add__(num2) 
    = num2 + 42 (that's why the order is important)
    = num2.__add__(42) 
    = 42 + 42 
    = 84

thrdbndndn4y ago

>Type annotations link is fixed, thank you!

I'm not sure about that - it's still `<a href="mypy-guide">type annotations</a>` which jumps to https://sadh.life/post/builtins/mypy-guide and then jumps to your homepage.

ripe4y ago· 2 in thread

Thank you for writing the post. Newbie question about “nonlocal” from your example:

    def outer_function():
        x = 11

        def inner_function():
            nonlocal x
            x = 22
            print('Inner x:', x)

        inner_funcion()
        print('Outer x:', x)

I get how the example works, but don’t see the point of the declaration? If I just left out the “nonlocal x” line, wouldn’t the example still work the same?

ekimekim4y ago

faho4y ago

This is a consequence of python not having explicit variable definition. Here it'll decide to define a new x instead of seeing the old one.

And that's also usually what you want because otherwise a function would start altering variables in the enclosing scope if they happen to exist!

E.g.

    foo = 42

    def myfunc(bar):
        foo = bar + 1
        print(foo)
    myfunc(6)
    print(foo) # would print "7" if the "foo =" above took the nonlocal foo automatically!

So the trade-off is to require "nonlocal" if you ever need a variable from the enclosing scope.

tyilo4y ago· 2 in thread

What is your definition of a primitive data types? All of these have object as a superclass, so I wouldn't call them primitive data types in python.

Maybe there is just 1 primitive type: type? Or none at all?

tusharsadhwaniOP4y ago

In my case I meant it as things that only extend from object. Kinda like how prime numbers only have two factors including themselves.

tyilo4y ago

What about list, dict, set, tuple, range and map then?

collsni4y ago· 2 in thread

You know what helped me with python? Breakpoints inside vscode's module. It all just kinda clicked.

yuy9106164y ago

python actually has a build-in `breakpoint()` function. I think it brings up a repl at the line.

I've been using that instead of print debug, it's been great.

int_19h4y ago

faho4y ago· 1 in thread

The list comparison here is also true when the first list is a prefix of the other:

    class list:
        def __eq__(self, other):
            return all(x == y for x, y in zip(self, other))

            # Can also be written as:
            return all(self[i] == other[i] for i in range(len(self)))

run that with `[1,2,3]` and `[1,2,3,4]` and it'll be true because it only checks up to the 3.

It's probably simplest to compare `len(self) == len(other)` before.

Similarly, the set comparison will also be true if the first is a subset of the other.

tusharsadhwaniOP4y ago

Yup, I'll fix this, thanks for pointing it out.

tored4y ago· 1 in thread

Thanks, got a better understanding of the Python "philosophy" because of this article, easy to follow even if you haven’t written a single line of Python like me.

tusharsadhwaniOP4y ago

That's great to hear :D hopefully you'll try out Python sometime.

Pearse4y ago· 1 in thread

This is such a good write up for someone like myself still trying to get their head around Python.

I can appreciate this must have taken considerable effort, It reads really well. Thank you!

tusharsadhwaniOP4y ago

I was concerned if my writing style will click with people, this is good to know :)

escanor4y ago· 1 in thread

> for index, item in enumerate(menu):

should be

> for index, item in enumerate(menu, start=1):

for the example to be correct :)

tusharsadhwaniOP4y ago

you're right, thanks!

tyingq4y ago

In a somewhat similar way, this post about hacking the import system to load modules from strings finally helped me understand how imports work:

https://cprohm.de/blog/python-packages-in-a-single-file/

xojoc4y ago

In a similar vein you may like "WTF Python: Exploring and understanding Python through surprising snippets":

https://github.com/satwikkansal/wtfpython

HN thread: https://news.ycombinator.com/item?id=26097732 (163 comments)

PS: found with a site I'm building: https://discussions.xojoc.pw/?q=Understanding+Python+through...

ps1734y ago

I am not even half way through it and I now understand how python actually works under hood. This is great for understanding how a lot of interpreted languages work

tallguytyo4y ago

Your article is really easy to get through and digest. I'd like to keep it as a reference going forward. Please consider adding a floating TOC to the page.

d_burfoot4y ago

Great article. This is what I come to HN to find.

howolduis4y ago

ever heard of line separator?

j / k navigate · click thread line to collapse