Lessons learned defying Joel Spolsky with Django (opens in new tab)

(speakerdeck.com)

138 pointsbenregn13y ago149 comments

149 comments

80 comments · 19 top-level

zzzeek13y ago· 21 in thread

Don't use just one ORM and then declare "ORM's are stupid". The "object = None" / "object_id = None" issue illustrated here is certainly not a mistake every ORM makes.

k4st13y ago

The SQL generated appears correct, unless the underlying database can guarantee that all foreign key constraints are met. That is, I consider this a failure of the user of the ORM to appreciate that the two invocations of filter are not identical.

In the former case, the query is verifying that the object_id field cannot be used to find a foreign object--regardless of the value of object_id. This is exactly what it is asked to do.

In the latter case, the query is simply verifying that object_id is NULL, which is exactly what it's asked to do.

zzzeek13y ago

"other_id" and "other" here refer to two different ways of referring to a many-to-one reference between "somemodel" and "othermodel". Asking for rows of "somemodel" where "object_id" is None is the exact same thing as asking for rows from "somemodel" where its reference to "object" is None. Both should produce the same query, the simple one without the JOIN. The query with the JOIN is completely wasteful and not at all correct - it does a LEFT OUTER JOIN to the remote table, only to filter on those rows where the remote table has no match; but this is already obvious from whether or not "somemodel.other_id" is NULL.

2 more replies

mwsherman13y ago

Correct, these are semantically different. You’ve described it well. The ORM would be incorrect if it did not generate the SQL that it did.

Part of the issue is confusing object data with metadata. The id is an implementation detail of the data store – metadata. What the user is trying to do here is “talk database” and “talk model” at the same time.

Which is a perfectly good argument against an ORM. But one should commit to a metaphor, or not: http://clipperhouse.com/2012/02/29/suspension-of-disbelief/

1 more reply

marcosdumay13y ago

Yes, the first querry is correct, and the second one will return incorrect results if you don't keep the consistency of your database in check.

No, the Django ORM is not correct in unsing the correct query by default. At least not on every database backend. That's because on any database that enforeces constraints, it already created the necessary checks, and altough functionaly they are equivalent, that line can mark the difference between a 5ms or a 45 minutes runtime (that's what happens with my data, not a hypotetical case). There is more to a framework than mathematical correctness.

But then, it should certainly have an option to always use the correct query, and it is a lot of implementation work that I can understand quite well that Django developers don't want to do now. They probably have other priorities.

And by the way, as I said, this is a problem to me (but no, not important enough that makes me fix it now, maybe later), but did I stop using an ORM just because of an abstraction leakage? Of course not, I encapsulated a solution to this problem and gone on, earning lots of man-hours at the 99% of my code where Django's ORM doesn't leak. That's my main disagreement with the article. Yes, ORMs are stupid, but not using one just because of that is stupid.

And yes, if you let Django templates go, you can easily cut 9ms of your response time. That's great! I wonder how much you'll cut if you rewrite it all on assembly.

tmarthal13y ago

Obviously there are subtleties to things that may not be apparent, but his example makes sense to me and so does the SQL queries that are generated. The first query is operating at the "object level" and the second is operating at the "attribute level".

The first is retrieving the object and checking if it exists, and the second is just checking the parent's foreign key. Not sure that they are really the same query, if you have unenforced foreign keys.

This is not "dumb", this is analogous to checking if a pointer is null or that the contents of the pointer are null (which is a distinction that some people want to make).

Aqueous13y ago

Why not just write the SQL?

PommeDeTerre13y ago

When ORMs first started becoming popular during the 2000s, especially within the Java world, their proponents drummed up a lot of animosity toward SQL.

While a lot of us who had started working with SQL in the 1980s, if not earlier, were perfectly fine with using it, many younger developers were scared away from it by these claims.

So we've had a generation of software developers who were essentially raised to hate SQL, and to embrace ORMs, even after it became clear that ORMs do come with some pretty serious trade-offs, and do not necessarily increase productivity.

Not having a solid grasp of SQL, a lot of these developers just don't realize what they're missing out on. I've seen this first-hand many times before. These developers will spent hours upon hours trying to get their ORM to perform a moderately complex query that could be easily written by hand within a few minutes, including any code necessary to perform the query and to retrieve the result. The time and effort expended on these sorts of queries will very quickly negate any time and effort the ORM may have saved for simpler queries. And these moderately-complex or complex queries always arise in real-world software.

I think that education is the only way to really solve this problem, but a lot of developers are quite set against this. Learning SQL isn't that much of an investment, but the returns it offers can be huge.

4 more replies

drdaeman13y ago

The only problem I see, is we can't rewrite SQL.

I.e., in pseudocode:

    query = SQL("SELECT * FROM entities WHERE owner = ?", owner=me)
    ...
    if some_condition:
        query = query + SQL.WHERE("OR public = TRUE")
    ...
    if other_condition:
        query = query + SQL("LEFT JOIN things AS t"
                            " ON t.entity_id = entities.id") \
                      + SQL.WHERE("things.value > 0")
    ...
    my_nice_list_of_results = run(query + SQL("LIMIT ?", count))

This should be technically possible, but I haven't seen any library that does it.

7 more replies

jl613y ago

Just my opinion of course, but it seems that designers who think primarily in terms of the application will prefer to use an ORM to abstract away details of the storage layer; whereas a designer who thinks primarily in terms of data will prefer to write SQL, with the application being just one of many possible applications of that data.

simonw13y ago

Because (at least for the kind of queries common to most web applications) you can develop faster using an ORM. Less boilerplate, less repeated code, less time thinking about how to convert between SQL and application-level representations of your data - and most importantly, no time at all spent thinking about how to dynamically generate SQL queries using string concatenation (a sure-fire way to introduce bugs and security vulnerabilities in to your code).

lmm13y ago

Poor tool integration, nonexistent library ecosystem, and the advantages of a single-language codebase.

antihero13y ago

Because you end up writing the same basic SQL over and over and over again?

InclinedPlane13y ago

Do you have an example of an ORM which is not in some fundamental way "stupid"? I haven't found one yet, but I'd love to know one existed somewhere.

craigkerstiens13y ago

You should take a look at the ORMs that are considered a bit more at the top of their class. In the Python world SQL Alchemy in the Ruby world Sequel. While you can't judge all ORMs by looking at a few these do offer a better impression of what an ORM can do for you.

zzzeek13y ago

Until we've attained AI, all software can be said to be "stupid". Google is stupid. Relational databases are stupid, SQL is stupid. None of it works without human intelligence actively directing it to do work for us. Why single out ORMs?

4 more replies

mythz13y ago

There are several Micro ORMs for .NET: http://www.servicestack.net/benchmarks/#dapper-benchmarks

That don't try to handle hidden-magic-state and lets you easily access via Raw SQL if you need to do complex queries. Many don't try to abstract anything and are simply extension methods over the underlying IDbConnection (so you never lose any flexibility), i.e. they simply exist to remove the tedium boilerplate of mapping RDBMS results back into POCOs.

1 more reply

lucian190013y ago

SQLAlchemy has several explicit layers of abstraction, the first of which is an AST for all of SQL. The ORM is entirely optional.

avenger12313y ago

Definitely agree.

Using an ORM should not exclude also using direct SQL. It should be both.

I believe this brings the best combination. Anytime there is major complexity just drop to normal SQL.

The best of both.

Daishiman13y ago

I think it's more of a case of the developer not having read the ORM documentation; this is a very newbie mistake (although very understandable, true).

Now obviously, some people would complain that it doesn't make sense to do the extra join, but then people would be complaining about magical or exceptional behavior. ORM behavior is very predictable about which fields are being queried

JshWright13y ago

I haven't checked this one either way, but I'm curious what SQL would be if they used the 'right' ORM incantation.

filter(object__isnull=True)

https://docs.djangoproject.com/en/dev/ref/models/querysets/#...

macspoofing13y ago

True, but every ORM will make that kind of mistake. You can't just build an object model without paying attention to the SQL layer.

nnq13y ago· 13 in thread

Bumping into ORM limitations + moving to Jinja for templates --> one word: Flask

...really, what advantage does Django provide at this point in this project anymore?

SoftwareMaven13y ago

Completely replacing the template system with Jinja is silly (no idea if this is what they did) since it significantly reduces the value of the Django ecosystem. Far better it to use jinja for your work, but leave the Django templates for Django and all the other apps you integrate to use. I do with Django would reduce their stance on no expressions in the templates. Sometimes it saves a lot of work to just be able to call a function or add two numbers together without needing to build a filter or tag every time.

I've been happy with django-jinja[1] for that purpose. It replaces the context processors so it will load jinja templates if they have a .jinja extension and Django if they are .html. It also includes the django filters in jinja-land.

The ORM is more problematic. I just started a project a couple months ago and thought a lot about ditching the default ORM in favor of SqlAlchemy. I decided not to, for the reason of expedience and, as the TFA mentions, it already leaks. So, I stuck with the Django ORM and will drop to SQL directly if need be (need being defined by the ORM making the code confusing or there being performance hotspots).

1. https://github.com/niwibe/django-jinja

megaman82113y ago

There are still a lot of Django apps that work fine in this scenario. Plus I don't think the slides imply that they are just blindly replacing ORM calls with raw SQL, just that they have profiled and replaced the hot spots.

Also Flask is a micro-framework and Django is a full-stack framework. Flask can be used as the backbone of a full-stack framework but figuring out a good project structure and finding out what third-party apps to use can be daunting for a person without Flask experience. If you really want to get people to switch, package up a Flask-based framework with the features of Django.

pyriku13y ago

Middlewares, context processors, forms, (class based) views, and tons of third party applications.

The CTO of a startup where some friends are working thought the same you did, and 2 years ago rewrote everything to Flask. Now they're going back to Django.

Django is much more that an ORM and templates.

jokull13y ago

Django middlewares is a poorly designed system. Flask has all of the things you mentioned except for forms. For that you can use Flask-WTF, but these libraries are becoming outdated anyway because they’re HTML based and don’t work so well for JSON validation.

4 more replies

Alex391713y ago

"what advantage does Django provide at this point in this project anymore?"

Documentation. I use Django but without any ORM and with Jinja2, so it's basically just Flask but with more stack overflow threads and more third-party software.

Is there any actual advantage that I would be getting by using Flask instead?

jokull13y ago

Flask’s documentation is great, and the codebase is readable and small. The third party community is not as big as Django’s, but the 3rd party work is verified and approved by Armin (Flask and Jinja2 author). These guidelines encourage authors to publish higher quality code and better documentation. You couldn’t verify my claims, but this is my experience coming from Django a couple of years back. Django’s extensions are often very poor quality.

I’d also like to point out that Flask-Admin has come a long way with adapters for SQLAlchemy and at least one other ORM library. Very usable and extendible.

hcarvalhoalves13y ago

The advantages aren't really in using Flask, but being able to use a better ORM (or no ORM at all, just SQL abstraction) and template engine.

After working with Django ORM for a large project, I started to rethink the usefulness of models. It turns out just treating data as sets lends to a functional style and is simpler down the road.

1 more reply

SPSteinbeck13y ago

I also use Django without any ORM and with Jinja2 and I'm taking baby steps towards completely abandoning Django and moving to Flask. For me, the primary benefit is that Flask being so much smaller and simpler than Django means you can understand the entire codebase without an extraordinary amount of effort.

I'm curious as to why you stick with Django (other than having projects already begun relying on it)? Without the ORM and with the templates, there's is not much I get out of Django.

1 more reply

andybak13y ago

In addition to the other replies I have to add - the goddamn Admin. The amount of time it saves me is beyond belief.

SoftwareMaven13y ago

If you aren't using the ORM, what does the admin bring to the table? I agree with the value of the admin, and is one of the reasons in my latest project I kept the Django ORM, even though some coworkers felt it was limiting (it is, but it isn't all about a single component of the system).

naithemilkman13y ago

In addition to what everyone else has mentioned, the amount of libraries written for Django is staggering. You almost never need to roll your own app -- someone has already done it!

Also, geodjango.

antihero13y ago

It has an entire ecosystem and tool chain to sort building websites/webapps. When I used to use Flask, before moving to Django, I found myself essentially creating a whole bunch of stuff that Django already has, and is better written.

bmelton13y ago

To throw a completely different wrench into the mix, I mostly just use Django to provide an API anymore, and for session handling.

I love and use Django, and have built large projects where I haven't really run into any limitations with it[1], but for the most part, nowadays my workflow is to pip install django, south, tastypie, then load in a template with Backbone and Marionette, then get to town.

Templates are either Mustache or Underscore, depending.

[1] - Yeah, it could be faster, but if you have a large, confusing database schema that you inherited, the Django ORM is great for getting things stood up, and then just tune the queries after the fact. It's still a huge timesaver vs. writing every query by hand.

jroseattle13y ago· 7 in thread

I was expecting this slidedeck to be a bit more focused on defying Spolsky, and whether or not that was a good decision.

FWIW, I've never bought into Spolsky's vision that re-writing code is poor strategy. Steve Jobs never thought twice about ripping something apart and starting over. If anything, code re-write can be an advantageous position -- you often have a greater understanding of the problems you're intending to solve. When well-executed, it can take the form of heavy refactoring, even when switching languages/platforms.

LockeWatts13y ago

I'm not really sure why Steve Jobs is your model for code design.

widdershins13y ago

Because he oversaw the creation of 3 of the most successful operating systems of all time? I know he didn't design them, but he was involved in managing the projects. Just playing devil's advocate.

1 more reply

jroseattle13y ago

Apple has/had a history of re-writing code over the years. The first iPods and their iterations were code re-writes, IIRC. Many of the onboard applications were completely re-written for the iPad.

1 more reply

grey-area13y ago

Steve Jobs never thought twice about ripping something apart and starting over. If anything, code re-write can be an advantageous position -- you often have a greater understanding of the problems you're intending to solve.

It's interesting that you take Jobs (and by extension Apple) as an example here, as many new projects from them which might appear to be complete rewrites from the outside are in fact heavily derivative or dependent on other projects. iOS for example is an incremental revision of OS X, removing some of the UI layer and replacing it, but leaving almost all of the underlying OS intact, using the same dev language any many of the same APIs - it is by no means a clean-room rewrite. OS X itself was heavily based on NextStep, which of course was based on Mach/BSD, so none of these 'new' platforms started from a clean slate like BeOS for example, and the same tends to happen with APIs, though sometimes these have been rewritten (Quicktime comes to mind, and arguably UIKit is a significant rewrite of AppKit, though the two still exist in parallel just now).

Sometimes rewriting is the best solution, but it does tend to take a lot longer than expected, doesn't always leave you with a satisfactory replacement, and ends in failure more often than it ends in success, particularly on very large projects or ones with fuzzy scope. I think Spolsky was talking about projects on the level of Netscape and Excel, where a rewrite would be a significant challenge very likely to fail or be delayed so long that it falls short of its initial goals. The smaller the project, the more viable a rewrite becomes, and sometimes it is the best option if the existing product is not delivering and is difficult to extend/support.

macspoofing13y ago

>I've never bought into Spolsky's vision that re-writing code is poor strategy.

I think it's something that's true in general but false in some specific cases. Rewriting involves spending enormous time and resources to at best standstill, and at worst move backwards ( chances are your re-written product will be poorer in features, and initially buggier than your old, stable, battle-tested version). For smaller companies, it is a death knell.

jroseattle13y ago

Yes, very true. Although I'd like to add some context.

> Rewriting involves spending enormous time and resources to at best standstill, and at worst move backwards ( chances are your re-written product will be poorer in features, and initially buggier than your old, stable, battle-tested version).

There is plenty of evidence that this has happened in many places and with many companies. A very real scenario that's played out before.

I'd say those scenarios were not well-executed. If the outcome of a re-write is "standstill", the re-write is pointless. There is no justification for proceeding with it.

However, if "standstill" equates only to user-facing features, chances are the re-write is to address critical issues elsewhere (I get the impression that was the situation with the OP.) In that case, "standstill" doesn't apply. It's simply a matter of deciding whether or not the effort and risk justifies the reward.

To my main point, Spolsky's hard-line essentially says re-building your application from scratch is bad strategy. I think it is short-sighted to draw that line. I prefer to exercise judgment and draw on the resources at my disposal for the given situation. I presume many others do as well.

2 more replies

yuhong13y ago

My personal favorite is the MS OS/2 2.0 fiasco, where the project was abandoned by MS after the first SDK was already sent to developers: http://yuhongbao.blogspot.ca/2012/12/about-ms-os2-20-fiasco-...

Kiro13y ago· 6 in thread

I didn't understand why ORMs are stupid. Can someone enlighten me?

cwbrandsma13y ago

The real issue with ORMs isn't the ORM itself, but over reliance on the ORM to do everything the right way and not validating the ORM is doing things the right way. ORMs are also often heavily leaned on by people who don't understand SQL and relational databases well enough and just want a data dumping ground (would have been better off with a document database).

But, if properly validated, and knowing when to NOT use the ORM, a good ORM can help you get a lot of work done very efficiently. But I've also seen improperly used ORMs turn into MASSIVE time sinks where devs spends days just configuring the stupid thing (hello Hibernate/nhibernate).

kybernetyk13y ago

I guess his point is that they sometimes create huge ugly and inefficient SQL queries. (Reminds me of the 90s sentiment of C compilers creating ugly assembly.)

mattchew13y ago

If you know SQL, they're often frustrating. You know what you want to do, but there's some ridiculous arcane approach to get those results via the ORM. If you can do it at all.

ORMs make it easy to do things like run queries inside a loop without realizing it. I worked on a site where the front page ran something like 200 queries every time it was accessed thanks to ORM magic.

PeterisP13y ago

All abstractions tend to be leaky.

ORM can abstract DB/SQL for you, but if you really ignore DB/SQL, then it can happily make some queries an order (or two ) of magnitude slower than they should be.

So, you must always think in explicit SQL-query-terms anyway; and then it's just a balance for ease of coding - does the ease of ORM syntactic sugar outweigh the effort for you to double-check if any ORM-built queries don't accidentally do something stupidly slow.

macspoofing13y ago

ORMs pretend to give you full abstraction, but in reality you have to be aware of the underlying SQL layer when you build your object model.

antihero13y ago

I find that if you are aware of the SQL underneath, they are good for saving time.

Do you really want to write SQL to retrieve data and code to populate an object for every damn thing in your system?

1 more reply

speg13y ago· 3 in thread

Does anyone know what tool was used to profile the django app? Looks cool.

naithemilkman13y ago

If you're talking about the slide that says '87.91% of time spent rendering' then yes, I would like to know too.

Annoying that it doesnt have slide numbers to refer to.

cdavid13y ago

it looks like kcachegrind or pycallgraph (https://github.com/gak/pycallgraph).

bdarnell13y ago

Looks like gprof2dot: https://code.google.com/p/jrfonseca/wiki/Gprof2Dot

bsaul13y ago· 3 in thread

Did someone understand the last slides about the differences between transactions in tasks with and without celery ? I'm using celery, and i have been using django in the past, and I really didn't get the point.

mbell13y ago

He's trying to avoid the situation where you enqueue a background job inside a transaction and the worker gets started on that job before the transaction is committed. If the job needs to hit the database for any reason your likely to get errors as the new data hasn't been committed yet.

It looks like the work around he used is to cache the job queue locally and only flush it to the real job queue after the database commit, so your guaranteed whatever data may be needed for the job has been committed to the database.

ashchristopher13y ago

Basically if you modify data in a transaction, add a task to the queue that relies on that data, and the worker pulls the task off the queue BEFORE the transaction is committed, then the task tries to access data that doesn't exist in the database yet.

spitfire13y ago

He's trying to fix/avoid a race condition caused by data created within a transaction not being committed yet.

emperorcezar13y ago· 2 in thread

While I love seeing slides, this deck obviously could use the audio or transcript along with it.

rattray13y ago

Agreed, I for one was felt feeling rather in the dark for the last few slides especially...

iconfinder13y ago

We are posting a version with audio on our blog on Monday: http://blog.iconfinder.com

1 more reply

Uchikoma13y ago· 2 in thread

Iconfinder. They are profitable? They have business requirements? Joel wrote about the problems of a rewrite when you need to earn money, implement money making features, heavy competition while at the same time maintaining two plattform where one is a moving target. I don't think Iconfinder fits in any of these constraints.

nickbruun13y ago

Incorrect. We fit in all of those constraints at the point where a rewrite was decided ;)

Uchikoma13y ago

Thanks for your reply, from looking at Iconfinder it did not look like a large piece of software with lots of business requirements.

lstamour13y ago· 1 in thread

I found I could follow along with the slides, though some of the icons and messages around third-party tools were lost on me. I'd definitely appreciate a video.

Edit: 9 minutes ago, Nick posted to Twitter: "we have the recording - just need the sound cleaned up. Expect it early next week ;)"

lstamour13y ago

Video now live: http://vimeo.com/65057265 [Slides in sync with audio]

gingerlime13y ago· 1 in thread

> 20ms with Jinja2 without auto-escaping

could this performance improvement back-fire if you end up with a security issue?

I'm not saying that it definitely would. If you know what you're doing / trust your data sources or sanitize them elsewhere, you should be fine. I'd be careful turning off such a feature completely though...

Daishiman13y ago

Of course. This is merely a tradeoff between performance and developer time. 99% of projects will never have HTML autoescaping as a performance pain point. Then again, you're going to need tens of hours to review all templates to make sure you're escaping everything. If your hardware budget is greated than what it costs to audit the code, it's the proper decision.

tachion13y ago· 1 in thread

I wonder, is there a video of the talk available? The slides, unfortunately, alone are rarely very informative.

iconfinder13y ago

We will post video version soon (Monday) on http://blog.iconfinder.com

1 more reply

level0913y ago· 1 in thread

I found these tools mentioned by the author really helpful, I compiled a list of them:

http://jinja.pocoo.org/ https://www.getsentry.com/welcome/ http://graphite.wikidot.com/start https://opbeat.com/

is there a way though to use/test opbeat ?

iconfinder13y ago

Just send them a message via twitter or email. I'm sure they will help you get started.

hasenj13y ago

You should give SQLAlchemy a try:

http://lucumr.pocoo.org/2011/7/19/sqlachemy-and-you/

It gives you more control and requires you to be more explicit about your queries and relationships.

claudiusd13y ago

I don't think Spolsky's argument applies to startups... His argument basically boils down to "You think the code in front of you is a mess, but the reality is you're just having trouble reading somebody else's code which is probably good enough". But what if you wrote the code yourself? In that case, it's probably just a mess.

scott_meade13y ago

ORMs may be "stupid", yet Basecamp manages to handle 400-500 request per second with Rails and MySQL just fine. https://twitter.com/dhh/status/287221705443774465

iconfinder13y ago

We have posted a video version of the talk (slides + audio): http://blog.iconfinder.com/staying-sane-while-defying-joel-s...

dmishe13y ago

Interesting point on celery and transaction, though I've never seen that happen in real life, interesting.

Uchikoma13y ago

+100 for using "leaking abstraction" in a slide deck "defying Joel Spolsky"

beambot13y ago

Is there a (publicly available) video that accompanies this slide deck?

1 more reply

j / k navigate · click thread line to collapse

149 comments

80 comments · 19 top-level

zzzeek13y ago· 21 in thread

Don't use just one ORM and then declare "ORM's are stupid". The "object = None" / "object_id = None" issue illustrated here is certainly not a mistake every ORM makes.

k4st13y ago

In the former case, the query is verifying that the object_id field cannot be used to find a foreign object--regardless of the value of object_id. This is exactly what it is asked to do.

In the latter case, the query is simply verifying that object_id is NULL, which is exactly what it's asked to do.

zzzeek13y ago

2 more replies

mwsherman13y ago

Correct, these are semantically different. You’ve described it well. The ORM would be incorrect if it did not generate the SQL that it did.

Which is a perfectly good argument against an ORM. But one should commit to a metaphor, or not: http://clipperhouse.com/2012/02/29/suspension-of-disbelief/

1 more reply

marcosdumay13y ago

Yes, the first querry is correct, and the second one will return incorrect results if you don't keep the consistency of your database in check.

And yes, if you let Django templates go, you can easily cut 9ms of your response time. That's great! I wonder how much you'll cut if you rewrite it all on assembly.

tmarthal13y ago

This is not "dumb", this is analogous to checking if a pointer is null or that the contents of the pointer are null (which is a distinction that some people want to make).

Aqueous13y ago

Why not just write the SQL?

PommeDeTerre13y ago

When ORMs first started becoming popular during the 2000s, especially within the Java world, their proponents drummed up a lot of animosity toward SQL.

While a lot of us who had started working with SQL in the 1980s, if not earlier, were perfectly fine with using it, many younger developers were scared away from it by these claims.

4 more replies

drdaeman13y ago

The only problem I see, is we can't rewrite SQL.

I.e., in pseudocode:

    query = SQL("SELECT * FROM entities WHERE owner = ?", owner=me)
    ...
    if some_condition:
        query = query + SQL.WHERE("OR public = TRUE")
    ...
    if other_condition:
        query = query + SQL("LEFT JOIN things AS t"
                            " ON t.entity_id = entities.id") \
                      + SQL.WHERE("things.value > 0")
    ...
    my_nice_list_of_results = run(query + SQL("LIMIT ?", count))

This should be technically possible, but I haven't seen any library that does it.

7 more replies

jl613y ago

simonw13y ago

lmm13y ago

Poor tool integration, nonexistent library ecosystem, and the advantages of a single-language codebase.

antihero13y ago

Because you end up writing the same basic SQL over and over and over again?

InclinedPlane13y ago

Do you have an example of an ORM which is not in some fundamental way "stupid"? I haven't found one yet, but I'd love to know one existed somewhere.

craigkerstiens13y ago

zzzeek13y ago

4 more replies

mythz13y ago

There are several Micro ORMs for .NET: http://www.servicestack.net/benchmarks/#dapper-benchmarks

1 more reply

lucian190013y ago

SQLAlchemy has several explicit layers of abstraction, the first of which is an AST for all of SQL. The ORM is entirely optional.

avenger12313y ago

Definitely agree.

Using an ORM should not exclude also using direct SQL. It should be both.

I believe this brings the best combination. Anytime there is major complexity just drop to normal SQL.

The best of both.

Daishiman13y ago

I think it's more of a case of the developer not having read the ORM documentation; this is a very newbie mistake (although very understandable, true).

JshWright13y ago

I haven't checked this one either way, but I'm curious what SQL would be if they used the 'right' ORM incantation.

filter(object__isnull=True)

https://docs.djangoproject.com/en/dev/ref/models/querysets/#...

macspoofing13y ago

True, but every ORM will make that kind of mistake. You can't just build an object model without paying attention to the SQL layer.

nnq13y ago· 13 in thread

Bumping into ORM limitations + moving to Jinja for templates --> one word: Flask

...really, what advantage does Django provide at this point in this project anymore?

SoftwareMaven13y ago

1. https://github.com/niwibe/django-jinja

megaman82113y ago

pyriku13y ago

Middlewares, context processors, forms, (class based) views, and tons of third party applications.

The CTO of a startup where some friends are working thought the same you did, and 2 years ago rewrote everything to Flask. Now they're going back to Django.

Django is much more that an ORM and templates.

jokull13y ago

4 more replies

Alex391713y ago

"what advantage does Django provide at this point in this project anymore?"

Documentation. I use Django but without any ORM and with Jinja2, so it's basically just Flask but with more stack overflow threads and more third-party software.

Is there any actual advantage that I would be getting by using Flask instead?

jokull13y ago

I’d also like to point out that Flask-Admin has come a long way with adapters for SQLAlchemy and at least one other ORM library. Very usable and extendible.

hcarvalhoalves13y ago

The advantages aren't really in using Flask, but being able to use a better ORM (or no ORM at all, just SQL abstraction) and template engine.

After working with Django ORM for a large project, I started to rethink the usefulness of models. It turns out just treating data as sets lends to a functional style and is simpler down the road.

1 more reply

SPSteinbeck13y ago

I'm curious as to why you stick with Django (other than having projects already begun relying on it)? Without the ORM and with the templates, there's is not much I get out of Django.

1 more reply

andybak13y ago

In addition to the other replies I have to add - the goddamn Admin. The amount of time it saves me is beyond belief.

SoftwareMaven13y ago

naithemilkman13y ago

In addition to what everyone else has mentioned, the amount of libraries written for Django is staggering. You almost never need to roll your own app -- someone has already done it!

Also, geodjango.

antihero13y ago

bmelton13y ago

To throw a completely different wrench into the mix, I mostly just use Django to provide an API anymore, and for session handling.

Templates are either Mustache or Underscore, depending.

jroseattle13y ago· 7 in thread

I was expecting this slidedeck to be a bit more focused on defying Spolsky, and whether or not that was a good decision.

LockeWatts13y ago

I'm not really sure why Steve Jobs is your model for code design.

widdershins13y ago

Because he oversaw the creation of 3 of the most successful operating systems of all time? I know he didn't design them, but he was involved in managing the projects. Just playing devil's advocate.

1 more reply

jroseattle13y ago

Apple has/had a history of re-writing code over the years. The first iPods and their iterations were code re-writes, IIRC. Many of the onboard applications were completely re-written for the iPad.

1 more reply

grey-area13y ago

macspoofing13y ago

>I've never bought into Spolsky's vision that re-writing code is poor strategy.

jroseattle13y ago

Yes, very true. Although I'd like to add some context.

There is plenty of evidence that this has happened in many places and with many companies. A very real scenario that's played out before.

I'd say those scenarios were not well-executed. If the outcome of a re-write is "standstill", the re-write is pointless. There is no justification for proceeding with it.

2 more replies

yuhong13y ago

Kiro13y ago· 6 in thread

I didn't understand why ORMs are stupid. Can someone enlighten me?

cwbrandsma13y ago

kybernetyk13y ago

I guess his point is that they sometimes create huge ugly and inefficient SQL queries. (Reminds me of the 90s sentiment of C compilers creating ugly assembly.)

mattchew13y ago

If you know SQL, they're often frustrating. You know what you want to do, but there's some ridiculous arcane approach to get those results via the ORM. If you can do it at all.

PeterisP13y ago

All abstractions tend to be leaky.

ORM can abstract DB/SQL for you, but if you really ignore DB/SQL, then it can happily make some queries an order (or two ) of magnitude slower than they should be.

macspoofing13y ago

ORMs pretend to give you full abstraction, but in reality you have to be aware of the underlying SQL layer when you build your object model.

antihero13y ago

I find that if you are aware of the SQL underneath, they are good for saving time.

Do you really want to write SQL to retrieve data and code to populate an object for every damn thing in your system?

1 more reply

speg13y ago· 3 in thread

Does anyone know what tool was used to profile the django app? Looks cool.

naithemilkman13y ago

If you're talking about the slide that says '87.91% of time spent rendering' then yes, I would like to know too.

Annoying that it doesnt have slide numbers to refer to.

cdavid13y ago

it looks like kcachegrind or pycallgraph (https://github.com/gak/pycallgraph).

bdarnell13y ago

Looks like gprof2dot: https://code.google.com/p/jrfonseca/wiki/Gprof2Dot

bsaul13y ago· 3 in thread

mbell13y ago

ashchristopher13y ago

spitfire13y ago

He's trying to fix/avoid a race condition caused by data created within a transaction not being committed yet.

emperorcezar13y ago· 2 in thread

While I love seeing slides, this deck obviously could use the audio or transcript along with it.

rattray13y ago

Agreed, I for one was felt feeling rather in the dark for the last few slides especially...

iconfinder13y ago

We are posting a version with audio on our blog on Monday: http://blog.iconfinder.com

1 more reply

Uchikoma13y ago· 2 in thread

nickbruun13y ago

Incorrect. We fit in all of those constraints at the point where a rewrite was decided ;)

Uchikoma13y ago

Thanks for your reply, from looking at Iconfinder it did not look like a large piece of software with lots of business requirements.

lstamour13y ago· 1 in thread

I found I could follow along with the slides, though some of the icons and messages around third-party tools were lost on me. I'd definitely appreciate a video.

Edit: 9 minutes ago, Nick posted to Twitter: "we have the recording - just need the sound cleaned up. Expect it early next week ;)"

lstamour13y ago

Video now live: http://vimeo.com/65057265 [Slides in sync with audio]

gingerlime13y ago· 1 in thread

> 20ms with Jinja2 without auto-escaping

could this performance improvement back-fire if you end up with a security issue?

Daishiman13y ago

tachion13y ago· 1 in thread

I wonder, is there a video of the talk available? The slides, unfortunately, alone are rarely very informative.

iconfinder13y ago

We will post video version soon (Monday) on http://blog.iconfinder.com

1 more reply

level0913y ago· 1 in thread

I found these tools mentioned by the author really helpful, I compiled a list of them:

http://jinja.pocoo.org/ https://www.getsentry.com/welcome/ http://graphite.wikidot.com/start https://opbeat.com/

is there a way though to use/test opbeat ?

iconfinder13y ago

Just send them a message via twitter or email. I'm sure they will help you get started.

hasenj13y ago

You should give SQLAlchemy a try:

http://lucumr.pocoo.org/2011/7/19/sqlachemy-and-you/

It gives you more control and requires you to be more explicit about your queries and relationships.

claudiusd13y ago

scott_meade13y ago

ORMs may be "stupid", yet Basecamp manages to handle 400-500 request per second with Rails and MySQL just fine. https://twitter.com/dhh/status/287221705443774465

iconfinder13y ago

We have posted a video version of the talk (slides + audio): http://blog.iconfinder.com/staying-sane-while-defying-joel-s...

dmishe13y ago

Interesting point on celery and transaction, though I've never seen that happen in real life, interesting.

Uchikoma13y ago

+100 for using "leaking abstraction" in a slide deck "defying Joel Spolsky"

beambot13y ago

Is there a (publicly available) video that accompanies this slide deck?

1 more reply

j / k navigate · click thread line to collapse