Null Values in SQL Queries (opens in new tab)

SigmundA6y ago

Empty string isn't enough for this situation since it doesn't work for data types other than string, I actually prefer null vs undefined in javascript / json here, xml also has the concept of null vs undefined.

Emtpy string vs null is not that useful if you have null and undefined.

tabtab6y ago

I've never needed such in 3 decades of systems design. I'd like to hear the details. An explicit flag or time-stamp should be used to indicate when or if a record as been updated. To be frank, heavily reliance on Null strings usually means somebody is doing something wrong or awkward in my opinion. Null strings cause 10 problems for every 1 they solve. I stand by that and will and have defended it for hours in debates. Bring it on! (Granted, most RDBMS don't offer enough tools to easily do it correctly.)

talaketu6y ago

I'm sceptical. Do you have examples?

Take the middle name example: "What was U.S President Theodore Roosevelt’s middle name?" when you know he didn't have a middle name.

Are you suggesting that a blank is the correct choice here?

I don't think it's accurate to say they have a blank middle name. I think it's better to say they don't have a middle name.

https://modern-sql.com/feature/is-distinct-from

juped6y ago

Yeah. That's what null is for (more or less).

johannes12343216y ago

Right, I am a fan of SQL NULl as well. It is nicely consistent - anything NULL in - you get NULL out. Clearly telling that you get undefined data. Silently converting to empty string, zero, or therelike would eventually return garbage for harder to debug reasons.

That Oracle behavior annoys me each time, though.

tabtab6y ago

Re: Silently converting to empty string, zero, or therelike would eventually return garbage for harder to debug reasons.

It's never been a problem with strings in my many decades of experience, unless somebody does something which I consider poor system engineering. Nearby I invited a solid use-case illustrating a real string need.

SigmundA6y ago

No I am tired of this WHERE (a = b) or (a IS NULL AND b IS NULL)

Null should equal null like in every other programming language even SQL group by do null equals null which is even more inconsistent.

jeltz6y ago

It is quite wordy but the SQL solution for this is "WHERE a IS NOT DISTINCT FROM b".

progval6y ago

> Null should equal null like in every other programming language

Think of SQL's NULL like it's a NaN.

WorldMaker6y ago

Something to petition ANSI about.

(Microsoft's SQL Server still defaults to the non-ANSI NULL behavior where a = b when both are NULL, and that's something that still pings on checklists of SQL Server if it follows ANSI standards. SQL Server is kind enough to let you enable/disable the behavior, and likely that would persist even after the default switches to meet the standard as the docs assure will happen "in some future version".)

Tactic6y ago

My preference would be a flag that you can set per connection.

NULL = Undefined or No Data. Whereas a blank field can, in and of itself, be data. It may indicated something is intentionally left blank.

But for those times where you want to consider them the same, it would be nice to have a setting.

(Note that I admit the possibility that this may exist already, like most my great ideas.)

bloomer6y ago

NULL is similar to floating point NaN (aka not a number) which also has the same comparison operation NaN != NaN.

stetrain6y ago

Sounds like a disaster when trying to correlate Ids/keys between tables.

mumblemumble6y ago

Letting null equal null might work okay for a WHERE clause, but it turn JOINing into a terrible muddle. And I don't want null to behave different ways in different clauses.

trts6y ago

where coalesce(a, '') = coalesce(b, '')

https://www.postgresql.org/docs/current/functions-conditiona...

Starwatcher20016y ago

I like nulls too and think they make perfect sense, especially with numeric fields.

Suppose we have an "age" field, but don't actually know the age of the person, null makes perfect sense.

Otherwise we'd have do do something like using a "magic number" like 0, -1, or a separate field altogether to indicate an unknown value.

Granted, they do need some handling in queries.

ilogik6y ago

except that age = 0 is valid if your database has toddlers. Hell, even -1 makes sense if you're running a query to get the age of someone at a specific time. this is why you should try to use NULLs whenever possible

marcus_holmes6y ago

I also am a fan of nulls. But I also make very sure that I have defaults set up on every column for which a null value would make no sense. Which means pretty much every column that isn't an optional foreign key.

I'm writing in Go, so my structs all have empty values unless specifically initialised. This does sometimes mean that I get null uuid's (0000-000000-00000-0000) inserted into tables, which Postgres doesn't understand as null and cheerfully returns as a valid uuid. This has been my only real pain with using nulls.

I've contemplated modding the database driver to interpret nil-value uuid's as null, but that seems a little drastic. Anyone got any better ideas?

gnosek6y ago

Not sure how this integrates with Go's database drivers but from the Postgres side:

1. NULLIF(the_uuid, '0000-000000-00000-0000')

2. on insert/update rules to rewrite the uuid (probably using NULLIF)

https://www.postgresql.org/docs/current/sql-createrule.html

kaslai6y ago

My typical pattern is to use pointer types as a pseudo-optional type. There is the additional cost of dealing with a pointer on the Go side though, which can get cumbersome at times.

`database/sql` does have an interface which lets you define your own marshaling code though. Using it is very simple and would let you marshal an all-zero UUID into NULL easily enough.

uncanny6y ago

You can use a type that marshals the Go zero value to database null and vice versa. https://godoc.org/github.com/jackc/pgtype/zeronull provides a UUID type with that behavior.

dx0346y ago

pgx allows to pass pointers to indicate which values are null. That works with any arbitrary types and prevents from accidentially storing an empty values as null.

chias6y ago

I've recently gone and removed the "nullable" attribute from a bunch of SQL columns that previously had them (and arguably should have them), and the result has actually been rather pleasant.

One interesting problem that arises when you use nulls is that it can be difficult to ensure people actually use them when it's appropriate to do so. Case in point I have a field that is essentially an optional positive integer, so obviously I made it a nullable unsigned int. A few years later there's a pretty even spread of nulls and 0s in there to indicate the same thing -- to a lot of consumers, they behave the same because their business logic basically says "if (foo->field) do stuff" which works either way. In the end I changed it to a non-nullable field using 0 as the null stand-in, which is semantically worse, but ended up making interesting searches over this data set a lot easier.

On the one hand, perhaps the more correct answer would have been to yell at people putting 0s in when they should have been putting nulls in. On the other hand, we put constraints on fields for a reason...

ak396y ago

AFAIK, this Oracle "feature" is only true for columns marked as "not nullable". So if you attempt to insert an empty sting ("") into a not-nullable column, it will fail.

All other relational databases differentiate between empty strings and NULL.

Sean17086y ago

Nah, it's true for all columns:

  SQL*Plus: Release 11.2.0.4.0 Production on Mon Feb 3 15:56:30 2020
  
  Copyright (c) 1982, 2013, Oracle.  All rights reserved.
  
  
  Connected to:
  Oracle Database 11g Express Edition Release 11.2.0.2.0 - 64bit Production
  
  SQL> create table foo(fook varchar2(10) not null);
  
  Table created.
  
  SQL> insert into foo values ('');
  insert into foo values ('')
                          *
  ERROR at line 1:
  ORA-01400: cannot insert NULL into ("SPORTSBOOK_DOCK"."FOO"."FOOK")
  
  
  SQL> create table bar(bark varchar2(10));
  
  Table created.
  
  SQL> insert into bar values ('');
  
  1 row created.
  
  SQL> insert into bar values ('a');
  
  1 row created.
  
  SQL> insert into bar values (null);
  
  1 row created.
  
  SQL> select * from bar;
  
  BARK
  ----------
  
  a
  
  
  SQL> select * from bar where bark = '';
  
  no rows selected

shawnz6y ago

It is also true for nullable columns.

thenewnewguy6y ago

> if you attempt to insert an empty sting ("") into a not-nullable column, it will fail

Yes, because it will convert the empty string into a NULL, and fail to insert that NULL into a non-NULLable column.

beefield6y ago

You are not the only one. I get that there are some exotic and theoretical corner cases where nulls actually are problematic, but for the vast, vast majority of practical cases, nulls and three state logic are very useful. In my humble opinion.

JMTQp8lwXL6y ago

>For example, Oracle database won’t allow you to have an empty string.

It's also true for AWS' DynamoDB offering.

shotashota6y ago

>I might be the only person who likes SQL nulls From my understanding, null values are bad because its a sign that the database design is flawed (See database normalizations) Perhaps you have a more practical experience?

kstrauser6y ago

That's not right. "Null" means that the value is not known. Suppose you have a table for employees, and you want to record the last time they were paid. What do you put in that column for people who just started this morning? The alternatives are to use null, indicating that they haven't been, or to formulate a codebase-wide sentinel value like "0000-01-01" and then accounting for that in every single database operation everywhere.

Further suppose that you have an external function in your codebase to estimate how many paychecks you've paid to someone, but the author doesn't know about any "0000-01-01" conventions your office uses. Without that, you'd see that Joe New Guy has worked here about 2,020 years, so we've probably issued him about 48,000 checks. If only you'd used null, then that function would have calculated "today() - null", which in any sane language would raise a type exception and alert you to the problem.

Nulls are beautiful. They have meaning. Lots of people misuse them, but that doesn't mean they're not valid and useful.

giornogiovanna6y ago

1. NOT NULL is very frequently used in schemas, because NULL is often an undesirable value (e.g. for a mandatory field). That doesn't make NULLs bad, and NULLs are still frequently used when you have optional fields or fields where NULL has some other special meaning.

2. NULLs are used by SQL functions and operators as an "unknown" value. So, for example, "NULL AND TRUE" is NULL, because we could substitute NULL with TRUE or with FALSE to get different results, but "NULL AND FALSE" is FALSE, because no matter what we substitute NULL with, the result will always be FALSE.

3. Clearly all these valid uses of NULL do not indicate "flaws" in the database design.

4. Database normalization isn't always a good thing, and beyond a certain level it's almost always a bad thing, so using normalization methods as a standard for whether something is "flawed" is probablyn ot the best idea.

5. No database normalization method, as far as I know, actually tries to eliminate NULLs, so I don't know what "(See database normalizations)" refers to. Can you clarify?

RHSeeger6y ago

The fact that a database is not fully normalized is not a sign that it's flawed. In fact, there are cases where some tables being fully denormalized makes sense (although that's less common in my experience).

wefarrell6y ago· 12 in thread

Null values and inequality are extremely counterintuitive (in postgres at least). If you run the query:

  SELECT * FROM my_table WHERE my_column != 5

You would expect it to return rows that have a null value for my_column, since null is not 5. However that is not the case.

grzm6y ago

NULL in SQL is often interpreted in many different ways. The most helpful I’ve found is to think of it as unknown. Postgres has the IS DISTINCT FROM operator to capture what you’ve intended above:

    ... WHERE my_column IS DISTINCT FROM 5

wefarrell6y ago

I wasn't aware, thanks for the tip. Is there any equivalent for sets of values? For example:

  SELECT * FROM my_table WHERE my_column NOT IN (5, 6)

magicalhippo6y ago

I think the issue here is that SQL should have more "NULL variants" to express why there is no concrete value.

A NULL value technically means it's unknown. An unknown value might be 5, hence why it's not in the result set. Some abuse NULL to mean "value doesn't exist". But a value that doesn't exist can't be 3, or 42, or any other value that's different from 5, so in that regard shouldn't be part of the result set either.

Others again abuse NULL to mean "doesn't apply". And in that case I think it makes sense to include the row in the result set. For example, if I write a query to get all people who's middle name is not "William", I'd most likely want people without middle names included.

Maybe we should have introduced NEX (non-existing) and NAP (non-applicable) as possible values in addition to NULL?

scottlamb6y ago

> Maybe we should have introduced NEX (non-existing) and NAP (non-applicable) as possible values in addition to NULL?

Codd (the inventor of relational algebra) actually suggested this. I think the primary source is a book that may not be on the web. There's some discussion here (mostly saying why they think it didn't happen and wouldn't work out): https://arxiv.org/html/1606.00740v1/

tabtab6y ago

Re: I think the issue here is that SQL should have more "NULL variants" to express why there is no concrete value.

No, that would muddy things in my opinion, like it did to JavaScript. Instead, have more operations/functions for dealing with them in a more "normal" way, so that we can say "WHERE x <> 5" and get results one expects. I'm not sure the syntax, and my drafts would take a lot of time to explain. To give a taste, maybe have something like "WHERE ~x <> 5" in which the tilde converts x's value to the type's default, such as a blank in the case of strings.

If the different reasons for "emptiness" matter, then usually it suggests the need for a "status" column of some kind so that queries can be done on the reasons. I'd need to study domain specifics to recommend something specific.

https://docs.microsoft.com/en-us/sql/t-sql/queries/is-null-t...

at_a_remove6y ago

Agreed. I have, off and on, labored on a still-incomplete and largely incoherent essay on this topic. NULL is overloaded to the point of some confusion.

goto116y ago

NULL in SQL means "unknown value". This is different from most programming languages where null is a special value which typically indicate "nothing".

If a value is unknown, you don't know if it is different from 5, so it would be incorrect to return in the query.

RmDen6y ago

Same in SQL Server.. this is documented behavior

Also a null is no equal to anything.. not even another null

This will print false in SQL Server

if null = null print 'true' else print 'false'

magicalhippo6y ago

> Also a null is no equal to anything.

Wrong. It is equal to UNKNOWN:

piyh6y ago

where coalesce(null_column,'') = ''

shortcuts "OR is null", works within functions.

goatlover6y ago

I wouldn't expect null row values since you're doing a numeric comparison for my_column, and null isn't a number.

gfody6y ago

the idea is you don’t know if the null != 5 because null isn’t a value it just marks the absence of a value

salzig6y ago· 4 in thread

there is something "missing". The SQL spec specifies `null = null` to be "unknown", where i sometimes expect "true". For MSSQL this can be configured using `SET ANSI_NULLS { ON | OFF }`. AFAIK MySQL can't be configured. Don't know about Postgres.

himinlomax6y ago

The standard makes sense if you go back to the theoretical basis of SQL. It seems somewhat counter-intuitive only when you think of NULL as a value you set in a cell.

When it's the result of a relational operation (such as a LEFT JOIN) however, the default makes sense while considering NULLs as equal to each other is typically not useful.

hobs6y ago

For what its worth, don't do this - pretty much all db code and practitioners expect three valued logic, not two.

salzig6y ago

until you have to work with a database created by an insane guy. Never needed it outside of that one project. (edit: small hint: composite primary key where parts can be null)

quietbritishjim6y ago

For postgres you can just use the separate operator IS NOT DISTINCT FROM to explicitly request this behaviour. In SQLite I think it's just IS. I assume most SQL databases have something similar, and that's a far better solution than applying a global config.

mwexler6y ago· 2 in thread

Null values are so important in representing data. But they cause so much confusion in a) unexpected behaviors in queries and b) inconsistent handling across various engines... I sometimes wish <whisper> that they hadn't been included in the spec at all </whisper>. But then I come to my senses again, and go fix yet another bugged query for an analyst who didn't account for nulls in the data.

Does it make sense to coalesce them away in a view? I thought most analysts are given star schemas implemented by views or ETL'd data anyway.

mwexler6y ago

Depends on the level of sophistication of the analyst, and if nulls have a meaning or value to the result.

Also, at a certain point, knowing that nulls are present gives you yet another measure of dq: not knowing if they are present and hidden vs. visible and countable can be the difference between a wrong answer vs. just an uncertain one.

altitudinous6y ago· 2 in thread

I miss my past Oracle career, I've diagnosed this "= NULL" rather than "IS NULL" in so many broken queries, slow queries because of the way Oracle indexing handles NULL.

There is a lot of discussion in this thread about whether this implementation of null checking in Oracle is appropriate, analysing it, but the current implementation is just fine, it has been tested by time.

The internet does tend to rehash the same arguments over and over!!! The internet forgets. I remember these arguments 20 years ago.

lisper6y ago

[Ignore this comment. It was posted by mistake. I'm only leaving it here for the historical record.]

> the current implementation is just fine, it has been tested by time.

No, it isn't "just fine". It is broken. Just because something has been broken for a very long time and has spawned an entire industry devoted to dealing with the fact that it is broken does not change the fact that it is broken.

altitudinous6y ago

Do you have substantial experience with Oracle? or are you just blindly going on what everyone else says?

There is no mention of outer joins in this thread, no mentions of the ability to minus results of one query from another which are basic constructs which handle many of the issues that are discussed here. It says that the people here are inexperienced with Oracle. Everyone here trying to resolve issues using inner joins. Inexperience.

If people here had experience, not only would these topics have been discussed, but the real issues with NULL would have been discussed, one of which I mention in my previous post.

michannne6y ago· 2 in thread

Another one is MIN and MAX ignore NULL values, which make for some interesting rollback scenarios.

I also swear I have seen a gotcha involving UPDATE WHERE IN and not throwing an error where it should have, which is why I always quadruple check my update statements, but I wasn't able to reproduce it and couldn't find any information online. I haven't seen the issue in so long I forgot what it was, but it would update all rows in your table even if your WHERE clause was proper.

robocat6y ago

Also OR/AND can return non-null results even if NULL is one side of the operator:

    (NULL AND 0) gives 0
    (0 AND NULL) gives 0
    (NULL AND 1) gives NULL
    (1 AND NULL) gives NULL
    (NULL AND NULL) gives NULL
    (NULL OR 0) gives NULL
    (0 OR NULL) gives NULL
    (NULL OR 1) gives 1
    (1 OR NULL) gives 1
    (NULL OR NULL) gives NULL

oarabbus_6y ago

What is a scenario where min or max should consider NULL values?

xivzgrev6y ago· 1 in thread

Not sure what big deal is. You learn somewhere along the way that you check for null values with “is” vs “=“. Done, write it on a sticky note if you need, and move on.

“Why isn’t it consistent??” - well a lot of systems have a lot of bat shit crazy inconsistencies, some times there for good reason. You learn to keep them straight and get your shit done.

If you want to learn the “why” every time you encounter a system design quirk, be my guest but you may be going down a time intensive rabbit hole with little pay off for yourself.

Jaxan6y ago

Sometimes it’s easier to remember the why than the what. Then it does make sense to learn about it!

tabtab6y ago· 1 in thread

A pet peeve of mine is concatenating null strings. It's like a poison pill that nulls the whole result. 99.99% of the time that's NOT what one wants domain-wise. Maybe the standard should make another concatenation operator that treats null strings as zero length strings. Sure, one can de-null each string in the expression, but that's ugly anti-DRY code. Please fix it, I haaate that.

Agreed that it makes the pipe concat operator a lot less useful. Now PostgreSQL and MySQL both have CONCAT_WS which does replace NULLs with empty strings. It's also nice when you do need a common separator between all elements.

sashavingardt26y ago

Now here's a blast from the past! 20 years ago this was common knowledge. Now it's making headline news on HN. SQL is back with the vengeance!

irrational6y ago

We recently moved from Oracle to Postgres. We had thousands of queries written based on the way Oracle handles NULLs and empty strings. It took us the better part of a year to rewrite all of them to the Postgres way. I am so glad to be off of Oracle.

pjdorrell6y ago

Theoretically NULL means "unknown" value. As it happens, most business applications do not have any requirement to deal with "unknown" values. These applications are only interested in acting on requests where all the required data are provided by the person responsible for entering the data. For example, when I transfer money from one bank account to another, the amount of the transfer can't be "unknown", the sending account can't be "unknown", and the receiving account can't be "unknown".

These same applications do have requirements to deal with empty values. Sometimes an empty value means "I haven't yet entered this value in the to the UI". But in that case the UI won't let you submit the form until you have supplied a valid value.

In other cases an empty value is a valid value. For example, "who is your spouse?" and the answer is "I'm not married".

Sometimes NULL represents "irrelevant", like for "who is your spouse?", where some of the records in the table represent people who can have spouses, and some of the records represent other person-like entities that aren't actually people and therefore they can't have spouses.

Given that NULL is _not_ being used to represent "unknown" values, and there is a requirement to represent empty values, and you don't want to have a whole extra column just to represent "emptiness", the most straightforward way to implement empty values is to use NULL. So that is what happens.

And you have to remember to use "is" instead of "=" when you want to test your empty NULL values for equality with other empty NULL values - because your SQL database is pretending that NULL really means "unknown", and it doesn't want to say that one unknown value is equal to another unknown value, because that would be theoretically incorrect.

hanche6y ago

NULLs in subselects do bite me with distressing regularity: Writing

  SELECT ... FROM ... WHERE blah NOT IN (SELECT foo FROM bar);

getting no hits until I slap my forehead and add WHERE foo IS NOT NULL to the subselect.

kords6y ago

DynamoDB, which is NoSql, also doesn't accept empty strings. But at least, Oracle automatically converts the empty string into NULL, comparing with DynamoDB which would actually fail the query.

With some columnar databases NULLs are 'free' because they are a default, absent state or compressed away. Can be another reason to prefer them with very large datasets.

lesserknowndan6y ago

In MySQL, NULL values are useful when using CONCAT_WS (concatenation with separator) or GROUP_CONCAT because NULL values will be ignored - so you don’t get e.g., “one,,two”.

Andromeda886y ago

I was dealing with NULLs whole day on MySQL workbench. It wasn't considering int as NULL value. Needed to make all empty cells 0 to be able to import data properly.

j / k navigate · click thread line to collapse

152 comments

75 comments · 16 top-level

juped6y ago· 35 in thread

>For example, Oracle database won’t allow you to have an empty string. Anytime Oracle database sees an empty string, it automatically converts the empty string into a NULL value.

Damn. This is how you do enterprise.

I might be the only person who likes SQL nulls. If you learn how they work up front, they're useful and not really that confusing. But if I ran into weird behaviors like this, I might hate them too.

danso6y ago

sammorrowdrums6y ago

SQL suffers this exact same problem. I wonder what SQL would look like without Null.

    Select id, Option (key)
    From table
 

    Insert... Some(5), None::Int

Interesting

SigmundA6y ago

Emtpy string vs null is not that useful if you have null and undefined.

tabtab6y ago

talaketu6y ago

I'm sceptical. Do you have examples?

Take the middle name example: "What was U.S President Theodore Roosevelt’s middle name?" when you know he didn't have a middle name.

Are you suggesting that a blank is the correct choice here?

I don't think it's accurate to say they have a blank middle name. I think it's better to say they don't have a middle name.

https://modern-sql.com/feature/is-distinct-from

juped6y ago

Yeah. That's what null is for (more or less).

johannes12343216y ago

That Oracle behavior annoys me each time, though.

tabtab6y ago

Re: Silently converting to empty string, zero, or therelike would eventually return garbage for harder to debug reasons.

SigmundA6y ago

No I am tired of this WHERE (a = b) or (a IS NULL AND b IS NULL)

Null should equal null like in every other programming language even SQL group by do null equals null which is even more inconsistent.

jeltz6y ago

It is quite wordy but the SQL solution for this is "WHERE a IS NOT DISTINCT FROM b".

progval6y ago

> Null should equal null like in every other programming language

Think of SQL's NULL like it's a NaN.

WorldMaker6y ago

Something to petition ANSI about.

Tactic6y ago

My preference would be a flag that you can set per connection.

NULL = Undefined or No Data. Whereas a blank field can, in and of itself, be data. It may indicated something is intentionally left blank.

But for those times where you want to consider them the same, it would be nice to have a setting.

(Note that I admit the possibility that this may exist already, like most my great ideas.)

bloomer6y ago

NULL is similar to floating point NaN (aka not a number) which also has the same comparison operation NaN != NaN.

stetrain6y ago

Sounds like a disaster when trying to correlate Ids/keys between tables.

mumblemumble6y ago

Letting null equal null might work okay for a WHERE clause, but it turn JOINing into a terrible muddle. And I don't want null to behave different ways in different clauses.

trts6y ago

where coalesce(a, '') = coalesce(b, '')

https://www.postgresql.org/docs/current/functions-conditiona...

Starwatcher20016y ago

I like nulls too and think they make perfect sense, especially with numeric fields.

Suppose we have an "age" field, but don't actually know the age of the person, null makes perfect sense.

Otherwise we'd have do do something like using a "magic number" like 0, -1, or a separate field altogether to indicate an unknown value.

Granted, they do need some handling in queries.

ilogik6y ago

marcus_holmes6y ago

I've contemplated modding the database driver to interpret nil-value uuid's as null, but that seems a little drastic. Anyone got any better ideas?

gnosek6y ago

Not sure how this integrates with Go's database drivers but from the Postgres side:

1. NULLIF(the_uuid, '0000-000000-00000-0000')

2. on insert/update rules to rewrite the uuid (probably using NULLIF)

https://www.postgresql.org/docs/current/sql-createrule.html

kaslai6y ago

My typical pattern is to use pointer types as a pseudo-optional type. There is the additional cost of dealing with a pointer on the Go side though, which can get cumbersome at times.

`database/sql` does have an interface which lets you define your own marshaling code though. Using it is very simple and would let you marshal an all-zero UUID into NULL easily enough.

uncanny6y ago

You can use a type that marshals the Go zero value to database null and vice versa. https://godoc.org/github.com/jackc/pgtype/zeronull provides a UUID type with that behavior.

dx0346y ago

pgx allows to pass pointers to indicate which values are null. That works with any arbitrary types and prevents from accidentially storing an empty values as null.

chias6y ago

I've recently gone and removed the "nullable" attribute from a bunch of SQL columns that previously had them (and arguably should have them), and the result has actually been rather pleasant.

ak396y ago

AFAIK, this Oracle "feature" is only true for columns marked as "not nullable". So if you attempt to insert an empty sting ("") into a not-nullable column, it will fail.

All other relational databases differentiate between empty strings and NULL.

Sean17086y ago

Nah, it's true for all columns:

  SQL*Plus: Release 11.2.0.4.0 Production on Mon Feb 3 15:56:30 2020
  
  Copyright (c) 1982, 2013, Oracle.  All rights reserved.
  
  
  Connected to:
  Oracle Database 11g Express Edition Release 11.2.0.2.0 - 64bit Production
  
  SQL> create table foo(fook varchar2(10) not null);
  
  Table created.
  
  SQL> insert into foo values ('');
  insert into foo values ('')
                          *
  ERROR at line 1:
  ORA-01400: cannot insert NULL into ("SPORTSBOOK_DOCK"."FOO"."FOOK")
  
  
  SQL> create table bar(bark varchar2(10));
  
  Table created.
  
  SQL> insert into bar values ('');
  
  1 row created.
  
  SQL> insert into bar values ('a');
  
  1 row created.
  
  SQL> insert into bar values (null);
  
  1 row created.
  
  SQL> select * from bar;
  
  BARK
  ----------
  
  a
  
  
  SQL> select * from bar where bark = '';
  
  no rows selected

shawnz6y ago

It is also true for nullable columns.

thenewnewguy6y ago

> if you attempt to insert an empty sting ("") into a not-nullable column, it will fail

Yes, because it will convert the empty string into a NULL, and fail to insert that NULL into a non-NULLable column.

beefield6y ago

JMTQp8lwXL6y ago

>For example, Oracle database won’t allow you to have an empty string.

It's also true for AWS' DynamoDB offering.

shotashota6y ago

kstrauser6y ago

Nulls are beautiful. They have meaning. Lots of people misuse them, but that doesn't mean they're not valid and useful.

giornogiovanna6y ago

3. Clearly all these valid uses of NULL do not indicate "flaws" in the database design.

5. No database normalization method, as far as I know, actually tries to eliminate NULLs, so I don't know what "(See database normalizations)" refers to. Can you clarify?

RHSeeger6y ago

wefarrell6y ago· 12 in thread

Null values and inequality are extremely counterintuitive (in postgres at least). If you run the query:

  SELECT * FROM my_table WHERE my_column != 5

You would expect it to return rows that have a null value for my_column, since null is not 5. However that is not the case.

grzm6y ago

    ... WHERE my_column IS DISTINCT FROM 5

wefarrell6y ago

I wasn't aware, thanks for the tip. Is there any equivalent for sets of values? For example:

  SELECT * FROM my_table WHERE my_column NOT IN (5, 6)

magicalhippo6y ago

I think the issue here is that SQL should have more "NULL variants" to express why there is no concrete value.

Maybe we should have introduced NEX (non-existing) and NAP (non-applicable) as possible values in addition to NULL?

scottlamb6y ago

> Maybe we should have introduced NEX (non-existing) and NAP (non-applicable) as possible values in addition to NULL?

tabtab6y ago

Re: I think the issue here is that SQL should have more "NULL variants" to express why there is no concrete value.

https://docs.microsoft.com/en-us/sql/t-sql/queries/is-null-t...

at_a_remove6y ago

Agreed. I have, off and on, labored on a still-incomplete and largely incoherent essay on this topic. NULL is overloaded to the point of some confusion.

goto116y ago

NULL in SQL means "unknown value". This is different from most programming languages where null is a special value which typically indicate "nothing".

If a value is unknown, you don't know if it is different from 5, so it would be incorrect to return in the query.

RmDen6y ago

Same in SQL Server.. this is documented behavior

Also a null is no equal to anything.. not even another null

This will print false in SQL Server

if null = null print 'true' else print 'false'

magicalhippo6y ago

> Also a null is no equal to anything.

Wrong. It is equal to UNKNOWN:

piyh6y ago

where coalesce(null_column,'') = ''

shortcuts "OR is null", works within functions.

goatlover6y ago

I wouldn't expect null row values since you're doing a numeric comparison for my_column, and null isn't a number.

gfody6y ago

the idea is you don’t know if the null != 5 because null isn’t a value it just marks the absence of a value

salzig6y ago· 4 in thread

himinlomax6y ago

The standard makes sense if you go back to the theoretical basis of SQL. It seems somewhat counter-intuitive only when you think of NULL as a value you set in a cell.

When it's the result of a relational operation (such as a LEFT JOIN) however, the default makes sense while considering NULLs as equal to each other is typically not useful.

hobs6y ago

For what its worth, don't do this - pretty much all db code and practitioners expect three valued logic, not two.

salzig6y ago

until you have to work with a database created by an insane guy. Never needed it outside of that one project. (edit: small hint: composite primary key where parts can be null)

quietbritishjim6y ago

mwexler6y ago· 2 in thread

Does it make sense to coalesce them away in a view? I thought most analysts are given star schemas implemented by views or ETL'd data anyway.

mwexler6y ago

Depends on the level of sophistication of the analyst, and if nulls have a meaning or value to the result.

altitudinous6y ago· 2 in thread

I miss my past Oracle career, I've diagnosed this "= NULL" rather than "IS NULL" in so many broken queries, slow queries because of the way Oracle indexing handles NULL.

The internet does tend to rehash the same arguments over and over!!! The internet forgets. I remember these arguments 20 years ago.

lisper6y ago

[Ignore this comment. It was posted by mistake. I'm only leaving it here for the historical record.]

> the current implementation is just fine, it has been tested by time.

altitudinous6y ago

Do you have substantial experience with Oracle? or are you just blindly going on what everyone else says?

If people here had experience, not only would these topics have been discussed, but the real issues with NULL would have been discussed, one of which I mention in my previous post.

michannne6y ago· 2 in thread

Another one is MIN and MAX ignore NULL values, which make for some interesting rollback scenarios.

robocat6y ago

Also OR/AND can return non-null results even if NULL is one side of the operator:

    (NULL AND 0) gives 0
    (0 AND NULL) gives 0
    (NULL AND 1) gives NULL
    (1 AND NULL) gives NULL
    (NULL AND NULL) gives NULL
    (NULL OR 0) gives NULL
    (0 OR NULL) gives NULL
    (NULL OR 1) gives 1
    (1 OR NULL) gives 1
    (NULL OR NULL) gives NULL

oarabbus_6y ago

What is a scenario where min or max should consider NULL values?

xivzgrev6y ago· 1 in thread

Not sure what big deal is. You learn somewhere along the way that you check for null values with “is” vs “=“. Done, write it on a sticky note if you need, and move on.

“Why isn’t it consistent??” - well a lot of systems have a lot of bat shit crazy inconsistencies, some times there for good reason. You learn to keep them straight and get your shit done.

If you want to learn the “why” every time you encounter a system design quirk, be my guest but you may be going down a time intensive rabbit hole with little pay off for yourself.

Jaxan6y ago

Sometimes it’s easier to remember the why than the what. Then it does make sense to learn about it!

tabtab6y ago· 1 in thread

sashavingardt26y ago

Now here's a blast from the past! 20 years ago this was common knowledge. Now it's making headline news on HN. SQL is back with the vengeance!

irrational6y ago

pjdorrell6y ago

In other cases an empty value is a valid value. For example, "who is your spouse?" and the answer is "I'm not married".

hanche6y ago

NULLs in subselects do bite me with distressing regularity: Writing

  SELECT ... FROM ... WHERE blah NOT IN (SELECT foo FROM bar);

getting no hits until I slap my forehead and add WHERE foo IS NOT NULL to the subselect.

kords6y ago

DynamoDB, which is NoSql, also doesn't accept empty strings. But at least, Oracle automatically converts the empty string into NULL, comparing with DynamoDB which would actually fail the query.