How to find size of an array in C without sizeof (opens in new tab)

(arjunsreedharan.org)

420 pointsashishb4u9y ago203 comments

203 comments

94 comments · 22 top-level

The result you get with this trick is signed, while the result you get with sizeof is unsigned.

Edit: Just to clarify, what you get is ptrdiff_t instead of size_t. So if array size is greater than PTRDIFF_MAX, you get undefined behavior [1].

[1] http://en.cppreference.com/w/c/types/ptrdiff_t

tedunangst9y ago

As far as I know, every compiler is badly broken with arrays greater than SIZE_MAX / 2, so this would be the least of your troubles.

pascal_cuoq9y ago

Since I have it at hand, here is a list of examples of how compilers are broken if an array is larger than SIZE_MAX / 2 (which is called PTRDIFF_MAX in the post):

http://trust-in-soft.com/objects-larger-than-ptrdiff_max-byt...

mannykannot9y ago

Is there a circle in hell reserved for C standards committee members who add to the number of cases where 'undefined behavior' occurs in the standards?

microtherion9y ago

Just how would you make this case defined? The alternatives to undefined behavior tend to be (1) being silent about the issue and letting users and implementers find out themselves (2) defining the behavior in a way that makes it difficult to implement on some architectures or (3) defining the behavior in a way that imposes costs on all architectures. None of these is particularly attractive.

1 more reply

kbart9y ago

Read "undefined behavior" as "depending on architecture and compiler". It's not like anything can happen, but it's simply not to describe every architecture and every compiler into a standard. Sure, somebody is free to write an implementation where a nuke is launched every time "undefined behavior" is encountered, and they would be right according to C standard, but in real world, you pretty much know what to expect on a given system.

3 more replies

michaelmior9y ago

I hope not. One of the reasons for allowing undefined behaviour is to allow room for compilers to perform certain optimizations that may not be possible if a specific result were required.

flamedoge9y ago

How likely do you run into array bigger than 2gb?

quotemstr9y ago

The "how likely is it, really?" response to questions of technical correctness has always bothered me. It takes a mindset completely alien to mine to say "Here's a race condition. Sure, it's undefined behavior, but the race is narrow, so it's rare" or to say "Sure, memory allocation can theoretically fail, but in practice almost never does" or to say "fsync is too slow and most computers have batteries these days".

Software is unreliable enough as it is due to problems beneath our notice. It seems reckless to avoid fixing problems that we do notice. Sure, you could argue that rare problems are rare and that users probably won't notice them --- this attitude is penny-wise and pound-foolish, because you can't meaningfully reason about a system that's only probably correct.

3 more replies

cperciva9y ago

A few months ago I was doing FFTs on arrays larger than 4GB. Amusingly, this uncovered a bug in the LLVM optimizer: It was looking at stride lengths to figure out if accesses were independent, and truncated a 4GB stride down to 0.

1 more reply

vram229y ago

Not likely, but possible. This reminds me of the bug that was found in the binary search algorithm a few years ago, IIRC, in Java. The interesting thing is that binary search is probably one of the earliest-invented algorithms. Yet, in the book Writing Efficient Programs by Jon Bentley (which I mentioned in a recent HN comment), he says that in a class he taught to several industrial programmers with many years of experience, some had bugs in their implementations of binary search that he set them as an exercise. Not sure but I think I remember reading in the article about the Java binary search issue, that even his algorithm had the bug that was found in the Java version. Why it was not found earlier is (maybe) because it only occurred with an extremely large array, IIRC. Don't have a link right now, but it can probably be found by searching for the right phrase.

3 more replies

loeg9y ago

It's basically bogus to have a single object bigger or equal to half of address space (represented by size_t) in C. 32-bit platforms should detect and abort in such conditions (compiler/linker for static objects, malloc() implementation for dynamic allocations).

1 more reply

dragandj9y ago

Today, with ML, big data and similar applications, that might be often.

minipci13219y ago

Probably not very likely, but keep in mind that this method could also be used without actually allocating the array -- akin to the 'offsetof()' macro. (Which is undefined behavior.)

icedchai9y ago

On a 64-bit platform (anything modern), ptrdiff_t is going to be 64-bit so this will not be an issue (ok, 63-bit... but you get my point.)

1 more reply

kabdib9y ago

Often enough; "pack" files in video games are often many GB. Memory-map one of those and there you are . . .

1 more reply

sfifs9y ago

Oh surprisingly easily. Say you're handling a few billion cookies in RAM or manipulating DNA data.

hsivonen9y ago

It's quite easy to serve over 2 GB of spaces over the network. (gzip, brotli)

Stratoscope9y ago· 13 in thread

Whether you use this method of getting the number of elements in an array or the more traditional sizeof method, please encapsulate the logic in a macro.

Instead of writing either of these:

  size_t length = sizeof array / sizeof array[0];

  size_t length = (&array)[1] - array;

Define this macro instead:

  #define countof( array )  ( sizeof(array) / sizeof((array)[0]) )

Or if you must:

  #define countof( array )  ( (&(array))[1] - (array) )

And then you can just say:

  size_t length = countof(array);

Edit: I used to call this macro 'elementsof', but it seems that 'countof' is a more common name for it and is a bit more clear too - so I'm going to run with that name in the future.

mianos9y ago

Please don't replace a one line, obviously recognised by every C programmer since the beginning of time, sizeof(array) / sizeof (type) with some macro that not everyone knows. But alas, I've only been a C programmer for 30 years so I probably don't know what is cool these days.

Tempest19819y ago

A more detailed article here: http://www.g-truc.net/post-0708.html

with a cleaner way to do _countof using a template in C++ 11.

You can also use the template technique to pass a fixed size array to a function, and have the function determine the array size (without needing a 2nd length param, or null terminator element). Similar to strcpy_s(): http://stackoverflow.com/questions/23307268/how-does-strcpy-...

MSVC has a built in _countof: http://stackoverflow.com/questions/4415530/equivalents-to-ms...

Stratoscope9y ago

Thanks for the interesting references!

While we're talking macros, anyone who reads the g-truc.net article should feel itchy after seeing the countof macro in their example:

  #define countof(arr) sizeof(arr) / sizeof(arr[0])

Two problems here:

1. The last use of 'arr' doesn't have 'arr' wrapped in parenthesis.

2. The entire expression is not wrapped in parentheses either.

If you write a macro that does any calculation like this, play it safe and put parens around every macro argument and parens around the entire expression too. Otherwise you never know what operator precedence will do to you.

1 more reply

babuskov9y ago

> please encapsulate the logic in a macro.

Why?

When reading such code, it means I would have to go and lookup a macro definition. So, there's a clear drawback. What's the benefit that makes it worthwhile?

Tempest19819y ago

Faster to read, and keeps the reader's mind at a semantically higher level.

2 more replies

jjnoakes9y ago

If the macro is named appropriately, you don't have to go look up anything. And even if you do, you do it once (per project perhaps). No big deal.

I mean, you don't go look up the definitions of every function that gets called, every time they are called, right?

gpderetta9y ago

I doubt the author meant for this trick to be actually used, they were just showing how pointers to arrays are typed correctly in a clever way.

Stratoscope9y ago

Indeed, one could hope that is the case! :-)

But my point with suggesting the macro applies equally to the more traditional sizeof division. I have seen code that divides the two sizeofs every time an array length is needed. I think it's better to put that calculation in a macro so you only do it in one place.

1 more reply

wruza9y ago

IIRC it is canonically called NELEMS(a).

int_19h9y ago

I don't know if there's such a thing as "canonical" here. In MSVC, it's _countof (and it's in one of the standard headers).

brian-armstrong9y ago

Why a macro and not a static inline function?

Stratoscope9y ago

That's a good question! Can you share the code for that function?

EpicEng9y ago

How exactly do you intend to get an array from a function argument?

millstone9y ago· 7 in thread

I'm surprised at all of the comments calling this stupid or pointless. The point is not that you should this trick in lieu of sizeof; the point is to shed light on a subtly of C arrays.

btrask9y ago

I suspect this article made a lot of people feel stupid, or in other words, it taught us something. Sometimes the ego gets out of check.

I think the article is well-presented and educational.

gragas9y ago

>I think this article made a lot of people feel stupid

I don't think so. Anyone with a solid understanding of C understands pointer arithmetic. I think the article isn't obvious only to those who have a weak understanding of the language.

2 more replies

anigbrowl9y ago

Quite. This exactly the sort of thing that makes C such a fun language.

shmerl9y ago

I'm not sure if it's a praise for C though. Arcane design and lack of clarity might be fun to decipher, but it's not something that you'd want to see in the programming language.

3 more replies

jheriko9y ago

personally the issue i take with this article is that it displays an opinion that is counterproductive to learning (imo).

rather than calling out that pointer arithmetic implicitly relies on 'sizeof' in order to be useful, its treated like some kind of magic. i.e. i don't think it points out the not subtle but rather obvious connection, and instead distracts from it...

JBiserkov9y ago

Your comment:

>rather than calling out that pointer arithmetic implicitly relies on 'sizeof'

Article:

>arr has the type int , where as &arr has the type int ()[size].

For me this is calling out the implicit use of sizeof by pointing out the type.

pwython9y ago

You've been on here 4 years and are surprised at the top comments criticizing the content of a post? :)

There's a reason this meme exists: http://i.imgur.com/Z6pFTjj.jpg

icedchai9y ago· 5 in thread

Interesting. I've been working with C for almost 30 years (first taught it to myself when I was 14) and never thought about the actual type of array.

psyc9y ago

You're not alone. I've been programming in either C or C++ for 25 years, and it wouldn't have occurred to me that you can have a "pointer to array of size N" that includes the size. Though I probably could have been led there with a little Socratic questioning.

int_19h9y ago

The reason why people don't usually run into this is because C tries really hard to decay your arrays to pointers to first element, so there are very few cases where it actually comes up - sizeof(array) and &array are some of the few. On top of that, writing down the type of such an array is not exactly obvious, and requires parentheses:

    int (*p)[10];

This all is much more interesting in C++, because there, in conjunction with references, this lets you write functions that take arrays as arguments and know their length. Like so:

    template<size_t N>
    void foo(const int (&a)[N]) {
        for (size_t i = 0; i < N; ++i)
            cout << a[i];
    }

    int a[10];
    foo(a);

jjnoakes9y ago

If you start thinking about two dimensional arrays, you'll probably get close quickly.

cbsmith9y ago

Which kind of explains so much about the problem with C. ;-)

jjnoakes9y ago

I think it more explains that you can do a lot without fully understanding what it is you are working with.

Which can be good or bad.

2 more replies

hnfairy9y ago· 5 in thread

Despite the argument at the end, this is undefined behavior in the latest C specification. The code dereferences a pointer one past the last element.

C11 6.5.6/8:

If the result points one past the last element of the array object, it shall not be used as the operand of a unary * operator that is evaluated

feelix9y ago

"it shall not be used as the operand of a unary * operator that is evaluated"

he doesn't use the * operator on it, he just calculates its position. If he were to access it (ie, use it with *) then that would be breaking the rule

dom09y ago

The snipper only calculates the pointer, and does not dereference it. Should be fine.

Buge9y ago

It's a complicated situation. There's a pointer to an array, and that pointer is dereferenced, resulting in an array (that then decays to a pointer). But that second array/pointer is not dereferenced. I'm not sure if it's legal.

1 more reply

amelius9y ago

I think the authors of the spec really meant something else: reading/writing a memory location past the end of the array is illegal. But here "*" is used only in an address computation, not to actually access memory.

Shows how difficult it is to get a spec right.

So, IMO, you are right, the code in the article is illegal (strictly speaking).

But I think it is likely that most compilers would still allow it, because that clause in the spec essentially exempts the compiler from adding an explicit bounds check.

bonzini9y ago

I don't think this is illegal. What is the clause in the spec that allows &arr[1]? I would try and see if it also applies to (&arr)[1].

1 more reply

halayli9y ago· 4 in thread

this is undefined behavior. &arr + 1 can overflow. There's no guarantee &arr isn't near memory end boundary. &arr + 1 is converted at compile time to rbp - X where X is an integer determined by the compiler similarly to how sizeof works.

Basically ptr + integer requires the compiler to determine the sizeof ptr's type.

cperciva9y ago

this is undefined behavior. &arr + 1 can overflow

No. From 6.5.6 Additive operators:

7 For the purposes of these operators, a pointer to an object that is not an element of an array behaves the same as a pointer to the first element of an array of length one with the type of the object as its element type.

8 [...] if the expression P points to the last element of an array object, the expression (P)+1 points one past the last element of the array object [...] If both the pointer operand and the result point to elements of the same array object, or one past the last element of the array object, the evaluation shall not produce an overflow; otherwise, the behavior is undefined. If the result points one past the last element of the array object, it shall not be used as the operand of a unary * operator that is evaluated.

So &arr + 2 can overflow, and &arr + 1 cannot be dereferenced, but &arr + 1 shall not overflow and is not undefined behaviour.

halayli9y ago

But arr != &arr even though they have the same value. #8 applies to arr (P), but in the post OP is using &arr which is a ptr to array[x] and doesn't apply to it.

3 more replies

mnarayan019y ago

So then I guess malloc can't return an allocation which actually goes to the end of the address space, but has to leave at least one extra byte to avoid overflow? That's pretty interesting, though I guess it certainly makes sense.

Edit: Also now that I think about it, I've written code that relied on that behavior...not sure if I'd heard it before and internalized and forgot it, or just was being foolish.

1 more reply

cbsmith9y ago

Nope, you have guarantees about checking the address of one element past the end of an array. Think of all the bugs you'd otherwise enjoy...

arjun0249y ago· 3 in thread

Author of the article here. There's no intention here to encourage people to use this in code (in fact the opposite). This article is more of a "Did you know cool shit like this exist?".

mynameisbahaa9y ago

Please fix your site's header :)

mynameisbahaa9y ago

I dug deeper and the problem was at my end. The computer I am using has a parental control software installed and configured to block certain websites including twitter which caused the author's site not to load all the needed assets and screwed up the page header. sorry for the inconvenience but I would have been able to figure it out quicker than this if people who down-voted my comment took the time to tell that the site is working fine for them!

arjun0249y ago

I'll appreciate if you could provide a screenshot :)

1 more reply

pmiller29y ago· 3 in thread

Was anyone else's first thought "Hmm... cool," followed by "I hope nobody asks me this on an interview?"

exabrial9y ago

If you are asked this in an interview, it's not longer an interview... I would simply reply "what circumstances would dictate the necessity of such rather than producing clean code for my coworkers?"

pmiller29y ago

Hence why I hope noone asks me it. :)

p1esk9y ago

I actually thought: "cool... I hope someone asks me this on an interview!"

Etheryte9y ago· 3 in thread

Given how many bugs & errors stem from simple fails in range checks etc, I would much rather go with the tried and true way rather than use something "clever".

Quoting http://stackoverflow.com/a/16019052/1470607

  Note that this trick will only work in places where `sizeof` would have worked anyway.

Animats9y ago

Yes. This only works for arrays on the stack, at best. It assumes that arrays are placed on the stack in the order of declaration, which is not a requirement of the C standard and may differ between compilers.

Unless you're writing a buffer overflow exploit, in which case you need to know exactly what's on the stack and where, this isn't a good way to program.

Update: misread the article; thought he was differencing with the beginning of the next array.

dllthomas9y ago

I don't see how the code assumes anything about the placement of the array. Indeed, it works just fine for static arrays:

    $ cat test.c
    #include <stdio.h>
    
    int arr[5];
    
    int main(int argc, char *argv[]) {
    	printf("%lu, %ld\n", sizeof(arr) / sizeof(*arr), (&arr)[1] - arr);
    }
    
    $ gcc test.c && ./a.out
    5, 5

Not saying it's "a good way to program" - it's needlessly obfuscated compared to the standard sizeof alternative. But it doesn't rely on anything tricky.

ycmbntrthrwaway9y ago

> It assumes that arrays are placed on the stack in the order of declaration

I am not sure it is the case here. The code uses only one array, how can it assume the order of arrays?

gruez9y ago· 2 in thread

How is this better than the sizeof method? This looks like a clever way to access sizeof information without explicitly using the sizeof operator.

jjnoakes9y ago

It isn't better. I don't think it was claimed to be.

But if you really understand C, it should also not be a surprise that it works this way.

cbsmith9y ago

I think it is better in exactly zero ways.

It is, nonetheless, different.

Nimitz149y ago· 2 in thread

Why do we dereference the array pointer? Wouldn't that give us the value at the address when we just want the address? Also wouldn't the subtraction just give us a number of bytes and thus we'd still need to divide by sizeof(int))?

clarry9y ago

Pointer arithmetic works element-wise, not byte-wise.

So if p is a pointer, then p+1 refers to the next element after p, regardless of the size of the pointee. And so (p+1) - p is 1, again regardless of the size of the pointee.

In this case, &arr is a pointer to array, and &arr + 1 would point to the next array following the first one. But we wanted to calculate the number of elements in the array, not the fact that we have one array. So we dereference the pointer, thus getting an array type, which in turns "decays" to a pointer to the first element of the array, which has the right type for counting the elements using pointer arithmetic.

Nimitz149y ago

Thank you.

disposablezero9y ago· 2 in thread

Many implementations historically also allocated enough memory to include one extra element at the end of the array.

tedunangst9y ago

I find this improbable.

angry_octet9y ago

Agreed, compiler implementors rarely decide to use more memory than is required. There may be a stack canary, but this is between stack allocated variables and control flow structures, not for every array.

stirner9y ago· 2 in thread

The printf commands say "the address of..." but proceed to print out the value, not address.

hmottestad9y ago

Looks fine to me. An address is just a number, this one being hex encoded.

stirner9y ago

Okay. In my experience "the address of x" is taken to be synonymous with "&x", but I suppose that's a pedantic difference.

1 more reply

Chinjut9y ago· 2 in thread

C is such a boondoggle of a language... We're condemned to forever explore its every weird nook and cranny for historical reasons, rather than because it is the cleanest, best approach to things possible.

minipci13219y ago

C for sure has its weird sides, but does appear much more logical and consistent when observed "from the below", from how-the-hardware-runs perspective.

For example, the shift operators have higher precedence than bitwise masking (and/or/xor) since this way the expressions setting/clearing ranges of bits won't require parentheses (so increased readability) and the masking constants in them will be the narrowest. Loading a wide immediate value into a register sometimes takes several instructions, so such precedence also brings in the least cost as well (nowadays compilers take care of that to some extent).

But people frequently mess up this aspect, use lots of parens (and ending up with wide masks) saying this rule is not intuitive. It is.

armitron9y ago

You could attempt to rationalize some of its (terrible) design decisions after-the-fact by finding convenient examples, but compared to the clarity and surety of straight-up assembly, C is a dystopian nightmare of enormous unseen complexity and undefined behavior.

minipci13219y ago· 1 in thread

For the completeness sake, the size of an array can also be computed via linker symbols, see for example: http://stackoverflow.com/questions/29901788/finding-the-last....

Same constraints apply (pointer arith).

I am not sure why this method, applied to ordinary arrays, would be preferred to sizeof (), but since we're shedding light here...

EDIT: pointer arith constraints only apply if we compute the difference (end - beg) in the C code. We could also do that in the linker script itself, and I don't recall whether or not C semantics of ptrdiff_t would be preserved in that case. Such preservation doesn't seem very probable to me, so potentially this method might allow to avoid overflows (or to move them much higher) -- to be checked in the 'ld' doc!

tlb9y ago

Do all linkers guarantee not to round this up to a word size?

Maro9y ago· 1 in thread

I haven't written C in a while, but I think this is pretty stupid. sizeof() is a compile-time thing in C, so it's substituted with a number by the time you get an executable. See:

http://stackoverflow.com/questions/671790/how-does-sizeofarr...

I think this is effectively doing the same thing, but in a non-standard way; ie. I think `int n = (&arr)[1] - arr;` is substituted with the actual the number by the compiler the same way sizeof() would be, only noone will know wtf is going on.

Disclaimer: I didn't look at the generated code to confirm; I guess it could even be compiler/runtime dependent.

dllthomas9y ago

I don't think anyone is proposing that people use this. I read it as an exercise to stretch our understanding of other bits of the language.

angry_octet9y ago

While this is as interesting as any c arcana, I truly hope that people are not passing around pointers to arrays and then using sizeof(array)/sizeof(elem) to figure out how big they are, like they are stuck in a first year programming assignment that denies them the use of malloc, so they use C99 VLAs everywhere.

jheriko9y ago

there is a classic mistake here... the idea that pointer arithmetic does not rely on sizeof.

that's the entire mystery opened and closed afaik. sure you can use some obscure notation if you like, but why not just use sizeof?

russkrayer9y ago

Thanks for posting this question. The responses are very interesting.

utopcell9y ago

nice exposition to c array types.

in c++, a compile-time equivalent to sizeof would be:

  template<typename T, size_t N> size_t sz(T(&)[N]) { return N; }

angeladur9y ago

I would do this only when I am obfuscating code.

slobdell9y ago

That pun in the first sentence alone made the article worth it.

j / k navigate · click thread line to collapse

203 comments

94 comments · 22 top-level

ycmbntrthrwaway9y ago· 17 in thread

The result you get with this trick is signed, while the result you get with sizeof is unsigned.

Edit: Just to clarify, what you get is ptrdiff_t instead of size_t. So if array size is greater than PTRDIFF_MAX, you get undefined behavior [1].

[1] http://en.cppreference.com/w/c/types/ptrdiff_t

tedunangst9y ago

As far as I know, every compiler is badly broken with arrays greater than SIZE_MAX / 2, so this would be the least of your troubles.

pascal_cuoq9y ago

Since I have it at hand, here is a list of examples of how compilers are broken if an array is larger than SIZE_MAX / 2 (which is called PTRDIFF_MAX in the post):

http://trust-in-soft.com/objects-larger-than-ptrdiff_max-byt...

mannykannot9y ago

Is there a circle in hell reserved for C standards committee members who add to the number of cases where 'undefined behavior' occurs in the standards?

microtherion9y ago

1 more reply

kbart9y ago

3 more replies

michaelmior9y ago

I hope not. One of the reasons for allowing undefined behaviour is to allow room for compilers to perform certain optimizations that may not be possible if a specific result were required.

flamedoge9y ago

How likely do you run into array bigger than 2gb?

quotemstr9y ago

cperciva9y ago

vram229y ago

loeg9y ago

dragandj9y ago

Today, with ML, big data and similar applications, that might be often.

minipci13219y ago

Probably not very likely, but keep in mind that this method could also be used without actually allocating the array -- akin to the 'offsetof()' macro. (Which is undefined behavior.)

icedchai9y ago

On a 64-bit platform (anything modern), ptrdiff_t is going to be 64-bit so this will not be an issue (ok, 63-bit... but you get my point.)

1 more reply

kabdib9y ago

Often enough; "pack" files in video games are often many GB. Memory-map one of those and there you are . . .

1 more reply

sfifs9y ago

Oh surprisingly easily. Say you're handling a few billion cookies in RAM or manipulating DNA data.

hsivonen9y ago

It's quite easy to serve over 2 GB of spaces over the network. (gzip, brotli)

Stratoscope9y ago· 13 in thread

Whether you use this method of getting the number of elements in an array or the more traditional sizeof method, please encapsulate the logic in a macro.

Instead of writing either of these:

  size_t length = sizeof array / sizeof array[0];

  size_t length = (&array)[1] - array;

Define this macro instead:

  #define countof( array )  ( sizeof(array) / sizeof((array)[0]) )

Or if you must:

  #define countof( array )  ( (&(array))[1] - (array) )

And then you can just say:

  size_t length = countof(array);

Edit: I used to call this macro 'elementsof', but it seems that 'countof' is a more common name for it and is a bit more clear too - so I'm going to run with that name in the future.

mianos9y ago

Tempest19819y ago

A more detailed article here: http://www.g-truc.net/post-0708.html

with a cleaner way to do _countof using a template in C++ 11.

MSVC has a built in _countof: http://stackoverflow.com/questions/4415530/equivalents-to-ms...

Stratoscope9y ago

Thanks for the interesting references!

While we're talking macros, anyone who reads the g-truc.net article should feel itchy after seeing the countof macro in their example:

  #define countof(arr) sizeof(arr) / sizeof(arr[0])

Two problems here:

1. The last use of 'arr' doesn't have 'arr' wrapped in parenthesis.

2. The entire expression is not wrapped in parentheses either.

1 more reply

babuskov9y ago

> please encapsulate the logic in a macro.

Why?

When reading such code, it means I would have to go and lookup a macro definition. So, there's a clear drawback. What's the benefit that makes it worthwhile?

Tempest19819y ago

Faster to read, and keeps the reader's mind at a semantically higher level.

2 more replies

jjnoakes9y ago

If the macro is named appropriately, you don't have to go look up anything. And even if you do, you do it once (per project perhaps). No big deal.

I mean, you don't go look up the definitions of every function that gets called, every time they are called, right?

gpderetta9y ago

I doubt the author meant for this trick to be actually used, they were just showing how pointers to arrays are typed correctly in a clever way.

Stratoscope9y ago

Indeed, one could hope that is the case! :-)

1 more reply

wruza9y ago

IIRC it is canonically called NELEMS(a).

int_19h9y ago

I don't know if there's such a thing as "canonical" here. In MSVC, it's _countof (and it's in one of the standard headers).

brian-armstrong9y ago

Why a macro and not a static inline function?

Stratoscope9y ago

That's a good question! Can you share the code for that function?

EpicEng9y ago

How exactly do you intend to get an array from a function argument?

millstone9y ago· 7 in thread

I'm surprised at all of the comments calling this stupid or pointless. The point is not that you should this trick in lieu of sizeof; the point is to shed light on a subtly of C arrays.

btrask9y ago

I suspect this article made a lot of people feel stupid, or in other words, it taught us something. Sometimes the ego gets out of check.

I think the article is well-presented and educational.

gragas9y ago

>I think this article made a lot of people feel stupid

I don't think so. Anyone with a solid understanding of C understands pointer arithmetic. I think the article isn't obvious only to those who have a weak understanding of the language.

2 more replies

anigbrowl9y ago

Quite. This exactly the sort of thing that makes C such a fun language.

shmerl9y ago

I'm not sure if it's a praise for C though. Arcane design and lack of clarity might be fun to decipher, but it's not something that you'd want to see in the programming language.

3 more replies

jheriko9y ago

personally the issue i take with this article is that it displays an opinion that is counterproductive to learning (imo).

JBiserkov9y ago

Your comment:

>rather than calling out that pointer arithmetic implicitly relies on 'sizeof'

Article:

>arr has the type int , where as &arr has the type int ()[size].

For me this is calling out the implicit use of sizeof by pointing out the type.

pwython9y ago

You've been on here 4 years and are surprised at the top comments criticizing the content of a post? :)

There's a reason this meme exists: http://i.imgur.com/Z6pFTjj.jpg

icedchai9y ago· 5 in thread

Interesting. I've been working with C for almost 30 years (first taught it to myself when I was 14) and never thought about the actual type of array.

psyc9y ago

int_19h9y ago

    int (*p)[10];

This all is much more interesting in C++, because there, in conjunction with references, this lets you write functions that take arrays as arguments and know their length. Like so:

    template<size_t N>
    void foo(const int (&a)[N]) {
        for (size_t i = 0; i < N; ++i)
            cout << a[i];
    }

    int a[10];
    foo(a);

jjnoakes9y ago

If you start thinking about two dimensional arrays, you'll probably get close quickly.

cbsmith9y ago

Which kind of explains so much about the problem with C. ;-)

jjnoakes9y ago

I think it more explains that you can do a lot without fully understanding what it is you are working with.

Which can be good or bad.

2 more replies

hnfairy9y ago· 5 in thread

Despite the argument at the end, this is undefined behavior in the latest C specification. The code dereferences a pointer one past the last element.

C11 6.5.6/8:

If the result points one past the last element of the array object, it shall not be used as the operand of a unary * operator that is evaluated

feelix9y ago

"it shall not be used as the operand of a unary * operator that is evaluated"

he doesn't use the * operator on it, he just calculates its position. If he were to access it (ie, use it with *) then that would be breaking the rule

dom09y ago

The snipper only calculates the pointer, and does not dereference it. Should be fine.

Buge9y ago

1 more reply

amelius9y ago

Shows how difficult it is to get a spec right.

So, IMO, you are right, the code in the article is illegal (strictly speaking).

But I think it is likely that most compilers would still allow it, because that clause in the spec essentially exempts the compiler from adding an explicit bounds check.

bonzini9y ago

I don't think this is illegal. What is the clause in the spec that allows &arr[1]? I would try and see if it also applies to (&arr)[1].

1 more reply

halayli9y ago· 4 in thread

Basically ptr + integer requires the compiler to determine the sizeof ptr's type.

cperciva9y ago

this is undefined behavior. &arr + 1 can overflow

No. From 6.5.6 Additive operators:

So &arr + 2 can overflow, and &arr + 1 cannot be dereferenced, but &arr + 1 shall not overflow and is not undefined behaviour.

halayli9y ago

But arr != &arr even though they have the same value. #8 applies to arr (P), but in the post OP is using &arr which is a ptr to array[x] and doesn't apply to it.

3 more replies

mnarayan019y ago

Edit: Also now that I think about it, I've written code that relied on that behavior...not sure if I'd heard it before and internalized and forgot it, or just was being foolish.

1 more reply

cbsmith9y ago

Nope, you have guarantees about checking the address of one element past the end of an array. Think of all the bugs you'd otherwise enjoy...

arjun0249y ago· 3 in thread

Author of the article here. There's no intention here to encourage people to use this in code (in fact the opposite). This article is more of a "Did you know cool shit like this exist?".

mynameisbahaa9y ago

Please fix your site's header :)

mynameisbahaa9y ago

arjun0249y ago

I'll appreciate if you could provide a screenshot :)

1 more reply

pmiller29y ago· 3 in thread

Was anyone else's first thought "Hmm... cool," followed by "I hope nobody asks me this on an interview?"

exabrial9y ago

If you are asked this in an interview, it's not longer an interview... I would simply reply "what circumstances would dictate the necessity of such rather than producing clean code for my coworkers?"

pmiller29y ago

Hence why I hope noone asks me it. :)

p1esk9y ago

I actually thought: "cool... I hope someone asks me this on an interview!"

Etheryte9y ago· 3 in thread

Given how many bugs & errors stem from simple fails in range checks etc, I would much rather go with the tried and true way rather than use something "clever".

Quoting http://stackoverflow.com/a/16019052/1470607

  Note that this trick will only work in places where `sizeof` would have worked anyway.

Animats9y ago

Unless you're writing a buffer overflow exploit, in which case you need to know exactly what's on the stack and where, this isn't a good way to program.

Update: misread the article; thought he was differencing with the beginning of the next array.

dllthomas9y ago

I don't see how the code assumes anything about the placement of the array. Indeed, it works just fine for static arrays:

    $ cat test.c
    #include <stdio.h>
    
    int arr[5];
    
    int main(int argc, char *argv[]) {
    	printf("%lu, %ld\n", sizeof(arr) / sizeof(*arr), (&arr)[1] - arr);
    }
    
    $ gcc test.c && ./a.out
    5, 5

Not saying it's "a good way to program" - it's needlessly obfuscated compared to the standard sizeof alternative. But it doesn't rely on anything tricky.

ycmbntrthrwaway9y ago

> It assumes that arrays are placed on the stack in the order of declaration

I am not sure it is the case here. The code uses only one array, how can it assume the order of arrays?

gruez9y ago· 2 in thread

How is this better than the sizeof method? This looks like a clever way to access sizeof information without explicitly using the sizeof operator.

jjnoakes9y ago

It isn't better. I don't think it was claimed to be.

But if you really understand C, it should also not be a surprise that it works this way.

cbsmith9y ago

I think it is better in exactly zero ways.

It is, nonetheless, different.

Nimitz149y ago· 2 in thread

clarry9y ago

Pointer arithmetic works element-wise, not byte-wise.

So if p is a pointer, then p+1 refers to the next element after p, regardless of the size of the pointee. And so (p+1) - p is 1, again regardless of the size of the pointee.

Nimitz149y ago

Thank you.

disposablezero9y ago· 2 in thread

Many implementations historically also allocated enough memory to include one extra element at the end of the array.

tedunangst9y ago

I find this improbable.

angry_octet9y ago

stirner9y ago· 2 in thread

The printf commands say "the address of..." but proceed to print out the value, not address.

hmottestad9y ago

Looks fine to me. An address is just a number, this one being hex encoded.

stirner9y ago

Okay. In my experience "the address of x" is taken to be synonymous with "&x", but I suppose that's a pedantic difference.

1 more reply

Chinjut9y ago· 2 in thread

minipci13219y ago

C for sure has its weird sides, but does appear much more logical and consistent when observed "from the below", from how-the-hardware-runs perspective.

But people frequently mess up this aspect, use lots of parens (and ending up with wide masks) saying this rule is not intuitive. It is.

armitron9y ago

minipci13219y ago· 1 in thread

For the completeness sake, the size of an array can also be computed via linker symbols, see for example: http://stackoverflow.com/questions/29901788/finding-the-last....

Same constraints apply (pointer arith).

I am not sure why this method, applied to ordinary arrays, would be preferred to sizeof (), but since we're shedding light here...

tlb9y ago

Do all linkers guarantee not to round this up to a word size?

Maro9y ago· 1 in thread

I haven't written C in a while, but I think this is pretty stupid. sizeof() is a compile-time thing in C, so it's substituted with a number by the time you get an executable. See:

http://stackoverflow.com/questions/671790/how-does-sizeofarr...

Disclaimer: I didn't look at the generated code to confirm; I guess it could even be compiler/runtime dependent.

dllthomas9y ago

I don't think anyone is proposing that people use this. I read it as an exercise to stretch our understanding of other bits of the language.

angry_octet9y ago

jheriko9y ago

there is a classic mistake here... the idea that pointer arithmetic does not rely on sizeof.

that's the entire mystery opened and closed afaik. sure you can use some obscure notation if you like, but why not just use sizeof?

russkrayer9y ago

Thanks for posting this question. The responses are very interesting.

utopcell9y ago

nice exposition to c array types.

in c++, a compile-time equivalent to sizeof would be:

  template<typename T, size_t N> size_t sz(T(&)[N]) { return N; }

angeladur9y ago

I would do this only when I am obfuscating code.

slobdell9y ago

That pun in the first sentence alone made the article worth it.

j / k navigate · click thread line to collapse