Gangnam Style breaks YouTube viewer count (opens in new tab)

(plus.google.com)

530 pointsstephenheron11y ago226 comments

226 comments

104 comments · 21 top-level

gavinpc11y ago· 16 in thread

Can't quite tell if this is a joke, but here's a related "story about a bug" from Doug Crockford [0]:

    I made a bug once, and I need to tell you about it.  So, in 2001, I wrote a
    reference library for JSON, in Java, and in it, I had this line
    
        private int index
    
    that created a variable called "index" which counted the number of characters in
    the JSON text that we were parsing, and it was used to produce an error message.
    Last year, I got a bug report from somebody.  It turns out that they had a JSON
    text which was several gigabytes in size, and they had a syntax error past two
    gigabytes, and my JSON library did not properly report where the error was — it
    was off by two gigabytes, which, that's kind of a big error, isn't it?  And the
    reason was, I used an int.
    
    Now, I can justify my choice in doing that.  At the time that I did it, two
    gigabytes was a really big disk drive, and my use of JSON still is very small
    messages.  My JSON messages are rarely bigger than a couple of K.  And — a
    couple gigs, yeah that's about a thousand times bigger than I need, I should be
    all right.  No, turns out it wasn't enough.
    
    You might think well, one bug in 12 years you're doing pretty good.  And I'm
    saying no, that's not good enough.  I want my programs to be perfect.  I don't
    want anything to go wrong.  And in this case it went wrong simply because *Java
    gave me a choice that I didn't need, and I made the wrong choice*.

[0] https://www.youtube.com/watch?v=bo36MrBfTk4&t=38m

EDIT: is there a reference for formatting comments? I've never been able to find one.

danbruc11y ago

He did not need the choice but others do. And he is wrong when he says it makes no difference whether you use a byte or eight of them. Yes, it will take the same amount of time to add two of them but it will also cost eight times more cache space and memory bandwidth to move them around. It may not be an issue if you have a single number or ten of them but it certainly becomes one if you have an array with millions or billions of them.

rtpg11y ago

There are use cases where you need the choice, but most people do not need the choice.

Most programs written in the real world (enterprise-y Java apps) do not need strong control on GC, choice of integer types, or many other things offered to them. Reducing choice will increase code/tool quality.

I think that we should make the uncommon choice reallllly hard to put into place. Make it a pain to configure the GC, give specific integer types really long names. Just stop people from premature optimization and leave these tools to people who know what they're doing.

2 more replies

philosophus11y ago

I don't believe he said it makes no difference.

1 more reply

gioele11y ago

Interesting, this is similar to the discussion going on for "int" in Rust (or the exact opposite, depending on how you view it). [1, 2]

One the one hand "implicit int" are being phased out in favour of explicit int size. Your variables cannot be `int` anymore, you have to sit down, think and choose: u8? u16? u32? i32? u64? i64? This avoids all the pains of programs behaving differently or crashing when compiled on different architectures.

On the other hand, a new "native integer for sizes that do not matter, minimum 32 bits" is being brewed, for example for pointer offsets or collection sizes. The idea is that you will not be able to have a collection with more of 2^32 elements in a 32-bit architecture nor more than 2^64 in a 64-bit architecture.

After this discussion, my hope is to see the introduction of a fast-ish dynamic bigint (that starts native and grow up to 256 or 512 bits) that can be used in all the cases where you do not care about the exact size type, yet you want to be future-proof (this `private int index` fits this case, IMO).

[1] http://discuss.rust-lang.org/t/if-int-has-the-wrong-size/454... [2] https://github.com/rust-lang/rfcs/pull/464

Narishma11y ago

Wouldn't that limit rust to 32-bit or more architectures?

2 more replies

robert_tweed11y ago

I'm guessing (since this is Doug Crockford talking about JSON) that this was in reference to how JavaScript does things differently, in that it just stores everything as floats, which are quite capable of representing integers within the 32-bit range anyway.

However, an overflow to floating point isn't necessarily an improvement because, while a float will hold bigger numbers, it does so with limited precision and sometimes that lack of precision will cause bugs too. Probably more often, in fact.

In the example given it wouldn't be so bad, but you'd only get an approximate indication of where the error occurred rather than a specific line/character. So of course, whatever is reporting the error would now need to understand and handle the much more complex scenario of "fuzzy" location information instead of a simple unique index to a specific character. Depending on what it then needs to do with that information, the complexity could spiral from there.

If you want to just have things work no matter what, you have no choice but to use bignums. I was wondering about this recently, so did some benchmarks in Clojure. The performance was horrible, so frankly this is still not a viable alternative. Maybe in 10 years time, if every CPU has a bignum coprocessor by then.

Also, there are times, particularly in low-level graphics programming or cryptography, where you actually want integer modulo arithmetic, or to be able to do bitwise booleans predictably. In those cases, JavaScript-style loose typing can be a huge pain.

BTW, I've been a big advocate of JavaScript for about as long as Doug Crockford, so my point isn't that JavaScript-style type handling is bad: just that it's very far from a silver bullet.

cstavish11y ago

>> Java gave me a choice that I didn't need, and I made the wrong choice

What if Java gave an arguably more useful choice--whether to use a signed or unsigned integer?

whyever11y ago

If you use an unsigned 32 bit integer instead of a signed one then you run into problems at 4 instead 2 gigabytes.

2 more replies

tdsamardzhiev11y ago

If you are working with files bigger than 2GB, hoping they're smaller than 4GB is NOT a good habit.

And I certainly don't believe there are programmers making only 1 mistake for 12 years. I believe he's just making a joke, or using the example as a means to an end.

talles11y ago

I loved how he ended

  *Java gave me a choice that I didn't need, and I made the wrong choice*

lerchmo11y ago

What if someone else needs that choice? like anyone doing compression?

3 more replies

batuhanicoz11y ago

Reference for comment formatting can be found here: https://news.ycombinator.com/formatdoc

Tepix11y ago

He's pushing for DEC64 now (http://DEC64.com)

peterashford11y ago

I'm constantly amused how Java is supposedly stupid for protecting programmers from things they shouldn't do and yet also stupid for not protecting programmers from things they shouldn't do.

IMHO Java makes some choices about safety. If you don't agree with those choices, use a different tool. It doesn't make Java wrong for having a different opinion. Likewise I wouldn't berate C for being too low level or Ruby for favouring readability over performance.

lmm11y ago

So for that kind of software you use Python. Doesn't everyone know that? Java gives you that choice because it's for the kind of software where you need that choice.

ColinWright11y ago

This is a FAQ. Oddly enough, the "FAQ" link at the bottom of the page takes you to the FAQ, in which it says:

========

What kind of formatting can you use in comments?

http://news.ycombinator.com/formatdoc

========

Was that what you were looking for?

ChuckMcM11y ago· 14 in thread

The interesting meta-point though is that an audience of 20 million viewers is a big hit [1] so a billion views is 20M people watching it 50 times or, 200M people watching it 5 times. And 2 billion views is double that.

Put in perspective that is probably in excess the number of times the most favored "I Love Lucy" show has been seen. Or put another way, you've got a music video with the same eyeball impact as the highest rated television show ever.

That says to me that either advertising on Youtube is a bargain or advertising on TV is way over priced :-)

[1] http://tvbythenumbers.zap2it.com/2014/02/10/the-walking-dead...

[2] http://en.wikipedia.org/wiki/I_Love_Lucy

derefr11y ago

Or advertising on TV seriously under-represents the total number of impressions over time through alternate consumption streams. Right now, supposedly "unpopular" shows are cancelled, and then immediately get a successful Kickstarter from what turns out to be millions of fans who happened to be watching only through Netflix, or iTunes, or DVD box sets.

(Of course, none of these streams show the same ads the original broadcast does—but if you're a clever ad agency, you're already doing product-placement instead of interstitials most of the time anyway.)

hsod11y ago

> Right now, supposedly "unpopular" shows are cancelled, and then immediately get a successful Kickstarter from what turns out to be millions of fans who happened to be watching only through Netflix, or iTunes, or DVD box sets.

Can you name any examples of this?

The closest thing I can think of is Veronica Mars which was Kickstarted many years later and raised ~5 million dollars from 91,000 backers to make a single movie.

I think perhaps the "alternate consumption streams" viewers are not as lucrative as you think.

2 more replies

ethbro11y ago

> (Of course, none of these streams show the same ads the original broadcast does—but if you're a clever ad agency, you're already doing product-placement instead of interstitials most of the time anyway.)

I think you just backdoored into the most interesting ad campaign ever: 1) Find a show with a directory / writer / production team known for producing content that "stands the test of time" (e.g. likely to have a high total_views_over_time:broadcast_views ratio) 2) Include product placement for a non-existent product by a currently-existing company with strong brand recognition 3) Test response to non-existent product by initial viewers 4) Start viral campaign around non-existent product (this likely favors "Hunh?" shows a la Lost or Fringe) 5) Trigger view bump in show (win award, produce new episodes in partnership with Netflix, produce new movie, etc.) 6) Launch real-product multiple years after initial product placement

tedunangst11y ago

I think you're missing a unit. You should be measuring eyeball-minutes. An episode of The Walking Dead might be 20M x 45min = 900 megaeyeball-minutes. Gangnam Style is 2B x 3min = 6000 megaeyeball-minutes. Disregarding target demographics for the moment, that says the advertising spend for a first run episode of Walking Dead should be about equal to 15% of the lifetime spend for Gangnam Style.

MichaelApproved11y ago

You're equating two things that have different lengths of attention which require different attention spans. They're also different in how the audience viewed that content. That gives the advertiser a different experience with the viewer.

For example, with I Love Lucy, the audience member likely sat and watched the entire commercial. With a YouTube video, the audience member can skip the ad or move on to other content.

TV = 22 minutes of content.

YouTube Video = 3 minutes of content.

Plus, the metrics that constitute views between the two media formats are completely different.

TeMPOraL11y ago

Advertising on YouTube is even more stupidly annoying than on TV. Fortunately, we have AdBlock :).

bjz_11y ago

I watch lots of independent channels, and I purposefully turn AdBlock off for Youtube. :)

1 more reply

eitally11y ago

And god knows how many times the top music videos on YT are played at parties & other semi-public events! Heck, as the parent of young children, I've probably watched things like Gangnam Style >20 times just within my house.

davedx11y ago

Indeed! There's a cartoon rabbit for small kids here in the Netherlands called "Nijntje", and there are a few "official Nijntje songs" on YouTube. Our 1 year old daughter's favourite is this: https://www.youtube.com/watch?v=20J8DUJMgA4&app=desktop "Nijntje dansles" - it has 12 million views, and there are only 20 million people in the Netherlands, total!

This song has been played many times by a relatively small section of the Dutch population :)

sliverstorm11y ago

That assumes YouTube eyeball count is of equal value to TV eyeball count though, right? Which doesn't seem like something we can assume- YouTube's targeting doesn't seem great, and there are plenty of other things to do on a computer while you wait through the ad.

cloudwalking11y ago

YouTube targeting is a lot more accurate than TV targeting...

spyder11y ago

Maybe YouTube ads aren't the best, but the ads on the Internet can have much better targeting and performance tracking than TV ads.

>there are plenty of other things to do on a computer while you wait through the ad.

Yea, for example you can buy the advertised product with a few clicks. If you are quick enough then you can finish the buying even before the video ad finishes (sure it's not the most realistic scenario but it's possible). Or with a quick search you can learn more about the product to check how honest is the ad. TV ads cannot compete with this efficiency. The only thing TV ads can do better is reaching bigger and the less tech interested audience.

snowwrestler11y ago

Advertising on YouTube is a bargain. Please don't tell anyone!

jickmagger11y ago

> That says to me that either advertising on Youtube is a bargain or advertising on TV is way over priced :-)

NEITHER. Advertising on YT is worthless and yes it used to be way overpriced on TV.

xanderjanz11y ago· 12 in thread

Should have gone with unsigned ints, YouTube!

EDIT: Which is the solution they apparently implemented, converting signed to unsigned at some higher layer.

timothya11y ago

From the Google C++ Style Guide:

"You should not use the unsigned integer types such as uint32_t, unless there is a valid reason such as representing a bit pattern rather than a number, or you need defined overflow modulo 2^N. In particular, do not use unsigned types to say a number will never be negative. Instead, use assertions for this." [0]

[0]: http://google-styleguide.googlecode.com/svn/trunk/cppguide.h...

nly11y ago

Which is a completely birdbrained policy given that signed integer under and overflow is completely undefined. If you want to catch implicit signed -> unsigned conversions then enable that warning on your compiler.... what they'd advocating is just dangerous.

3 more replies

coolgeek11y ago

From the coolgeek style guide:

"Never use a signed type for a number that can never be negative"

One of my pet peeves is developers using int (instead of unsigned ints) for primary keys in database tables.

8 more replies

mohawk11y ago

That seems like bad advice to me. A possible infinite loop is given as justification in case of wrongly implemented reverse iteration (counting down an unsigned loop variable). Well, i claim that an infinite loop is a much more noticeable bug than undefined overflow behaviour, negative view counts, etc. Unsigned ints will make bugs impossible that with signed ints will (hopefully, famous last words) trigger assertions, if they are enabled...

Buge11y ago

One problem with this is that the sizes of STL containers are returned unsigned, and with high warning levels, compilers will warn about comparing a signed int with one of these sizes.

1 more reply

blahedo11y ago

Surely only temporarily, though. I mean, this is an exponential process---adding one bit only doubles the space, and it will not take another nine years before some video passes 4 billion.

sillysaurus311y ago

By that line of reasoning, 15.75 years from now there will be a viewcount greater than 8 billion.

... and now I realize you may be correct, and that it's probably inevitable that a viewcount will not only exceed the total number of people alive, but will double or even quadruple it. Our total population is actually about 100 billion, but only ~7% of us are still alive.

The shadows of the dead will be forever enshrined as YouTube view counts. Our shadows.

3 more replies

lazaroclapp11y ago

Why not just use some sort of unlimited BigNum implementation? Yeah, for small numbers it's still ~2x the size of just storing an int, depending on implementation (or it can be: "int unless MAX_VALUE, in which case bignum is stored somewhere else") and it might be slower to operate on... but, on the other hand, you are already storing and processing a full video for every such counter!

Edit: Now I realize that would mean Google couldn't have made this joke. But I am still not sure this was foreseen by Youtube devs from day one.

xanderjanz11y ago

Yea according to a reddit post from a Googler this was more of a staged easter egg then a real bug. Google coding styles actually prohibit the use of unsigned integers in C++ code.

2 more replies

sytelus11y ago

I suggest someone write a browser add-in that re-plays this 100s of time when machines are idle to do massive distributed viewcount attack and force YouTube upgrade to 64-bit unsigned int now!

tdsamardzhiev11y ago

Counting on the difference between signed and unsigned is asking for trouble. 64-bit ints would be a better option.

31reasons11y ago

Hire people who know the difference Google. Math puzzles doesn't solve everything :)

rkachowski11y ago· 12 in thread

I saw this a few days ago, at first I thought it was an easter egg on youtube's part - saying "so many views we overflow!"

But it's real?! It seems incredibly absurd that it could actually overflow, how are signed values useful for a count of views? How are you going to have negative views?

IvyMike11y ago

In C/C++, unsigned ints can result in a few very subtle bugs--more details here:

http://stackoverflow.com/a/1555186/67591

TL;DR: Google's C++ coding standard says: "Document that a variable is non-negative using assertions. Don't use an unsigned type."

As someone who has written a lot of C++ code to interact with hardware: I view an unsigned as a bucket of bits, and a signed as a number.

hk__211y ago

It is an easter egg: http://www.reddit.com/r/compsci/comments/2nrjc7/gangnam_styl...

rkachowski11y ago

Ah i see, i didn't expect Google to back up an easter egg with an associated G+ post

1 more reply

spb11y ago

Say you're comparing the number of views between two different videos as (video_A.views - video_B.views). How do you represent that the second video has more views than the first?

nly11y ago

    video_B.views > video_A.views

1 more reply

xxxyy11y ago

Legacy code I guess (from some really ancient times). Also tiny bugs like this make great content for blog posts, just like the 301+ views thing.

lmm11y ago

Lots of languages don't bother with a separate unsigned type. Maybe the backend is written in Java.

divegeek11y ago

Java doesn't have unsigned integer types. Google is mostly a Java shop.

hobo_mark11y ago

That is mostly false.

fishnchips11y ago

Java is mostly confined to Android and the frontend (think GWT) which do not constitute a large proportion of the Google internal code as far as I can recall.

exo76211y ago

If anything, google is mostly c++ shop.

Someone11y ago

Java's char is unsigned.

Aldo_MX11y ago· 5 in thread

Next milestone: 19th January 2038 03:14:07 GMT

ravenkat11y ago

Ah, So many systems going to fail on that day for using epoch with 32 bit.

Someone123411y ago

Aren't most Linux servers already 64 bit? And we aren't even close to 2020.

I'm sure some software will need to be re-written between now and 2038, but I don't think it will be quite as bad as Y2K just because that was only a 15 year gap (Sometimes less), whereas this is over 24 years.

I just think a lot of software will be naturally replaced between now and then. And while there will be a slight mad scramble to fix stuff at the last minute, I don't think it is Y2K-2.

2 more replies

0x011y ago

I set the clock to one minute before time_t overflow on an iMac once. Recovering from that and just getting the machine to boot afterwards was no joke.

1 more reply

codezero11y ago

I did tech support when the 99->00 switch happened, got paid 3x overtime. I got one call, and it was actually legitimate, but was a third party piece of software so after that we left and went to a party :)

I doubt this will be a real problem in 2038, then again the prevalence of computing devices is much larger now and will continue to grow by 2038, but so will technical aptitude, so hopefully they'll cancel out and this will still not be a problem.

1 more reply

ge0rg11y ago

Time to print our "Y2K38 consultant" business cards! :)

DigitalSea11y ago· 5 in thread

Wow, this is cool. One video was able to exceed a 32 bit integer thus requiring a change to a 64 bit integer, all caused by one man and one video.

return011y ago

Not sure if he did everything all by himself.

Regardless, it was bound to happen sooner or later as youtube is getting older.

acqq11y ago

It's just a 2 billion limit crossed, 32 bits can count up to 4 billion. Afterwards, they certainly don't have to change to 64-bits, just add a few bits more.

srtjstjsj11y ago

there's no reason to pick any number between 33 and 63

1 more reply

abc_lisper11y ago

And don't forget there are multiple copies of the same video there.

DigitalSea11y ago

My understanding is that the original is the only video that has broken the 32 bit integer barrier though, right?

1 more reply

leephillips11y ago· 4 in thread

The interesting question to me is why this particular video is so wildly popular. I don't generally go in for music videos, but I find this one fascinating and have watched it a dozen times. I read an article that tried to explain to non-Koreans like me the meaning of it all, and apparently there are several layers of parody and social satire. I think I love it for its combination of attitude, surrealism, bizarre humor, and self-mockery, plus the music that seems to fit magically.

prawn11y ago

The explanations of Korean parody/satire are largely irrelevant to its success given its popularity elsewhere, surely? I think it's the bizarre visuals that had it spread (why I tweeted it when it first emerged), then catchiness plus a repeatable dance move. It's the Macarena of its time in that regard.

Being Korean might've given it crossover appeal into much of Asia? Just a guess.

lmm11y ago

> The explanations of Korean parody/satire are largely irrelevant to its success given its popularity elsewhere, surely?

I think they gave people who would otherwise have looked down on a silly craze an excuse to enjoy the video.

orblivion11y ago

I love it because, in a world of fake pop musicians, this guy comes off as such a genuine goof. I can't help but like the guy, I'm very happy for him for this level of success on YouTube. And the song is super catchy. He's one of the very few pop musicians I appreciate (though so far, this is probably the only song of his I care for). The political satire makes it all the more compelling. I love the horse riding on top of a sky scraper.

Shivetya11y ago

in the words of my niece, its fun. She likes the silly man and while the sexual connotations of some things he does might make parents wince, they fly right over her head. She still has that innocence of youth. So why we can enjoy he irreverent humor, the sexual innuendo, she enjoys the silliness at her level. (plus she can do his horse stepping dance)

dogma113811y ago· 4 in thread

Every time i check the most viewed videos on YouTube i get depressed and lose all faith in humanity. Landing on a comet gets you 250K views, anouncing the discovery of the higgs gets you less than 100K, latest twerking video or PewDiePie 2M at least...

tedunangst11y ago

Have you considered that the repeat viewing value of the Higgs boson announcement diminishes rapidly?

Retra11y ago

Humor and music are basic human social functions -- even children understand them. Quantum physics, not so much.

throwaway197911y ago

Not sure why the pessimistic poster is being downvoted. Factoring for repeat viewers, it is sad how little society values scientific achievements. That said, I was up in the wee hours of the morning watching the LHSC start up and probably put more than 20 views on that song. Have ye hope :D

4 more replies

1009811y ago

You mean you're sad that the majority of people on earth don't know or care about higgs boson? That's a weird thing to be sad about.

jmount11y ago· 3 in thread

Nifty example. Billionaires, trillion dollar budges, billion-view celebrities, fast CPUs, and large memories: all reasons I am done with 32 bit architectures (old article of mine, but only on large memories http://www.win-vector.com/blog/2012/09/i-am-done-with-32-bit... ).

diego11y ago

32-bit architectures have nothing to do with the size of different data types that have existed forever. We had 64-bit longs in 8-bit cpus.

Also, there are perfectly valid applications that require numbers of 8, 16, 32 or 64 bits (or variable encodings with arbitrary precision). Petabytes, embedded microcontrollers, etc.

jmount11y ago

Sorry I was unclear. 32 bits architecture can mean a lot of different things (buss sizes, address word sizes, and so on). Mostly I am done with small pointers (having to use segments to address all of your memory, or not being able to memory-map a disk sucks) and small counters (only being able to put signed 32 bit integers into a collection sucks).

agumonkey11y ago

True, the HP48 pocket calculator was a "4bit" cpu with 64bit fp-able registers.

IgorPartola11y ago· 3 in thread

uint_32 strikes again! And one day we'll stop using it in favor of int_64, and all unique identifiers will be string, and all will be well.

I remember when Twitter had rolled over their tweet ID's because they were using an int type that was too short. Should have gone with variable length strings to avoid that problem.

Someone123411y ago

> uint_32 strikes again!

It isn't a uint32, it is an int32.

Using strings avoids one problem but introduce a bunch of others (e.g. a string is harder to verify, therefore less secure, and therefore needs to be handled with the kiddie gloves). Checking that every character is between 0-9 and dropping all other characters is easy, cheap, and effective. Then just check it is between uint64.Min and uint64.Max, and you're done.

uint32 gives them twice as much capacity (which isn't enough at this stage), they'll likely want to go with a uint64.

rinon11y ago

Var-length strings are often too slow to manipulate and store. Better choice to use a fixed but large (64-bit) integer.

michaelgrosner211y ago

Or just use a GUID.

3 more replies

jawedkarim11y ago· 2 in thread

When youtube launched in April, 2005, the initial source code was based on another completely unrelated website that I had worked on before, written in PHP and running on Apache and MySQL. It’s always fascinating how implementations of complex systems evolve.

diroussel11y ago

What was the original site for?

pavel_lishin11y ago

Maybe some sort of exchange site for Magic the Gathering cards?

antimora11y ago· 1 in thread

It looks like it also broke the formatting on the number of the viewers: "2151501252". This string does not have thousands separators.

Direct link to the video: https://www.youtube.com/watch?v=9bZkp7q19f0

lstamour11y ago

It's a joke. Hover your mouse over it to see why ;-)

thibauts11y ago· 1 in thread

Why the hell would you want to store a counter as a signed int in the first place ?

maaku11y ago

Java?

tn1311y ago· 1 in thread

Am I the only one who thinks that Google is posting this bug(!) just to make the Google plus post popular ?

prawn11y ago

Probably not the only reason, but it'd be a bonus viral thing for Google+ and YouTube.

SapphireSun11y ago

I love that they added an easter egg to the actual video. If you hover over the counter, it briefly shows you the negative overflow value.

https://www.youtube.com/watch?v=9bZkp7q19f0

EDIT: I just realized that YouTube also posted a comment to that effect just below the video. :P

rodgort11y ago

Mea culpa. I can't remember why I didn't fix that when I reloaded the entire schema. At least I widened the video ids.

Animats11y ago

This is a minor problem. In the 1980s, the number of tradable things with ticker symbols in US markets passed 32767, and some new issues had to be delayed until it was fixed.

ecesena11y ago

I'd be curious to know how they discovered it. Were they monitoring it? Did someone report it? Did an alarm trigger? ...

adad9511y ago

There is Easter Egg in the video counter. Hover with your mouse. https://www.youtube.com/watch?v=9bZkp7q19f0

alejandc11y ago

unbelievable

jfmercer11y ago

I will always upvote anything related to Gangnam Style. Always.

j / k navigate · click thread line to collapse

226 comments

104 comments · 21 top-level

gavinpc11y ago· 16 in thread

Can't quite tell if this is a joke, but here's a related "story about a bug" from Doug Crockford [0]:

    I made a bug once, and I need to tell you about it.  So, in 2001, I wrote a
    reference library for JSON, in Java, and in it, I had this line
    
        private int index
    
    that created a variable called "index" which counted the number of characters in
    the JSON text that we were parsing, and it was used to produce an error message.
    Last year, I got a bug report from somebody.  It turns out that they had a JSON
    text which was several gigabytes in size, and they had a syntax error past two
    gigabytes, and my JSON library did not properly report where the error was — it
    was off by two gigabytes, which, that's kind of a big error, isn't it?  And the
    reason was, I used an int.
    
    Now, I can justify my choice in doing that.  At the time that I did it, two
    gigabytes was a really big disk drive, and my use of JSON still is very small
    messages.  My JSON messages are rarely bigger than a couple of K.  And — a
    couple gigs, yeah that's about a thousand times bigger than I need, I should be
    all right.  No, turns out it wasn't enough.
    
    You might think well, one bug in 12 years you're doing pretty good.  And I'm
    saying no, that's not good enough.  I want my programs to be perfect.  I don't
    want anything to go wrong.  And in this case it went wrong simply because *Java
    gave me a choice that I didn't need, and I made the wrong choice*.

[0] https://www.youtube.com/watch?v=bo36MrBfTk4&t=38m

EDIT: is there a reference for formatting comments? I've never been able to find one.

danbruc11y ago

rtpg11y ago

There are use cases where you need the choice, but most people do not need the choice.

2 more replies

philosophus11y ago

I don't believe he said it makes no difference.

1 more reply

gioele11y ago

Interesting, this is similar to the discussion going on for "int" in Rust (or the exact opposite, depending on how you view it). [1, 2]

[1] http://discuss.rust-lang.org/t/if-int-has-the-wrong-size/454... [2] https://github.com/rust-lang/rfcs/pull/464

Narishma11y ago

Wouldn't that limit rust to 32-bit or more architectures?

2 more replies

robert_tweed11y ago

BTW, I've been a big advocate of JavaScript for about as long as Doug Crockford, so my point isn't that JavaScript-style type handling is bad: just that it's very far from a silver bullet.

cstavish11y ago

>> Java gave me a choice that I didn't need, and I made the wrong choice

What if Java gave an arguably more useful choice--whether to use a signed or unsigned integer?

whyever11y ago

If you use an unsigned 32 bit integer instead of a signed one then you run into problems at 4 instead 2 gigabytes.

2 more replies

tdsamardzhiev11y ago

If you are working with files bigger than 2GB, hoping they're smaller than 4GB is NOT a good habit.

And I certainly don't believe there are programmers making only 1 mistake for 12 years. I believe he's just making a joke, or using the example as a means to an end.

talles11y ago

I loved how he ended

  *Java gave me a choice that I didn't need, and I made the wrong choice*

lerchmo11y ago

What if someone else needs that choice? like anyone doing compression?

3 more replies

batuhanicoz11y ago

Reference for comment formatting can be found here: https://news.ycombinator.com/formatdoc

Tepix11y ago

He's pushing for DEC64 now (http://DEC64.com)

peterashford11y ago

I'm constantly amused how Java is supposedly stupid for protecting programmers from things they shouldn't do and yet also stupid for not protecting programmers from things they shouldn't do.

lmm11y ago

So for that kind of software you use Python. Doesn't everyone know that? Java gives you that choice because it's for the kind of software where you need that choice.

ColinWright11y ago

This is a FAQ. Oddly enough, the "FAQ" link at the bottom of the page takes you to the FAQ, in which it says:

========

What kind of formatting can you use in comments?

http://news.ycombinator.com/formatdoc

========

Was that what you were looking for?

ChuckMcM11y ago· 14 in thread

That says to me that either advertising on Youtube is a bargain or advertising on TV is way over priced :-)

[1] http://tvbythenumbers.zap2it.com/2014/02/10/the-walking-dead...

[2] http://en.wikipedia.org/wiki/I_Love_Lucy

derefr11y ago

hsod11y ago

Can you name any examples of this?

The closest thing I can think of is Veronica Mars which was Kickstarted many years later and raised ~5 million dollars from 91,000 backers to make a single movie.

I think perhaps the "alternate consumption streams" viewers are not as lucrative as you think.

2 more replies

ethbro11y ago

tedunangst11y ago

MichaelApproved11y ago

For example, with I Love Lucy, the audience member likely sat and watched the entire commercial. With a YouTube video, the audience member can skip the ad or move on to other content.

TV = 22 minutes of content.

YouTube Video = 3 minutes of content.

Plus, the metrics that constitute views between the two media formats are completely different.

TeMPOraL11y ago

Advertising on YouTube is even more stupidly annoying than on TV. Fortunately, we have AdBlock :).

bjz_11y ago

I watch lots of independent channels, and I purposefully turn AdBlock off for Youtube. :)

1 more reply

eitally11y ago

davedx11y ago

This song has been played many times by a relatively small section of the Dutch population :)

sliverstorm11y ago

cloudwalking11y ago

YouTube targeting is a lot more accurate than TV targeting...

spyder11y ago

Maybe YouTube ads aren't the best, but the ads on the Internet can have much better targeting and performance tracking than TV ads.

>there are plenty of other things to do on a computer while you wait through the ad.

snowwrestler11y ago

Advertising on YouTube is a bargain. Please don't tell anyone!

jickmagger11y ago

> That says to me that either advertising on Youtube is a bargain or advertising on TV is way over priced :-)

NEITHER. Advertising on YT is worthless and yes it used to be way overpriced on TV.

xanderjanz11y ago· 12 in thread

Should have gone with unsigned ints, YouTube!

EDIT: Which is the solution they apparently implemented, converting signed to unsigned at some higher layer.

timothya11y ago

From the Google C++ Style Guide:

[0]: http://google-styleguide.googlecode.com/svn/trunk/cppguide.h...

nly11y ago

3 more replies

coolgeek11y ago

From the coolgeek style guide:

"Never use a signed type for a number that can never be negative"

One of my pet peeves is developers using int (instead of unsigned ints) for primary keys in database tables.

8 more replies

mohawk11y ago

Buge11y ago

One problem with this is that the sizes of STL containers are returned unsigned, and with high warning levels, compilers will warn about comparing a signed int with one of these sizes.

1 more reply

blahedo11y ago

Surely only temporarily, though. I mean, this is an exponential process---adding one bit only doubles the space, and it will not take another nine years before some video passes 4 billion.

sillysaurus311y ago

By that line of reasoning, 15.75 years from now there will be a viewcount greater than 8 billion.

The shadows of the dead will be forever enshrined as YouTube view counts. Our shadows.

3 more replies

lazaroclapp11y ago

Edit: Now I realize that would mean Google couldn't have made this joke. But I am still not sure this was foreseen by Youtube devs from day one.

xanderjanz11y ago

Yea according to a reddit post from a Googler this was more of a staged easter egg then a real bug. Google coding styles actually prohibit the use of unsigned integers in C++ code.

2 more replies

sytelus11y ago

I suggest someone write a browser add-in that re-plays this 100s of time when machines are idle to do massive distributed viewcount attack and force YouTube upgrade to 64-bit unsigned int now!

tdsamardzhiev11y ago

Counting on the difference between signed and unsigned is asking for trouble. 64-bit ints would be a better option.

31reasons11y ago

Hire people who know the difference Google. Math puzzles doesn't solve everything :)

rkachowski11y ago· 12 in thread

I saw this a few days ago, at first I thought it was an easter egg on youtube's part - saying "so many views we overflow!"

But it's real?! It seems incredibly absurd that it could actually overflow, how are signed values useful for a count of views? How are you going to have negative views?

IvyMike11y ago

In C/C++, unsigned ints can result in a few very subtle bugs--more details here:

http://stackoverflow.com/a/1555186/67591

TL;DR: Google's C++ coding standard says: "Document that a variable is non-negative using assertions. Don't use an unsigned type."

As someone who has written a lot of C++ code to interact with hardware: I view an unsigned as a bucket of bits, and a signed as a number.

hk__211y ago

It is an easter egg: http://www.reddit.com/r/compsci/comments/2nrjc7/gangnam_styl...

rkachowski11y ago

Ah i see, i didn't expect Google to back up an easter egg with an associated G+ post

1 more reply

spb11y ago

Say you're comparing the number of views between two different videos as (video_A.views - video_B.views). How do you represent that the second video has more views than the first?

nly11y ago

    video_B.views > video_A.views

1 more reply

xxxyy11y ago

Legacy code I guess (from some really ancient times). Also tiny bugs like this make great content for blog posts, just like the 301+ views thing.

lmm11y ago

Lots of languages don't bother with a separate unsigned type. Maybe the backend is written in Java.

divegeek11y ago

Java doesn't have unsigned integer types. Google is mostly a Java shop.

hobo_mark11y ago

That is mostly false.

fishnchips11y ago

Java is mostly confined to Android and the frontend (think GWT) which do not constitute a large proportion of the Google internal code as far as I can recall.

exo76211y ago

If anything, google is mostly c++ shop.

Someone11y ago

Java's char is unsigned.

Aldo_MX11y ago· 5 in thread

Next milestone: 19th January 2038 03:14:07 GMT

ravenkat11y ago

Ah, So many systems going to fail on that day for using epoch with 32 bit.

Someone123411y ago

Aren't most Linux servers already 64 bit? And we aren't even close to 2020.

I just think a lot of software will be naturally replaced between now and then. And while there will be a slight mad scramble to fix stuff at the last minute, I don't think it is Y2K-2.

2 more replies

0x011y ago

I set the clock to one minute before time_t overflow on an iMac once. Recovering from that and just getting the machine to boot afterwards was no joke.

1 more reply

codezero11y ago

1 more reply

ge0rg11y ago

Time to print our "Y2K38 consultant" business cards! :)

DigitalSea11y ago· 5 in thread

Wow, this is cool. One video was able to exceed a 32 bit integer thus requiring a change to a 64 bit integer, all caused by one man and one video.

return011y ago

Not sure if he did everything all by himself.

Regardless, it was bound to happen sooner or later as youtube is getting older.

acqq11y ago

It's just a 2 billion limit crossed, 32 bits can count up to 4 billion. Afterwards, they certainly don't have to change to 64-bits, just add a few bits more.

srtjstjsj11y ago

there's no reason to pick any number between 33 and 63

1 more reply

abc_lisper11y ago

And don't forget there are multiple copies of the same video there.

DigitalSea11y ago

My understanding is that the original is the only video that has broken the 32 bit integer barrier though, right?

1 more reply

leephillips11y ago· 4 in thread

prawn11y ago

Being Korean might've given it crossover appeal into much of Asia? Just a guess.

lmm11y ago

> The explanations of Korean parody/satire are largely irrelevant to its success given its popularity elsewhere, surely?

I think they gave people who would otherwise have looked down on a silly craze an excuse to enjoy the video.

orblivion11y ago

Shivetya11y ago

dogma113811y ago· 4 in thread

tedunangst11y ago

Have you considered that the repeat viewing value of the Higgs boson announcement diminishes rapidly?

Retra11y ago

Humor and music are basic human social functions -- even children understand them. Quantum physics, not so much.

throwaway197911y ago

4 more replies

1009811y ago

You mean you're sad that the majority of people on earth don't know or care about higgs boson? That's a weird thing to be sad about.

jmount11y ago· 3 in thread

diego11y ago

32-bit architectures have nothing to do with the size of different data types that have existed forever. We had 64-bit longs in 8-bit cpus.

Also, there are perfectly valid applications that require numbers of 8, 16, 32 or 64 bits (or variable encodings with arbitrary precision). Petabytes, embedded microcontrollers, etc.

jmount11y ago

agumonkey11y ago

True, the HP48 pocket calculator was a "4bit" cpu with 64bit fp-able registers.

IgorPartola11y ago· 3 in thread

uint_32 strikes again! And one day we'll stop using it in favor of int_64, and all unique identifiers will be string, and all will be well.

I remember when Twitter had rolled over their tweet ID's because they were using an int type that was too short. Should have gone with variable length strings to avoid that problem.

Someone123411y ago

> uint_32 strikes again!

It isn't a uint32, it is an int32.

uint32 gives them twice as much capacity (which isn't enough at this stage), they'll likely want to go with a uint64.

rinon11y ago

Var-length strings are often too slow to manipulate and store. Better choice to use a fixed but large (64-bit) integer.

michaelgrosner211y ago

Or just use a GUID.

3 more replies

jawedkarim11y ago· 2 in thread

diroussel11y ago

What was the original site for?

pavel_lishin11y ago

Maybe some sort of exchange site for Magic the Gathering cards?

antimora11y ago· 1 in thread

It looks like it also broke the formatting on the number of the viewers: "2151501252". This string does not have thousands separators.

Direct link to the video: https://www.youtube.com/watch?v=9bZkp7q19f0

lstamour11y ago

It's a joke. Hover your mouse over it to see why ;-)

thibauts11y ago· 1 in thread

Why the hell would you want to store a counter as a signed int in the first place ?

maaku11y ago

Java?

tn1311y ago· 1 in thread

Am I the only one who thinks that Google is posting this bug(!) just to make the Google plus post popular ?

prawn11y ago

Probably not the only reason, but it'd be a bonus viral thing for Google+ and YouTube.

SapphireSun11y ago

I love that they added an easter egg to the actual video. If you hover over the counter, it briefly shows you the negative overflow value.

https://www.youtube.com/watch?v=9bZkp7q19f0

EDIT: I just realized that YouTube also posted a comment to that effect just below the video. :P

rodgort11y ago

Mea culpa. I can't remember why I didn't fix that when I reloaded the entire schema. At least I widened the video ids.

Animats11y ago

This is a minor problem. In the 1980s, the number of tradable things with ticker symbols in US markets passed 32767, and some new issues had to be delayed until it was fixed.

ecesena11y ago

I'd be curious to know how they discovered it. Were they monitoring it? Did someone report it? Did an alarm trigger? ...

adad9511y ago

There is Easter Egg in the video counter. Hover with your mouse. https://www.youtube.com/watch?v=9bZkp7q19f0

alejandc11y ago

unbelievable

jfmercer11y ago

I will always upvote anything related to Gangnam Style. Always.

j / k navigate · click thread line to collapse