A Quick Puzzle to Test Your Problem Solving (opens in new tab)

(nytimes.com)

533 pointsgranfalloon10y ago281 comments

281 comments

209 comments · 74 top-level

ddlatham10y ago· 30 in thread

I'm curious to see more about the distribution of questions and answers people had, and how the HN population may differ from the NYT's. There will certainly be self selection bias here, but if you're willing to share how you did with others, please enter it here: https://docs.google.com/forms/d/17e5BIL0lH8OHsGj89Zdtdl8GeCV...

The result summary is visible here: https://docs.google.com/forms/d/17e5BIL0lH8OHsGj89Zdtdl8GeCV...

The raw answers are visible here: https://docs.google.com/spreadsheets/d/1ZxR2_eOUtNLXwgKfLO1J...

veli_joza10y ago

People familiar with unit testing and test driven development will feel at home with this kind of puzzle. That doesn't mean that they will be less biased in social/political decisions, it just means that this test will fail to prove a point.

ttctciyf10y ago

Hah :)

Seriously, though, it seems a bit of leap from the existence of confirmation bias to explaining away the public outpourings of US Politicians about their financial crises and foreign policy disasters - in the absence of better data as to just why the given statements were made ascribing this to confirmation bias seems itself open to accusations of confirmation bias! :)

tikhonj10y ago

I mean, it also illustrates how training and systematic reasoning can improve these things. Whether explicitly or implicitly, I picked up certain skills and procedures for problem solving (from programming and math contests) that I now use by default. Trying a bunch of examples, coming up with a hypothesis, trying to disprove it, testing edge cases…

This doesn't mean I always use these—at the very least, I have to explicitly jump into "problem solving" mode—but it means they can be useful.

It's still a meaningful difference, and could very well apply to lots of things beyond this kind of puzzle.

diminishedprime10y ago

That's exactly what I was thinking. I (sometimes) follow TDD, and I applied it to this problem. I made sure to include negatives, 0, positives, and include primes here or there to help avoid issues with multiplication/exponentiation. After a few of these, I felt pretty confident that the rule was simple.

3 more replies

aptwebapps10y ago

Exactly. I'm just as 'No' averse as the next person but the 'No' I'm averse to is the one where you make wrong assumptions and it comes back to haunt you afterwards.

benihana10y ago

I was just writing this out! I believe practice in unit testing is what let me get this question right. I almost submitted n^1, n^2, n^3 before I realized I didn't get any wrong answers and something didn't feel right. I credit this 'instinct' to having written thousands of tests, and trying to make them fail to make sure my tests weren't lying to me.

2 more replies

jerf10y ago

This is now the fourth time I've seen this exact rule. Another example: https://www.youtube.com/watch?v=vKA4w2O61Xo

It's probably not "fair" to say I got it in zero... but I did. :)

Now that the NYT has done it, this puzzle has probably attained enough popularity now that you really ought to change it up a bit now if you're going to run it yourself. Granted, the space of hypotheses as simple as "increasing/decreasing" is pretty small, but your ability to fool people with the first sample run is almost unbounded, so that helps.

Verdex10y ago

Similar story here. I got it in zero because this problem shows up at the early part of HPMOR.

I suspect that the basic idea behind it is about right (people who insist on failures before committing to a theory will probably do "better"). But it seems to me that this test will be best at selecting people who've seen it before and can pretend they didn't (or even remember to ask negative questions when someone asks you to guess three numbers to get the job).

1 more reply

larkspur10y ago

If you look at the raw answers, a lot of people who guessed the wrong answer according to: "What answer did you give?" said they got it right:

Right? What answer did you give?

Yes, double the previous number

Yes, Each number much larger than the previous

Yes, sequence must always increase by 1

Yes, Powers of 2

Yes, h = 2n, i = 2(n+1), j = 2(n+2) . n is an integer

Yes, Powers of 2

abandonliberty10y ago

There's a selection bias - those of us who got it right are more likely to fill that out (:

As a result we can't really rely on overall accuracy, but we can break it out by yes/no to account for the selection bias to get a profile for how a HN correct and incorrect differ.

nzealand10y ago

This crowd needs one more question:

How many "I know the answer, but can I find a flaw in their code" questions did you ask?

stillsut10y ago

to be pedantic: "monotonically increasing" is incorrect. It should be "strictly increasing"

fenomas10y ago

Cool idea, thanks!

Personally, I spent most of my answers playing with the inputs. The form was happy to report that "1e1, 15, 0x10" was a valid sequence. :D

elevensies10y ago

This is a good idea. I submitted the survey but here are my tests: http://imgur.com/hYKudyl

FWIW, I've seen this kind of game before and I was expecting it to be something simple.

Jugurtha10y ago

My answer was: "The sequence is of an increasing real variable, where each subsequent value is greater than the preceding value. It's monotonically varying."

When I clicked "I think I know it", nothing happened. I don't want to click their "I don't want to play; just tell me the answer". But it seems like the right answer. I can't answer your form question if it is the right anwer, since I haven't clicked on their link and don't know for a fact whether it is or not.

Although I used the wrong term, it's strictly increasing.

wzdd10y ago

Your last question is "have you seen a test about confirmation bias before?" But the text of this test says that it's about "why no-one likes to be wrong", which means it's pretty obviously about confirmation bias (and therefore that the test-taker should be wary about just confirming their first intuition).

scuba_man_spiff10y ago

[ 456 , 42 , 48 , No ] [ 1 , 2 , 3 , Yes ] [ 3 , 2 , 1 , No ] [ 45 , 46 , 47 , Yes ] [ -1 , 999 ,1238978 , Yes ] [ 1+1 , 1+2 , 1+3 , Fail ] [ 7 , 654 , 653 , No ] [ 7 , 654 , 655 , Yes ] [ 1.1 , 1.2 , 1.3 , Yes ] [ 1 , e , pi , Fail ] [ 1 , e , 42 , Fail ] [ -1.1 , -1.0 , -0.9 , Yes ] [ 2i , 3i , 4i , Fail ] [ 1 , e , pi , Fail ] [ 0 , 1 , 2 , Yes ] [ -1 , 0 , 1 , Yes ]

If anyone's tallying

pitchups10y ago

Intersting that the split of correct/wrong answers from the HN crowd is 78%/22% - the exact opposite of the general population : 22% / 78%! The HN community does think different :)

teahat10y ago

The article doesn't actually specify that 78% guessed correctly or incorrectly, just that 78% guessed without ever entering an invalid sequence.

philh10y ago

If you're taking that from the article, I think you misread it.

> Remarkably, 78 percent of people who have played this game so far have guessed the answer without first hearing a single no.

Some of those 78% probably got it right, and some of the remainder would have got it wrong.

1 more reply

snowwrestler10y ago

Or at least, the portion of the HN community who reported their results, reported that they think different. :-)

1 more reply

Yizahi10y ago

It just means that this test was described at least several times in different "computer" media over last few years and almost everyone has read about it. Same thing with all "logic" puzzles.

maxerickson10y ago

I wanted to answer "Probably" to the last question about having seen a similar question before.

I don't specifically recall seeing one, but it is likely that I have.

hammock10y ago

Guessed after just one "No"

  3  9 27  yes  (is it exponential series?)
  4 16 64  yes  (is it only odd numbers?)
  5  7  9  yes  (is it any numbers of the same parity?)
  6  7  8  yes  (is it any set of increasing numbers?)
  6  7  6  no   (just to confirm that it's x<y<z, and not something like x<=y<=z)

Retric10y ago

6,7,7 or 1,1,1 satifies x<=y<=z not 6, 7, 6

Falcon910y ago

-2 -4 -8 (is it increasing distance from zero?)

1 more reply

Myrmornis10y ago

It would be relevant to include a histogram of #yes_answers - #no_answers in the summary, to test whether people are biased towards positive rather than negative tests. I think the raw data suggests that it does although I totally failed to create a histogram in google spreadsheets within 5 minutes.

beatbrokedown10y ago

My fuzzing went:

8 4 2 (no) 1 2 3 (yes) 1 1 1 (no) 1 100 123 (yes) 1.0 1.1 1.2 (yes)

answer: incrementing numbers

eru10y ago

Thanks for setting up the survey!

Could you perhaps move to a bar chart instead of pie charts?

ddlatham10y ago

Don't think that's an option for Google docs - but feel free to take the data and share one.

nothrabannosir10y ago· 14 in thread

My mathematical logic is rusty, but if I recall correctly, Gödel's incompleteness theorem basically states that it is impossible to solve this kind of question. No matter how many tests you run, there will always be an uncertainty.

An incredibly stupid example is that the rule could be "yes for strictly increasing, OR if one of the numbers is -18273192783127897981." You'll never know.

I understand this is contrived, especially when the test subject doesn't know. But if you do realize this while doing it, it makes the test a little frustrating..

EDIT: I see people are making a connection with unit testing, and the irony is poetic. This is precisely the problem Dijkstra was talking about when he said that "Testing shows the presence, not the absence of bugs."

monjaro10y ago

This has nothing to do with Gödel's incompleteness theorem. It's much simpler than that: https://en.wikipedia.org/wiki/Wittgenstein_on_Rules_and_Priv...

GhotiFish10y ago

Thanks for the link! That was an interesting read, but I don't think I understand the premise of the argument.

From the article

  ... It is perfectly consistent with your previous use of 
  'plus' that you actually meant it to mean the 'quus' 
  function, ...

That may be true, but only if you assume I'm not referring to the plus derived from the axioms of principa mathematica. I am. Is it then the question that when I refer to the principa mathematica that I'm actually referring to principa quus? If I then describe axiom 1 can you not know my words are referring to axiom quus?

It seems the only power of this assertion is that language provides no absolute common ground.

Am I understanding this correctly?

1 more reply

xenophon10y ago

I think the core idea is much simpler than that. It's the notion that falsifiability is the most powerful tool in our arsenal when we try to conceive of theories that explain a certain state of affairs.

The idea you're getting at is that no number of confirming observations can verify a universal generalization -- the reason why we continue to call generally accepted "truths" theories. But we can increase our certainty to the greatest degree possible by trying to test our hypothesis to the greatest degree we can.

scarmig10y ago

Remotely related: I've been interested for awhile in how the same initial terms of a sequence could possibly be generated by multiple rules.

For example, you might have

2,3...

And the rest of the sequence might look like either

2,3,4,5,6...

2,3,5,8,13...

2,3,5,7,11...

or even

2,3,5,10,20...

Clearly, on some level those sequences are all much less complicated than one defined as "The first term is 2, the second term is 3, the third term is 919243, the fourth term is -1234..."

It's unclear to me how one might rank them in complexity, though. The maximum amount of memory necessary to get an arbitrary nth term? The number of operations necessary to get to the next term?

Another interesting question to me: if there is an ordering of ways to generate a sequence of numbers, given the first couple terms of a sequence of numbers, what are the simplest N ways to generate the full sequence?

drewcrawford10y ago

Fun fact: Mathemetica has a method FindSequenceFunction which does exactly what you describe. In my experience, however, it generally requires at least 4 terms.

I'm not totally sure how the math behind it works (maybe it's similar to Eureqa?) but the results speak for themselves and are rather incredible.

For example, if I run FindSequenceFunction on this input:

    {0, 1, 3, 8, 19, 43, 94, 201, 423, 880}

Which is the number of 0,1 sequences of length n that contain two adjacent 1s

Mathematica produces the result:

    1/10 (5 2^(1 + x) - 5 (1/2 - Sqrt[5]/2)^x + 
    3 Sqrt[5] (1/2 - Sqrt[5]/2)^x - 5 (1/2 + Sqrt[5]/2)^x - 
    3 Sqrt[5] (1/2 + Sqrt[5]/2)^x)

Which, astonishingly, is correct for all the values I've tried. So apparently Mathematica understands more about this sequence than I do, and I know its definition.

Another party trick is to use the input

    {-(1/6), 2/15, -(13/140), 23/315, -(83/1386), 305/6006, -(2269/
     51480), 4259/109395, -(16103/461890), 30616/969969}

Which is the integral x^n (1 - 2 x)^n for x from 0 to 1, for n = 0..<10. Here it seems 10 numbers are required. This yields the solution

    (2^(-2 - 3 x)
    x! (Sqrt[\[Pi]] (1 + x)! + 
    3 (-1)^x 2^(
    2 + 3 x) (1/2 (1 + 2 x))! Hypergeometric2F1[1, 3/2 + x, 
    2 + x, -8]))/((1/2 + x)! (1 + x)!)

Which as far as I can tell, is a closed-form solution (!) to the integral. A solution it worked out to an integral it has never seen, but only the first 10 elements in the sequence.

So it's safe to say Mathematica knows a lot more about math than I do.

2 more replies

nandemo10y ago

How about this?

Choose a programming language. Choose a sequence prefix (in your example: 2, 3). Then consider all the programs that accept n as input and output a sequence of n numbers, such that the first numbers are always 2, 3. Now take the shortest of those programs. The sequence it produces is the "simplest".

If this sounds tedious to code, you could easily outsource via Odesk or something.

1 more reply

liadmat10y ago

Take a look at Kolmogorov complexity. It's uncomputable.

nsrivast10y ago

I think you might enjoy the book Fluid Concepts and Creative Analogies, by Douglas Hofstadter [1]. The first chapter is on (what Hofstadter argues is) the fundamental nature of recognizing patterns in number sequences.

[1] http://www.amazon.com/Fluid-Concepts-And-Creative-Analogies/...

elnion10y ago

This might be helpful: https://en.wikipedia.org/wiki/Generating_function

Generating functions provide a general framework for describing sequences, solving recurrence relations, etc.

the_af10y ago

It's not ironic. Dijkstra's assertion is, in a convoluted way, related to the main point of the quiz and the article :)

yvsong10y ago

With an equal protection clause, i.e., no particular number or a group of numbers can appear in the rule, is the problem solvable?

eru10y ago

Depends on what rules you allow.

What you want to forbid is not so much mentioning specific numbers, but you want to only allow rules that have certain symmetries. Eg you can require tranlation invariance

    rule(x, y, z) = rule(x+offset, y+offset, z+offset)

to restrict the set of rules.

odonnellryan10y ago

Tests like this are frustrating even if you don't recognize the logical fallacy! Just because it's always possible you're wrong. You're either right because you're lucky, or wrong because you messed up!

nightcracker10y ago

This does not hold - the test uses Javascript's numbers, which are finite. So it's theoretically possible to test every combination of inputs and convincingly answer what a possible rule is.

bediger400010y ago· 13 in thread

The official answer to this puzzle makes a huge assumption: that there is one correct answer. There is not one correct answer. (x, 2x, 4x) gives you a "yes" every time, therefore it is a correct answer, at least as automatically checkable, and there's an infinite number of such tuples. To find a "no" you're reduced to random guessing. That's not a puzzle, that's crap. The confirmation bias material might be true, but the puzzle does not illustrate it.

myNXTact10y ago

It's amusing that you went to a website on confirmation bias, did the puzzle incorrectly, presumably read the material on confirmation bias, but still suffer from the effects of confirmation bias.

harperlee10y ago

Amusingly human, if I may add. Reading "Thinking, fast and slow", by Daniel Kahneman, one key idea that I got was that even knowing against biases you are very, very likely to suffer from those biases. Disheartening results were gathered from studies done on well-trained psychologists and people prepared for the experiment, to no avail. Can't remember the details right now, but just read the book, it's awesome. Another good one was "Influence" by Cialdini, but they gave you tips on trying to avoid those biases that, upon reading Kahneman, I don't think anymore that are very useful.

1 more reply

ddlatham10y ago

The puzzle is not "Find a rule that matches everything you tested". It's figure out what rule they are using. It may be true that there are multiple ways to state the rule, such as "x_2 >= x_1 + 1 and x_3 is >= x_1 + 2" but they are equivalent.

Try the sequence: 0,0,0. It would give "yes" for (x,2x,4x), but the actual rule gives it "no".

1 more reply

upquark10y ago

Your task is to learn a class of objects by example. Without seeing any negative examples, you're unlikely to learn the right class (which is unique, it is precisely the class of increasing sequences of length 3). (x, 2x, 4x) is not "a correct answer", as it does not describe the class you were supposed to learn to distinguish.

snowwrestler10y ago

It's not an assumption; the article tells you up front that there is one correct answer.

It is an analog for how science works. When it comes to a natural phenomenon, humans can come up with multiple explanations that fit a given set of observations, but presumably (I mean, this is a basic tenet of science) nature only works in one consistent way.

Thus, the importance of a falsifying test. You form a hypothesis based on the initial observations (in this case, the number sequence 2, 4, 8), and then you propose a test that could falsify your hypothesis.

The trick is that a hypothesis can fail in several ways. It can be outright wrong, like saying "the rule is that the numbers decrease from left to right." That's obviously just wrong.

But it can also be too specific, like saying "the rule is that the exponent increments by one with each step to the right." That matches the given evidence, and tests with other base numbers will succeed too. But it's over-fitting.

Here's a concrete example: a man wearing a red shirt drops a weight and measures gravitational acceleration as 9.8 m/s^2. So he formulates a hypothesis that gravity always produces an acceleration of 9.8 m/s^2 in the presence of a red shirt.

And if he always wears a red shirt, and always tests gravity on the surface of the Earth, he'll always find supporting evidence for that hypothesis.

But of course we know that gravitational acceleration varies depending on mass and distance, and that it's the same no matter what color your shirt is. But he would only find that out if he varied his experiment beyond what his hypothesis predicts.

jpollock10y ago

Well, yes, in this situation random guessing would be very likely to provide an immediate check on the solution.

(1x, 2x, 4x), as indicated in the video below, is not sufficient. It represents a subset of the values that are valid.

Think of it this way. When asked to write a unit test, do you only test the positive outcomes? No, you test to make sure the failures are as you expect as well. Otherwise, you are likely to have what you think is a failure end up as a success.

pdpi10y ago

You got the problem completely wrong.

The idea isn't to come up with tuples that satisfy the predicate. The idea is to figure out what the predicate is in the first place.

Also, you're not in any way, shape, or form reduced to random guessing. If you have an idea of what the rule might be, you build a counterexample. There's a ton of value in _trying_ to get a no but getting a yes instead.

asQuirreL10y ago

From a Computational Learning Theory perspective, we are faced with an infinite hypothesis space, with an infinite VC dimension. So yes, there's not much strategy that can be employed here.

But, the fact that there is one correct answer is not really an assumption that the puzzle makes, it is information we have been given:

> We've chosen a rule that some sequences of three numbers obey -- and some do not.

This simply means that the solution is realisable. No matter how many ways (in English) we have to describe that solution it is still the same solution.

benkant10y ago

Consider the sequence

1, 3, 5, 7

what comes next? 9 right? Or is the sequence generated by 2n − 1 + (n − 1)(n − 2)(n − 3)(n − 4) for n ∈ N. Then we've got 33.

"among all hypotheses consistent with the observations, the simplest is the most likely"

33 is correct, but it's less likely to be the basis for the generation of the sequence.

Your answer of (x, 2x, 4x) proves the puzzle illustrates the confirmation bias, at least in your case.

Does the unit test that confirms your function returns the expected result given one set of arguments prove it correct?

nilkn10y ago

(x,2x,4x) does not actually give a "yes" every time. In particular, it won't work if x is negative. But, in that case, (4x,2x,x) will work.

ttctciyf10y ago

> (x, 2x, 4x) gives you a "yes" every time, therefore it is a correct answer

I think this logic is a bit wonky - if there are sequences that get a "yes", but don't match (x, 2x, 4x) then the correct rule cannot be (x, 2x, 4x), can it?

drostie10y ago

I've read this comment three times and I still can't figure out what you're trying to say. It sounds like you're arguing that all "puzzles" should exist on finite domains so that a brute force solution exists.

"(x, 2x, 4x) gives you a "yes" every time, therefore it is a correct answer, at least as automatically checkable."

Well, no: you can type 1, 2, 3 into the system and it will tell you "yes", but your rule says that it should tell you "no".

It is crucial to the definition of "correct answer" here that your rule should not just say "yes" only for tuples which the widget also says yes to, but also your rule should say "no" only for tuples which the widget also says no to. That is what the puzzle means when it's asking, "can you guess the rule that we've created?"

This makes it very, very different from what I think you're thinking about, which is situations where someone tells you, "what is the next number in this sequence? 4, 7, 13, 25, ...?" where technically there are an infinite number of rules which will generate those 4 numbers first and an arbitrary number afterwards. Technically one of them is "simplest" in the sense that it can be expressed in 7 symbols, but in general it's a complicated problem and there is no best solution.

"To find a 'no' you're reduced to random guessing. That's not a puzzle, that's crap."

In many ways it still is a puzzle but the space that it lives in is richer. If you think about typical "puzzles" they're things like: "here's a grid with some spaces filled in with numbers,

    2 . . 2 . 2 .
    . . . . . . .
    1 . 3 . . 2 .
    . . . . . . .
    3 . . . 2 . 3
    . . 2 . . . .
    . . . . . . .

Each number is a block in a block wall. We want you to turn this into a block maze so that each 'block wall' (set of blocks connected by adjacency) contains exactly one numbered-block whose number says how many total blocks are in the wall. Furthermore the path (non-block space) of the block maze should be connected and should not contain any 'rooms' -- that is, any 2x2 or larger segments of open space."

This 7x7 grid has 10 spaces which are known to be blocks and exactly 12 more blocks scattered in the remaining 39 spaces, so just by those factors alone we're searching only (39 choose 12) ~= 3.91 billion possibilities; we can also use a quick heuristic to identify 6 places which must be "space" to break apart adjacent numbered walls, removing 91% of that search space.

The puzzles, "I have a set of integers where inclusion in the set is governed by a short rule, you can ask me any integer and I will tell you whether it is in my set", by contrast, have an infinite search space. This means that any solution is going to be more interesting, as will the means for checking that solution's validity. You could require, for instance, a Haskell expression of 140 characters or fewer which turns a nonnegative Int named `n` into a Bool, to be judged as "valid" or "invalid" if it properly filters `[1..10000000] :: [Int]`. You could even give the first 100 numbers in the set, e.g.:

    ghci> take 100 $ filter trueFn [0..]
    [2,5,8,9,13,14,18,19,20,25,26,27,32,33,34,35,41,42,43,44,50,51,52,53,54,61,62,63,64,65,
    72,73,74,75,76,77,85,86,87,88,89,90,98,99,100,101,102,103,104,113,114,115,116,117,118,
    119,128,129,130,131,132,133,134,135,145,146,147,148,149,150,151,152,162,163,164,165,
    166,167,168,169,170,181,182,183,184,185,186,187,188,189,200,201,202,203,204,205,206,
    207,208,209]

In this case that's pretty much enough to see the general pattern; the verification covers 10 million bits while the 140-character limit probably limits your search space to 1000 bits or so, so it's going to be hard to get an "incorrect" answer which agrees on that subspace of the whole.

drjesusphd10y ago

> There is not one correct answer. (x, 2x, 4x) gives you a "yes" every time

Not true. What if x is negative?

1 more reply

tmd10y ago· 12 in thread

It responds "No" to (10000000000000000, 10000000000000001, 10000000000000002) so the rule is not so simple after all :)

kittenfluff10y ago

Responds "Yes" to

  9007199254740990, 9007199254740991, 9007199254740992

but "No" to

  9007199254740991, 9007199254740992, 9007199254740993

Presumably this is due to how Javascript handles integers, i.e. it uses the integer part of a float64, to wit

  > parseInt('9007199254740992')
  9007199254740992
  > parseInt('9007199254740993')
  9007199254740992

Edit: I think this is the code that actually reads the numbers the user enters, see [0]

  function l(){
      var a=h.exec(m[1]),f=null,g=null,n=null;
      return a&&(null!==a[1]&&a[1]&&(f=parseInt(a[1],10)),
          null!==a[2]&&a[2]&&(g=parseInt(a[2],10)),
          null!==a[3]&&a[3]&&(n=parseInt(a[3],10))),
      new e(f,g,n)
  }

Edit(2): Actually, I'm not so sure that's the correct code at all. They NYT game is capable of parsing floats correctly (e.g. it accepts 1.1, 1.2, 1.3 as a "Yes") so it's not just using parseInt.

[0] http://a1.nyt.com/assets/interactive/20150612-151638/js/foun...

janka10210y ago

The actual code seems to be from here [0]

on line 588 is the comparison

    var rightWrong = (inputData[0] < inputData[1]) & (inputData[1] < inputData[2]) ? right : wrong;

With a variable declaration on line 545 being

    var inputData = [NaN, NaN, NaN],
        revealed = false,
        right = "<p class = 'g-answer g-yes'>Yes!</p>",
        wrong = "<p class = 'g-answer g-no'>No.</p>";

And `inputData` is changed on text input on line 662

    $("#g-input input").each(function(i) {
        var val = $(this).val();
        inputData[i] = $.isNumeric(val) ? Number(val) : NaN;
    });

It uses the `Number()` function to convert from the input text to an actual number, so it can convert any number format defined by ES5[1] or ES6[2]. So in ES6 you can use binary (0b, 0B) and octal (0o, 0O) formatting along with exponential (1e-2) and hex (0x, 0X). Binary and octal works for me currently on Chrome 43 OS X.

[0] http://graphics8.nytimes.com/newsgraphics/2015/06/16/puzzle/...

[1] http://www.ecma-international.org/ecma-262/5.1/#sec-9.3.1

[2] http://www.ecma-international.org/ecma-262/6.0/#sec-7.1.3.1

mordrax10y ago

yeah alright.. i think someone didn't actually get the point of the article hehe

taigeair10y ago

you broke it.

itaibn10y ago

Further subtleties:

* The number may have optional sign and digits after a decimal point, and may use exponential notation. Example: (-1.2e1, .0E+0, 1.e-3) => "Yes". As seen in the second and third number here, there may be no digits before or after the decimal point, but both at the same time (i.e., ".0" and "0." parse but not ".").

* If the number begins with "0x" or "0X" it is read in hexadecimal, where the digits a-f may be in either case. Hexadecimal notation must not be accompanied by decimal point, sign, or exponential notation.

* No whitespace is permitted within the numeral, even between the sign and the digits as in "+ 11", but both tabs and spaces may be used before and after the numeral without changing its value. In particular, by using a input of the form "1 " it is possible to make rectangular display empty while still parsing it as number. Note that pressing "Check" leads to the numbers being displayed in the rectangle in exactly the same way as they were displayed in the text box, which may depend on the position of the cursor in the text box.

ETA: Also, you mentioned rounding, but there is also exponent overflow and underflow. The application refuses to parse numbers greater or equal to 1.7976932e308. It parses arbitrary negative exponents fine, but it does not recognize that 1e-324 is greater than 0.

thrownaway242410y ago

A test engineer walks into a bar. He orders a beer. He orders two beers. He orders 999999999 beers. He orders 1.00001 beers. He orders -42 beers. He orders 1048576 beers...

Natsu10y ago

Yes, but did he think to order NaN beers?

oldboyFX10y ago

Only on hackernews :O

Kenji10y ago

Good old IEEE 64bit floating point numbers =)

That also means that for(i=0;i<j;++i){} doesn't necessarily terminate for an arbitrary j smaller than infinity, which I find hilarious.

mikeash10y ago

I'd prefer an example along the lines of:

    for(i = j; i < j + 2; i++) {}

With your example it's easy to lose the distinction between "would eventually terminate if you had a fast computer and a lot of time" and "never terminates even in theory." Here, it definitely looks like it should always run two iterations to matter what the numbers are (as long as they're finite), but it doesn't.

Either way, though, it is definitely hilarious!

1 more reply

mark-r10y ago

It terminates for an arbitrary j smaller than 2^53, which is enough for most people.

TheSlowSmoker10y ago

Any idea why? Could it be an error from the input being too large? Or is there some other magic at work here...

Nedit: One of the other responses nailed it. IEEE standards on 64 bit fp ops

yequalsx10y ago· 9 in thread

Math person here. I'm curious to know if anyone used decimal numbers in their tests and if negative numbers were used. The rule is increasing real numbers and one can guess that the rule is increasing numbers without realizing this includes all real numbers and not just integers.

In addition to getting it right did you use an exhaustive set of tests?

MalcolmPF10y ago

I tried a negative series and a decimal series, just in case the rule was increasing natural numbers. I tried very large numbers to see if there was a limit to the rule, and a series that had large contrast in between each element. For fun I tried to see if the app would recognize "pi", "i", or "e", but, perhaps unsurprisingly, it did not.

yequalsx10y ago

I would have checked complex numbers for fun but there is no reasonable ordering of the complex numbers in the way there is for real numbers. It's too bad it didn't recognize pi or e.

Infernal10y ago

I did test with negatives, but I did not think to test with decimals. 15 yes's, 6 no's, and I successfully determined the rule.

tjohns10y ago

I tested with negative integers, but didn't think to test with real numbers unfortunately. (Still got it right, but I should've tested that.)

That said, I did make sure to test all the edge cases I could think of. I was actually going to guess (n^1, n^2, n^3) at first, until it failed for (1, 1, 1).

10 "yes" tests, 6 "no" tests.

dorgo10y ago

I wonder if somebody tested the rule to be deterministic by entering same number over and over. But I think the rule a<b<c is too simple to invite to use creative tests.

perlgeek10y ago

Yes and yes. It wouldn't activate the "check" button with complex numbers.

yequalsx10y ago

There isn't a way to check order for complex numbers in any reasonable way.

1 more reply

pbnjay10y ago

I got it right, using decimals - I was probably a little too exhaustive...

http://imgur.com/6adtvqz

ddlutz10y ago

I tested with real numbers to be curious, just tried 1.1, 1.2, and 1.3.

cabirum10y ago· 6 in thread

function judgeSentence(sentence, numNo)

var probablyWrong = ["doubl", "expon", "multipl", "^", "", "power", "two", "2", "twice", "as big", "nth", "rais"];

var seemsRight = ["larger", "increas", "greater", "small", "less", "big", ">", "<", "go up", "ascending"], weaselWords = ['but ', 'not ', 'odd'];

Been expecting something more interesting than that

10987610y ago

Full method just for fun:

    function judgeSentence(sentence, numNo) {

        sentence = sentence.toLowerCase();

        // no nos -> wrong.
        if (numNo === 0 || sentence == "") {
            return false;
        }

        // if have any fancy words -> wrong.
        var probablyWrong = ["doubl", "expon", "multipl", "^", "**", "power", "two", "2", "twice", "as big", "nth", "rais"];
        if (hasAny(probablyWrong, sentence)) {
            return false;
        }

        // if you have the right words, and no buts.
        var seemsRight = ["larger", "increas", "greater", "small", "less", "big", ">", "<", "go up", "ascending"],
            weaselWords = ['but ', 'not ', 'odd'];
        if (hasAny(seemsRight, sentence) & !hasAny(weaselWords, sentence)) {
            return true;
        }

        // // no nouns, verbs or adjectives in your sentence -> wrong.
        // var s = nlp.pos(sentence).sentences[0],
        //     verbs = s.verbs().map(getWords),
        //     nouns = s.nouns().map(getWords),
        //     adj = s.adjectives().map(getWords),
        //     numWords = verbs.length + nouns.length + adj.length;
        // if (numWords === 0) {
        //     return false;
        // }


        return false;
    }

TeMPOraL10y ago

It's surprising how far you can go by such heuristics. I run an IRC bot that uses matches like this (I'm working on a proper solution right now though) to parse natural language queries and I managed to trick few people into thinking they're talking with human. As long as it's ok for 90% of most common cases, people often won't notice.

tomerico10y ago

Which is another case of confirmation bias

tripzilch10y ago

I've made a robot that screams[0]. That is, just outputs a random string of "AAAAaaa" when its name is mentioned or when somebody else screams (four or more A's).

What is surprising is how basically 15 lines of python implementing these rules, invokes a very real emotional response in a lot of people :-)

I named it "Wilhelm".

[0] inspired by this comic http://gunshowcomic.com/513

stillsut10y ago

include "monoton" in probablyWrong.

The sequence is not monotonically increasing.

thekingofspain10y ago

Interestingly, it would appear that monotonically increasing in general means "strictly increasing": http://mathworld.wolfram.com/MonotoneIncreasing.htm

It only means non-decreasing in the context of a monotonic function, where the definition, I believe, is that the derivative of the function is never <0.

Kenji10y ago· 6 in thread

"We’ve chosen a rule that some sequences of three numbers obey — and some do not. Your job is to guess what the rule is."

My mistake was to assume that choosing the first number uniquely defines the next ones in the sequence. Since, you know, like all the sequence puzzles I've seen before worked like that, and I didn't read it rigorously enough. Oh, by the way, the doubling thing is wrong if you use negative numbers (wrong as in it gives false positives, instead of just false negatives). But the problem definition doesn't even tell what set of numbers we're operating on.

Finding the rule the sequences obey is impossible since it could be that all cases follow a simple rule except for one triplet which you're unlikely to find. It's trivially easy to fool the user into finding a wrong rule.

mikehawkins10y ago

Same here - I was also guilty of feeling a bit clever and avoiding the obvious "oh, the numbers double every time" answer. So, when I found out that if the spelling of each number was one letter longer (4 = four letters, 8 = five letters, 11 = six) I got smug, and didn't bother testing further.

Good article - and a humbling experience. :)

TylerE10y ago

I went for the slightly more clever "series of powers" after 3/9/27 validated.

1 more reply

TheOtherHobbes10y ago

I think it's a reasonable mistake.

Most people will make assumptions about what's required based on previous experience. And hardly anyone will have previous experience where a question formatted in this way isn't asking you to find a series rule.

That's not quite the same thing as confirmation bias. With a bias you're just as likely to discount significant evidence as you are to mismodel the problem space.

DougBTX10y ago

> It's trivially easy to fool the user into finding a wrong rule.

While that's true, the real thing being tested is the user's readiness to prove themselves wrong.

the_af10y ago

It is true that, in general, this quiz is impossible to guess, for the reasons you explain.

But in this case -- which is the main point of the article -- it was actually a trivial rule. No tricks, no special cases. The real purpose wasn't for you to find the actual rule, but to learn about your biases. The only trick here lies in the human mind, and its tendency to validate patterns (and claim an early victory) instead of trying to refute them.

IshKebab10y ago

Yeah I thought it would be something like "Numbers must strictly increase unless the first number is 15326". Then the point of the article would be that some government rules are not well defined.

binarymax10y ago· 4 in thread

The puzzle is not nearly as interesting as the code being able to understand my answer!

My answer: The numbers increase from left to right

Application response: As you seem to have guessed, the answer was extremely basic

philh10y ago

Yes, I'm curious what heuristics it used there.

Playing the confirmation bias game on "playthroughs of the confirmation bias game": it looks like you get that message if you have a 'no' answer, and the word 'increasing' in your guess. ("Increasing by the same amount" is still accepted.) I wouldn't be surprised if other words also count.

Edit: the words '<', '>', 'increas', 'big' and 'larger' seem to count. Looks like they're accepted as substrings, not just words - so 'increase' and 'increasing' are accepted. I could look for a long time, so I'm giving up now.

arjunnarayan10y ago

I simply said: "a < b < c", with no other words, and it gave me the same answer as yours.

Retr0spectrum10y ago

I said "The numbers are in ascending order", which it didn't seem to recognise.

matchu10y ago

"ascending" is one of the words it checks for. Might've typoed it? :/

1 more reply

abecedarius10y ago· 3 in thread

I ran this experiment for a while (code at https://github.com/darius/wason, derived from http://lesswrong.com/lw/g2/positive_bias_test_c_program/).

In my logs most people seem to have gotten it right, though presumably that's because it was linked from LessWrong.

For an actually-fun game like this, see https://en.wikipedia.org/wiki/Zendo_%28game%29.

TeMPOraL10y ago

> though presumably that's because it was linked from LessWrong

No surprise there, given that this experiment is discussed in Sequences, the link to the post present in the article you link to :).

harryjo10y ago

Thank you! I forgot the name of the game, and it is totally unsearchable.

roccaturi10y ago

Zendo is fantastic. Watching people unaccustomed to its sort of problem-solving struggle with and improve their methodologies for being good at the game is really enlightening.

phantarch10y ago· 3 in thread

Funnily enough, I notice confirmation bias quite a bit in a D&D game I am currently DM of. I'm playing with a group of friends who are big into video games, and as a result they consistently seek resolutions to conflicts in D&D by way of what they know from shooters: kill everything in sight. Yes, it's at times a valid answer, but it's not the only one and it's certainly not the most interesting one. The best way that I've seen the confirmation bias dissipate from their thinking is to put them into situations where their bias just doesn't help at all.

pizza10y ago

Maybe some positive/negative feedback built into the campaign could help; e.g. for each act of benevolence/violence, add/subtract a 'karma' point from some running total, and alter the gameplay as needed.

eru10y ago

Just don't tell them explicitly about it. (That'd be taking away all the fun.)

mikeash10y ago

You must be awfully tempted to constantly give them powerful adversaries who are willing to become friends if given the smallest chance.

qznc10y ago· 3 in thread

> A mere 8 percent heard at least three nos

I guessed correctly with only 2 nos. Since there is no penalty for guessing incorrectly here, I felt safe enough with my theory. I might have checked for more nos, if I had to announce my theory publicly (Twitter, comment, etc). However, I also knew about Confirmation Bias beforehand.

rockdoe10y ago

I guessed correctly with only one no.

You can enter all kinds of crazy random sequences which only have The Rule in common and get a yes, which seemed to be enough assurance. If you're trying to get it to say "no" but failing, is that still confirmation bias? Doesn't sound like it.

reagency10y ago

That highlights a great point: on this simple decision rule, using random test data will draw the solution more quickly than human cleverness.

dendriteseeker10y ago

I also guessed correctly with only 2 nos -- my real confirmation came from the fact that "-1, 0, 300000000" (or some number of 0s) was correct, meaning it couldn't be any really meaningful sequence.

dude_abides10y ago· 3 in thread

I'm a data scientist, and it relieved me no end that I got this one right: http://i.imgur.com/V5oJ4i4.png I would have had second thoughts about my career choice if I got this wrong :)

The correct approach for any data modeling problem is to think in terms of entropy. Each subsequent approach should minimize entropy, until you reach diminishing returns.

Sadgrinner10y ago

Isn't your answer technically wrong, though?

The sequence is not monotonically increasing. It's strictly increasing. If you test [1, 1, 2] or [1, 1, 1] or [1, 2, 2], you'll get "No" answers even though those sequences are monotonically increasing.

thaumasiotes10y ago

Not so.

http://mathworld.wolfram.com/MonotoneIncreasing.html

1 more reply

taigeair10y ago

I got so many nos. I can't believe that "Remarkably, 77 percent of people who have played this game so far have guessed the answer without first hearing a single no." That's crazy.

brianwillis10y ago· 3 in thread

This doesn't seem to work right for negative numbers. The article says the rule is "each number must be larger than the one before it", but if you try -2 -4 -6 it says that pattern doesn't match the rule.

Maybe I'm just being pedantic here, but last I checked -6 was larger than -4.

StevenXC10y ago

"Larger" in this context means "greater than" in the usual ordering of the real numbers.

Tloewald10y ago

It works for floating point numbers -- 0.01 0.02 0.04 for example. So it's a geometric series that has to start with a positive number and doubles. (The submit button and the show the answer button were broken for me.)

eru10y ago

That's unintentionally hilarious.

1 more reply

noreasonw10y ago· 2 in thread

I think that the "quick" adjective in the title is purposeful misleading. You are supposed to learn quickly the most general rule, but that is not so easy because there are many possible rules that could fit such a pattern. It seems that you should be rewarded for solving the puzzle quickly and then you fall in the trap. I propose to change the title to "A puzzle to test your Generalization Abilities", and state clearly that you should try to find the most general rule that satisfies all patterns you can think of. In that case, I would expect the conclusion and results of the experiment to be completely different. So to summarize: the so "quick" adjective in the title has a very strong anchor effect.

Edit: changed for grammar and to express more clearly what I think.

Retra10y ago

Maybe it should be "a puzzle many people already know the answer to" in which case the conclusion and results are already obviously biased.

I was able to solve the puzzle without testing any numbers at all. Which really skews the relevance of "only nine percent of people saw three 'no's before answering."

crgt10y ago

You mean you correctly guessed without testing at all - and got lucky. I'm not sure this is the same thing as 'solving' it.

1 more reply

damoncali10y ago· 2 in thread

Cool. The funny thing is I inserted a constraint of my own invention without even realizing it: "Use the fewest number of examples possible." Of course, this meant failing miserably, and was nowhere in the problem statement.

Perhaps that's an additional factor - not exactly confirmation bias, but not unrelated.

Nimitz1410y ago

Eliminating assumptions baby.

https://www.youtube.com/watch?v=9-9VLVkm8R4

aroman10y ago

this is exactly what I did. I wonder what the psychology/cognitive science explanation of this might be.

smilefreak10y ago· 2 in thread

Does laziness have anything to do with the responses? You get a rule that seems to work and so you seek the reward early. It takes effort to prove yourself wrong.

I was trapped by this and guessed it was exponential series n^1,n^2 etc for n starting at greater than 2. While technically true this was not the rule they had in mind.

eru10y ago

Why is it technically true?

smilefreak10y ago

As in every number in that set is a subset of the larger set of x < y < z. Poor language choice it's not true, yes I was wrong. I am just curious as to how much laziness and not necessarily confirmation bias has to do with the result. If getting it wrong had some kind of penalty or getting it right had some kind of reward ( money etc. ..), how much better would people do then?

1 more reply

sp33210y ago· 2 in thread

I got 7 yes, 5 no, and was still wrong. Maybe I'm just dumb.

EvanKelly10y ago

I'm curious about how you came to your answer and what led you there?

Were you testing a pre-supposed hypothesis that confirmed itself?

sp33210y ago

I got the part about increasing eventually, but I thought the third number also had to be the sum of the other two. I came up with the sum idea after trying (3 6 9), so only 2 tests. The idea that they had to be increasing came later. I don't have it open but I'm pretty sure one of my tests was (1 2 5) which should have tipped me off... in conclusion yes, I'm probably dumb.

3 more replies

afitnerd10y ago· 2 in thread

Seems busted now - clicking the "I think I know" button does nothing. I thought the answer was:

Let the first number be x. If x is 0, then the second number is 1. Otherwise, the second number is two times the absolute value of x. The third number is 2 times the value of the second number.

gervase10y ago

The answer they were looking for was: Let the numbers be x, y, and z; x < y < z must be true.

Their hypothesis was that most people would guess as you did (they mentioned 78% of people did so).

lowmagnet10y ago

I had to turn off µBlock to make it work.

jasallen10y ago· 2 in thread

There is no penalty to being told "no". That also applies to "I think I know the answer". There is no penalty for being told "no" there as well, so once you have a reasonable guess why not check it? Disagree with their analysis.

ddlatham10y ago

There is an implied penalty because it says about the answer box "Make sure you’re right; you won’t get a second chance" in contrast to "You can test as many sequences as you want" about the sequences.

Falcon910y ago

This is absolutely correct. Perhaps Hacker News culture is such that they realize such a warning can be overcome through a refresh, or possibly an incognito browser window if a cookie is preventing a second guess. Still though, within the rules of the game, there is no drawback of testing negative sequences, but there's a clearly defined drawback of entering an incorrect answer.

Geee10y ago· 2 in thread

Anyone tried entering letters in the boxes? I didn't. I think that would have been an unexpected twist.

So, after all, I think I fall in the trap of confirmation bias that the sequence must consist of numbers only.

dEnigma10y ago

Well, after all they tell you that the sequence consists of numbers:

>We’ve chosen a rule that some sequences of three numbers obey

>Now it’s your turn. Enter a number sequence in the boxes below, and we’ll tell you whether it satisfies the rule or not.

jazzyb10y ago

I entered A, B, C. The form didn't let me submit the sequence, so they are doing some verification that the entries are numbers. Rational numbers work, though: 1.1, 1.2, 1.3.

72568610y ago· 2 in thread

That page prints the nyt logo in the javacript console (will probably look messed up here):

       0000000                         000        0000000
     111111111      11111111100          000      111111111
     00000        111111111111111111      00000      000000
     000        1111111111111111111111111100000         000
     000        1111       1111111111111111100          000
     000         11       0     1111111100              000
     000          1      00             1               000
     000               00      00       1               000
     000             000    00000       1               000
  00000            0000  00000000       1                00000
  11111            000 00    000000      000                 11111
  00000          0000      000000     00000              00000
     000        10000      000000      000              0000
     000        00000      000000       1               000
     000        000000     10000        1     0         000
     000        1000000 00              1    00         000
     000         1111111                1 0000          000
     000          1111111100           000000           000
     0000          111111111111111110000000            0000
     111111111        111111111111100000          111111111
       0000000              00000000              0000000

NYTimes.com: All the code that's fit to printf() We're hiring: developers.nytimes.com/careers

heartbreak10y ago

Every nytimes.com page displays that in the JS console.

stephengillie10y ago

Putting your engineer hiring notices under the hood is becoming a common "Easter-egg" practice. It's also a form of targeted recruitment advertising - the only people who see it are your target audience.

phaemon10y ago· 2 in thread

I got this wrong, but oddly enough my first attempt was a No. I tried, "6, 1, 8"

After that, I went with doubling and it worked 3 times with various sizes of number, so I went with that.

Double bonus points if you can guess what I was testing with the first sequence (which the given numbers do satisfy).

o0-0o10y ago

Were you guessing random area codes of midwestern states? :)

phaemon10y ago

Hah! No, I'm not American :)

But, in case no-one guesses, the answer is (rot13):

erirefr nycunorgvpny beqre

1 more reply

pbnjay10y ago· 1 in thread

If anyone's curious about the raw numbers, I found the counts here:

http://int.nyt.com/newsgraphics/2015/2015-06-26-rule-guessin...

    {"count": 27723, "numNo":     0, "share": 0.7716051100782098}
    {"count":  2921, "numNo":     1, "share": 0.08129922903504133}
    {"count":  1883, "numNo":     2, "share": 0.05240891758746417}
    {"count":  1285, "numNo":     3, "share": 0.035764980934621056}
    {"count":   880, "numNo":     4, "share": 0.024492749589468118}
    {"count":   525, "numNo":     5, "share": 0.014612151743716774}
    {"count":   288, "numNo":     6, "share": 0.008015808956553202}
    {"count":   156, "numNo":     7, "share": 0.004341896518132985}
    {"count":   108, "numNo":     8, "share": 0.0030059283587074506}
    {"count":    56, "numNo":     9, "share": 0.0015586295193297892}
    {"count":    45, "numNo":    10, "share": 0.001252470149461438}
    {"count":    11, "numNo":    11, "share": 0.00030615936986835146}
    {"count":    22, "numNo":    12, "share": 0.0006123187397367029}
    {"count":     8, "numNo":    13, "share": 0.00022266135990425562}
    {"count":     4, "numNo":    14, "share": 0.00011133067995212781}
    {"count":     5, "numNo":    15, "share": 0.00013916334994015976}
    {"count":     3, "numNo":    17, "share": 0.00008349800996409585}
    {"count":     1, "numNo":    18, "share": 0.000027832669988031953}
    {"count":     3, "numNo":    20, "share": 0.00008349800996409585}
    {"count":     1, "numNo":    21, "share": 0.000027832669988031953}
    {"count":     2, "numNo":  null, "share": 0.000055665339976063906}

buckbova10y ago

Interesting totals. I had 4 No's and would have had at least 5 had I considered negative digits.

stygiansonic10y ago· 1 in thread

Neat - as others have pointed out, I feel that being familiar with unit testing would help in this situation. Having negative test cases is just as important, if not moreso, than having "happy path" tests.

david_shaw10y ago

That's a great point. Perhaps we should show something like this to new developers who don't understand the value.

rtkwe10y ago· 1 in thread

Wonder how many people here immediately knew what the rule was?

This is the standard example of to right way to test a hypothesis/theory and the power of Confirmation Bias, testing sequences that are invalid under the theory instead of testing what you think is correct.

rockdoe10y ago

I guessed that immediately (but still verified it). It was obvious it was going to be a trick question, so...

FilterJoe10y ago· 1 in thread

While the HN crowd mostly gets this right when framed as a math puzzle, my guess is that confirmation bias is alive and well in high tech just like in any other field. One example:

Young 20s entrepreneur vs. early 50s entrepreneur. Without knowing anything about either person, which startup is more likely to succeed? Even if you have the business plans for both, and you meet both - which one are you going to be more skeptical about as you evaluate which one of them gets funding?

eru10y ago

I'd trust the older guy.

dhruvbird10y ago· 1 in thread

People try to make the fit be as tight as possible to the sample data -- the explanation is that simple. I don't buy the explanation provided in the article.

mpu10y ago

Pretty good point. That's how I felt.

Additionally, this setting is probably too close to usual situations you get in school where there is little to no interaction and negative answers from the teacher are seen as failures by students. (Speaking about education in my country only.)

danielvinson10y ago· 1 in thread

I remembered this puzzle from HPMOR, but either way this is just writing a unit test.

Tyr4210y ago

Having just re-read HPMOR a few weeks ago, I could answer it right away, but there was actually a difference from HPMOR's version, which required three positive increasing numbers, while this allowed negatives.

jjuhl10y ago· 1 in thread

The same experiment: https://m.youtube.com/watch?v=vKA4w2O61Xo

zeidrich10y ago

What's funny is that I felt smug and self satisfied for getting the correct answer even though I watched that video when it came out.

ernsheong10y ago· 1 in thread

We should all be doing good on this one if we have been practicing making our test cases are red first before turning them green :D

lugg10y ago

Looking at my result there, seems to be something I need to start doing. TIL :P

To be fair, I did check 3/6/12 - just to make sure it was double and not powers of two. Guess that's the articles point though isn't it!

m4r71n10y ago

Derek Muller of Veritasium has done this test over a year ago on people in one of his videos: https://www.youtube.com/watch?v=vKA4w2O61Xo

I've given the test to various people since then and never once came across someone who'd guess it right away, or within a short period of time. The breaking time came usually after a few minutes when they gave up and started throwing out random numbers that coincidentally did not meet the rule. Once you hit the first "No", it took a very short time to figure out the rule for almost everyone.

aaronbrethorst10y ago

More or less related: when I'm looking for constructive criticism from someone, I'll ask them "what do you dislike about this?" or "what's wrong with this?" instead of "what do you think?"

I tend to get much more interesting and useful feedback this way.

MarkMc10y ago

The comments here suggest people are missing the full significance of this problem. It's not just a cute number puzzle - it demonstrates a profound human weakness that has a deep impact in everything we do.

1. People that think having a gun in the house makes it safer will not try to design an experiment designed to demonstrate the opposite.

2. People who think organic food is better for you than regular food will not try to look for evidence that the two types of foods are equally healthy.

3. An Israeli who believes the area where he lives was uninhabited before 1948 is not going to think about what kind of evidence would contradict that belief.

I'm not saying the views above are incorrect. It's just that we are all guilty of falling in love with our beliefs when they should be mere acquaintances. Hence the quote, "People don't change their minds. They die, and are replaced by people with different opinions." [1]

[1] http://www.paulgraham.com/quo.html

shmageggy10y ago

This phenomenon has had a profound effect on the history and philosophy of science. There have been entire schools of thought based on verification of hypotheses, and entire movements based on refuting those schools. The most effective strategy in this puzzle(and the one that is unintuitive for many) is to systematically generate alternative hypotheses and falsify them. Karl Popper claimed that this method is actually at the core of how we gain scientific knowledge, and his brand of philosophy of science is the most popular and arguably the most successful today.

harryh10y ago

Reminds me of the game mastermind which I loved playing as a kid.

Here's a javscript version: http://www.archimedes-lab.org/mastermind.html

moo10y ago

I've heard of intelligence failures to explain the Iraq war, now this author says it was confirmation bias which caused a completely erroneous justification for the invasion of Iraq. I think the author has confirmation bias in too easily using the term to explain government and corporate policy choices which have been based on false justifications.

sambe10y ago

The "Check" buttons weren't enough of a hint for me and I jumped into "This is a numerical reasoning problem" mode. I'd argue that this kind of situational bias is as much a factor here as confirmation bias.

cristianpascu10y ago

I understand the power of confirmation bias. I believe it to be natural for anyone. It's perfectly normal to seek an explanation that fits the already built cognitive structures, developed through experience. It's unreasonable to simply jump into new paradigms everytime we encounter a new fact. It takes time to prove that it doesn't fit, and then we start looking for new explanations.

However, the test simply required a possible solution. There are plenty solutions and it's absurd to think they have the simplest one. The simplicity of the rule is subjective, in that is evaluated differently by different people. The famous 'simple but no simpler' is relevant here. As long as we were not told to look for the simplest solution, ALL solutions are equally probable. That being the case, I started to with the first solution that popped into my mind. I sticked with that because of my psychological state. Some searched for other solutions.

I don't think that getting an YES was the main driving force. Of course it feels good to get an yes. This is fundamental in human relations. But it's not the whole story. People do not disbelief globar warming because they want to get an YES. The reason is much deeper. Just as many, so many people go to the wave of climate change because it's fashionable, it makes them feel good, accepted , part of the mainstream. Being a climate change denier is being a disident this day (not my flavor of disidentism), and being a disident is not for everyone. And perhaps disidents picking their fight have complicated reasons for doing so.

I went with the doubles.

mc80810y ago

This looked like a puzzle I had seen before, so I assumed this was the case (testing a few sequences just to verify) and turned out right.

I guess the conclusion is that if a problem looks suspiciously like one you've encountered before, there's a good chance that they are the same or similar. The world is self-organizing, not completely random where you must obsessively second-guess your accumulated wisdom.

Gravityloss10y ago

Anyone played Monkey Island 2?

There's a puzzle with a doorman, there's a few distracting clues where the answer is actually very simple.

It doesn't offer confirmation bias though, and it took some time to figure it out.

I consider it a very similar test.

So one could actually construct such a test without the confirmation bias part, and then look at how long it takes for people to realize the simple model.

js210y ago

For a more extensive version of this article, see "Mistakes Were Made (But Not by Me)":

http://www.amazon.com/Mistakes-Were-Made-But-Not/dp/14915141...

harryjo10y ago

Veritasium covered this classic puzzle / bias test nicely, last year. https://www.youtube.com/watch?v=vKA4w2O61Xo

nsxwolf10y ago

So, am I supposed to feel bad if I assumed it was some tricky, hard to figure out function? It just reminded me of those questions on the ACT or whatever and I froze up and got frustrated.

Am I a dim bulb?

pbreit10y ago

I guess I'm the opposite: I got more No's than Yes's. My immediate sense was to find No's. I guess that's why I like 1 star reviews.

sushirain10y ago

My explanation to the scarcity of "No"s: people are used to seeing in these puzzles mostly sub-types of increasing series: exponential, linear, etc. By the time they had ruled out these sub-types, and had resorted to guessing "ascending", they wouldn't have encountered even a single No.

I predict that if the rule was narrower, like "exponential", much more guesses would have yielded No's.

Lorenzo4510y ago

Wish I could have done this with no previous knowledge of the puzzle, I knew it right away because I saw the exact same problem in a Veritasium video.

yellowapple10y ago

I think the first-known matching pattern plays a huge role. The original 2,4,8 sequence, for example, locked me in immediately to doubles of the previous number (causing me to test 1,2,4 and 7,14,28 and such). Had it been a different starting sequence (like 3,9,27), I might've based my guesses differently. Same for 1,2,3.

In other words, first impressions really are important.

mordrax10y ago

They don’t want to hear the answer “no.” In fact, it may not occur to them to ask a question that may yield a no.

So the author's obviously never heard of sanity checking, in fact that's the second thing that I always do once I confirm a solution is to confirm it's not a fallacy.

Having said that, my solutions were -10 -20 -40 -10 -8 -4 1024 1026 1030

and I said it was +2, +4 and got it wrong!

__z10y ago

Veritasium video on the same thing

https://youtu.be/vKA4w2O61Xo

nerdo10y ago

Reminiscent of the folding table libertarians with their questionnaire and that political cartesian coordinate chart. "You answered that 'it's wrong to steal', [...psychobabble...], on this science graph it appears you've always secretly been a libertarian, we meet at the cinnabon on sundays".

Gabriel_Martin10y ago

My process was [1, 2, 3], and then I guessed each number is greater than the last.

It was totally a possibility that they wouldn't apply the simplest rule, but I felt it highly unlikely. This "rule" is a meme of the rationality community, especially given the example, so it seemed pretty likely that it was sequential numbers.

ljk10y ago

did anyone else only try even numbers? guess "increasing even numbers"

pretty interesting how in-the-box my thought process was

SZJX10y ago

Umm... Not sure what's so special about it. Isn't that what all programmers and science people do all day?

drhdylan8810y ago

Veritasium, a pretty interesting youtube channel, posted a video on this experiment a while back. I found the discussion afterwards to be more thought-provoking than this article.

https://www.youtube.com/watch?v=vKA4w2O61Xo

decisiveness10y ago

Constraints allow for only integers <= 9999999999999999 and >= -9999999999999999 which is interesting considering (-)10000000000000000 through (-)9223372036854775807 are also within the bounds of a 64 bit integer.

It's ironic that these facts are not mentioned considering the article is about confirmation bias.

progmanal10y ago

The attached reading material is interesting, but this question is too similar to problems where you guess the next one in the sequence and none are missing.

A rule where the numbers are increasing does not explain why 3 or 5 or 6 is missing from the sequence in that version of the question that is much more common.

pmelendez10y ago

I failed it again... Every time I take a test like this I ended being fooled by the confirmation bias.

theVirginian10y ago

I don't seem to be able to submit my answer or follow the link to see the answer for "just tell me the answer" is this supposed to be a trick or is my browser just not capable of properly following those links? Using Chromebook.

gesman10y ago

After total of 13 answers including 6 "no", I guessed it right :)

arikrak10y ago

the questions is somewhat ambiguous. it would be more interesting to see what happens of people really understood the question and if they had a real motivation to get it correct.

Kluny10y ago

Decimals and negative numbers still get you a "yes" as long as you obey the rule. However, the button to submit your answer didn't work for me.

bluker10y ago

My first assumption was x,2x,2(2x) - then I tested 0,0,0 and got a NO which disproved my first assumption... the answer is obvious after.

Did anyone else test 0,0,0?

adamc10y ago

I got it, but it took a bunch of guesses before I had the pattern. It's a good example of how our preconceptions shape our answers.

0xdeadbeefbabe10y ago

Confirmation bias affects Corporate America and Government Policy, but not science? Talk about confirmation bias :)

jessaustin10y ago

Have large numbers of HN people really not seen this riddle before? Maybe I read weird things.

elwell10y ago

After programming daily for more than a decade, I question my assumptions annoyingly often.

ammaar10y ago

for anyone who hasn't seen it Veritasium did a video on it.

https://www.youtube.com/watch?v=vKA4w2O61Xo

vincston10y ago

Sadly ive already seen Veritasiums test. so i was biased.

PSeitz10y ago

Nice game, got it right after 7 yes and 7 no

_lce010y ago

I thought..

    F(n + 1) = n * 2

limeyx10y ago

Got it right with 7 nos.

flint10y ago

Yup - Dick Cheney!

mastre_10y ago

-10, 0, 0.01

elektromekatron10y ago

I liked this puzzle. It got me. I am thankful for it.

j / k navigate · click thread line to collapse

281 comments

209 comments · 74 top-level

ddlatham10y ago· 30 in thread

The result summary is visible here: https://docs.google.com/forms/d/17e5BIL0lH8OHsGj89Zdtdl8GeCV...

The raw answers are visible here: https://docs.google.com/spreadsheets/d/1ZxR2_eOUtNLXwgKfLO1J...

veli_joza10y ago

ttctciyf10y ago

Hah :)

tikhonj10y ago

This doesn't mean I always use these—at the very least, I have to explicitly jump into "problem solving" mode—but it means they can be useful.

It's still a meaningful difference, and could very well apply to lots of things beyond this kind of puzzle.

diminishedprime10y ago

3 more replies

aptwebapps10y ago

Exactly. I'm just as 'No' averse as the next person but the 'No' I'm averse to is the one where you make wrong assumptions and it comes back to haunt you afterwards.

benihana10y ago

2 more replies

jerf10y ago

This is now the fourth time I've seen this exact rule. Another example: https://www.youtube.com/watch?v=vKA4w2O61Xo

It's probably not "fair" to say I got it in zero... but I did. :)

Verdex10y ago

Similar story here. I got it in zero because this problem shows up at the early part of HPMOR.

1 more reply

larkspur10y ago

If you look at the raw answers, a lot of people who guessed the wrong answer according to: "What answer did you give?" said they got it right:

Right? What answer did you give?

Yes, double the previous number

Yes, Each number much larger than the previous

Yes, sequence must always increase by 1

Yes, Powers of 2

Yes, h = 2n, i = 2(n+1), j = 2(n+2) . n is an integer

Yes, Powers of 2

abandonliberty10y ago

There's a selection bias - those of us who got it right are more likely to fill that out (:

As a result we can't really rely on overall accuracy, but we can break it out by yes/no to account for the selection bias to get a profile for how a HN correct and incorrect differ.

nzealand10y ago

This crowd needs one more question:

How many "I know the answer, but can I find a flaw in their code" questions did you ask?

stillsut10y ago

to be pedantic: "monotonically increasing" is incorrect. It should be "strictly increasing"

fenomas10y ago

Cool idea, thanks!

Personally, I spent most of my answers playing with the inputs. The form was happy to report that "1e1, 15, 0x10" was a valid sequence. :D

elevensies10y ago

This is a good idea. I submitted the survey but here are my tests: http://imgur.com/hYKudyl

FWIW, I've seen this kind of game before and I was expecting it to be something simple.

Jugurtha10y ago

My answer was: "The sequence is of an increasing real variable, where each subsequent value is greater than the preceding value. It's monotonically varying."

Although I used the wrong term, it's strictly increasing.

wzdd10y ago

scuba_man_spiff10y ago

If anyone's tallying

pitchups10y ago

Intersting that the split of correct/wrong answers from the HN crowd is 78%/22% - the exact opposite of the general population : 22% / 78%! The HN community does think different :)

teahat10y ago

The article doesn't actually specify that 78% guessed correctly or incorrectly, just that 78% guessed without ever entering an invalid sequence.

philh10y ago

If you're taking that from the article, I think you misread it.

> Remarkably, 78 percent of people who have played this game so far have guessed the answer without first hearing a single no.

Some of those 78% probably got it right, and some of the remainder would have got it wrong.

1 more reply

snowwrestler10y ago

Or at least, the portion of the HN community who reported their results, reported that they think different. :-)

1 more reply

Yizahi10y ago

It just means that this test was described at least several times in different "computer" media over last few years and almost everyone has read about it. Same thing with all "logic" puzzles.

maxerickson10y ago

I wanted to answer "Probably" to the last question about having seen a similar question before.

I don't specifically recall seeing one, but it is likely that I have.

hammock10y ago

Guessed after just one "No"

  3  9 27  yes  (is it exponential series?)
  4 16 64  yes  (is it only odd numbers?)
  5  7  9  yes  (is it any numbers of the same parity?)
  6  7  8  yes  (is it any set of increasing numbers?)
  6  7  6  no   (just to confirm that it's x<y<z, and not something like x<=y<=z)

Retric10y ago

6,7,7 or 1,1,1 satifies x<=y<=z not 6, 7, 6

Falcon910y ago

-2 -4 -8 (is it increasing distance from zero?)

1 more reply

Myrmornis10y ago

beatbrokedown10y ago

My fuzzing went:

8 4 2 (no) 1 2 3 (yes) 1 1 1 (no) 1 100 123 (yes) 1.0 1.1 1.2 (yes)

answer: incrementing numbers

eru10y ago

Thanks for setting up the survey!

Could you perhaps move to a bar chart instead of pie charts?

ddlatham10y ago

Don't think that's an option for Google docs - but feel free to take the data and share one.

nothrabannosir10y ago· 14 in thread

An incredibly stupid example is that the rule could be "yes for strictly increasing, OR if one of the numbers is -18273192783127897981." You'll never know.

I understand this is contrived, especially when the test subject doesn't know. But if you do realize this while doing it, it makes the test a little frustrating..

monjaro10y ago

This has nothing to do with Gödel's incompleteness theorem. It's much simpler than that: https://en.wikipedia.org/wiki/Wittgenstein_on_Rules_and_Priv...

GhotiFish10y ago

Thanks for the link! That was an interesting read, but I don't think I understand the premise of the argument.

From the article

  ... It is perfectly consistent with your previous use of 
  'plus' that you actually meant it to mean the 'quus' 
  function, ...

It seems the only power of this assertion is that language provides no absolute common ground.

Am I understanding this correctly?

1 more reply

xenophon10y ago

scarmig10y ago

Remotely related: I've been interested for awhile in how the same initial terms of a sequence could possibly be generated by multiple rules.

For example, you might have

2,3...

And the rest of the sequence might look like either

2,3,4,5,6...

2,3,5,8,13...

2,3,5,7,11...

or even

2,3,5,10,20...

Clearly, on some level those sequences are all much less complicated than one defined as "The first term is 2, the second term is 3, the third term is 919243, the fourth term is -1234..."

It's unclear to me how one might rank them in complexity, though. The maximum amount of memory necessary to get an arbitrary nth term? The number of operations necessary to get to the next term?

drewcrawford10y ago

Fun fact: Mathemetica has a method FindSequenceFunction which does exactly what you describe. In my experience, however, it generally requires at least 4 terms.

I'm not totally sure how the math behind it works (maybe it's similar to Eureqa?) but the results speak for themselves and are rather incredible.

For example, if I run FindSequenceFunction on this input:

    {0, 1, 3, 8, 19, 43, 94, 201, 423, 880}

Which is the number of 0,1 sequences of length n that contain two adjacent 1s

Mathematica produces the result:

    1/10 (5 2^(1 + x) - 5 (1/2 - Sqrt[5]/2)^x + 
    3 Sqrt[5] (1/2 - Sqrt[5]/2)^x - 5 (1/2 + Sqrt[5]/2)^x - 
    3 Sqrt[5] (1/2 + Sqrt[5]/2)^x)

Which, astonishingly, is correct for all the values I've tried. So apparently Mathematica understands more about this sequence than I do, and I know its definition.

Another party trick is to use the input

    {-(1/6), 2/15, -(13/140), 23/315, -(83/1386), 305/6006, -(2269/
     51480), 4259/109395, -(16103/461890), 30616/969969}

Which is the integral x^n (1 - 2 x)^n for x from 0 to 1, for n = 0..<10. Here it seems 10 numbers are required. This yields the solution

    (2^(-2 - 3 x)
    x! (Sqrt[\[Pi]] (1 + x)! + 
    3 (-1)^x 2^(
    2 + 3 x) (1/2 (1 + 2 x))! Hypergeometric2F1[1, 3/2 + x, 
    2 + x, -8]))/((1/2 + x)! (1 + x)!)

Which as far as I can tell, is a closed-form solution (!) to the integral. A solution it worked out to an integral it has never seen, but only the first 10 elements in the sequence.

So it's safe to say Mathematica knows a lot more about math than I do.

2 more replies

nandemo10y ago

How about this?

If this sounds tedious to code, you could easily outsource via Odesk or something.

1 more reply

liadmat10y ago

Take a look at Kolmogorov complexity. It's uncomputable.

nsrivast10y ago

[1] http://www.amazon.com/Fluid-Concepts-And-Creative-Analogies/...

elnion10y ago

This might be helpful: https://en.wikipedia.org/wiki/Generating_function

Generating functions provide a general framework for describing sequences, solving recurrence relations, etc.

the_af10y ago

It's not ironic. Dijkstra's assertion is, in a convoluted way, related to the main point of the quiz and the article :)

yvsong10y ago

With an equal protection clause, i.e., no particular number or a group of numbers can appear in the rule, is the problem solvable?

eru10y ago

Depends on what rules you allow.

What you want to forbid is not so much mentioning specific numbers, but you want to only allow rules that have certain symmetries. Eg you can require tranlation invariance

    rule(x, y, z) = rule(x+offset, y+offset, z+offset)

to restrict the set of rules.

odonnellryan10y ago

nightcracker10y ago

This does not hold - the test uses Javascript's numbers, which are finite. So it's theoretically possible to test every combination of inputs and convincingly answer what a possible rule is.

bediger400010y ago· 13 in thread

myNXTact10y ago

It's amusing that you went to a website on confirmation bias, did the puzzle incorrectly, presumably read the material on confirmation bias, but still suffer from the effects of confirmation bias.

harperlee10y ago

1 more reply

ddlatham10y ago

Try the sequence: 0,0,0. It would give "yes" for (x,2x,4x), but the actual rule gives it "no".

1 more reply

upquark10y ago

snowwrestler10y ago

It's not an assumption; the article tells you up front that there is one correct answer.

The trick is that a hypothesis can fail in several ways. It can be outright wrong, like saying "the rule is that the numbers decrease from left to right." That's obviously just wrong.

And if he always wears a red shirt, and always tests gravity on the surface of the Earth, he'll always find supporting evidence for that hypothesis.

jpollock10y ago

Well, yes, in this situation random guessing would be very likely to provide an immediate check on the solution.

(1x, 2x, 4x), as indicated in the video below, is not sufficient. It represents a subset of the values that are valid.

pdpi10y ago

You got the problem completely wrong.

The idea isn't to come up with tuples that satisfy the predicate. The idea is to figure out what the predicate is in the first place.

asQuirreL10y ago

From a Computational Learning Theory perspective, we are faced with an infinite hypothesis space, with an infinite VC dimension. So yes, there's not much strategy that can be employed here.

But, the fact that there is one correct answer is not really an assumption that the puzzle makes, it is information we have been given:

> We've chosen a rule that some sequences of three numbers obey -- and some do not.

This simply means that the solution is realisable. No matter how many ways (in English) we have to describe that solution it is still the same solution.

benkant10y ago

Consider the sequence

1, 3, 5, 7

what comes next? 9 right? Or is the sequence generated by 2n − 1 + (n − 1)(n − 2)(n − 3)(n − 4) for n ∈ N. Then we've got 33.

"among all hypotheses consistent with the observations, the simplest is the most likely"

33 is correct, but it's less likely to be the basis for the generation of the sequence.

Your answer of (x, 2x, 4x) proves the puzzle illustrates the confirmation bias, at least in your case.

Does the unit test that confirms your function returns the expected result given one set of arguments prove it correct?

nilkn10y ago

(x,2x,4x) does not actually give a "yes" every time. In particular, it won't work if x is negative. But, in that case, (4x,2x,x) will work.

ttctciyf10y ago

> (x, 2x, 4x) gives you a "yes" every time, therefore it is a correct answer

I think this logic is a bit wonky - if there are sequences that get a "yes", but don't match (x, 2x, 4x) then the correct rule cannot be (x, 2x, 4x), can it?

drostie10y ago

"(x, 2x, 4x) gives you a "yes" every time, therefore it is a correct answer, at least as automatically checkable."

Well, no: you can type 1, 2, 3 into the system and it will tell you "yes", but your rule says that it should tell you "no".

"To find a 'no' you're reduced to random guessing. That's not a puzzle, that's crap."

In many ways it still is a puzzle but the space that it lives in is richer. If you think about typical "puzzles" they're things like: "here's a grid with some spaces filled in with numbers,

    2 . . 2 . 2 .
    . . . . . . .
    1 . 3 . . 2 .
    . . . . . . .
    3 . . . 2 . 3
    . . 2 . . . .
    . . . . . . .

    ghci> take 100 $ filter trueFn [0..]
    [2,5,8,9,13,14,18,19,20,25,26,27,32,33,34,35,41,42,43,44,50,51,52,53,54,61,62,63,64,65,
    72,73,74,75,76,77,85,86,87,88,89,90,98,99,100,101,102,103,104,113,114,115,116,117,118,
    119,128,129,130,131,132,133,134,135,145,146,147,148,149,150,151,152,162,163,164,165,
    166,167,168,169,170,181,182,183,184,185,186,187,188,189,200,201,202,203,204,205,206,
    207,208,209]

drjesusphd10y ago

> There is not one correct answer. (x, 2x, 4x) gives you a "yes" every time

Not true. What if x is negative?

1 more reply

tmd10y ago· 12 in thread

It responds "No" to (10000000000000000, 10000000000000001, 10000000000000002) so the rule is not so simple after all :)

kittenfluff10y ago

Responds "Yes" to

  9007199254740990, 9007199254740991, 9007199254740992

but "No" to

  9007199254740991, 9007199254740992, 9007199254740993

Presumably this is due to how Javascript handles integers, i.e. it uses the integer part of a float64, to wit

  > parseInt('9007199254740992')
  9007199254740992
  > parseInt('9007199254740993')
  9007199254740992

Edit: I think this is the code that actually reads the numbers the user enters, see [0]

  function l(){
      var a=h.exec(m[1]),f=null,g=null,n=null;
      return a&&(null!==a[1]&&a[1]&&(f=parseInt(a[1],10)),
          null!==a[2]&&a[2]&&(g=parseInt(a[2],10)),
          null!==a[3]&&a[3]&&(n=parseInt(a[3],10))),
      new e(f,g,n)
  }

Edit(2): Actually, I'm not so sure that's the correct code at all. They NYT game is capable of parsing floats correctly (e.g. it accepts 1.1, 1.2, 1.3 as a "Yes") so it's not just using parseInt.

[0] http://a1.nyt.com/assets/interactive/20150612-151638/js/foun...

janka10210y ago

The actual code seems to be from here [0]

on line 588 is the comparison

    var rightWrong = (inputData[0] < inputData[1]) & (inputData[1] < inputData[2]) ? right : wrong;

With a variable declaration on line 545 being

    var inputData = [NaN, NaN, NaN],
        revealed = false,
        right = "<p class = 'g-answer g-yes'>Yes!</p>",
        wrong = "<p class = 'g-answer g-no'>No.</p>";

And `inputData` is changed on text input on line 662

    $("#g-input input").each(function(i) {
        var val = $(this).val();
        inputData[i] = $.isNumeric(val) ? Number(val) : NaN;
    });

[0] http://graphics8.nytimes.com/newsgraphics/2015/06/16/puzzle/...

[1] http://www.ecma-international.org/ecma-262/5.1/#sec-9.3.1

[2] http://www.ecma-international.org/ecma-262/6.0/#sec-7.1.3.1

mordrax10y ago

yeah alright.. i think someone didn't actually get the point of the article hehe

taigeair10y ago

you broke it.

itaibn10y ago

Further subtleties:

thrownaway242410y ago

A test engineer walks into a bar. He orders a beer. He orders two beers. He orders 999999999 beers. He orders 1.00001 beers. He orders -42 beers. He orders 1048576 beers...

Natsu10y ago

Yes, but did he think to order NaN beers?

oldboyFX10y ago

Only on hackernews :O

Kenji10y ago

Good old IEEE 64bit floating point numbers =)

That also means that for(i=0;i<j;++i){} doesn't necessarily terminate for an arbitrary j smaller than infinity, which I find hilarious.

mikeash10y ago

I'd prefer an example along the lines of:

    for(i = j; i < j + 2; i++) {}

Either way, though, it is definitely hilarious!

1 more reply

mark-r10y ago

It terminates for an arbitrary j smaller than 2^53, which is enough for most people.

TheSlowSmoker10y ago

Any idea why? Could it be an error from the input being too large? Or is there some other magic at work here...

Nedit: One of the other responses nailed it. IEEE standards on 64 bit fp ops

yequalsx10y ago· 9 in thread

In addition to getting it right did you use an exhaustive set of tests?

MalcolmPF10y ago

yequalsx10y ago

I would have checked complex numbers for fun but there is no reasonable ordering of the complex numbers in the way there is for real numbers. It's too bad it didn't recognize pi or e.

Infernal10y ago

I did test with negatives, but I did not think to test with decimals. 15 yes's, 6 no's, and I successfully determined the rule.

tjohns10y ago

I tested with negative integers, but didn't think to test with real numbers unfortunately. (Still got it right, but I should've tested that.)

That said, I did make sure to test all the edge cases I could think of. I was actually going to guess (n^1, n^2, n^3) at first, until it failed for (1, 1, 1).

10 "yes" tests, 6 "no" tests.

dorgo10y ago

I wonder if somebody tested the rule to be deterministic by entering same number over and over. But I think the rule a<b<c is too simple to invite to use creative tests.

perlgeek10y ago

Yes and yes. It wouldn't activate the "check" button with complex numbers.

yequalsx10y ago

There isn't a way to check order for complex numbers in any reasonable way.

1 more reply

pbnjay10y ago

I got it right, using decimals - I was probably a little too exhaustive...

http://imgur.com/6adtvqz

ddlutz10y ago

I tested with real numbers to be curious, just tried 1.1, 1.2, and 1.3.

cabirum10y ago· 6 in thread

function judgeSentence(sentence, numNo)

var probablyWrong = ["doubl", "expon", "multipl", "^", "", "power", "two", "2", "twice", "as big", "nth", "rais"];

var seemsRight = ["larger", "increas", "greater", "small", "less", "big", ">", "<", "go up", "ascending"], weaselWords = ['but ', 'not ', 'odd'];

Been expecting something more interesting than that

10987610y ago

Full method just for fun:

    function judgeSentence(sentence, numNo) {

        sentence = sentence.toLowerCase();

        // no nos -> wrong.
        if (numNo === 0 || sentence == "") {
            return false;
        }

        // if have any fancy words -> wrong.
        var probablyWrong = ["doubl", "expon", "multipl", "^", "**", "power", "two", "2", "twice", "as big", "nth", "rais"];
        if (hasAny(probablyWrong, sentence)) {
            return false;
        }

        // if you have the right words, and no buts.
        var seemsRight = ["larger", "increas", "greater", "small", "less", "big", ">", "<", "go up", "ascending"],
            weaselWords = ['but ', 'not ', 'odd'];
        if (hasAny(seemsRight, sentence) & !hasAny(weaselWords, sentence)) {
            return true;
        }

        // // no nouns, verbs or adjectives in your sentence -> wrong.
        // var s = nlp.pos(sentence).sentences[0],
        //     verbs = s.verbs().map(getWords),
        //     nouns = s.nouns().map(getWords),
        //     adj = s.adjectives().map(getWords),
        //     numWords = verbs.length + nouns.length + adj.length;
        // if (numWords === 0) {
        //     return false;
        // }


        return false;
    }

TeMPOraL10y ago

tomerico10y ago

Which is another case of confirmation bias

tripzilch10y ago

I've made a robot that screams[0]. That is, just outputs a random string of "AAAAaaa" when its name is mentioned or when somebody else screams (four or more A's).

What is surprising is how basically 15 lines of python implementing these rules, invokes a very real emotional response in a lot of people :-)

I named it "Wilhelm".

[0] inspired by this comic http://gunshowcomic.com/513

stillsut10y ago

include "monoton" in probablyWrong.

The sequence is not monotonically increasing.

thekingofspain10y ago

Interestingly, it would appear that monotonically increasing in general means "strictly increasing": http://mathworld.wolfram.com/MonotoneIncreasing.htm

It only means non-decreasing in the context of a monotonic function, where the definition, I believe, is that the derivative of the function is never <0.

Kenji10y ago· 6 in thread

"We’ve chosen a rule that some sequences of three numbers obey — and some do not. Your job is to guess what the rule is."

mikehawkins10y ago

Good article - and a humbling experience. :)

TylerE10y ago

I went for the slightly more clever "series of powers" after 3/9/27 validated.

1 more reply

TheOtherHobbes10y ago

I think it's a reasonable mistake.

That's not quite the same thing as confirmation bias. With a bias you're just as likely to discount significant evidence as you are to mismodel the problem space.

DougBTX10y ago

> It's trivially easy to fool the user into finding a wrong rule.

While that's true, the real thing being tested is the user's readiness to prove themselves wrong.

the_af10y ago

It is true that, in general, this quiz is impossible to guess, for the reasons you explain.

IshKebab10y ago

Yeah I thought it would be something like "Numbers must strictly increase unless the first number is 15326". Then the point of the article would be that some government rules are not well defined.

binarymax10y ago· 4 in thread

The puzzle is not nearly as interesting as the code being able to understand my answer!

My answer: The numbers increase from left to right

Application response: As you seem to have guessed, the answer was extremely basic

philh10y ago

Yes, I'm curious what heuristics it used there.

arjunnarayan10y ago

I simply said: "a < b < c", with no other words, and it gave me the same answer as yours.

Retr0spectrum10y ago

I said "The numbers are in ascending order", which it didn't seem to recognise.

matchu10y ago

"ascending" is one of the words it checks for. Might've typoed it? :/

1 more reply

abecedarius10y ago· 3 in thread

I ran this experiment for a while (code at https://github.com/darius/wason, derived from http://lesswrong.com/lw/g2/positive_bias_test_c_program/).

In my logs most people seem to have gotten it right, though presumably that's because it was linked from LessWrong.

For an actually-fun game like this, see https://en.wikipedia.org/wiki/Zendo_%28game%29.

TeMPOraL10y ago

> though presumably that's because it was linked from LessWrong

No surprise there, given that this experiment is discussed in Sequences, the link to the post present in the article you link to :).

harryjo10y ago

Thank you! I forgot the name of the game, and it is totally unsearchable.

roccaturi10y ago

Zendo is fantastic. Watching people unaccustomed to its sort of problem-solving struggle with and improve their methodologies for being good at the game is really enlightening.

phantarch10y ago· 3 in thread

pizza10y ago

eru10y ago

Just don't tell them explicitly about it. (That'd be taking away all the fun.)

mikeash10y ago

You must be awfully tempted to constantly give them powerful adversaries who are willing to become friends if given the smallest chance.

qznc10y ago· 3 in thread

> A mere 8 percent heard at least three nos

rockdoe10y ago

I guessed correctly with only one no.

reagency10y ago

That highlights a great point: on this simple decision rule, using random test data will draw the solution more quickly than human cleverness.

dendriteseeker10y ago

I also guessed correctly with only 2 nos -- my real confirmation came from the fact that "-1, 0, 300000000" (or some number of 0s) was correct, meaning it couldn't be any really meaningful sequence.

dude_abides10y ago· 3 in thread

I'm a data scientist, and it relieved me no end that I got this one right: http://i.imgur.com/V5oJ4i4.png I would have had second thoughts about my career choice if I got this wrong :)

The correct approach for any data modeling problem is to think in terms of entropy. Each subsequent approach should minimize entropy, until you reach diminishing returns.

Sadgrinner10y ago

Isn't your answer technically wrong, though?

thaumasiotes10y ago

Not so.

http://mathworld.wolfram.com/MonotoneIncreasing.html

1 more reply

taigeair10y ago

I got so many nos. I can't believe that "Remarkably, 77 percent of people who have played this game so far have guessed the answer without first hearing a single no." That's crazy.

brianwillis10y ago· 3 in thread

Maybe I'm just being pedantic here, but last I checked -6 was larger than -4.

StevenXC10y ago

"Larger" in this context means "greater than" in the usual ordering of the real numbers.

Tloewald10y ago

eru10y ago

That's unintentionally hilarious.

1 more reply

noreasonw10y ago· 2 in thread

Edit: changed for grammar and to express more clearly what I think.

Retra10y ago

Maybe it should be "a puzzle many people already know the answer to" in which case the conclusion and results are already obviously biased.

I was able to solve the puzzle without testing any numbers at all. Which really skews the relevance of "only nine percent of people saw three 'no's before answering."

crgt10y ago

You mean you correctly guessed without testing at all - and got lucky. I'm not sure this is the same thing as 'solving' it.

1 more reply

damoncali10y ago· 2 in thread

Perhaps that's an additional factor - not exactly confirmation bias, but not unrelated.

Nimitz1410y ago

Eliminating assumptions baby.

https://www.youtube.com/watch?v=9-9VLVkm8R4

aroman10y ago

this is exactly what I did. I wonder what the psychology/cognitive science explanation of this might be.

smilefreak10y ago· 2 in thread

Does laziness have anything to do with the responses? You get a rule that seems to work and so you seek the reward early. It takes effort to prove yourself wrong.

I was trapped by this and guessed it was exponential series n^1,n^2 etc for n starting at greater than 2. While technically true this was not the rule they had in mind.

eru10y ago

Why is it technically true?

smilefreak10y ago

1 more reply

sp33210y ago· 2 in thread

I got 7 yes, 5 no, and was still wrong. Maybe I'm just dumb.

EvanKelly10y ago

I'm curious about how you came to your answer and what led you there?

Were you testing a pre-supposed hypothesis that confirmed itself?

sp33210y ago

3 more replies

afitnerd10y ago· 2 in thread

Seems busted now - clicking the "I think I know" button does nothing. I thought the answer was:

Let the first number be x. If x is 0, then the second number is 1. Otherwise, the second number is two times the absolute value of x. The third number is 2 times the value of the second number.

gervase10y ago

The answer they were looking for was: Let the numbers be x, y, and z; x < y < z must be true.

Their hypothesis was that most people would guess as you did (they mentioned 78% of people did so).

lowmagnet10y ago

I had to turn off µBlock to make it work.

jasallen10y ago· 2 in thread

ddlatham10y ago

Falcon910y ago

Geee10y ago· 2 in thread

Anyone tried entering letters in the boxes? I didn't. I think that would have been an unexpected twist.

So, after all, I think I fall in the trap of confirmation bias that the sequence must consist of numbers only.

dEnigma10y ago

Well, after all they tell you that the sequence consists of numbers:

>We’ve chosen a rule that some sequences of three numbers obey

>Now it’s your turn. Enter a number sequence in the boxes below, and we’ll tell you whether it satisfies the rule or not.

jazzyb10y ago

I entered A, B, C. The form didn't let me submit the sequence, so they are doing some verification that the entries are numbers. Rational numbers work, though: 1.1, 1.2, 1.3.

72568610y ago· 2 in thread

That page prints the nyt logo in the javacript console (will probably look messed up here):

       0000000                         000        0000000
     111111111      11111111100          000      111111111
     00000        111111111111111111      00000      000000
     000        1111111111111111111111111100000         000
     000        1111       1111111111111111100          000
     000         11       0     1111111100              000
     000          1      00             1               000
     000               00      00       1               000
     000             000    00000       1               000
  00000            0000  00000000       1                00000
  11111            000 00    000000      000                 11111
  00000          0000      000000     00000              00000
     000        10000      000000      000              0000
     000        00000      000000       1               000
     000        000000     10000        1     0         000
     000        1000000 00              1    00         000
     000         1111111                1 0000          000
     000          1111111100           000000           000
     0000          111111111111111110000000            0000
     111111111        111111111111100000          111111111
       0000000              00000000              0000000

NYTimes.com: All the code that's fit to printf() We're hiring: developers.nytimes.com/careers

heartbreak10y ago

Every nytimes.com page displays that in the JS console.

stephengillie10y ago

phaemon10y ago· 2 in thread

I got this wrong, but oddly enough my first attempt was a No. I tried, "6, 1, 8"

After that, I went with doubling and it worked 3 times with various sizes of number, so I went with that.

Double bonus points if you can guess what I was testing with the first sequence (which the given numbers do satisfy).

o0-0o10y ago

Were you guessing random area codes of midwestern states? :)

phaemon10y ago

Hah! No, I'm not American :)

But, in case no-one guesses, the answer is (rot13):

erirefr nycunorgvpny beqre

1 more reply

pbnjay10y ago· 1 in thread

If anyone's curious about the raw numbers, I found the counts here:

http://int.nyt.com/newsgraphics/2015/2015-06-26-rule-guessin...

    {"count": 27723, "numNo":     0, "share": 0.7716051100782098}
    {"count":  2921, "numNo":     1, "share": 0.08129922903504133}
    {"count":  1883, "numNo":     2, "share": 0.05240891758746417}
    {"count":  1285, "numNo":     3, "share": 0.035764980934621056}
    {"count":   880, "numNo":     4, "share": 0.024492749589468118}
    {"count":   525, "numNo":     5, "share": 0.014612151743716774}
    {"count":   288, "numNo":     6, "share": 0.008015808956553202}
    {"count":   156, "numNo":     7, "share": 0.004341896518132985}
    {"count":   108, "numNo":     8, "share": 0.0030059283587074506}
    {"count":    56, "numNo":     9, "share": 0.0015586295193297892}
    {"count":    45, "numNo":    10, "share": 0.001252470149461438}
    {"count":    11, "numNo":    11, "share": 0.00030615936986835146}
    {"count":    22, "numNo":    12, "share": 0.0006123187397367029}
    {"count":     8, "numNo":    13, "share": 0.00022266135990425562}
    {"count":     4, "numNo":    14, "share": 0.00011133067995212781}
    {"count":     5, "numNo":    15, "share": 0.00013916334994015976}
    {"count":     3, "numNo":    17, "share": 0.00008349800996409585}
    {"count":     1, "numNo":    18, "share": 0.000027832669988031953}
    {"count":     3, "numNo":    20, "share": 0.00008349800996409585}
    {"count":     1, "numNo":    21, "share": 0.000027832669988031953}
    {"count":     2, "numNo":  null, "share": 0.000055665339976063906}

buckbova10y ago

Interesting totals. I had 4 No's and would have had at least 5 had I considered negative digits.

stygiansonic10y ago· 1 in thread

david_shaw10y ago

That's a great point. Perhaps we should show something like this to new developers who don't understand the value.

rtkwe10y ago· 1 in thread

Wonder how many people here immediately knew what the rule was?

rockdoe10y ago

I guessed that immediately (but still verified it). It was obvious it was going to be a trick question, so...

FilterJoe10y ago· 1 in thread

While the HN crowd mostly gets this right when framed as a math puzzle, my guess is that confirmation bias is alive and well in high tech just like in any other field. One example:

eru10y ago

I'd trust the older guy.

dhruvbird10y ago· 1 in thread

People try to make the fit be as tight as possible to the sample data -- the explanation is that simple. I don't buy the explanation provided in the article.

mpu10y ago

Pretty good point. That's how I felt.

danielvinson10y ago· 1 in thread

I remembered this puzzle from HPMOR, but either way this is just writing a unit test.

Tyr4210y ago

jjuhl10y ago· 1 in thread

The same experiment: https://m.youtube.com/watch?v=vKA4w2O61Xo

zeidrich10y ago

What's funny is that I felt smug and self satisfied for getting the correct answer even though I watched that video when it came out.

ernsheong10y ago· 1 in thread

We should all be doing good on this one if we have been practicing making our test cases are red first before turning them green :D

lugg10y ago

Looking at my result there, seems to be something I need to start doing. TIL :P

To be fair, I did check 3/6/12 - just to make sure it was double and not powers of two. Guess that's the articles point though isn't it!

m4r71n10y ago

Derek Muller of Veritasium has done this test over a year ago on people in one of his videos: https://www.youtube.com/watch?v=vKA4w2O61Xo

aaronbrethorst10y ago

More or less related: when I'm looking for constructive criticism from someone, I'll ask them "what do you dislike about this?" or "what's wrong with this?" instead of "what do you think?"

I tend to get much more interesting and useful feedback this way.

MarkMc10y ago

1. People that think having a gun in the house makes it safer will not try to design an experiment designed to demonstrate the opposite.

2. People who think organic food is better for you than regular food will not try to look for evidence that the two types of foods are equally healthy.

3. An Israeli who believes the area where he lives was uninhabited before 1948 is not going to think about what kind of evidence would contradict that belief.

[1] http://www.paulgraham.com/quo.html

shmageggy10y ago

harryh10y ago

Reminds me of the game mastermind which I loved playing as a kid.

Here's a javscript version: http://www.archimedes-lab.org/mastermind.html

moo10y ago

sambe10y ago

cristianpascu10y ago

I went with the doubles.

mc80810y ago

This looked like a puzzle I had seen before, so I assumed this was the case (testing a few sequences just to verify) and turned out right.

Gravityloss10y ago

Anyone played Monkey Island 2?

There's a puzzle with a doorman, there's a few distracting clues where the answer is actually very simple.

It doesn't offer confirmation bias though, and it took some time to figure it out.

I consider it a very similar test.

So one could actually construct such a test without the confirmation bias part, and then look at how long it takes for people to realize the simple model.

js210y ago

For a more extensive version of this article, see "Mistakes Were Made (But Not by Me)":

http://www.amazon.com/Mistakes-Were-Made-But-Not/dp/14915141...

harryjo10y ago

Veritasium covered this classic puzzle / bias test nicely, last year. https://www.youtube.com/watch?v=vKA4w2O61Xo

nsxwolf10y ago

So, am I supposed to feel bad if I assumed it was some tricky, hard to figure out function? It just reminded me of those questions on the ACT or whatever and I froze up and got frustrated.

Am I a dim bulb?

pbreit10y ago

I guess I'm the opposite: I got more No's than Yes's. My immediate sense was to find No's. I guess that's why I like 1 star reviews.

sushirain10y ago

I predict that if the rule was narrower, like "exponential", much more guesses would have yielded No's.

Lorenzo4510y ago

Wish I could have done this with no previous knowledge of the puzzle, I knew it right away because I saw the exact same problem in a Veritasium video.

yellowapple10y ago

In other words, first impressions really are important.

mordrax10y ago

They don’t want to hear the answer “no.” In fact, it may not occur to them to ask a question that may yield a no.

So the author's obviously never heard of sanity checking, in fact that's the second thing that I always do once I confirm a solution is to confirm it's not a fallacy.

Having said that, my solutions were -10 -20 -40 -10 -8 -4 1024 1026 1030

and I said it was +2, +4 and got it wrong!

__z10y ago

Veritasium video on the same thing

https://youtu.be/vKA4w2O61Xo

nerdo10y ago

Gabriel_Martin10y ago

My process was [1, 2, 3], and then I guessed each number is greater than the last.

ljk10y ago

did anyone else only try even numbers? guess "increasing even numbers"

pretty interesting how in-the-box my thought process was

SZJX10y ago

Umm... Not sure what's so special about it. Isn't that what all programmers and science people do all day?

drhdylan8810y ago

Veritasium, a pretty interesting youtube channel, posted a video on this experiment a while back. I found the discussion afterwards to be more thought-provoking than this article.

https://www.youtube.com/watch?v=vKA4w2O61Xo

decisiveness10y ago

It's ironic that these facts are not mentioned considering the article is about confirmation bias.

progmanal10y ago

The attached reading material is interesting, but this question is too similar to problems where you guess the next one in the sequence and none are missing.

A rule where the numbers are increasing does not explain why 3 or 5 or 6 is missing from the sequence in that version of the question that is much more common.

pmelendez10y ago

I failed it again... Every time I take a test like this I ended being fooled by the confirmation bias.

theVirginian10y ago

gesman10y ago

After total of 13 answers including 6 "no", I guessed it right :)

arikrak10y ago

the questions is somewhat ambiguous. it would be more interesting to see what happens of people really understood the question and if they had a real motivation to get it correct.

Kluny10y ago

Decimals and negative numbers still get you a "yes" as long as you obey the rule. However, the button to submit your answer didn't work for me.

bluker10y ago

My first assumption was x,2x,2(2x) - then I tested 0,0,0 and got a NO which disproved my first assumption... the answer is obvious after.

Did anyone else test 0,0,0?

adamc10y ago

I got it, but it took a bunch of guesses before I had the pattern. It's a good example of how our preconceptions shape our answers.

0xdeadbeefbabe10y ago

Confirmation bias affects Corporate America and Government Policy, but not science? Talk about confirmation bias :)

jessaustin10y ago

Have large numbers of HN people really not seen this riddle before? Maybe I read weird things.

elwell10y ago

After programming daily for more than a decade, I question my assumptions annoyingly often.

ammaar10y ago

for anyone who hasn't seen it Veritasium did a video on it.

https://www.youtube.com/watch?v=vKA4w2O61Xo

vincston10y ago

Sadly ive already seen Veritasiums test. so i was biased.

PSeitz10y ago

Nice game, got it right after 7 yes and 7 no

_lce010y ago

I thought..

    F(n + 1) = n * 2

limeyx10y ago

Got it right with 7 nos.

flint10y ago

Yup - Dick Cheney!

mastre_10y ago

-10, 0, 0.01

elektromekatron10y ago

I liked this puzzle. It got me. I am thankful for it.

j / k navigate · click thread line to collapse