The result summary is visible here: https://docs.google.com/forms/d/17e5BIL0lH8OHsGj89Zdtdl8GeCV...
The raw answers are visible here: https://docs.google.com/spreadsheets/d/1ZxR2_eOUtNLXwgKfLO1J...
Seriously, though, it seems a bit of leap from the existence of confirmation bias to explaining away the public outpourings of US Politicians about their financial crises and foreign policy disasters - in the absence of better data as to just why the given statements were made ascribing this to confirmation bias seems itself open to accusations of confirmation bias! :)
This doesn't mean I always use these—at the very least, I have to explicitly jump into "problem solving" mode—but it means they can be useful.
It's still a meaningful difference, and could very well apply to lots of things beyond this kind of puzzle.
It's probably not "fair" to say I got it in zero... but I did. :)
Now that the NYT has done it, this puzzle has probably attained enough popularity now that you really ought to change it up a bit now if you're going to run it yourself. Granted, the space of hypotheses as simple as "increasing/decreasing" is pretty small, but your ability to fool people with the first sample run is almost unbounded, so that helps.
I suspect that the basic idea behind it is about right (people who insist on failures before committing to a theory will probably do "better"). But it seems to me that this test will be best at selecting people who've seen it before and can pretend they didn't (or even remember to ask negative questions when someone asks you to guess three numbers to get the job).
Right? What answer did you give?
Yes, double the previous number
Yes, Each number much larger than the previous
Yes, sequence must always increase by 1
Yes, Powers of 2
Yes, h = 2n, i = 2(n+1), j = 2(n+2) . n is an integer
Yes, Powers of 2
As a result we can't really rely on overall accuracy, but we can break it out by yes/no to account for the selection bias to get a profile for how a HN correct and incorrect differ.
How many "I know the answer, but can I find a flaw in their code" questions did you ask?
Personally, I spent most of my answers playing with the inputs. The form was happy to report that "1e1, 15, 0x10" was a valid sequence. :D
FWIW, I've seen this kind of game before and I was expecting it to be something simple.
When I clicked "I think I know it", nothing happened. I don't want to click their "I don't want to play; just tell me the answer". But it seems like the right answer. I can't answer your form question if it is the right anwer, since I haven't clicked on their link and don't know for a fact whether it is or not.
Although I used the wrong term, it's strictly increasing.
If anyone's tallying
> Remarkably, 78 percent of people who have played this game so far have guessed the answer without first hearing a single no.
Some of those 78% probably got it right, and some of the remainder would have got it wrong.
I don't specifically recall seeing one, but it is likely that I have.
3 9 27 yes (is it exponential series?)
4 16 64 yes (is it only odd numbers?)
5 7 9 yes (is it any numbers of the same parity?)
6 7 8 yes (is it any set of increasing numbers?)
6 7 6 no (just to confirm that it's x<y<z, and not something like x<=y<=z)8 4 2 (no) 1 2 3 (yes) 1 1 1 (no) 1 100 123 (yes) 1.0 1.1 1.2 (yes)
answer: incrementing numbers
An incredibly stupid example is that the rule could be "yes for strictly increasing, OR if one of the numbers is -18273192783127897981." You'll never know.
I understand this is contrived, especially when the test subject doesn't know. But if you do realize this while doing it, it makes the test a little frustrating..
EDIT: I see people are making a connection with unit testing, and the irony is poetic. This is precisely the problem Dijkstra was talking about when he said that "Testing shows the presence, not the absence of bugs."
From the article
... It is perfectly consistent with your previous use of
'plus' that you actually meant it to mean the 'quus'
function, ...
That may be true, but only if you assume I'm not referring to the plus derived from the axioms of principa mathematica. I am. Is it then the question that when I refer to the principa mathematica that I'm actually referring to principa quus? If I then describe axiom 1 can you not know my words are referring to axiom quus?It seems the only power of this assertion is that language provides no absolute common ground.
Am I understanding this correctly?
The idea you're getting at is that no number of confirming observations can verify a universal generalization -- the reason why we continue to call generally accepted "truths" theories. But we can increase our certainty to the greatest degree possible by trying to test our hypothesis to the greatest degree we can.
For example, you might have
2,3...
And the rest of the sequence might look like either
2,3,4,5,6...
or
2,3,5,8,13...
or
2,3,5,7,11...
or even
2,3,5,10,20...
Clearly, on some level those sequences are all much less complicated than one defined as "The first term is 2, the second term is 3, the third term is 919243, the fourth term is -1234..."
It's unclear to me how one might rank them in complexity, though. The maximum amount of memory necessary to get an arbitrary nth term? The number of operations necessary to get to the next term?
Another interesting question to me: if there is an ordering of ways to generate a sequence of numbers, given the first couple terms of a sequence of numbers, what are the simplest N ways to generate the full sequence?
I'm not totally sure how the math behind it works (maybe it's similar to Eureqa?) but the results speak for themselves and are rather incredible.
For example, if I run FindSequenceFunction on this input:
{0, 1, 3, 8, 19, 43, 94, 201, 423, 880}
Which is the number of 0,1 sequences of length n that contain two adjacent 1sMathematica produces the result:
1/10 (5 2^(1 + x) - 5 (1/2 - Sqrt[5]/2)^x +
3 Sqrt[5] (1/2 - Sqrt[5]/2)^x - 5 (1/2 + Sqrt[5]/2)^x -
3 Sqrt[5] (1/2 + Sqrt[5]/2)^x)
Which, astonishingly, is correct for all the values I've tried. So apparently Mathematica understands more about this sequence than I do, and I know its definition.Another party trick is to use the input
{-(1/6), 2/15, -(13/140), 23/315, -(83/1386), 305/6006, -(2269/
51480), 4259/109395, -(16103/461890), 30616/969969}
Which is the integral x^n (1 - 2 x)^n for x from 0 to 1, for n = 0..<10. Here it seems 10 numbers are required. This yields the solution (2^(-2 - 3 x)
x! (Sqrt[\[Pi]] (1 + x)! +
3 (-1)^x 2^(
2 + 3 x) (1/2 (1 + 2 x))! Hypergeometric2F1[1, 3/2 + x,
2 + x, -8]))/((1/2 + x)! (1 + x)!)
Which as far as I can tell, is a closed-form solution (!) to the integral. A solution it worked out to an integral it has never seen, but only the first 10 elements in the sequence.So it's safe to say Mathematica knows a lot more about math than I do.
Choose a programming language. Choose a sequence prefix (in your example: 2, 3). Then consider all the programs that accept n as input and output a sequence of n numbers, such that the first numbers are always 2, 3. Now take the shortest of those programs. The sequence it produces is the "simplest".
If this sounds tedious to code, you could easily outsource via Odesk or something.
[1] http://www.amazon.com/Fluid-Concepts-And-Creative-Analogies/...
Generating functions provide a general framework for describing sequences, solving recurrence relations, etc.
What you want to forbid is not so much mentioning specific numbers, but you want to only allow rules that have certain symmetries. Eg you can require tranlation invariance
rule(x, y, z) = rule(x+offset, y+offset, z+offset)
to restrict the set of rules.Try the sequence: 0,0,0. It would give "yes" for (x,2x,4x), but the actual rule gives it "no".
It is an analog for how science works. When it comes to a natural phenomenon, humans can come up with multiple explanations that fit a given set of observations, but presumably (I mean, this is a basic tenet of science) nature only works in one consistent way.
Thus, the importance of a falsifying test. You form a hypothesis based on the initial observations (in this case, the number sequence 2, 4, 8), and then you propose a test that could falsify your hypothesis.
The trick is that a hypothesis can fail in several ways. It can be outright wrong, like saying "the rule is that the numbers decrease from left to right." That's obviously just wrong.
But it can also be too specific, like saying "the rule is that the exponent increments by one with each step to the right." That matches the given evidence, and tests with other base numbers will succeed too. But it's over-fitting.
Here's a concrete example: a man wearing a red shirt drops a weight and measures gravitational acceleration as 9.8 m/s^2. So he formulates a hypothesis that gravity always produces an acceleration of 9.8 m/s^2 in the presence of a red shirt.
And if he always wears a red shirt, and always tests gravity on the surface of the Earth, he'll always find supporting evidence for that hypothesis.
But of course we know that gravitational acceleration varies depending on mass and distance, and that it's the same no matter what color your shirt is. But he would only find that out if he varied his experiment beyond what his hypothesis predicts.
(1x, 2x, 4x), as indicated in the video below, is not sufficient. It represents a subset of the values that are valid.
Think of it this way. When asked to write a unit test, do you only test the positive outcomes? No, you test to make sure the failures are as you expect as well. Otherwise, you are likely to have what you think is a failure end up as a success.
The idea isn't to come up with tuples that satisfy the predicate. The idea is to figure out what the predicate is in the first place.
Also, you're not in any way, shape, or form reduced to random guessing. If you have an idea of what the rule might be, you build a counterexample. There's a ton of value in _trying_ to get a no but getting a yes instead.
But, the fact that there is one correct answer is not really an assumption that the puzzle makes, it is information we have been given:
> We've chosen a rule that some sequences of three numbers obey -- and some do not.
This simply means that the solution is realisable. No matter how many ways (in English) we have to describe that solution it is still the same solution.
1, 3, 5, 7
what comes next? 9 right? Or is the sequence generated by 2n − 1 + (n − 1)(n − 2)(n − 3)(n − 4) for n ∈ N. Then we've got 33.
"among all hypotheses consistent with the observations, the simplest is the most likely"
33 is correct, but it's less likely to be the basis for the generation of the sequence.
Your answer of (x, 2x, 4x) proves the puzzle illustrates the confirmation bias, at least in your case.
Does the unit test that confirms your function returns the expected result given one set of arguments prove it correct?
I think this logic is a bit wonky - if there are sequences that get a "yes", but don't match (x, 2x, 4x) then the correct rule cannot be (x, 2x, 4x), can it?
"(x, 2x, 4x) gives you a "yes" every time, therefore it is a correct answer, at least as automatically checkable."
Well, no: you can type 1, 2, 3 into the system and it will tell you "yes", but your rule says that it should tell you "no".
It is crucial to the definition of "correct answer" here that your rule should not just say "yes" only for tuples which the widget also says yes to, but also your rule should say "no" only for tuples which the widget also says no to. That is what the puzzle means when it's asking, "can you guess the rule that we've created?"
This makes it very, very different from what I think you're thinking about, which is situations where someone tells you, "what is the next number in this sequence? 4, 7, 13, 25, ...?" where technically there are an infinite number of rules which will generate those 4 numbers first and an arbitrary number afterwards. Technically one of them is "simplest" in the sense that it can be expressed in 7 symbols, but in general it's a complicated problem and there is no best solution.
"To find a 'no' you're reduced to random guessing. That's not a puzzle, that's crap."
In many ways it still is a puzzle but the space that it lives in is richer. If you think about typical "puzzles" they're things like: "here's a grid with some spaces filled in with numbers,
2 . . 2 . 2 .
. . . . . . .
1 . 3 . . 2 .
. . . . . . .
3 . . . 2 . 3
. . 2 . . . .
. . . . . . .
Each number is a block in a block wall. We want you to turn this into a block maze so that each 'block wall' (set of blocks connected by adjacency) contains exactly one numbered-block whose number says how many total blocks are in the wall. Furthermore the path (non-block space) of the block maze should be connected and should not contain any 'rooms' -- that is, any 2x2 or larger segments of open space."This 7x7 grid has 10 spaces which are known to be blocks and exactly 12 more blocks scattered in the remaining 39 spaces, so just by those factors alone we're searching only (39 choose 12) ~= 3.91 billion possibilities; we can also use a quick heuristic to identify 6 places which must be "space" to break apart adjacent numbered walls, removing 91% of that search space.
The puzzles, "I have a set of integers where inclusion in the set is governed by a short rule, you can ask me any integer and I will tell you whether it is in my set", by contrast, have an infinite search space. This means that any solution is going to be more interesting, as will the means for checking that solution's validity. You could require, for instance, a Haskell expression of 140 characters or fewer which turns a nonnegative Int named `n` into a Bool, to be judged as "valid" or "invalid" if it properly filters `[1..10000000] :: [Int]`. You could even give the first 100 numbers in the set, e.g.:
ghci> take 100 $ filter trueFn [0..]
[2,5,8,9,13,14,18,19,20,25,26,27,32,33,34,35,41,42,43,44,50,51,52,53,54,61,62,63,64,65,
72,73,74,75,76,77,85,86,87,88,89,90,98,99,100,101,102,103,104,113,114,115,116,117,118,
119,128,129,130,131,132,133,134,135,145,146,147,148,149,150,151,152,162,163,164,165,
166,167,168,169,170,181,182,183,184,185,186,187,188,189,200,201,202,203,204,205,206,
207,208,209]
In this case that's pretty much enough to see the general pattern; the verification covers 10 million bits while the 140-character limit probably limits your search space to 1000 bits or so, so it's going to be hard to get an "incorrect" answer which agrees on that subspace of the whole.Not true. What if x is negative?
9007199254740990, 9007199254740991, 9007199254740992
but "No" to 9007199254740991, 9007199254740992, 9007199254740993
Presumably this is due to how Javascript handles integers, i.e. it uses the integer part of a float64, to wit > parseInt('9007199254740992')
9007199254740992
> parseInt('9007199254740993')
9007199254740992
Edit: I think this is the code that actually reads the numbers the user enters, see [0] function l(){
var a=h.exec(m[1]),f=null,g=null,n=null;
return a&&(null!==a[1]&&a[1]&&(f=parseInt(a[1],10)),
null!==a[2]&&a[2]&&(g=parseInt(a[2],10)),
null!==a[3]&&a[3]&&(n=parseInt(a[3],10))),
new e(f,g,n)
}
Edit(2): Actually, I'm not so sure that's the correct code at all. They NYT game is capable of parsing floats correctly (e.g. it accepts 1.1, 1.2, 1.3 as a "Yes") so it's not just using parseInt.[0] http://a1.nyt.com/assets/interactive/20150612-151638/js/foun...
on line 588 is the comparison
var rightWrong = (inputData[0] < inputData[1]) & (inputData[1] < inputData[2]) ? right : wrong;
With a variable declaration on line 545 being var inputData = [NaN, NaN, NaN],
revealed = false,
right = "<p class = 'g-answer g-yes'>Yes!</p>",
wrong = "<p class = 'g-answer g-no'>No.</p>";
And `inputData` is changed on text input on line 662 $("#g-input input").each(function(i) {
var val = $(this).val();
inputData[i] = $.isNumeric(val) ? Number(val) : NaN;
});
It uses the `Number()` function to convert from the input text to an actual number, so it can convert any number format defined by ES5[1] or ES6[2]. So in ES6 you can use binary (0b, 0B) and octal (0o, 0O) formatting along with exponential (1e-2) and hex (0x, 0X). Binary and octal works for me currently on Chrome 43 OS X.[0] http://graphics8.nytimes.com/newsgraphics/2015/06/16/puzzle/...
[1] http://www.ecma-international.org/ecma-262/5.1/#sec-9.3.1
[2] http://www.ecma-international.org/ecma-262/6.0/#sec-7.1.3.1
* The number may have optional sign and digits after a decimal point, and may use exponential notation. Example: (-1.2e1, .0E+0, 1.e-3) => "Yes". As seen in the second and third number here, there may be no digits before or after the decimal point, but both at the same time (i.e., ".0" and "0." parse but not ".").
* If the number begins with "0x" or "0X" it is read in hexadecimal, where the digits a-f may be in either case. Hexadecimal notation must not be accompanied by decimal point, sign, or exponential notation.
* No whitespace is permitted within the numeral, even between the sign and the digits as in "+ 11", but both tabs and spaces may be used before and after the numeral without changing its value. In particular, by using a input of the form "1 " it is possible to make rectangular display empty while still parsing it as number. Note that pressing "Check" leads to the numbers being displayed in the rectangle in exactly the same way as they were displayed in the text box, which may depend on the position of the cursor in the text box.
ETA: Also, you mentioned rounding, but there is also exponent overflow and underflow. The application refuses to parse numbers greater or equal to 1.7976932e308. It parses arbitrary negative exponents fine, but it does not recognize that 1e-324 is greater than 0.
That also means that for(i=0;i<j;++i){} doesn't necessarily terminate for an arbitrary j smaller than infinity, which I find hilarious.
for(i = j; i < j + 2; i++) {}
With your example it's easy to lose the distinction between "would eventually terminate if you had a fast computer and a lot of time" and "never terminates even in theory." Here, it definitely looks like it should always run two iterations to matter what the numbers are (as long as they're finite), but it doesn't.Either way, though, it is definitely hilarious!
Nedit: One of the other responses nailed it. IEEE standards on 64 bit fp ops
In addition to getting it right did you use an exhaustive set of tests?
That said, I did make sure to test all the edge cases I could think of. I was actually going to guess (n^1, n^2, n^3) at first, until it failed for (1, 1, 1).
10 "yes" tests, 6 "no" tests.
var probablyWrong = ["doubl", "expon", "multipl", "^", "", "power", "two", "2", "twice", "as big", "nth", "rais"];
var seemsRight = ["larger", "increas", "greater", "small", "less", "big", ">", "<", "go up", "ascending"], weaselWords = ['but ', 'not ', 'odd'];
Been expecting something more interesting than that
function judgeSentence(sentence, numNo) {
sentence = sentence.toLowerCase();
// no nos -> wrong.
if (numNo === 0 || sentence == "") {
return false;
}
// if have any fancy words -> wrong.
var probablyWrong = ["doubl", "expon", "multipl", "^", "**", "power", "two", "2", "twice", "as big", "nth", "rais"];
if (hasAny(probablyWrong, sentence)) {
return false;
}
// if you have the right words, and no buts.
var seemsRight = ["larger", "increas", "greater", "small", "less", "big", ">", "<", "go up", "ascending"],
weaselWords = ['but ', 'not ', 'odd'];
if (hasAny(seemsRight, sentence) & !hasAny(weaselWords, sentence)) {
return true;
}
// // no nouns, verbs or adjectives in your sentence -> wrong.
// var s = nlp.pos(sentence).sentences[0],
// verbs = s.verbs().map(getWords),
// nouns = s.nouns().map(getWords),
// adj = s.adjectives().map(getWords),
// numWords = verbs.length + nouns.length + adj.length;
// if (numWords === 0) {
// return false;
// }
return false;
}What is surprising is how basically 15 lines of python implementing these rules, invokes a very real emotional response in a lot of people :-)
I named it "Wilhelm".
[0] inspired by this comic http://gunshowcomic.com/513
The sequence is not monotonically increasing.
It only means non-decreasing in the context of a monotonic function, where the definition, I believe, is that the derivative of the function is never <0.
My mistake was to assume that choosing the first number uniquely defines the next ones in the sequence. Since, you know, like all the sequence puzzles I've seen before worked like that, and I didn't read it rigorously enough. Oh, by the way, the doubling thing is wrong if you use negative numbers (wrong as in it gives false positives, instead of just false negatives). But the problem definition doesn't even tell what set of numbers we're operating on.
Finding the rule the sequences obey is impossible since it could be that all cases follow a simple rule except for one triplet which you're unlikely to find. It's trivially easy to fool the user into finding a wrong rule.
Good article - and a humbling experience. :)
Most people will make assumptions about what's required based on previous experience. And hardly anyone will have previous experience where a question formatted in this way isn't asking you to find a series rule.
That's not quite the same thing as confirmation bias. With a bias you're just as likely to discount significant evidence as you are to mismodel the problem space.
While that's true, the real thing being tested is the user's readiness to prove themselves wrong.
But in this case -- which is the main point of the article -- it was actually a trivial rule. No tricks, no special cases. The real purpose wasn't for you to find the actual rule, but to learn about your biases. The only trick here lies in the human mind, and its tendency to validate patterns (and claim an early victory) instead of trying to refute them.
My answer: The numbers increase from left to right
Application response: As you seem to have guessed, the answer was extremely basic
Playing the confirmation bias game on "playthroughs of the confirmation bias game": it looks like you get that message if you have a 'no' answer, and the word 'increasing' in your guess. ("Increasing by the same amount" is still accepted.) I wouldn't be surprised if other words also count.
Edit: the words '<', '>', 'increas', 'big' and 'larger' seem to count. Looks like they're accepted as substrings, not just words - so 'increase' and 'increasing' are accepted. I could look for a long time, so I'm giving up now.
In my logs most people seem to have gotten it right, though presumably that's because it was linked from LessWrong.
For an actually-fun game like this, see https://en.wikipedia.org/wiki/Zendo_%28game%29.
No surprise there, given that this experiment is discussed in Sequences, the link to the post present in the article you link to :).
I guessed correctly with only 2 nos. Since there is no penalty for guessing incorrectly here, I felt safe enough with my theory. I might have checked for more nos, if I had to announce my theory publicly (Twitter, comment, etc). However, I also knew about Confirmation Bias beforehand.
You can enter all kinds of crazy random sequences which only have The Rule in common and get a yes, which seemed to be enough assurance. If you're trying to get it to say "no" but failing, is that still confirmation bias? Doesn't sound like it.
The correct approach for any data modeling problem is to think in terms of entropy. Each subsequent approach should minimize entropy, until you reach diminishing returns.
The sequence is not monotonically increasing. It's strictly increasing. If you test [1, 1, 2] or [1, 1, 1] or [1, 2, 2], you'll get "No" answers even though those sequences are monotonically increasing.
Maybe I'm just being pedantic here, but last I checked -6 was larger than -4.
Edit: changed for grammar and to express more clearly what I think.
I was able to solve the puzzle without testing any numbers at all. Which really skews the relevance of "only nine percent of people saw three 'no's before answering."
Perhaps that's an additional factor - not exactly confirmation bias, but not unrelated.
I was trapped by this and guessed it was exponential series n^1,n^2 etc for n starting at greater than 2. While technically true this was not the rule they had in mind.
Were you testing a pre-supposed hypothesis that confirmed itself?
Let the first number be x. If x is 0, then the second number is 1. Otherwise, the second number is two times the absolute value of x. The third number is 2 times the value of the second number.
So, after all, I think I fall in the trap of confirmation bias that the sequence must consist of numbers only.
>We’ve chosen a rule that some sequences of three numbers obey
>Now it’s your turn. Enter a number sequence in the boxes below, and we’ll tell you whether it satisfies the rule or not.
0000000 000 0000000
111111111 11111111100 000 111111111
00000 111111111111111111 00000 000000
000 1111111111111111111111111100000 000
000 1111 1111111111111111100 000
000 11 0 1111111100 000
000 1 00 1 000
000 00 00 1 000
000 000 00000 1 000
00000 0000 00000000 1 00000
11111 000 00 000000 000 11111
00000 0000 000000 00000 00000
000 10000 000000 000 0000
000 00000 000000 1 000
000 000000 10000 1 0 000
000 1000000 00 1 00 000
000 1111111 1 0000 000
000 1111111100 000000 000
0000 111111111111111110000000 0000
111111111 111111111111100000 111111111
0000000 00000000 0000000
NYTimes.com: All the code that's fit to printf()
We're hiring: developers.nytimes.com/careersAfter that, I went with doubling and it worked 3 times with various sizes of number, so I went with that.
Double bonus points if you can guess what I was testing with the first sequence (which the given numbers do satisfy).
http://int.nyt.com/newsgraphics/2015/2015-06-26-rule-guessin...
{"count": 27723, "numNo": 0, "share": 0.7716051100782098}
{"count": 2921, "numNo": 1, "share": 0.08129922903504133}
{"count": 1883, "numNo": 2, "share": 0.05240891758746417}
{"count": 1285, "numNo": 3, "share": 0.035764980934621056}
{"count": 880, "numNo": 4, "share": 0.024492749589468118}
{"count": 525, "numNo": 5, "share": 0.014612151743716774}
{"count": 288, "numNo": 6, "share": 0.008015808956553202}
{"count": 156, "numNo": 7, "share": 0.004341896518132985}
{"count": 108, "numNo": 8, "share": 0.0030059283587074506}
{"count": 56, "numNo": 9, "share": 0.0015586295193297892}
{"count": 45, "numNo": 10, "share": 0.001252470149461438}
{"count": 11, "numNo": 11, "share": 0.00030615936986835146}
{"count": 22, "numNo": 12, "share": 0.0006123187397367029}
{"count": 8, "numNo": 13, "share": 0.00022266135990425562}
{"count": 4, "numNo": 14, "share": 0.00011133067995212781}
{"count": 5, "numNo": 15, "share": 0.00013916334994015976}
{"count": 3, "numNo": 17, "share": 0.00008349800996409585}
{"count": 1, "numNo": 18, "share": 0.000027832669988031953}
{"count": 3, "numNo": 20, "share": 0.00008349800996409585}
{"count": 1, "numNo": 21, "share": 0.000027832669988031953}
{"count": 2, "numNo": null, "share": 0.000055665339976063906}This is the standard example of to right way to test a hypothesis/theory and the power of Confirmation Bias, testing sequences that are invalid under the theory instead of testing what you think is correct.
Young 20s entrepreneur vs. early 50s entrepreneur. Without knowing anything about either person, which startup is more likely to succeed? Even if you have the business plans for both, and you meet both - which one are you going to be more skeptical about as you evaluate which one of them gets funding?
Additionally, this setting is probably too close to usual situations you get in school where there is little to no interaction and negative answers from the teacher are seen as failures by students. (Speaking about education in my country only.)
To be fair, I did check 3/6/12 - just to make sure it was double and not powers of two. Guess that's the articles point though isn't it!
I've given the test to various people since then and never once came across someone who'd guess it right away, or within a short period of time. The breaking time came usually after a few minutes when they gave up and started throwing out random numbers that coincidentally did not meet the rule. Once you hit the first "No", it took a very short time to figure out the rule for almost everyone.
I tend to get much more interesting and useful feedback this way.
1. People that think having a gun in the house makes it safer will not try to design an experiment designed to demonstrate the opposite.
2. People who think organic food is better for you than regular food will not try to look for evidence that the two types of foods are equally healthy.
3. An Israeli who believes the area where he lives was uninhabited before 1948 is not going to think about what kind of evidence would contradict that belief.
I'm not saying the views above are incorrect. It's just that we are all guilty of falling in love with our beliefs when they should be mere acquaintances. Hence the quote, "People don't change their minds. They die, and are replaced by people with different opinions." [1]
Here's a javscript version: http://www.archimedes-lab.org/mastermind.html
However, the test simply required a possible solution. There are plenty solutions and it's absurd to think they have the simplest one. The simplicity of the rule is subjective, in that is evaluated differently by different people. The famous 'simple but no simpler' is relevant here. As long as we were not told to look for the simplest solution, ALL solutions are equally probable. That being the case, I started to with the first solution that popped into my mind. I sticked with that because of my psychological state. Some searched for other solutions.
I don't think that getting an YES was the main driving force. Of course it feels good to get an yes. This is fundamental in human relations. But it's not the whole story. People do not disbelief globar warming because they want to get an YES. The reason is much deeper. Just as many, so many people go to the wave of climate change because it's fashionable, it makes them feel good, accepted , part of the mainstream. Being a climate change denier is being a disident this day (not my flavor of disidentism), and being a disident is not for everyone. And perhaps disidents picking their fight have complicated reasons for doing so.
I went with the doubles.
I guess the conclusion is that if a problem looks suspiciously like one you've encountered before, there's a good chance that they are the same or similar. The world is self-organizing, not completely random where you must obsessively second-guess your accumulated wisdom.
There's a puzzle with a doorman, there's a few distracting clues where the answer is actually very simple.
It doesn't offer confirmation bias though, and it took some time to figure it out.
I consider it a very similar test.
So one could actually construct such a test without the confirmation bias part, and then look at how long it takes for people to realize the simple model.
http://www.amazon.com/Mistakes-Were-Made-But-Not/dp/14915141...
Am I a dim bulb?
I predict that if the rule was narrower, like "exponential", much more guesses would have yielded No's.
In other words, first impressions really are important.
So the author's obviously never heard of sanity checking, in fact that's the second thing that I always do once I confirm a solution is to confirm it's not a fallacy.
Having said that, my solutions were -10 -20 -40 -10 -8 -4 1024 1026 1030
and I said it was +2, +4 and got it wrong!
It was totally a possibility that they wouldn't apply the simplest rule, but I felt it highly unlikely. This "rule" is a meme of the rationality community, especially given the example, so it seemed pretty likely that it was sequential numbers.
pretty interesting how in-the-box my thought process was
It's ironic that these facts are not mentioned considering the article is about confirmation bias.
A rule where the numbers are increasing does not explain why 3 or 5 or 6 is missing from the sequence in that version of the question that is much more common.
Did anyone else test 0,0,0?
F(n + 1) = n * 2