Jump to content

Talk:Monty Hall problem/Arguments/Archive 8

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by MiszaBot I (talk | contribs) at 11:29, 7 May 2011 (Archiving 2 thread(s) from Talk:Monty Hall problem/Arguments.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
Archive 5Archive 6Archive 7Archive 8Archive 9Archive 10Archive 15

Gerhard's question

Rick and Glopk, one hypothetical question, I'd like to know what you think: In case some remark would have been added to Marylin's answer or even to the well-known statement of the problem that has been published in Marilyn vos Savant's "Ask Marilyn" column in Parade magazine in 1990 (as per Whitaker/vos Savant 1990):
"Suppose you're on a game show, and you're given the choice of three doors: Behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what's behind the doors, opens another door, say No. 3, which has a goat. He then says to you, 'Do you want to pick door No. 2?' Is it to your advantage to switch your choice?"
added with the following or a similar remark (for clarification that this was what she meant to be the basis for her answer):
"Supposed/given the car is initially placed uniformly at random and the host chooses a goat door uniformly at random, if he has the choice from two goat-hiding doors, and he always offers to switch ... '
In case this clarification had originally been added by vos Savant, and if she had said: "No news, he always can open one goat-hiding door and it does not matter which one if he has the choice between two goats, because he will chose uniformly at random, so you should switch, yes it always will be wise to switch", would conditionalists nevertheless say "her answer is numerically right, but it does not reflect the conditional problem that asks for the probability P(win by switching AND player initially picks door Y AND host opens door Z)"? I would be glad if you could give your comment to this (my hypothetical) question. Thank you! Regards, Gerhardvalentin (talk) 12:21, 5 August 2010 (UTC)

Gerhard, I answer mu. I can read your sentences above (with the hypothetical addition) as a restatement in words of the "Mathematical formulation" section already in the article. In which case it is the "conditional" answer to the K&W statement + uniform initial placement. On the other hand, if you parse MVS's sentence as applying "regardless" of the choice of doors, I would say it is correct, but that it is a stronger statement than the original question asks for. I use the term "regardless" to imply a marginalization, so that MVS's statement would mean , where . So it boils down to our usual point of contention: the "simple" solutions can be formulated correctly, but in the sources they normally aren't. The requirement of complete symmetry is at their core, and it is not an obvious one, so omitting it is a logical mistake that makes their conclusions non sequiturs. And in any case the simple solutions do not answer directly the MHP question, because they compute a marginal probability, whereas the question asks for a conditional. glopk (talk) 15:08, 5 August 2010 (UTC)
I don't actually know whether anyone would have made a big deal out of it or not. As Glopk says the simple solution is still formally answering the wrong question. Since the heart of the paradox is whether there is a difference between P(win by switching AND player picks door 1) and P(win by switching AND player picks door 1 AND host opens door 3), it seems having the answer based only on P(win by switching AND player picks door 1) without saying anything at all about P(win by switching AND player picks door 1 AND host opens door 3) is missing the point.
Rick, you say that Falk is wrong?[1] Quite the contrary. — Assuming an unbiased host — the point is NOT whether there is a difference, but that there is NO difference between P(win by switching AND player picks door 1) and P(win by switching AND player picks door 1 AND host opens door 3). But admittedly this is only a sideshow and belongs to a separate section for people interested in maths, believing in a biased host outside of vos Savant's Monty Hall paradox. — Whereas the "real heart" of the MHP is that the "constant-ratio belief" of more than 90 % of people is the assumption that when the host has ruled out one door of three by opening it, the ratio of the probabilities of the two remaining closed doors should be the same as the ratio of their prior probabilities (before 1/3:1/3 and now 1/2:1/2). This constant-ratio belief, seeing no reason to switch, is obviously not valid since it leads to the wrong answer 1/2 : 1/2, whereas the correct answer (assuming an unbiased host) is 1/3 : 2/3. And even if the host (outside of vos Savant's Monty Hall paradox) is extremely biased, it would always be wise for the contestant to switch, and no maths has ever been able to change that. Gerhardvalentin (talk) 14:43, 14 August 2010 (UTC)
Gerhardvalentin, this would make an excellent topic to add to the mediation. It has been the subject of edit warring, it's interpreted in the article in a perversely incorrect manner, and you have a great reference & argument. Please consider adding this to the "Opposition's viewpoint" section on the Mediation page. Thanks. Glkanter (talk) 16:55, 14 August 2010 (UTC)
Thank you, Glkanter, will be going to try to be accepted, but I guess the strong belief that the question was not "Should the contestant stick or switch? (See Shaughnessy & Dick, 1991)", but clinging to answer completely different questions that some participants made their own, as "What's the exact Bayesien probability to win by switching, under some assumed telltale bias", ignoring vos Savants basis that the overall probability applies to each and any game if no additional info/hint about the actual current constellation and distribution of the objects is given by showing that there only can be ONE car, without telltale bias evidence, will be ongoing to prevail. Any treacherous "bias-hint" automatically means the simultaneous opening of more than only one door, presenting a new and closer hint and a new condition, then. We can tell about strange variants and add Bayes, but that belongs to a completely different and very clearly secreted topic. Regards, Gerhardvalentin (talk) 18:19, 14 August 2010 (UTC)
Well, that's what we're trying to highlight in the mediation. My portion calls out the insistence that every mention of simple solutions is accompanied by the nonsense you describe above. Martin calls out the insistence that only conditional solutions are true. 2 of those guys have latched onto 'it's not fair to put the conditional solution last among equals'. Your portion can highlight the bunk that people are confused by something other than the 50/50, that you just described. The arguments are clear, as is the outcome. Your contribution will help to ensure that all those reliably sourced contrivances are dealt with in the manner appropriate for a Wikipedia article, even after the mediation has ended. Glkanter (talk) 18:55, 14 August 2010 (UTC)
IMO, the basic reason the answer is counterintuitive is because the problem nearly forces you to think about the conditional case where there are two closed doors and one open door (and the host opening a door affects the two remaining closed doors unevenly). This is fundamentally conditional because the doors are distinguishable. It's possible to posit an unconditional version as an urn problem (I've described this before), which might be as counterintuitive (or might not). If I were an experimental psychologist looking to publish a paper (I'm not), I might actually run an experiment to see if an unconditional version of the problem is as difficult for people to solve correctly. My guess is it would be difficult - but not as difficult. -- Rick Block (talk) 15:56, 5 August 2010 (UTC)
Rick, are you saying Selvin answered the wrong question in his 1st letter? Glkanter (talk) 16:53, 5 August 2010 (UTC)
Yes - and he realized this and corrected himself in his 2nd letter. -- Rick Block (talk) 18:50, 5 August 2010 (UTC)
Rick, do you have a reliable source that says Selvin repudiated his own simple solution? Glkanter (talk) 02:25, 6 August 2010 (UTC)
Repudiate is too strong. He said in his second letter (you've quoted this) that "the basis to my solution is that Monty Hall knows which box contains the keys and when he can open either of two boxes without exposing the keys, he chooses between them at random". This is what forces the conditional and unconditional answers to be the same. His original solution presents the unconditional answer. By acknowledging the assumptions that force these to be the same, he's acknowledging that the solution of interest is the conditional one. He then goes on to present a conditional solution as an "alternate". Seems pretty clear to me that he's asking the conditional question. -- Rick Block (talk) 03:20, 6 August 2010 (UTC)
So your answer is "No, I do not have a source that says Selvin has repudiated his simple solution to the MHP he posed. I have concluded that he did so based on my interpretation of his 2 letters." That's OR. And your whole argument rests on it. Your statement that the simple solutions, of which Selvin's is one, solve the wrong problem is illogical and unsourced, as far as Wikipedia is concerned. Glkanter (talk) 04:25, 6 August 2010 (UTC)
No, Selvin did not. The only published source that offers a change to it's original position is Morgan, as prompted by Martin's letter pointing out their math error. You refuse to acknowledge that they have admitted a mistake, yet claim Selvin has. Remarkable. Glkanter (talk) 20:36, 5 August 2010 (UTC)
No, he gave an alternative solution, he never said that his first solution was wrong. Martin Hogbin (talk) 19:38, 5 August 2010 (UTC)
All this talk of 'mathematically correct' solutions is nonsense. There are many ways to solve most maths problems. For the standard MHP it is perfectly correct to note that the answer to the conditional must, by symmetry, be equal to the answer to the unconditional problem and then proceed to solve the unconditional problem to get the answer to the conditional problem. This sort of thing is done in mathematics all the time. Martin Hogbin (talk) 18:20, 5 August 2010 (UTC)
Martin - this has been said so many times it seems rather pointless to repeat it. Again, for the 4,000th time, we're not talking about solving the conditional problem indirectly as you're suggesting and I don't think anyone here is saying that doing this would not be a valid approach. What we are talking about is presenting an unconditional solution, without noting the problem must be symmetrical and without saying anything about whether the conditional and unconditional answers must be the same - which is what nearly all sources presenting "simple" solutions do. -- Rick Block (talk) 18:50, 5 August 2010 (UTC)
Martin, we can solve some math problems quite well, thank you. On the other hand, you be careful touching upon mathematical correctness and nonsense - given that in less than a week you have authored two fine examples of nonsense on these very pages (see "simple solution solves the problem of calculating P(C=1|H=2 or H=3)" and "P(Ws| I) = P(Ws | I and S=s and H=h and C=c) for all legal values of h,s,and c"). What makes you think that you are qualified to interject in a conversation with Gerhard in the general area of probability theory? Please do not disrupt. glopk (talk) 20:14, 5 August 2010 (UTC)

A symmetry argument is OK?

Let me get this right Rick, you are saying that the argument that the answer to the conditional must, by symmetry, be equal to the answer to the unconditional problem and then proceed to solve the unconditional problem to get the answer to the conditional problem is perfectly correct and that no one here disagrees with it? (Note that I am talking here about the correctness of the approach, not whether we can put it in the article.)

What about the argument that, by symmetry, the door specific goat opened by the host cannot make any difference even in the conditional case? Martin Hogbin (talk) 19:47, 5 August 2010 (UTC)

Are you saying there's some confusion about this? Really? I think Nijdam has said this literally about 100 times. I've said it at least another 100 times. Kmhkmh has said it maybe 50 times. Glopk has said it probably 20 times. Coffee2theorems has said it about 5 times. Please tell me you're not just now understanding this because if it takes 5 different people telling you something a total 200 times before you understand it this is going to take a VERY long time. -- Rick Block (talk) 00:01, 6 August 2010 (UTC)
Maybe I have misunderstood you all. You are confirming that everyone here accepts that by symmetry, the specific goat door opened by the host cannot make any difference to the probability of winning even in the conditional case? Martin Hogbin (talk) 08:22, 6 August 2010 (UTC)
If you show the problem is symmetrical, i.e. that P(win by switching AND player picks door i AND host opens door j) is the same for all i,j, then all of these are the same as P(win by switching). If you show the problem is symmetrical with respect to which door the host opens after the player picks say door 1, i.e. P(win by switching AND player picks door 1 AND host opens door 2) = P(win by switching AND player picks door 1 AND host opens door 3), then these both must be the same as P(win by switching AND player picks door 1). How about if you paraphrase what you think I've been saying?
If it helps, I'll paraphrase what I think you've been saying. Many published solutions effectively say that since the average player has a 2/3 chance of initially selecting a goat and always ends up with a car if switching, every individual player has a 2/3 chance of winning by switching regardless of what door is initially picked and what door the host opens. Because this can be shown to be a true statement if the problem is fully symmetrical (and the host must open a door showing a goat), the article should primarily present this and similar solutions - as "fact" without any further explanation or comment, in particular without saying anything about symmetry or any justification whatsoever for shifting between talking about the initial average 2/3 chance to the chances faced by every individual player such as one given in the problem statement who has picked door 1 and is deciding whether to switch after seeing the host open door 3.-- Rick Block (talk) 14:23, 6 August 2010 (UTC)

You have nearly got one of the reasons that I support showing the simple sources without a health warning. My reasons are:

  1. The reason you give above, bearing in mind that we define the problem to be fully symmetrical in the article, thus all that is missing is a simple, obvious, and intuitive symmetry argument.
  2. It is far from clear that Whitaker actually wanted to know the answer to the conditional problem. Seymann makes this point.
  3. The door numbers did not form part of the original Whitaker question, they were added by vos Savant. Whitaker did not mention specific doors.
  4. Hundreds of sources, including academic 'psychology' sources give a simple solution. To pick just the ones that you agree with is POV
  5. The MHP is one of the world's hardest simple brain teasers. Some people will never understand it if we force them to read through an essay on conditional probability.

I agree that none of the above is completely bomb-proof. If that were the case I would be pressing to have the complex solutions completely removed as unnecessary verbiage. As it is, we have a somewhat POV editorial at the start of the 'Probabilistic' section followed by several complex and fully conditional solutions. The interested reader is free to read these. Martin Hogbin (talk) 17:17, 6 August 2010 (UTC)

Please paraphrase what you think I've been saying. -- Rick Block (talk) 18:27, 6 August 2010 (UTC)
I must admit that I thought you , and others, had been saying that the argument that the specific goat door opened by the host cannot make any difference to the probability of winning even in the conditional case was incorrect.
Now I guess your argument is this: 'A symmetry argument shows that the simple solution to the unconditional problem must be a solution to the conditional problem. We have sources that give a simple solution to the conditional problem and a source which says that the symmetry argument is impeccable but we have no one source which gives a simple solution and states the symmetry argument therefore we cannot show the simple solution a a valid solution to the conditional problem'. Martin Hogbin (talk) 19:01, 6 August 2010 (UTC)
The statement that the specific goat door opened by the host cannot make any difference to the probability of winning is incorrect unless it is accompanied by an actual argument. I'd rephrase what you said my argument is more like this: A symmetry argument can be made showing that the unconditional probability computed by the simple solutions must be the same as the conditional probability. We have numerous sources that give simple solutions, and a source that says that the symmetry argument is impeccable, but we have no one source which gives a simple solution and actually makes the symmetry argument and therefore we cannot add this argument to the simple solutions in the article (because of .
BTW - there is no "unconditional" and "conditional" problem. There is simply "the problem". In all the time we've been discussing this I don't think you've ever produced any source that claims to be interpreting the problem as asking about the unconditional probability (if you think Seymann does, please read this source again - what he's actually talking about is the assumption that the host pick randomly between two goats which does make the problem symmetrical but doesn't magically change the probability of interest to be the unconditional probability). We have ALL agreed, numerous times, that the problem involves the player picking a door, and then the host opening a door, and only then is the player deciding whether to switch. Not before the initial door selection and not before the host opens a door. And not without the knowledge of which door was initially picked and which door has been opened by the host. Surely you understand by now that this means the probability of interest is the conditional probability. -- Rick Block (talk) 00:26, 7 August 2010 (UTC)
I understand your WP:SYNTH point but think the symmetry argument is so obvious and natural in the standard case that it scarcely needs stating. In any case I strongly believe that this is a case where we should bend the rules a little to make the article as useful as possible to the people who are going to read it. The 'conditional' issue is likely to be of interest only to a very small fraction of readers.
I disagree that there is no unconditional problem. You are still reading Whitaker's question like a question in an examination. It was a question from an interested member of the public who wanted to know the answer to a question. Just ask yourself this: 'Why would someone want to know the probability of winning by switching only in the case that they had chosen door 1 and the host had opened door 3?'. Whitaker was not actually on the show when he wrote in to Parade. He surely wanted to just know 'Is it best to stick or swap on the show?'.
You could argue that Whitaker meant to ask about what is the best thing to do in the case of a player who goes on the show, chooses a door at random then sees the host open a specific door to reveal a goat. This is a more complex question that is not properly addressed by any of the solutions given. To make my point these are the answers to that question
If we know that the host has a non-uniform but undisclosed policy for opening doors and we pick a door uniformly at random to start with, the probability of winning by switching is 2/3.
If we know the host's exact door opening policy for every possibility and we choose door 1 and see the host open door 3, we can calculate the probability of winning by switching from the host's policy parameter for that specific case, we know the answer will be from 1/2 to 1.
For cases in between the two I have quoted above we need to state exactly on what basis we are to answer the question and 'exactly' what assumptions we are to make before the question can be answered. No form of mathematical treatment is to any avail until we have decided on the 'exact' question we want answered. This fact is made clear by a reliable source (Seymann), but unfortunately no source gives us any answers. No solution given in the article addresses this point properly either. Martin Hogbin (talk) 11:16, 7 August 2010 (UTC)

—Preceding unsigned comment added by BradleDean (talkcontribs) 19:44, 17 September 2010 (UTC)

References

  1. ^ Falk: "The constant-ratio belief is the assumption that when some alternatives are ruled out, the ratio of the probabilities of the remaining alternatives should be the same as the ratio of their prior probabilities. This belief is obviously not valid since it leads to the answer l/2, whereas the correct answer (assuming an unbiased host) is l/3."

We should name the goats

We should name the goats

I believe that it is essential to differentiate the goats and treat them as separate entities in order to enumerate all of the equally likely possibilities that face the player. Let’s call one goat Billy and the other one Nanny. The list of possibilities for placement of car and goats is shown below. Designate door 1 as the first choice of the player. The last two columns show what happens for each distribution if the player switches or keeps his original choice. It doesn’t make any difference.

Door 1 Door 2 Door 3 Switch Stay
Car Billy Nanny Nanny Billy
Car Nanny Billy Billy Nanny
Billy Car Nanny Nanny Car
Billy Nanny Car Car Nanny
Nanny Car Billy Billy Car
Nanny Billy Car Car Billy

This analysis follows the same logic structure as the “Marilyn” solution but it notes that the goats are different.

—Preceding unsigned comment added by Beepbeep9 (talkcontribs) 13:53, 10 September 2010 (UTC)

It seems like the switch and stay columns aren't quite right. If the player initially picks door 1, then I think it should look like this (and I've added a Notes column):
Door 1 Door 2 Door 3 Switch Stay Notes
Car Billy Nanny Nanny or Billy Car Host can open door 2 or door 3, which one the host opens determines which goat the switcher ends up with
Car Nanny Billy Nanny or Billy Car Host can open door 2 or door 3, which one the host opens determines which goat the switcher ends up with
Billy Car Nanny Car Billy Host must open door 3 (showing Nanny)
Billy Nanny Car Car Billy Host must open door 2 (showing Nanny)
Nanny Car Billy Car Nanny Host must open door 3 (showing Billy)
Nanny Billy Car Car Nanny Host must open door 2 (showing Billy)
In 4 out of the 6 total cases the switcher ends up with the car. This table makes it difficult to tell what happens if the host opens door 3, but of the last four cases clearly only 2 apply. Both of the first 2 do as well, but since the host has a choice in these cases these cases are not as likely (if the host opens door 3) as the other two. Assuming the host picks evenly between Billy and Nanny in these cases, these two cases are (together) as likely as only one of the other two cases (each case in the table has probability 1/6 - but for the first two cases there are subcases where the host opens door 2 or door 3 making the probability of these 1/12 if we also know the host opens door 3), so even if we're only talking about what happens if the host opens door 3 the switcher has a 2 to 1 advantage. -- Rick Block (talk) 15:06, 10 September 2010 (UTC)


Rick,

Thanks for your comments. I cannot say that they led me directly to agreement, but they did force me to start over and try to justify what I did, It is an interesting process. Eventually I came to the realization that the table of permutations and combinations does not give equally probable events after the host has opened his door.

Beepbeep9 (talk) 00:21, 13 September 2010 (UTC)

Exactly. If you want to look at what happens after the host opens a door, a tree diagram like this one (that shows everything that can happen after the player picks door 1) is the way to go. From this tree it's clear if you've picked door 1 and the host has opened door 3, the probability the car is behind door 2 is twice the probability the car is behind door 1 (because the host must open door 3 if the car is behind door 2, but opens either door 2 or door 3 - presumably each one half the time - if the car is behind door 1). If you care about the names of the goats the first column of this table (labeled "Car location") ends up with 6 equally likely configurations - car/billy/nanny, car/nanny/billy, billy/car/nanny, billy/nanny/car, nanny/car/billy, nanny/billy/car - and from the top two of these there's a fork where the host opens door 2 or door 3 and from the bottom 4 there's no fork (the host only has one choice). You can then look at either all the cases where the host opens door 3, or just the cases where the host opens door 3 revealing billy (or only the cases where the host reveals nanny). -- Rick Block (talk) 00:57, 13 September 2010 (UTC)

Beepbeep9, is there a reliable source that gives the goats these names, and the solution you posted? Glkanter (talk) 13:31, 13 September 2010 (UTC)

Marking the goats as separate entities makes the solution to switch far simpler to grasp, this should be included on the main page near the top. —Preceding unsigned comment added by 75.158.2.195 (talk) 11:05, 11 January 2011 (UTC)
According to Krauss and Wang (cited in the article), viewing it from the host's perspective and ignoring the door 1/door 3 example given in the problem description makes it far simpler to grasp. The "simple solutions" ignore the door 1/door 3 example. Is there some source that says making the goats as separate entities makes it easier to grasp? -- Rick Block (talk) 14:09, 11 January 2011 (UTC)

Richard Gill's POV

Out of the wikipedia discussions and arguments on MHP was born a publication: Gill (2011), html version at my homepage. It's appearing in the first issue of 2011 of Statistica Neerlandica. Richard Gill (talk) 19:58, 15 January 2011 (UTC)

Archive 5Archive 6Archive 7Archive 8Archive 9Archive 10Archive 15

A Fresh Start

I just archived this page so everybody can make a fresh start after the Arbitration. I encourage all involved editors to put away any hard feelings from the past and to make a fresh start. Please remember that this article is now on probation and that editors making disruptive edits may be blocked by an administrator. I would strongly encourage all editors to stop and review WP:NPOV, WP:OR, WP:V, WP:OWN, WP:CIVIL, WP:NPA, and WP:EW. Even if you have read them before, read them again, and treat them as your road map for editing this article, its talk page, any any discussions about this article on your user page. Thanks! Guy Macon (talk) 04:12, 25 March 2011 (UTC)

One of two doors

Here's a quick way to argue against the current article's conclusion. It makes what appear to me to be some of the same subtle errors that I believe occur in the arguments for the currently-presented solution, but it has the virtue of being brief, and ( as it appears to me ) of producing the correct result. Let's ourselves be the contestant here. Assuming as much, then:

  • At the outset of the problem, before we've made any choice at all, consider that the car is behind one of two doors.
Yes, one of two. We just don't know which two, but it certainly is behind one of two. And of those two, based on what we know at this point, the chance that it's behind one is as good as the chance that it's behind the other.
  • Next, just to humor Monty, we "pick" one of the three available doors, and we try to keep his spirits up by pretending that our "choice" at this point matters. But we know it doesn't matter, because the outcome of the ensuing sequence will be the same regardless of which of the three doors we select at this point. After our apparent "choice", nothing relevant to the outcome has changed. The car is still behind one of two doors, and we still don't know which two those are.
  • Then Monty opens one of the unchosen doors, and now something has changed: We now know "which two".
Monty says, "Do you want to switch?" which translates into, "You now have the opportunity to pick between two doors. The car is behind one of them. Which one do you pick?" This is the first real choice we have, the first choice we can make that's relevant to the final outcome.
( There's zero chance that the car is behind the now open door, and no reason to believe that the now-dead chance that it was should be magically and preferentially transferred to any remaining door. There's a very subtle flaw involved here, imo, that I won't try to elaborate on, but that I think is due in part to a semantic confusion and in part to not properly accommodating time and instantiation as factors in this reasoning. Or maybe it would be more fun to observe that there was never any chance that the car was behind the now open door, but that we just weren't aware of that until now. ;-)
Remember that at the outset we knew that the car was behind one of two doors, that we didn't know which two, and that either of the two was as good as the other. Now we know "which two", and choosing one is still as good as choosing the other.

That being the case, there's no advantage to "switching". As I said, this is an abbreviated argument that I know incorporates certain subtle errors, but can anyone refute its conclusion in this same idiom? Apologies if this has been proposed before, especially if I'm wrong in my conclusion. ;-) Many thanks for reading,  – OhioStandard (talk) 07:52, 1 April 2011 (UTC)

OhioStandard your argument above shows exactly why the Monty Hall problem is so famous. Marylyn vos Savant got the answer exactly right, you should switch and you double your chances of winning if you do so. This is not in dispute. Please have a read through the article and see if anything there convinces you otherwise. Do you see anything wrong with any of the solutions given? If not, why are you not convinced by them? We would all be very interetsed to see how this article looks to a new reader. Martin Hogbin (talk) 09:21, 1 April 2011 (UTC)
Thanks for replying, Martin. I didn't know if anyone would. I did read through the article, of course. But I thought it would be quicker to express this positive argument than to try to make a negative one, i.e. than to use the idiom of the current article's examples and show why I suppose they're in error. I'll give that a try, though, and would be pleased to hear what you think about that effort. I'll also post, of course, if I persuade myself that the solution examples presented there are correct.
Likewise, if you feel inclined, I'd be pleased to know where you think I've gone wrong in what I presented above. No need to do so before I present my critique of existing solution examples, of course. But I wonder, for example, whether there would be general agreement with the proposition: "The first 'choice' ( i.e., before the contestant is offered the chance to 'switch' ) has no impact on the final outcome. Only the stay/switch choice is relevant." Best,  – OhioStandard (talk) 13:37, 1 April 2011 (UTC)
I would be happy to explain where you have gone wrong but this should probably be on my or your talk page as these pages are intended for discussions on how to improve the article. If you do not find the solutions in the article convincing then in my opinion the article need improvement. You will see from the article that most people get the answer wrong but one thing you can be sure about is that you have.
Have a look at the section beginning, 'Another way to understand the solution is to consider the two original unchosen doors together' and tell me why you do not agree with it. Martin Hogbin (talk) 20:47, 1 April 2011 (UTC)
What a delightful problem! I read quickly through the article a couple of days ago, and was unconvinced. Then, last night, without re-reading it, but after rapidly scanning through Marilyn Savant's presentation and remaining unconvinced, I posted the above. I was really very sleepy, though, both when I initially read the article and again, last night, when I scanned Savant's explanation and posted the above. ( I find Savant's self-promotion in very poor taste, and I suppose I'm quite prejudiced against her ideas. ) But when I awoke this morning it was the first thing on my mind, and I "grokked" it within the next couple of minutes. But how to express that "grok" concisely? Are you interested in hearing about my mental process to get there? I assume you might be, since you said you were curious about how the article would look to a new reader.
When I awoke, I found I did so with an image of a "shell game" present to mind, but one modified to include a gazillion shells with still just one pea. Obviously, I had combined one of Savant's suggestions with the presentation I vaguely recall from the article ( I'll look at both again when I'm through writing this ) having something to do with a million doors. I imagined Monty sweeping all the other shells but two right off the table, one of those two being the one I'd previously chosen, of course. Then it "hit" me: I'd been ignoring the information value of Monty's removal of ( a gazillion minus 2 ) shells.
I'd been preoccupied with my own subjective knowledge as a contestant, and missed the fact that Monty, transferred from his studio to the setting of my shell game, wasn't making his removal of all but one of the remaining shells at random, that his doing so was informed by intelligence I didn't have access to. I realized that, in effect, by sweeping all the other shells off the table, Monty, also, was making a choice as to which shell the pea was under, being constrained only by the (almost) negligible restriction that he couldn't choose the one I'd already chosen. Like me, he'd manifested a choice as to which shell the pea was under; he just had better information to inform his choice than I'd had.
I'm not certain how to formalize that insight without so-encumbering it with explanation that the "intuitive flash of recognition" it presents gets too submerged in notation or words or tables to be easily grasped. Decision tables don't seem to do it for me; it's one thing to grasp an idea, but another thing entirely to prove or demonstrate it, of course, and it's not often the case that the proof communicates the grasp, communicates that initial "intuitive flash of recognition" very economically, I believe...
I suppose one could explain that in the "gazillion shells" scenario, if my probability of making the correct choice was one-over-a-gazillion, that Monty's probability of doing so would equal 1 minus one-over-a-gazillion, i.e. that his probability wouldn't equal 1 only because he couldn't select from among all the gazillion to make his choice. But I'm not sure this would leave the typical reader very much enlightened, even if he were to accept the truth that the same formula must apply as the number of initial selections is (gradually?) reduced in successive example trials from a gazillion to just three. But it seems a promising avenue to me; is that in the article already? I'll be interested to see if it is, in that form. It wouldn't surprise me at all if I were recollecting it from there and assuming I'd come up with it originally.
I'll be curious to see how re-reading the article, and reading Gerhard's post, below, will affect my view of the problem. I purposely haven't done either yet, because I wanted to be able to express how I came to this with my presentation of it uninfluenced by doing so. Just in closing, I'll mention that one of the things I love about the way I came to see this, to the extent I presently do, is that it depends on the "reformulating the problem according to an extreme case" approach. Using a "gazillion" shells or doors, in other words, demonstrates Polya's statement to the effect that if you can't solve a problem, that there's also a simpler one (or an extreme case, as I'd put it, in this instance) that you can't solve, and that you should find that simpler problem, solve it, and use that knowledge to move on to the more puzzling one.
I see I no longer have time just now to read Gerhard's post with the attention it deserves, or to re-read the article carefully, either, but I'll do so soon, and will reply further. I expect the small degree of vague unease I still feel over the "three door" or "three shell" reductionist case, as opposed to the "gazillion" case, will be dispelled by the understanding I expect I'll gain in doing so. Thanks so much; this has been loads of fun so far. It really is a delightful and fecund problem! Thanks,  – OhioStandard (talk) 04:55, 2 April 2011 (UTC)
Ohiostandard, I am interested in how you came to understand the correct solution as I think i will help improve the article. The way in which you understood the answer also explains another aspect of the problem that some people find troubling, which is that it matters whether Monty knows where the car is and does not just pick an unchosen door at random which happens to reveal a goat. I find your description of the transfer of information from Monty to you an interesting way of explaining why this is so.
If you read the explanations in the article you will probably find them all convincing now you have grokked it, that is why I was so interested in your thoughts earlier as I want to find out how effective this article is for newcomers and how it might be made more so. Martin Hogbin (talk) 08:47, 2 April 2011 (UTC)
Thanks again, Martin, very much, for being patient with yet another newcomer who initially thought he knew better. :-) I can entirely understand why this article has resulted in such a shouting match; a person would need to have the patience of a saint to keep trying to explain, over and over again, for years on end.
I have now re-read the article, and Gerhard's and Rick's very kind explanations, as well. I'm very grateful for the work everyone has put into the article, and especially for Gerhard's and Rick's generosity, and for yours as well. But without the least possible slight toward any preceding explanation, I have to admit that none of those really do it for me. They're fine as proofs or demonstrations, but for me ... well, they don't really nudge me along toward that "flash of insight" (aka "grok") that seems so necessary in this case.
I guess my major criticism of the article in its current state is that it focuses too little on helping mathematically unsophisticated readers get the "intuitive flash" they need. As I'm sure you're all aware, many, many mathematicians have written about the disparity between the insight that motivates a proof and the proof itself, and have observed that the proof isn't always or even usually the most efficient means of communicating that "flash".
I love well expressed proofs dearly, of course, and I understand why it's absolutely crucial that they be painstakingly exact in the ideas they present and in the language and symbols they use to present them. It's just that the insight that motivated them in the first place is often times submerged, especially for the non-mathematician, in the necessary rigor.
With a view toward trying to help address that, I've posted to the article's main talk page with a suggested re-write for the "increasing the number of doors" approach, and would be very pleased to learn what you all think of that. Thanks again, to everyone here, for your generosity and patience, and refusal to bite! Cheers,  – OhioStandard (talk) 21:29, 2 April 2011 (UTC)


@Ohiostandard: Will that table in your opinion help to show that the contestant will win by staying in 1/3 of cases, but will win by switching in 2/3 of cases,
no matter at all which door she should have chosen (1, 2 or 3), and no matter which other door whatsoever the host has opened, showing a goat:

  Supposed door  A  was chosen by contestant:  
 

"Another" door was opened by the host, showing a goat:



  contestant's final decision:  
Initial arrangement behind doors   A,  B,  C
(probability)
 Open Door A  
(probability)
 Open Door B  
(probability)
 Open Door C  
(probability)
Joint
   probability    
Win by
 staying  
Win by
switching
Car     Goat     Goat     (1/3) No Yes
(1/2
)
No 1/3 x 1/2 Yes
(1/6
)
No
No No Yes
(1/2
)
1/3 x 1/2 Yes
(1/6
)
No
Goat     Car     Goat     (1/3) No No Yes
(1
)
1/3 x 1 No
Yes
(1/3)

 
Goat     Goat     Car     (1/3) No Yes
(1
)
No 1/3 x 1 No
Yes
(1/3)

 
Note that the host has no choice when the contestant in 2/3 of all cases fails and
chooses a goat, as in these cases the host is just bound to show the second goat.
Win by
staying
1/3
Win by
switching
2/3


And note: If, in only 1/3 of cases, the contestant luckily should have chosen the car, only then the host has a choice indeed and can open either of his two doors, both containing goats.

Some sources now said that the host eventually could be "biased" to some extent to open just only one special "preferred" door if ever possible, avoiding to open the other door, avoiding his "unwanted" door.

But for the decision asked for, to stay or to switch, the "door number first chosen" by the contestant and "the door number opened by the host" is of no relevance at all, as you do not exactly know about such "bias" and about its "direction", and even if you exactly knew about such bias, its extent and its direction, it would never "help better" to make the right decision. Let us assume even an extremely biased host:

If, in 1/3 of cases, the host got both goats, then he can and will open his preferred door, and switching will loose. Probability to win by switching: Zero.
And if, in another 1/3 of cases, he got the car and one goat, and the goat being behind the door he usually prefers to open, then he can and will also open his preferred door, but in this case switching will win, probability to win by switching: 1.

So, whenever in these 2/3 of cases an extremely biased host opens his preferred door, probability to win by switching will be at least 1/2. And that means staying never can be any better than to switch, so you should switch.

If however, in the last 1/3 of cases, he got one goat and the car, but the car being behind his preferred door, then he exceptionally will be opening his "avoided" door and, by doing so, be showing that the car is very likely to be behind his preferred but now still closed door, and switching is very likely to win the car, and switching is imperative.

You never can nor will know any "assumed closer is better" -probabilities for any "actual game". Conditioning on "just assumptions". Unneeded diligence. Results of "conditional probability theorems" will forever remain within the strict bounds of at least 1/2, but never less, to 1. Irrevocably. You know that already from the outset. So conditional probability calculus is quite irrelevant for the question asked for, even if one should exactly know about "host's bias, its extent and its direction", as staying never can be any better than always to switch, and you know for sure that switching will win the car in exactly 2/3 of cases. Full stop. Irreversible. No further "proof by conditional probability calculus" needed.

Will that table help the readers to see that switching will double the chance to win from 1/3 to 2/3? Regards, Gerhardvalentin (talk) 18:16, 1 April 2011 (UTC)

Gerhard, I'm extremely grateful to you for taking the time to respond so carefully here: It's very generous in you. You may notice the response I top-posted above, after Martin's reply to me. I hope you won't be offended that I (still) haven't read your own reply; I certainly will do so, but it would be disrespectful not to do so with the care it merits, and I have to fly out the door just now. I'll read through it carefully soon, though, and will reply. Again, many thanks for taking the time to post this. Best regards,  – OhioStandard (talk) 04:55, 2 April 2011 (UTC)
Gerhard, I've read carefully through your post, now, and would like to again thank you for it. As I wrote above, though, I'm afraid I have to admit that I'm too dense for tables to be of much help to me. I certainly admit they can demonstrate a solution but, for me, they're not much assistance in understanding why that solution is as it is; they're not much help for me in trying to understand the problem, that is. There's just too much for me to look at, too much to take in ... I think it was Poincare' who wrote that you don't really understand a proof until you see it as a single idea. I don't know if I'd go quite so far as to say that for all problems, but I think it does apply to this one. Sorry to disappoint; I'm sure there are others who would read through your presentation above and at some point in their reading get that "Aha!" moment of understanding. I'm just not one of those, I'm afraid. Thanks very much for your presentation, however; as I said before, I'm very sensible of your generosity in making it. As I mentioned to Marin, up above, I've posted a section to the article's main talk page about a possible re-write based on the "increasing the number of doors" idea. I'd be pleased to know whether you think it has any value. Best regards,  – OhioStandard (talk) 21:29, 2 April 2011 (UTC)
But I am interested to hear from you whether it is plausible that the above table clearly shows that the probability to win by staying "1/6 + 1/6 = 1/3"  is only one half of the probability to win by switching of "1/3 + 1/3 = 2/3".  Regards, Gerhardvalentin (talk) 23:34, 2 April 2011 (UTC)
Maybe it does show that, Gerhard, but I'm afraid I can't say that it does so clearly, for me. Perhaps it would do so for someone who had taken a probability class recently, though, and who was thus already familiar with the interpretation of such tables. For my part, I spent 30 minutes before I understood what the "(1/2)" in the top cell of the third column was intended to mean, and then only did so by reading the text of the Citizendum page that includes basically the same table. And without previous acquaintance with Bayes' rule ( "posterior odds equals prior odds times likelihood ratio", as Citizendum expresses it ), the "Joint probability" column wouldn't make sense to the uninitiated, and the rest of the table would be equally opaque, as well, I believe. To speak very candidly, I think the table would be very likely to confuse most readers, even assuming they'd take the time to try to understand what it's intended to communicate. Sorry to disappoint; perhaps other readers would find it easier to interpret than I do. Best,  – OhioStandard (talk) 02:20, 3 April 2011 (UTC)


@OhioStandard - Responding to your original request (in the same idiom as your original argument), if we follow your argument through it's fairly simple to point out where it goes wrong.

  • At the outset of the problem, before we've made any choice at all, consider that the car is behind one of two doors. We don't know which, and the chances that it is behind one is the same as the chances it is behind the other.
Perfectly true. It's even true that if you pick ANY two doors at this point the chance that the car is behind either one is the same. However, the only way to make this true is to consider the probability that the car is behind any of the three doors is the same, i.e. p(door 1) = p(door 2) = p(door 3), and they must add up to 1, so they must all be 1/3. If you pick any two, the chances are even at 1/3:1/3 which only adds up to 2/3. This reflects the uncertainty that you've picked the right two. The chances that the car is behind a randomly selected two doors at this point is only 2/3.
  • Next, just to humor Monty, we "pick" one of the three available doors, and we try to keep his spirits up by pretending that our "choice" at this point matters. But we know it doesn't matter, because the outcome of the ensuing sequence will be the same regardless of which of the three doors we select at this point. After our apparent "choice", nothing relevant to the outcome has changed. The car is still behind one of two doors, and we still don't know which two those are.
Also perfectly true. The key is we still don't know how to pick two doors guaranteeing the car is behind one of those two. So the probability of each of the three doors is still 1/3. The probability of any two (at this point) is 2/3.
  • Then Monty opens one of the unchosen doors, and now something has changed: We now know "which two".
Yes, now something has changed. But it's more than you think. The probability that the car was behind the two remaining doors that used to be 2/3 is now 1, and the probability the car is behind the open door that used to be 1/3 is now 0. How this is possible is that we're now not talking about the "original" probabilities - but something different. Specifically, we're now talking about the conditional probabilities in the case that the host has opened (say) door 3. This doesn't mean the "original" chances that the car was behind the two remaining doors was 1 or that the original chances that the car was behind the door that has been opened was 0. The original chances were (and still are!) 1/3 for each door, 2/3 for any two doors, and 1 for all three doors. If you pick (say) door 1, when the host opens a door these original chances are split into two cases, one where the host opens door 2 and the other where the host opens door 3. The probabilities in each of these cases are conditional probabilities. The original probabilities of door 1 and door 2 were indeed equal, at 1/3 each. And now, if we're in the case where the host opened door 3, we know its (conditional) probability is 0. We also know the conditional probabilities of the two remaining doors will add up to 1 (just as they do in the case where the host opens door 2). Where you're going wrong is thinking that because the original probabilities of these doors was equal, then the conditional probabilities of these doors must be equal as well. If the car is behind door 2 the host MUST open door 3. However, if the car is behind door 1 the host chooses whether to open door 2 or door 3. Assuming this is a fair choice, this means the host opens door 3 only half the time the car is behind door 1. Half the time behind door 1 and all the time the car is behind door 2 means you have a 2 to 1 advantage if you switch.
If you pick door 1 the original 1/3 + 1/3 + 1/3 = 1 splits (unevenly) into
1/6 + 1/3 + 0 = 1/2 (if the host opens door 3)
and
1/6 + 0 + 1/3 = 1/2 (if the host opens door 2)
Expressing these as conditional probabilities, you get 1/3 + 2/3 + 0 (considering only the case where the host opens door 3), or 1/3 + 0 + 2/3 (considering only the case where the host opens door 2).
The essential confusion is the difference between the original unconditional probabilities and the conditional probabilities in effect after the host opens one of the doors. -- Rick Block (talk) 05:40, 2 April 2011 (UTC)
Thanks so much, Rick, for your generosity in replying like this, "in my idiom", so to speak. This is a really clear explanation of where I went wrong, and it was great fun to read and understand. I agreed with everything you'd written up to this point, but when I read, "Where you're going wrong is thinking that because the original probabilities of these doors was equal, then the conditional probabilities of these doors must be equal as well" I thought to myself, "that's a beautiful and concise explanation!" Before coming back to this I'd read James & James' definition/entry for "Probability", since it's been so long since I've thought about the subject ( and was never any good at computational or applied math, anyway ) and that refreshed my memory re the distinction between mathematical or (same) a priori probability, and conditional probability, an important distinction, of course, and one that the wikilink you provided points out, as well.
Again, I appreciate your kindness and effort in presenting this explanation, Rick, very much. You'll see that I've also responded at some length to Martin, above (sorry for the top post), although my reply there really is meant for you and for Gerhard, as well. As I said (as if) to Martin, I'd also be pleased if you'd care to look at the "Increasing the number of doors rewrite?" section I added to the article's main talk page, and see whether you think it has any value. Perhaps it just represents the way my own peculiar brain came to the "flash" or "grok" or "Aha! moment" of seeing the solution; I'd be pleased to hear objective opinions about whether it might be more universally helpful. Best regards,  – OhioStandard (talk) 21:29, 2 April 2011 (UTC)


Btw, since this is so long a thread, I'll have no objection if regulars here want to collapse it, or parts of it, or just move it manually to archives at some point when everyone has been able to reply. I'll leave that up to all y'all. Thanks again, everyone, for your very generous comments so far!  – OhioStandard (talk) 21:40, 2 April 2011 (UTC)

A different table

OhioStandard - can you comment on this table showing how many times we'd expect the car to be behind each door in a sample of 300 games, and the outcomes when switching (assuming the player picked door 1 initially)?

Situation BEFORE the host opens a door Situation AFTER the host opens a door
Door 1 Door 2 Door 3 total cases host opens Door 2 host opens Door 3
cases result if switching cases result if switching
Car Goat Goat 100 50 Goat 50 Goat
Goat Car Goat 100 0 N/A 100 Car
Goat Goat Car 100 100 Car 0 N/A

The point is to show that by opening either door the host splits the original 300 cases into two subsets. The (conditional) probability of winning by switching (if the host has opened, say, door 3) is evaluated in the context of one of these subsets, not in the context of the entire original sample of 300 shows. -- Rick Block (talk) 16:17, 3 April 2011 (UTC)

Thanks, Rick. I sincerely hope it won't offend anyone if I say so, but this is the first table I've seen that's simple-enough and clear-enough that I could understand it quickly, and see how it demonstrates the 2/3 versus 1/3 advantage re switching. I'm getting the impression that most regular contributors here have their favorite way of illustrating that, but I have to say that I think this is a really good one. It's much (!!!) easier for me to understand than either of the tables currently in the article, or than the decision tree either, for that matter.
I know that someone must have put a great deal of work into the graphics for the illustrated table especially, and I certainly don't intend to disparage that effort: Quite the contrary; I honor it. If I'm to answer candidly, though, I'm afraid I have to say that while I imagine the decision tree has value to the uninitiated, I can't support the same statement for the two tables presently in use in the article, and I'd prefer to see the article simplified by using the one above, instead.
I've top-posted this, above Guymacon's flush-left/outdented comment below, btw (sorry, Guy) because his outdenting made it impossible for me to otherwise indicate which post I was responding to. Thanks Rick, for this table. I think it's extremely helpful, and that it provides the strongest assist I've seen so far to help the average reader who has no training in probability or statistics understand the solution.  – OhioStandard (talk) 15:11, 4 April 2011 (UTC)
Thanks for correcting the outdent mistake. It really is a quite good table. Guy Macon (talk) 22:59, 4 April 2011 (UTC)

( ← outdenting ) Let me ask a question here (there is no ulterior motive or implied criticism: I really do not know). Are we as a group slipping back into the sort of Wikipedia:Truth discussion that led to stalemates and frustration in the past, or are we moving forward with the sort of Wikipedia:Verifiability, not truth discussion that leads to Wikipedia:Consensus? Guy Macon (talk) 14:10, 4 April 2011 (UTC)

This is the "Arguments" page. IMO, here (but not on the main talk page) discussions are explicitly permitted to drift more toward "truth" - the topic is mathematical arguments concerning the MHP as opposed to content of the article. -- Rick Block (talk) 14:25, 4 April 2011 (UTC)
Ah. Of course. (note to self: next time smoke crack after posting to Wikipedia...) Guy Macon (talk)

Talk about notation

The present article contains in the conditional probability section a fully written out proof of Bayes' theorem in the context of MHP. (IMHO, totally superfluous, since also present in the wikipedia article Bayes theorem). But anyway there were discussions about the proper notation to be used in this proof. I have written out an alternative here (on my talk page) and written a general essay about notation here also in my user area. Richard Gill (talk) 09:51, 5 April 2011 (UTC)

Increasing the number of doors rewrite?


I think the "Increasing the number of doors" approach that's currently (permalink) in the article is probably the one that's most likely to be accessible to our widest swath of readers. But I also think it needs to be rewritten to accomplish that. Here's how I'd use the idea to explain the solution to someone who knew only simple algebra:



Look at it from Monty's perspective
To understand this, you need to set aside your natural preoccupation with your own perspective as a potential contestant, and look at the situation from Monty's perspective instead. Specifically, you need to recognize that you're not the only one who expresses a choice about which door the car is behind. It's easier to understand that if you imagine there's some impossibly large number of doors, at first; let's say a "gazillion".
So you recognize that when you first choose one door, your chance of doing so correctly is really small, one-in-a-gazillion, right? Then switch perspectives, and put yourself in Monty's shoes: Ask yourself "What is Monty's chance of choosing the correct door, the one with the car behind it?" "But wait!", you say, "Monty doesn't choose anything!!"
Monty does make a choice, though, and this is the key to understanding the problem: When Monty opens all the other doors but one, he eliminates all those doors because he knows the car isn't behind them. Assuming you didn't guess correctly the first time (a reasonable assumption, given your one-in-a-gazillion chance) then Monty is effectively choosing the door the car is behind, by keeping that one door closed and opening all the others. The rules he has to operate under make it impossible for him to do anything else. Further, his expressing that choice, even though he does so indirectly, discloses information to you that you'd be smart to take advantage of when he eventually asks if you want to switch from your original choice to his.


The mathematics of this insight, and what they show for the three-door case
The mathematics of this become pretty simple to understand if you reduce the number of doors at the outset from a "gazillion" to just 100. In that case, when you first choose a door, it's pretty clear that you have a .01 or (same thing) 1% chance of choosing the right one. But then ask yourself, "What chance does Monty have of choosing the right door?" ( Recall that he has to express his choice not by picking the door directly, of course, but by opening all the other ones, except yours. )
Since Monty knows where the car is, the answer would be that he'd choose correctly 100% of the time if he were not constrained by your having eliminated one of his possible choices when you picked first. So that constraint means that he has only a .99 or (same thing) 99% chance of choosing the door with the car behind it. His chances of picking the correct door are, in this example, 99 times better than your chance of having done so.
To express that insight in a formula, if we let
  • N = the number of doors at the outset,
  • PContestant = the contestant's probability of choosing right the first time
  • PMonty = Monty's probability of choosing the right door, i.e. of leaving closed the door with the car behind it
Then it's clear that
  • N = 100
  • PContestant = 1 / N = 1/100 = .01 = 1%
  • PMonty = 1 - ( 1/N ) = .99 = 99%
  • PMonty = 1 - PContestant
This last statement or formula isn't strictly necessary to solve the problem, of course, but it might help you understand it better: It says that Monty's probability of choosing correctly is 1 minus the contestant's probability of choosing correctly, because the contestant might have got lucky and picked the correct door before he could, thus eliminating Monty's ability to do so.
Now these formulae continue to hold if, instead of starting with a gazillion doors, or even 100 doors, we reduce the number of doors at the outset to 50, or 25, or even to three. So with three doors at the outset, we have:
  • N = 3
  • PContestant = 1/3
  • PMonty = 2/3
It may seem contrary to common sense in this case, when we've reduced the number of doors to just three, but the same reasoning and the same formulae do continue to hold, and if you switch to the door Monty "chose" (by leaving it closed), you do in fact double your chances of winning.



I don't intend to introduce this proposed explanation into the article myself: That would seem the height of presumption to me, given that some of you have been herding cats here for years! ( Thanks for that! ) But I do think it presents the clearest chance that a person who's relatively unsophisticated in math has of getting the "flash of insight" that's necessary to understanding the problem. I don't claim it's the best way to prove the solution, or even that it's a proof at all, though, just that it's likely to be pretty accessible to a typical reader. What does everyone else think about that?  – OhioStandard (talk) 19:50, 2 April 2011 (UTC)

( On re-reading this, I have some misgivings over saying that Monty has only a probability of .99 in choosing the "right" door in the case where we start with 100 doors at the outset, or .66 in the case where we start with three. Obviously his probability rises to unity, to 1, if the contestant's first pick was incorrect. But I won't (yet?) complicate the presentation I made above by correcting it to accommodate cases based on whether or not the contestant's first choice is correct. I know that might be helpful, of course, because Monty's "chances" of picking the correct door drop to zero if the contestant gets lucky on his first pick. But if people think the approach has value, maybe we can work together to clarify the two cases, and dot the "I's" and cross the "T's". Best regards, all, and thanks for your ongoing work on this article, very much. Btw, special thanks to Martin, Gerhard, and Rick, for their patient and generous help on the "Arguments" subpage. Cheers,  – OhioStandard (talk) 22:12, 2 April 2011 UTC )
"Monty is effectively choosing the door the car is behind, by keeping that one door closed [...] impossible for him to do anything else." — Incorrect, because if you should have chosen the winning door, the host will not dispose of the car. So what is written here is just incorrect. And if the host "should be biased to just never open the door he left closed, if any possible", then the chance that the still closed door hides the car could be only 1/2 at least (as per Ruma Falk), even in your gazillion example. And btw: "Then it's clear that ..." is never enough, the content of the article must be sourced. Regards, Gerhardvalentin (talk) 23:10, 2 April 2011 (UTC)
Thanks, Gerhard, but I think you missed the condition that the boldfaced part of the sentence is preceded by, viz. "Assuming you didn't guess correctly the first time (a reasonable assumption, given your one-in-a-gazillion chance)..." Also, I'm afraid I don't understand your objection about the possibility of the host's bias. As I understand the problem, Monty has no opportunity for bias; as I understand it he has no choice but to open all but one door, excluding your own. Could you explain further?
I've moved your comment to just above, btw, from being interleaved with the presentation I made in the "callout box". If everyone did that, then the original intent of that post, and what I'd written versus what everyone else contributed, would become obscure. Also, if we can all agree on an aid to understanding (not a proof) then I don't see any reason why that would have to be be "sourced". Or was that a part of the arbitration requirements?  – OhioStandard (talk) 00:00, 3 April 2011 (UTC)
Good point, @Ohiostandard. Host bias (or not) is irrelevant to the simple solutions. Those who would always switch will win with unconditional probability 2/3. For frequentists, this holds whether or not the host is biased, and for subjectivists this holds whether or not their opinions about possible host bias are symmetric regarding the direction of any bias. Gerhard Valentin refers to Ruma Falk for the host-biased conditional result but it was already in Morgan et al, but more easily derived with Bayes' rule (odds form of Bayes theorem), for instance, cf. Rosenthal's paper and book. But you are talking about an argument for the simple (unconditional) result. It's indeed a valuable aid for understanding why the "naive" argument "no need to switch because Monty has not given you any information about your initial choice" is faulty. It's given by many sources. In the unbiased case, Monty has given you no information about your intitial choice, so the probability that that was right is still 1/3. There is still 2/3 chance left and that stays where it was, with the two other doors, one of which Monty has kindly shown to you does not hide a goat. As Gerhard mentions, in the case of a biased host, the identity of the door which Monty opens can contain information. In the most extreme cases, the chance your initial choice was right can rise to certainty, or fall to 1/2. But it is never unfavourable to switch, since there is no way to improve the 2/3 overall success chance of "always switching". Editor @Lambiam recently found an alternative proof that 2/3 cannot be beaten, and hence that all conditional chances of wining by switching are at least 1/2, by showing that a known host bias can only be advantageous to the player. Yet even with a maximally biased host, it's easy to see that 2/3 (overall) can't be improved. I wrote it out in [1]. Richard Gill (talk) 10:47, 3 April 2011 (UTC)
Thanks, Richard. Your kind response reminds of one of the many things I love about Wikipedia: There are experts here in just about any subject that could interest a curious person, and they're usually willing to share their expertise, even with untrained persons, quite generously and patiently. However it's communicated, I do wish it could be made explicit in the article that Monty is actually making a choice, too, but one that's based on better information, and that his doing so has information value for the contestant.
Also, I don't know whether it can make it into the article or not ( I'll leave that to you who've contributed here for so long ), and without the least wish to disparage any other presentation, all of which I honor, I feel that candor requires me to disclose that the first immediately convincing presentation that has worked for me for the simple version is the one (permalink) that Rick Block posted to the Arguments page under the heading, "A different table". Best regards, all.  – OhioStandard (talk) 17:25, 4 April 2011 (UTC)
OhioStandard, I am a little concerned that you are deferring to other editors based upon them being here longer. Please read WP:NVC WP:OWNERSHIP and WP:ODNT. (I do agree that it is best to discuss changes and and seek consensus rather than just jumping in and changing things, but we are all equal here.) Guy Macon (talk) 07:52, 10 April 2011 (UTC)
That's a fair point, Guy, based on the language I used above. If it'll make you feel better, I don't mind disclosing that my apparent humility is much more formal than real. "Jungle manners" on entering new or disputed territory, as Marie Louise von Franz used to say. ;-) The links are helpful, nevertheless, but please let me assure you that if something comes up that seems important to me, I'm perfectly willing to make any amount of noise or to (figuratively) bloody my knuckles to be heard about it. I genuinely appreciate your remarks, though; thanks.  – OhioStandard (talk) 10:16, 10 April 2011 (UTC)

Probability concepts

IMHO, one can see from any written source about MHP whether the writer is a subjectivist or a frequentist in his or her use of probability. Often of course without even knowing about the distinction. I think that the present page contains quite a few subtle biases to one or the other concept of probability. For instance the just mentioned proof of Bayes' theorem explicitly put randomness in the physical procedure used by game-show team and the host to hide the car and open a door respectively. Probability is in the real world. Ontological probability (about things). Whereas the "usual assumptions" of equal probabilities really (IMHO) only make sense for the subjectivist (aka Bayesian) viewpoint, whereby probability describes our information or lack thereof. It's in our mind. It describes our personal ignorance or knowledge. Epistemological probability (about our knowledge). I wrote a little essay about this on my talk page, here, it was originally a response to a correspondent. Read Probability interpretations on wikipedia.

So far editors of the page have shied away from this issue. Maybe they are scared of opening a can of worms? Or they have a POV and don't know any other? I think the issue can't be avoided in any decent discussion of MHP. You can only draw conclusions by making assumptions and where do those assumptions come from? The man in the street is a subjectivist, the academic literature is biased to the frequentist position. Richard Gill (talk) 10:00, 5 April 2011 (UTC)

I am very happy to discuss the subject with you. It has indeed caused a lot of confusion in recent arguments.
I do not think the distinction is as great as you suggest. The two approaches can always be made to agree by making sure that the set-up of the frequentist approach matches the state of knowledge in the Bayesian approach. If you disagree, please give an example of where the two approaches give different answers. Martin Hogbin (talk) 22:03, 7 April 2011 (UTC)

Better shoes argument?

If it won't annoy people, may I ask whether any sources have tried to give the reader the necessary "flash of insight" by asking him to put himself not in the position of the contestant, as is usual, but in the position of the host, instead? As in the following, for example:

Imagine the producers of the show let the host keep the car for himself personally if the contestant picks the wrong door at the outset and the host then (indirectly) "picks" the correct one, i.e. by choosing to leave that one of the remaining two doors closed. Now suppose the producers, at the start of the event, offer to let you play the role of the host instead of that of the contestant. The contestant gets to pick a door first, but you'd be told in advance which door the car is behind if you played the host. Which role would you prefer, and why?

This seems pleasantly concise, and (to me) very likely to at least give the reader pause, to give him a salutary doubt that his immediate "50/50" assumption re the stay/switch decision is correct. It also has the pleasant consequence, I think, that if people think about it, it will bring them to the conclusion that it's more advantageous to know where the car is than it is to pick a door first. One could then go on to explain that by choosing "switch" one effectively puts himself into the host's shoes, and thus benefits by the host's foreknowledge of where the car is. I'm mostly just curious to know whether this has been argued before. Thanks,  – OhioStandard (talk) 00:34, 8 April 2011 (UTC)

Krauss and Wang (reference in the article) set out to try to get people to arrive at the correct answer more frequently. Asking people to take the perspective of the host is one of the things they tried (experimentally). It indeed does help. -- Rick Block (talk) 06:24, 8 April 2011 (UTC)
In Selvin's second note, this is long before Marilyn Vos Savant, Selvin quotes from a letter sent by Monty Hall himself, who did solve the problem in one sentence from the point of view of the host. For the host, the location of the car is known, and the way he will choose a goat-door if necessary is his own business. The only thing random for him, is which door the player chooses, which from his point of view has a odds of 2 to 1 against of being correct. Richard Gill (talk) 11:09, 10 April 2011 (UTC)

Will the real probability please stand up

One of the continuing problems here, in my opinion, is the assumption by some editors that there is some 'real' meaning of the the term 'probability' that everybody knows and understands. The truth is much closer to the reverse; there are two definitions of probability in common use and nobody understands what the word 'really' means.

Richard Gill and I have tried several times to discuss this important subject but to no avail. Might I suggest that it would avoid many pointless arguments if people would state which definition of probability they are using and stick to it throughout their argument. It is generally unhelpful to talk the results of 'repeating the experiment' when discussing a subject from a Bayesian perspective. Similarly, states of knowledge are of little relevance in calculating a frequentist probability.

Now, my personal opinion, at least, is that it is always possible to make the two interpretations of probability agree by carefully making sure that the frequency distributions in the frequentist interpretation match the state of knowledge in the Bayesian interpretation. However that is a subject for another time and place.

What would greatly help here would be for editors to clearly state which interpretation they are using and then stick to it. Unfortunately, not many sources do this either. Martin Hogbin (talk) 08:57, 9 April 2011 (UTC)

The following is a quote from Probability interpretations:
"The terminology of this topic is rather confusing, in part because probabilities are studied within so many different academic fields. The word 'frequentist' is especially tricky. To philosophers it refers to a particular theory of physical probability, one that has more or less been abandoned. To scientists, on the other hand, "frequentist probability" is just what philosophers call physical (or objective) probability. Those who promote Bayesian inference view 'frequentist statistics' as an approach to statistical inference that recognises only physical probabilities. Also the word 'objective', as applied to probability, sometimes means exactly what "physical" means here, but is also used of evidential probabilities that are fixed by rational constraints, such as logical and epistemic probabilities."
Also see Probability, Probability interpretations, Frequency probability, Bayesian probability, Pignistic probability, Algorithmic probability, Philosophy of probability, Sunrise problem, Two envelopes problem, Necktie paradox, Exchange paradox, and Doomsday argument. Guy Macon (talk) 14:50, 9 April 2011 (UTC)
Guy, I am not sure what you are trying to tell me. Your quote above confirms my point above that there is no general agreement on exactly what probability 'really' is.
The probability article says:
1. Frequentists talk about probabilities only when dealing with experiments that are random and well-defined. The probability of a random event denotes the relative frequency of occurrence of an experiment's outcome, when repeating the experiment. Frequentists consider probability to be the relative frequency "in the long run" of outcomes.[1]
2. Bayesians, however, assign probabilities to any statement whatsoever, even when no random process is involved. Probability, for a Bayesian, is a way to represent an individual's degree of belief in a statement, or an objective degree of rational belief, given the evidence.
My point is that in discussing probability issues regarding the MHP it would be very helpful and would avoid pointless disagreements if editors would adopt a particular perspective and stick to it for the duration of the argument.
My personal opinion is that the Baysian approach is the most appropriate approach for the MHP but I do wish to try to force that opinion on anyone else and I am happy to discuss the problem from a frequentist perspective.
I am not sure from your post whether you agree with me or not Guy. Are you asserting that there are other important interpretations of probability or do you agree that there are only two?
Do you agree that in a discussion about probability it is best to stick to one interpretation for the duration of a logical argument rather than change the meaning of the word part way through a discussion? Martin Hogbin (talk) 15:39, 9 April 2011 (UTC)
Yes. I fully agree that we need to pick a definition and stick to it. The definition I use as a working engineer is not the definition a philosopher uses, and I suspect that a statistician will have a third definition. Any of the different definitions will work if everybody uses them , but having different people use different definitions is an invitation to misunderstanding and strife. Guy Macon (talk) 20:16, 9 April 2011 (UTC)
Yes the words are confusing and are used to mean the opposite by different communities. Maybe we should just stick to using the adjectives "epistemological" and "ontological"  ;-) Richard Gill (talk) 06:56, 10 April 2011 (UTC)
Responding to Martin's original post in this thread - IMO, what we mean by probability is not even remotely the issue here. What IS the issue is what probability we mean (per my response to Martin in the thread above). Bayesians and frequentists are both completely capable of distinguishing P(win by switching) from P(win by switching|player picks door 1 and host opens door 3). If we want to reduce the confusion here, we should clarify which one of these we're talking about. Are we interested in the probability of winning of a strategy of (always) switching, or the probability of winning given the player has picked a particular door (say door 1) and has seen the host open some other door (say door 3)? Which of these we're talking about is and always has been the issue - not the meaning of probability. -- Rick Block (talk) 23:59, 9 April 2011 (UTC)
Rick, that was not a response to Martin's point at all, I think. Of course everyone ought to be able to agree on a distinction between P(A) and P(A|B). And both kinds of probability satisfy the same rules so the arithmetic is the same too. But the words around the arithmetic and the distinction of concepts *is* important. And the literature is confusing since different writers do clearly have different intended meanings of probability in mind. Moreover, the assumptions which some people want to add into the problem in order to be able to solve it their way have to be justified or explained, and the justification or explanation is different per different understanding of what the word probability is supposed to mean.
So an editor of the page his well advised to be aware of the distinction and he or she has to be master of using formulations which are probability interpretation free. It can be done but it requires careful and clear thinking. Never talk about "how many times". Aways talk about "so many times more likely". Basicly it forces you into the language of odds, not of probabilities. Richard Gill (talk) 06:51, 10 April 2011 (UTC)
Rick I am not saying that the issue we are arguing about is related to the different interpretation of probability, just that not making clear what interpretation you are using makes it very hard to discuss the subject without continual misunderstandings. I am not even asking you to stick permanently to one approach, all I am asking is that we talk about them one at a time. I have not been able to respond to your reply to me above because you seem to continually flit between one approach and another whilst using concepts from one approach in the context of another.
Let me say again, I do not want to have a philosophical discussion about the meaning of probability on this page all I ask is that during any given discussion we adopt one interpretation and stick to it. Can there be any valid reason not to do that? Martin Hogbin (talk) 21:14, 10 April 2011 (UTC)
And, what I'm saying, is that for the purposes of any discussion we've ever had, which interpretation of probability we're talking about makes no difference whatsoever. In my reply above (that you haven't responded to) I don't "flit between one approach and another" but connect what we ARE talking about to both. The OP above was asking why his professor insisted on using conditional probability. The explanation (whether you're a frequentist or a Bayesian) is the same - i.e. that at the point of making the decision whether to switch, the player knows which specific doors are involved (and the doors are not "otherwise indistinguishable"). Whether you're a Bayesian or a frequentist, this means you're interested in the conditional probabilities. Can there be any valid reason for continuing to harp on a point that has no relevance to what we are talking about, and refusing to address the actual issue?
Richard says above "everyone ought to be able to agree on a distinction between P(A) and P(A|B)". So, Martin, are we talking about P(A) or P(A|B)? Harder question - do the "simple" solutions address P(A), or P(A|B)? If you think it's necessary, feel free to specify what interpretation of probability you're talking about. -- Rick Block (talk) 23:20, 10 April 2011 (UTC)
Rick, I am not harping on, I am asking you to make your arguments clearer by stating which interpretation of probability you are using. This is simply to help other editors understand exactly what you mean. You may know what you mean but Richard and I need to as well. I am quite ready to address the actual issue provide we agree at the start to use and stick one interpretation of probability. We may well arrive at the same conclusion whichever path we take but it will be an easier journey, for me at least, if we stick one philosophy throughout. Martin Hogbin (talk) 13:27, 11 April 2011 (UTC)
Regarding your question, let us agree to take a Bayesian perspective on Whitaker's question. You asked whether simple solutions calculate the value of P(win by switching)? I think we can all agree the answer is 'yes'.
Having said 'yes', I should point out that the simple solutions are in no way rigorous and include many assumptions that are not stated. As I think Boris said, there is a very rigorous for of mathematics in which even the most simple solutions become inordinately complex and completely unsuitable for anyone except mathematical specialists. For our expected readership and especially those who may read only the first part of the article, the simple solutions give, in my opinion, perfectly adequate calculation of P(win by switching).
If you ask whether the simple solutions calculate P(win by switching|player picks door 1 and host opens door 3) my answer would be the same except that they are marginally less good in that we do not actually state the obvious fact that as we have no information to allow us to distinguish between doors P(win by switching|player picks door 1 and host opens door 3) must be equal to P(win by switching). And as we have agreed that the simple solutions are satisfactory for calculating P(win by switching) we know that they also calculate P(win by switching|player picks door 1 and host opens door 3). It is obvious that, from a Bayesian perspective, P(win by switching|player picks door 1 and host opens door 3) = P(win by switching|player picks door 1 and host opens door 2) = P(win by switching). Martin Hogbin (talk) 14:00, 11 April 2011 (UTC)

Question to Glopk and to Rick and any others regarding success rates

Article understandable to the reader, desirably to grandma and grandson likewise. Therefore my question:

MHP –  "decision to switch or to stay"   –   gains and losses as results of effectively taken decisions of frequentists

a) decision based exclusively on the so-called flawed simple solution, after the host had shown a goat and thereafter had offered to switch to his still closed door
b) decision based exclusively on the so-called only correct conditional solution as per updated actual probability calculus

Please can anyone help me to find some reliable source that ever has determined the respective results of decisions to switch (resp. to stay), that effectively had been taken on solely "a", compared to solely "b" and thereafter has presented the specific results, for any combination of door chosen by the guest and door opened by the host likewise?

Is it conceivable that decisions that had exclusively been made based on the only correct conditional probability are of any higher success rate?

Is there evidence that specific decisions for any combination of doors, that had been made exclusively based on the only correct conditional probability "b", correctly updated for every show, has a higher success rate, compared to the specific results of decisions that have strictly been made on the so called flawed simple solution as per Marilyn vos Savant, i.e. to switch in the respective "actual" game show after the host has made his offer?   And if so, to which degree?  Because that must necessarily be added to the article. And otherwise, the contrary as well.

It is about the possible superiority of the so called conditional solution. Is its success rate really indeed so significantly higher for any specific combination of doors, as it is claimed by some?

Please help, because afaik there is no better possible decision than just "to switch" in the specific actual game, irrelevant of door numbers. Is there reliable evidence in reliable sources that "updated b" is of remarkably higher success rate than "a"? Otherwise "updated b" impossibly can be presented as a relevant topic here, but just the quite other aspect that the MHP is a suitable method to train conditional probability theorems.    Thank you. Gerhardvalentin (talk) 17:32, 11 April 2011 (UTC)

The simple solution shows that always switching is a much better strategy than always staying. The conditional solution shows that there is nothing better still. But of course nobody would imagine this, anyway. This is true if all doors are initially equally likely to hide a car. It doesn't matter whether or not the host has some bias. It's easy to see that the more biased the host is the more favourable it is to the player. And with a maximally biased host you can easily see (just work out the two possibilities in the Monty Crawl game) that switching is never unfavourable. Yet also in Monty crawl always switching only has success rate 2/3. This proves rigorously that there is no better strategy than always switching. (This argument was discovered by Lambiam, it is inspired by game theory, and it makes the explicit computations of Morgan et al. superfluous and allows you also to explain the biased host case to your Grandma or Grandpa. I have written it up in the longer version [2] of my article in the StatProb encyclopedia [3]. It's also written out on my University of Leiden home page. @Lambiam got a prize from me for finding this proof. [User:Gill110951|Richard Gill]] (talk) 16:48, 14 April 2011 (UTC)