Dominance in the Monty Hall Problem

Reading time: 5 minute
...

📝 Original Info

  • Title: Dominance in the Monty Hall Problem
  • ArXiv ID: 1106.0833
  • Date: 2011-11-24
  • Authors: Alexander Gnedin

📝 Abstract

Elementary decision-theoretic analysis of the Monty Hall dilemma shows that the problem has dominance. This makes possible to discard nonswitching strategies, without making any assumptions on the prior distribution of factors out of control of the decision maker. A path to the Bayesian and the minimax decision-making environments is then straightforward.

💡 Deep Analysis

📄 Full Content

Suppose there is a set of 'doors' D with at least three elements. One door θ ∈ D is a winning door. You will be asked to choose a door x ∈ D, and then all doors except one door y ∈ D \ {x} will be revealed as not winning. Then you will be asked to guess if x = θ (action match with door x) or x = θ (action switch to door y). You win if your guess is correct.

The case of three doors is widely known as the Monty Hall problem. The observation of this note is that the problem has dominance, which makes sense of the question ’to switch or not to switch’ even in the situations when no prior distributions are assigned to the parameters out of control of the decision maker. The idea is quite simple: if the guessing strategy chooses x and in some situation plays match, so does not switch to y, then another strategy which chooses y and plays switch all the time is at least as good as the first whichever θ ∈ D. This simplistic view is basically right, modulo subtleties involved in the formal definition of ‘strategy’ and ‘situation’

A strategy is a pair (x, a x ) where x ∈ D and a x is a function of y ∈ D \ {x} with values a x (y) ∈ {match, switch}. For constant functions we call (x, match) and (x, switch) single-action strategies, and call (x, switch) always-switching strategy.

The decision-making environment is specified by two parameters. One parameter is the winning door θ ∈ D. Another parameter is the door y to which switching can occur, which is a function d θ of x ∈ D , that must be admissible in the sense that d θ (x) = θ off the diagonal θ = x, and d θ (x) = θ on the diagonal θ = x.

Let W (θ, x, match) := 1(x = θ) and W (θ, x, switch) := 1(x = θ), where 1(• • • ) denotes indicators. A strategy (x, a x ) is evaluated by the win-ornothing payoff function W (θ, x, a x (d θ (x))). Note that the payoff does not depend on y = d(θ, x) explicitly. The payoff of each single-action strategy depends on y neither explicitly nor implicitly.

Theorem The class of always-switching strategies is weakly dominant, that is for every strategy (x, a x ) there exists an always-switching strategy

for all θ ∈ D and all admissible functions d θ .

Proof. If a x is not always-switching, there exists x ′ = x with a x (x ′ ) = match. Consider the strategy (x ′ , switch). For θ = x ′ we have

For θ = x ′ we have W (x ′ , x ′ , switch) = 0, but then also

In the Bayesian setting of the decision problem the winning door Θ and the door Y to which switching is offered are random variables with given probability distributions, with distribution of Y defined conditionally on the decision variable x and Θ, with account of the rule Y = Θ in the event Θ = x. Assuming D finite, the distributions can be specified by a probability mass function p θ and transition probabilities q θ,y for θ ∈ D, y ∈ D{θ}. A Bayesian strategy is the always-switching strategy (θ * , switch), where θ * minimizes p θ . The optimality readily follows by first noting that by the dominance the class of always-switching strategies is optimal, then noting that the winning probability with (x, switch) is 1 -p x , i.e. the probability of the event Θ = x that the winning door is not the one you have chosen before facing the ’to switch or not to switch’ dilemma.

Let n be the cardinality of D (n ≥ 3). Since max x∈D (1 -p x ) ≥ (n -1)/n it is clear that in the worst-case the distribution of Θ is uniform on D, and the minimax probability of winning the prize is v := (n -1)/n. If Θ is uniform, then every always-switching strategy yields v, no matter what is the distribution of Y . On the other hand, if the decision-maker applies the mixed strategy (X, switch) with X uniformly distributed, then the probability to win is v for arbitrary (Θ, Y ) (independent of X). It follows that in the zerosum setting of the decision problem, every saddle-point solution has the form (X, switch)-versus-(Θ, Y ), with X and Θ uniformly distributed and the law of Y determined by arbitrary transition probabilities. The value of this game is then v = (n -1)/n.

The Monty Hall problem (MHP) and its numerous variants, all involving probabilities, are summarized on the Wikipedia MHP page. We refer to [1] for a recent critical analysis of the textbook solutions and the history of the MHP.

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut