The Monty Hall Problem, Proven with Python (and Math)

4 min readAug 26, 2021

image source: https://paulvanderlaken.com/2020/04/14/simulating-visualizing-monty-hall-problem-python-r/

When the movie 21 came out in 2008, I remember one scene that never quite sat right with me. MIT statistics professor Micky Rosa (played, unfortunately, by Kevin Spacey) offers an extra credit problem to his lecture hall. He demonstrates what in probability theory is known as the Monty Hall problem, named after the Let’s Make a Deal game show host. Professor Rosa tells Ben Campbell (played by Jim Sturgess) that Ben is a contestant on a game show. There are 3 doors. Hidden behind one of them is a new car. The other two? Goats. Ben picks door #1.

This is where things get interesting. The game show host (who knows what’s behind each door) decides to reveal that door #3 is hiding a goat. Ben has a second chance to choose a door. He can either stick with his first choice, door #1, or switch to door #2. Ben chooses to switch. When asked “so how do you know [the host] isn’t trying to play a trick on you? Trying to use reverse psychology to get you to pick a goat?”, Ben explains that it doesn’t matter. His answer is based on statistics, variable change, specifically. When he initially picked a door, he had a 33.3% chance of picking the car. Now that the host has taken away the last door, he’s got a 66.7% if he chooses to switch.

For years, this seemed like the least intuitive answer to me. Each door has an equal chance of having a car. What does it matter what the last door had? It should be 50/50 now. I’m the kind of person who needs to see something proven myself before I can fully commit to believing it. So that’s what I set out to do. The math goes roughly like this:

Q.E.D.
But I still wasn’t totally happy with this explanation. I wanted to see it in action. So, for you more computer-science oriented folks, the following Python code should be easy to follow and reproduce on your own machine.

import numpy as np
np.random.seed(21)doors = ['1', '2', '3']
second_choices = ['stay', 'switch']# simulation function
def simulation(first_choice='1', second_choice='stay', n_trials=10_000):
    win_count = 0
    
    for i in range(n_trials):
        # putting car behind door A, B, or C at random (w/ equal prob for each)
        car_location = np.random.choice(doors)
        
        # host must show door with a goat that we haven't chosen already
        door_shown = np.random.choice([door for door in doors if door not in (first_choice, car_location)])if second_choice == 'stay':
            final_door = first_choice
        else: # switch
            final_door = np.random.choice([door for door in doors if door not in (first_choice, door_shown)])
    
        if final_door == car_location:
            win_count += 1win_ratio = win_count / n_trials
    
    print(f'Win ratio for initially choosing door #{first_choice} then {second_choice}ing is {win_ratio}')
    return

To finally give me the output

Win ratio for initially choosing door #1 then staying is 0.329
Win ratio for initially choosing door #2 then staying is 0.3331
Win ratio for initially choosing door #3 then staying is 0.3369
Win ratio for initially choosing door #1 then switching is 0.6673
Win ratio for initially choosing door #2 then switching is 0.6653
Win ratio for initially choosing door #3 then switching is 0.6626

We can reasonably infer from the above results that the underlying probability of being correct when staying is 1/3 whereas the probability of being correct when switching is 2/3. The Central Limit Theorem comes into play here. We probably would have been fine to only run a a few dozen trials on each, as the average value of these samples will soon approach the mean, but our findings are further validated by the 10,000 simulations ran on each of the six game options.

In conclusion, as unintuitive as it may seem, we have proven through both mathematical reasoning and a Python simulation that the answer to the Monty Hall problem is to always switch doors. Q.E.D.

The Monty Hall Problem, Proven with Python (and Math)

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by M S

Responses (1)

More from M S

How to Create a Choropleth Map in Python (without GeoPandas, GiS, or any GeoJSON knowledge)

I don’t know if it’s because GeoPandas is poorly maintained or that most people can’t be bothered and simply use GiS software for this type…

Web Scraping, Natural Language Processing, and Classification on Reddit

Despite the fact that online forums, a once essential part of internet culture, have all but died out, it seems that reddit it having no…

My Career Switch Towards Data Science

I’ve always been a bit of a numbers person. Math was always my favorite class in school. The homework hardly felt like work. I was always…

A Quick and Customizable Function to GridSearch Over ARIMA Model Parameters in Python

One of the things that I came to discover during the process of modelling time series is that the most time consuming part is choosing how…

Recommended from Medium

🚅 Information Theory for People in a Hurry

A quick guide to Entropy, Cross-Entropy and KL Divergence. Python code provided. 🐍

Solving Max-Cut Problems with D-Wave Quantum Annealing

Split the Network for Maximum Gain!

Lists

Coding & Development

Predictive Modeling w/ Python

Practical Guides to Machine Learning

ChatGPT

This Is How Tesla Will Die

The vultures are circling the tech giant.

How Does Our Sense of Humor Change With Age? A Statistical Analysis

How do our comedic sensibilities form and transform over time?

What is Catastrophe Theory?

Like Plato’s cave allegory, the dramatic phenomena we observe are just shadows of an unseen reality

Quantum Diary #1 — How I’m planning to self learn Quantum Physics and Computing in 1 year

Yes, two of the most difficult subjects in the entire world and my overconfident ass decides “Hm, wouldn’t it be cool if i learnt it, all…