The Obvious Problem That No One Can Agree On

VeritasiumVeritasium
Education4 min read26 min video
Mar 9, 2026|2,517,397 views|96,373|36,066
Save to Pod

Key Moments

TL;DR

Newcomb's Paradox: One box ($1M) or two ($1k + mystery box). Predictors are highly accurate, but your choice is already made.

Key Insights

1

Newcomb's Paradox presents a choice between a guaranteed $1,000 plus a potentially empty or million-dollar box, or just the mystery box.

2

The paradox highlights a fundamental disagreement on decision-making: evidential decision theory (trusting the predictor's accuracy) vs. causal decision theory (acting on what you can influence).

3

One-boxers prioritize the predictor's accuracy, believing the million dollars is already in the box based on its successful predictions.

4

Two-boxers focus on strategic dominance, arguing that their current choice cannot alter the past setup of the boxes, making taking both always more profitable.

5

The paradox touches on concepts of free will, rationality, and the nature of commitment, with no universally agreed-upon correct answer.

6

Real-world parallels exist in nuclear deterrence and game theory, where pre-commitment to a seemingly irrational action can lead to a better overall outcome.

THE SETUP: TWO BOXES, ONE AMBIGUOUS CHOICE

The core of Newcomb's Paradox involves a supercomputer that predicts human choices with extreme accuracy. The participant faces two boxes: one open with $1,000 and a mysterious, closed box. The predictor has either placed $1 million in the mystery box (if it predicted the participant would choose only the mystery box) or nothing (if it predicted the participant would choose both). The choice is between taking only the expected $1 million or taking both the $1,000 and whatever is in the mystery box, which could be nothing.

TWO CAMPS: THE DIVIDE ON RATIONAL DECISION-MAKING

This paradox creates a stark division among people, forcing them into two camps: 'one-boxers' and 'two-boxers.' While the choice seems trivial to each side, they strongly disagree on the obviousness of the solution. This fundamental disagreement stems from differing philosophical approaches to decision theory and how one factors in the predictor's accuracy against the immediate consequences of one's actions.

THE CASE FOR ONE-BOXING: EVIDENTIAL DECISION THEORY

One-boxers argue that the predictor's proven accuracy is paramount. They accept that the predictor has likely made the correct prediction before they even entered the room. Therefore, if they choose only the mystery box, it's because the predictor foresaw this and placed $1 million inside. This approach aligns with evidential decision theory, where the evidence of the predictor's success informs the expected utility of the choice, leading to the conclusion that taking one box offers the highest expected payout.

THE CASE FOR TWO-BOXING: CAUSAL DECISION THEORY

Two-boxers employ causal decision theory, asserting that their current choice cannot retroactively alter the past. The boxes are already set, meaning the money is either in the mystery box or it isn't, regardless of their immediate decision. From this perspective, taking both boxes is always the dominant strategy, as it guarantees an extra $1,000. If the mystery box is empty, they get $1,000; if it contains $1 million, they secure $1,001,000. Their decision is based on what they can directly influence now.

BROADER IMPLICATIONS: FREE WILL, RATIONALITY, AND COMMITMENT

Newcomb's Paradox extends beyond a simple financial choice, probing deeper questions. It challenges our understanding of free will: if a perfect predictor exists, does that negate free will? It also questions the definition of rationality, showing how a rational act might not always align with a rational person, or how rationality at an individual level can differ from societal rationality. The paradox also introduces the idea of pre-commitment, suggesting that the ability to commit to an action beforehand, even an apparently suboptimal one, can lead to a more desirable outcome.

REAL-WORLD ANALOGIES: DETERRENCE AND GAME THEORY

The principles behind Newcomb's Paradox are mirrored in real-world scenarios like nuclear deterrence and game theory. Mutually Assured Destruction (MAD) relies on a credible commitment to retaliate, even if launching a counter-attack results in self-destruction. Similarly, the game of chicken involves demonstrating an inability to swerve to force the opponent to yield. In these cases, a pre-commitment to a seemingly irrational or destructive action serves as a powerful deterrent, ultimately leading to a more stable or beneficial outcome.

THE POWER OF PRE-COMMITTMENT AND IDEAL RULES

The video suggests that a rational approach might involve deciding on overarching rules to live by, rather than making isolated decisions. If one could 'wire' themselves to adhere to certain principles, the optimal pre-commitment in Newcomb's Paradox would be to be the kind of person who chooses one box. This reframing emphasizes that consistent adherence to chosen principles, even when it seems disadvantageous in a single instance, can lead to better long-term outcomes, similar to how cooperation is beneficial in repeated games.

CORRELATION VERSUS CAUSATION: THE ULTIMATE QUESTION

Ultimately, Newcomb's Paradox hinges on the interpretation of the predictor's accuracy. Is it merely a strong correlation, or does it imply causation that should influence one's decision? The video posits that while there may not be one single 'correct' answer, the problem reveals much about how we process information, make decisions, and what we value. It encourages viewers to think critically about the assumptions underlying their choices and the nature of rationality itself.

Newcomb's Paradox: Expected Outcomes

Data extracted from this episode

ChoiceMystery Box ContentOutcome (Box + Mystery Box)Decision Theory
One Box$0$0Causal Decision Theory
One Box$1,000,000$1,000,000Causal Decision Theory
Two Boxes$0$1,000Causal Decision Theory
Two Boxes$1,000,000$1,001,000Causal Decision Theory
One Box$1,000,000$1,000,000Evidential Decision Theory (assuming 100% accuracy)
Two Boxes$0$1,000Evidential Decision Theory (assuming 100% accuracy)

Prisoner's Dilemma Payoffs

Data extracted from this episode

Player 1 / Player 2CooperateDefect
Cooperate3 coins eachPlayer 1: 5 coins, Player 2: 0 coins
DefectPlayer 1: 0 coins, Player 2: 5 coins1 coin each

Common Questions

Newcomb's Paradox is a thought experiment involving a supercomputer that predicts your choice between taking one or two boxes of money. If it predicts you'll take only one, it puts $1 million in it; otherwise, it puts nothing.

Topics

Mentioned in this video

More from Veritasium

View all 90 summaries

Found this useful? Build your knowledge library

Get AI-powered summaries of any YouTube video, podcast, or article in seconds. Save them to your personal pods and access them anytime.

Try Summify free