Lucas Husted: How to outsmart the Prisoner's Dilemma

Two perfectly rational gingerbread men, Crispy and Chewy, are out strolling when they’re caught by a fox. Seeing how happy they are, he decides that, instead of simply eating them, he’ll put their friendship to the test with a cruel dilemma. He’ll ask each gingerbread man whether he’d opt to Spare or Sacrifice the other. They can discuss, but neither will know what the other chose until their decisions are locked in.

שני אנשי עוגיה רציונליים לגמרי, קריספי וצ’ואי, מטיילים בחוץ כשתופס אותם שועל. כשהוא רואה כמה הם שמחים, הוא מחליט, שבמקום פשוט לאכול אותם, הוא יעמיד את החברות שלהם במבחן עם דילמה אכזרית. הוא ישאל כל איש עוגיה אם הוא בוחר להציל או להקריב את האחר. הם יכולים לדון, אבל אף אחד לא ידע מה האחר יבחר עד שהבחירות שלהם יבוצעו.

If both choose to spare the other, the fox will eat just one of each of their limbs; if one chooses to spare while the other sacrifices, the sparer will be fully eaten, while the traitor will run away with all his limbs intact. Finally, if both choose to sacrifice, the fox will eat 3 limbs from each.

אם השניים יבחרו להציל את האחר, השועל יאכל רק אחת מהגפיים שלהם; אם אחד יבחר להציל בעוד השני להקריב, זה שחס יאכל לגמרי, בעוד הבוגד יברח עם כל גפיו שלמים. לבסוף, אם השניים יבחרו להקריב, השועל יאכל 3 גפיים מכל אחד.

In game theory, this scenario is called the “Prisoner's Dilemma.” To figure out how these gingerbread men will act in their perfect rationality, we can map the outcomes of each decision. The rows represent Crispy’s choices, and the columns are Chewy’s. Meanwhile, the numbers in each cell represent the outcomes of their decisions, as measured in the number of limbs each would keep:

בתאוריית המשחקים, התרחיש הזה נקרא “דילמת האסיר.” כדי להבין איך אנשי העוגיה האלה יפעלו ברציונליות המושלמת שלהם, אנחנו יכולים למפות את התוצאות של כל החלטה. השורןת מייצגת את הבחירות של קריספי, והעמודות מייצגות את של צ’ואי. בינתיים, המספרים בכל תא מייצגים את התוצאות של ההחלטה שלהם, כמו שהן נמדדות במספר האברים שכל אחד ישמור:

So do we expect their friendship to last the game?

אז האם אנחנו מצפים שהחברות שלהם תשרוד את המשחק?

First, let’s consider Chewy’s options. If Crispy spares him, Chewy can run away scot-free by sacrificing Crispy. But if Crispy sacrifices him, Chewy can keep one of his limbs if he also sacrifices Crispy. No matter what Crispy decides, Chewy always experiences the best outcome by choosing to sacrifice his companion. The same is true for Crispy.

ראשית, בואו נשקול את האופציות של צ'ואי. אם קריספי יחוס עליו, צ’ואי יכול לברוח שלם על ידי הקרבת קריספי. אבל אם קריספי יקריב את עצמו, צ’ואי יכול לשמור את אחת הגפיים שלו אם הוא גם יקריב את קריספי. לא משנה מה קריספי יחליט, צ’ואי תמיד חווה את התוצאה הטובה ביותר על יד בחירה בלהקריב את חברו. אותו הדבר נכון לקריספי.

This is the standard conclusion of the Prisoner's Dilemma: the two characters will betray one another. Their strategy to unconditionally sacrifice their companion is what game theorists call the “Nash Equilibrium," meaning that neither can gain by deviating from it.

זו המסקנה הסטנדרטית של דילמת האסיר: שתי הדמויות יבגדו אחת בשניה. האסטרטגיה שלהם להקריב את חברייהם ללא תנאי היא מה שתאורתיקני משחקים קוראים לו “שווי משקל נאש,” מה שאומר שאף אחד לא יכול להרוויח מלסטות מזה.

Crispy and Chewy act accordingly and the smug fox runs off with a belly full of gingerbread, leaving the two former friends with just one leg to stand on.

קריספי וצ'ואי פועלים בהתאם והשועל הזחוח רץ עם בטן מלאה עוגיות גי‘נג’ר, ומשאיר את שני החברים רק עם רגל אחת לעמוד עליה.

Normally, this is where the story would end, but a wizard happened to be watching the whole mess unfold. He tells Crispy and Chewy that, as punishment for betraying each other, they’re doomed to repeat this dilemma for the rest of their lives, starting with all four limbs at each sunrise. Now what happens?

באופן נורמלי, פה הסיפור שלנו יסתיים, אבל קוסם צפה במקרה בכל הבלאגן המתרחש. הוא אומר לקריספי וצ’ואי, שכעונש על הבגידה אחד בשני, הם ידונו לחזור על הדילמה במשך שארית חייהם, כשהם מתחילים עם כל ארבע הגפיים כל זריחה. עכשיו מה קורה?

This is called an Infinite Prisoner’s Dilemma, and it’s a literal game changer. That’s because the gingerbread men can now use their future decisions as bargaining chips for the present ones. Consider this strategy: both agree to spare each other every day. If one ever chooses to sacrifice, the other will retaliate by choosing “sacrifice” for the rest of eternity. So is that enough to get these poor sentient baked goods to agree to cooperate?

זו נקרא דילמת האסיר האין סופית, והיא משנת משחק לגמרי. זה בגלל שאנשי העוגיה יכולים עכשיו להשתמש בבחירות העתידיות שלהם כקלף מיקוח לנוכחיות. שקלו את האנסטרטגיה הזו: השניים מסכימים לחוס אחד על השני כל יום. אם אחד יבחר להקריב, האחר יגיב בבחירת ההקרבה למשך שארית החיים. אז האם זה מספיק כדי לגרום למוצרי האפיה בעלי ההכרה להסכים לשתף פעולה?

To figure that out, we have to factor in another consideration: the gingerbread men probably care about the future less than they care about the present. In other words, they might discount how much they care about their future limbs by some number, which we’ll call delta. This is similar to the idea of inflation eroding the value of money. If delta is one half, on day one they care about day 2 limbs half as much as day 1 limbs, day 3 limbs 1 quarter as much as day 1 limbs, and so on.

כדי להבין את זה, אנחנו צרכים להכניס שיקול נוסף: לאנשי העוגיה כנראה אכפת מהעתיד פחות משאכפת להם מההווה. במילים אחרות, הם אולי יפחיתו בכמה אכפת להם בנוגע לגפיים העתידיות שלהם במספר מסויים, שנקרא לו דלתא. זה דומה לרעיון של אינפלציה ששוחקת את ערך הכסף. אם דלתא הוא חצי, ביום הראשון הם דואגים לגפיים של היום השני חצי מהגפים של היום הראשון, גפים של היום השלישי רבע משל היום הראשון, וכך הלאה.

A delta of 0 means that they don’t care about their future limbs at all, so they’ll repeat their initial choice of mutual sacrifice endlessly. But as delta approaches 1, they’ll do anything possible to avoid the pain of infinite triple limb consumption, which means they’ll choose to spare each other. At some point in between they could go either way. We can find out where that point is by writing the infinite series that represents each strategy, setting them equal to each other, and solving for delta.

דלתא של 0 משמעה שלא אכפת להם מהגפיים בעתיד בכלל, אז הם חוזרים על הבחירה שלהם של הקרבה הדדית לתמיד. אבל כשדלתא מתקרבת ל 1, הם יעשו כל מה שאפשרי כדי להמנע מהכאב של אכילת שלוש גפיים תמידית, מה שאומר שהם יבחרו לחוס אחד על השני. בנקודה מסויימת באמצע הם יכולים לבחור בכל דרך. אנחנו יכולים למצוא איפה הנקודה הזו על ידי כתיבת הסדרה האין סופית שמייצגת כל אסטרטגיה, הצבתם כשווים אחד לשני, ופתירה לדלתא.

That yields 1/3, meaning that as long as Crispy and Chewy care about tomorrow at least 1/3 as much as today, it’s optimal for them to spare and cooperate forever.

זה מוביל ל 1/3, מה שאומר שכל עוד לקריספי וצ’ואי אכפת מהמחר לפחות 1/3 כמו היום, זה אופטימלי עבורם לחוס ולשתף פעולה לתמיד.

This analysis isn’t unique to cookies and wizards; we see it play out in real-life situations like trade negotiations and international politics. Rational leaders must assume that the decisions they make today will impact those of their adversaries tomorrow. Selfishness may win out in the short-term, but with the proper incentives, peaceful cooperation is not only possible, but demonstrably and mathematically ideal.

הניתוח הזה לא יחודי לעוגיות וקוסמים; אנחנו רואים אותו מתרחש במצבים בחיים האמיתייים כמו שיחות סחר ופוליטיקה בין לאומית. מנהיגים רציונליים חייבים להניח שההחלטות שהם עושים היום ישפיעו על אלו של היריבים שלהם מחר. אנוכיות אולי תזכה בטווח הקצר, אבל עם תמריצים מתאימים, שיתוף פעולה לא רק אפשרי, אלא אפשר להראות ולהוכיח מתמטית שהוא אידאלי.

As for the gingerbread men, their eternity may be pretty crumby, but so long as they go out on a limb, their friendship will never again be half-baked.

ובנוגע לאנשי העוגיה, הנצח שלהם אולי יהיה די מפורר, אבל כל עוד הם ינסו, החברות שלהם לעולם לא תהיה חצי אפויה.

So do we expect their friendship to last the game?

אז האם אנחנו מצפים שהחברות שלהם תשרוד את המשחק?

Crispy and Chewy act accordingly and the smug fox runs off with a belly full of gingerbread, leaving the two former friends with just one leg to stand on.

That yields 1/3, meaning that as long as Crispy and Chewy care about tomorrow at least 1/3 as much as today, it’s optimal for them to spare and cooperate forever.

As for the gingerbread men, their eternity may be pretty crumby, but so long as they go out on a limb, their friendship will never again be half-baked.

ובנוגע לאנשי העוגיה, הנצח שלהם אולי יהיה די מפורר, אבל כל עוד הם ינסו, החברות שלהם לעולם לא תהיה חצי אפויה.

Lucas Husted: How to outsmart the Prisoner's Dilemma

Lucas Husted: How to outsmart the Prisoner's Dilemma

Related talks

Lucas Husted: Game theory challenge: Can you predict human behavior?

Jeff Dekofsky: The Infinite Hotel Paradox

Jac de Haan: Why do competitors open their stores next to one another?

Related talks

Lucas Husted: Game theory challenge: Can you predict human behavior?

Jeff Dekofsky: The Infinite Hotel Paradox

Jac de Haan: Why do competitors open their stores next to one another?