Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
The pirate game (wikipedia.org)
298 points by dbalan on Jan 13, 2016 | hide | past | favorite | 135 comments


Steven Landsburg has pointed out that the problem is a bit underdetermined, and the usual solution is far from unique. For example, "the first pirate gets all the coins and everyone else votes yes" is a subgame-perfect equilibrium which is just as good as the usual solution, because no pirate has an incentive to deviate unilaterally (one vote can't change the voting outcome). There are many other equilibria like that. The moral of the story is that games with more than two players seldom have unique solutions, unless you make very strong and explicit assumptions.

Another problem which puzzles me even more is dividing a cake among three people by majority vote. Let's say Alice and Bob make an agreement where each of them gets half of the cake and Carol is left out. But that's unstable, because Carol can offer Bob a different agreement where Bob gets 60%, Carol gets 40%, and Alice is left out. Since Bob gets 10% more than before, he has an incentive to switch. But that's unstable as well, because now Alice can make a similar offer to Carol, etc. In the end there's no possible setup that everyone will stick with. As far as I know, this problem is still mostly open (though a few advances have been made).

To sum up, the idea of "perfectly rational" decision-making is surprisingly difficult to nail down. It's been definitively solved only for the case of two-player zero-sum games. When the game is not zero-sum, you get complications like bargaining and equilibrium selection, and when you have more than two players, you get an explosion of complexity without any clear-cut answers.


> Steven Landsburg has pointed out that the problem is a bit underdetermined, and the usual solution is far from unique. For example, "the first pirate gets all the coins and everyone else votes yes" is a subgame-perfect equilibrium which is just as good as the usual solution, because no pirate has an incentive to deviate unilaterally (one vote can't change the voting outcome). There are many other equilibria like that. The moral of the story is that games with more than two players seldom have unique solutions, unless your assumptions are very strong and explicit.

I don't follow. If pirate A assigns themself all the coins, the other pirates may as well vote against the plan - doing so risks nothing, and might result in them getting more than 0 coins if others happen to also vote against.


I don't follow either. In addition to what you say, the rules state that, all else being equal, pirates would prefer other pirates to die. Pirates would prefer A to die, so they would vote against this proposal.


Right, so you need a more restrictive equilibrium concept than subgame-perfect equilibrium. It's a good math exercise to try fleshing out exactly which concept is implied by your common sense reasoning (which I'm not arguing against!) Some folks have suggested trembling hand equilibria and other ideas, none of which are obvious from a naive reading of the problem.

Maybe it's more helpful to think about such problems in terms of "how would I write a program to solve all problems in this class?", rather than "how would I act using common sense in this particular problem?" That often makes things clearer.


> "...no pirate has an incentive to deviate unilaterally (one vote can't change the voting outcome)..."

I would strongly challenge that assumption as having any realism (even for perfectly rational agents and no communication): if an agent decided to vote, he could assume all other perfectly rational agents (another usual assumption) would also take the same decision (by symmetry), so he can safely vote yes. This symmetry could only be broken if the agents have access to randomness.


Right, so we need to figure out what kinds of reasoning by symmetry are allowed under perfect rationality (in all possible games, not just this one). For example, should a rational person cooperate in the Prisoner's Dilemma because the opponent will be forced to cooperate by symmetry? In that case, won't irrational people eat rational people for breakfast, thus devaluing the idea that rational behavior maximizes utility?

If you take the time to think it though, you might well reinvent superrationality, updateless decision theory and other fascinating things. In fact, you'll quickly get to questions that I have no idea how to solve!


Yes I don't think this assumption makes a lot of sense in general if you don't assume your perfectly rational players have access to randomness (I do think it's the best assumption if the players can't communicate or have any entropy).

In the case of Prisoner's Dilemma, if the cooperate/defect outcome had sufficiently large cooperation bonus for fixed other payoffs (large enough T>>R in wikipedia's notation), then with my assumptions it's clear that each prisoner should flip a coin to decide, and each one gets ~T/4 expected payoff.

But the most glaring problem is of course assuming every player is perfectly rational. Personally I'd only assume that if your players are all game theorists with enough of time and paper :) I probably wouldn't even assume myself as rational.

In conclusion, I believe equating maximin with optimal play/perfect rationality is misguided, but maximin is a good safe bet.


What do you mean, "in all possible games"? The assumptions that hold for this game don't hold for the Prisoner's Dilemma.

Like someone else said, you seem to be speaking in jargon. "Superrationality", "updateless decision theory", none of this matters in order to understand the Pirate Game. The rules of the game are very simple, maybe the pirates thing is throwing you off? That's just a theme for the puzzle (people aren't rational like this, and of course pirates aren't, and the whole situation is extremely artificial and unrealistic). The non-intuitive solution is derived from the rules as stated without any need for additional theory.


This is Hofstadter's SuperRationality. It's main contribution isnto highlight the arbitrary simplifying assumptions of traditional game theory


Ah thanks it's exactly what I was thinking. It clearly isn't a good model for games in real life, but yes it does show the assumptions of traditional game theory can be misleading. Optimal play can only be defined with relation to your knowledge of the precise inner workings of your opponents.


I still don't understand, you seem to be responding with jargon.

The point made was that your 'equilibrium' is not valid because a pirate voting that way decreases their expected return. I don't see how you need a different 'equilibrium concept' for your proposed solution, when the basic assumption of rationality excludes it.

Not trying to be mean, but your response seems obfuscatory.


If everyone votes "yes", one pirate deciding to vote "no" doesn't affect their expected return. Therefore everyone voting "yes" is a subgame-perfect equilibrium.

Ideas like Nash equilibria and subgame-perfect equilibria are the only known formalizations of rational behavior in multiplayer games. I'm not obfuscating, I just don't know any other kind of math that would work...


> If everyone votes "yes", one pirate deciding to vote "no" doesn't affect their expected return.

If everyone knows what everyone else is going to do, then neither option is better or worse, sure. But where did you get this perfect knowledge from? Certainly you couldn't deduce your situation.

And why is 'no' your default choice: this isn't a repeated game. You're trying to decide what to vote, you don't have a mind to change, at this stage. And neither does anyone else. Therefore, if everyone knows I can choose to vote either way, your situation is irrational.

You're objecting to a problem that isn't the one being considered.

It may well be that there is a compelling mathematical argument you're trying to evoke, but at the moment you're not actually putting it forward. You're name-dropping, but not explaining. Well, not in a way I can follow, anyway.


Man, I agree with you. This kind of reasoning is bizarre, how could you possibly know the strategies of the other players? And yet it's right. Read the first sentence of https://en.wikipedia.org/wiki/Nash_equilibrium, which is brilliant in its simplicity:

> In game theory, the Nash equilibrium is a solution concept of a non-cooperative game involving two or more players, in which each player is assumed to know the equilibrium strategies of the other players, and no player has anything to gain by changing only their own strategy.

Many people have tried to define mathematically what "rational behavior" means when there are multiple players. Nash equilibrium is the one idea that stayed. It's the first chapter of every game theory textbook. Since the 1950s, literally all publications about non-cooperative games have been based on Nash equilibrium. To understand why, you'd have to look at the history of the field, or you could take my word for it. If it's any consolation, it was very unintuitive for me at first, too.


Oh don't get me wrong, my objection is not because I don't understand Nash Equilibria (my PhD was in evolutionary dynamics, so it's been a while, and I was never a game theoretician, but I'm not totally ignorant of the field). My problem is that the mathematical tools you are name checking don't apply in this case. Even your quote itself says 'changing ... strategy', and goes on:

"If each player has chosen a strategy and no player can benefit by changing strategies while the other players keep theirs unchanged..."

which is not the situation in this case.

Strategies for infinite games are not necessarily optimal if the game is finite, and vice versa.


I'm sorry, now I don't follow. If not subgame-perfect equilibrium, what do you think is the right solution concept for the pirate game? Maybe it doesn't have a name, but can you describe it (at least informally)?


Can you explain a little more how this subgame-perfect equilibrium should be reached? From the game's description it appears that, all other things equal, I as a pirate reject a proposed coin distribution if I could kill the proposer. Thus, if offered 0 coins, I would always choose to reject that plan.


Well, one pirate changing their vote to "no" wouldn't actually kill anyone. So it sounds like you want a utility function over outcomes (prefer outcomes where someone is killed) and another, separate utility function over actions (prefer voting to kill someone even if that won't actually kill them).

That's a pretty unusual setup, because game theory usually just gives you a utility function over outcomes and lets you derive strategic behavior from that, without fiddling with individual actions. Though I guess most people solving the problem have tried to merge the two utility functions somehow? That's probably the root of the disagreement in this thread...


You keep saying "change their vote to no". Perhaps it would be educational why you think they should default to "yes" in the first place when offered 0 coins. It really doesn't seem that the problem you see necessarily arises from this specific set of rules for this game. Offered 0 --> vote no; no would at worst be equal and at best be better than yes.


The point is that everyone voting yes is a possible equilibrium solution (along with the "normal" solution), so a priori there's no rule that tells you which one is the equilibrium the pirates would reach.


Ok, I guess I need to understand how this is an equilibrium and read more carefully what the rules of this game are exactly. Because from reading it once I understood that pirates are supposed to decide on a proposed distribution. And according to my understanding, a pirate would never vote YES if offered 0. Therefore, I seem to not see how the situation we are discussing is a viable equilibrium. (I do see how once in this "state" unilateral change of votes means no advantage to any one player, but is this really interesting for this game, i.e. is it a viable outcome of it despite being stable once reached? Maybe I have it all wrong though.)


> I do see how once in this "state" unilateral change of votes means no advantage to any one player

Right, so it meets the definition of an equilibrium.

> but is this really interesting for this game, i.e. is it a viable outcome of it despite being stable once reached?

In "reality" no. But the Nash Equilibrium formalism gives us no way of determining which of the two possible equilibria (this one and the usual solution) will happen in practice. So you need some more powerful idea to solve that (see references to "superrationality" and "trembling hand" elsewhere in this thread), and none of those theories is mathematically complete.


Yep okay. So is what this then shows, really, that a Nash equilibrium is not so meaningful for this game?


It's meaningful, but it's insufficient to completely solve it.


> And according to my understanding, a pirate would never vote YES if offered 0.

A pirate will vote YES if offered 0 when the alternative is getting killed in one of the following turns. Pirates always prefer to live first, and only then they prefer to maximize their money.


what "following turns"?

edit: to clarify, the only pirate that would have to consider being killed is the one making the offer. thus yes, offering yourself 0 is possible and in fact seems to be a requirement for the (one of the) stable solutions. But if I'm not most senior, I would never accept 0.


The game has turns in theory: each split proposal is one turn. The most senior pirate proposes a split; if it's not accepted, he gets killed and the next pirate proposes a split. That's the next "turn", assuming the first split gets rejected.

Theoretically, a pirate could accept a seemingly inconvenient proposal if he evaluates that rejecting it would result in himself getting killed further down the line, when it's his own turn to propose a split.

You're right that this never happens in the 5-pirate game as stated (I'm not sure about the general N-pirate, C-coins game). The pirates who are offered 0 coins reject the split, but they don't get to alter the result anyway.


No, the point being made is there is no 'equilibrium' because the game is not repeated.

Equilibrium meaning 'if everyone votes this way, there is no benefit for any individual to vote differently' - that doesn't apply when the pirates are trying to determine what to vote for the one and only time they'll ever vote on this question, not knowing what anyone else will be voting.


The idea of Nash equilibrium isn't just for repeated games, in fact it was invented for one-shot games. (Now I understand why you keep mentioning "infinite games"...)


It isn't, but it requires knowledge of the other strategies: there is at least a 'starting' strategy. That isn't the case here.

Which is why you've been unable to express your disagreement with the puzzle without resorting to language of 'changing' votes, or assuming that you know what other people are going to vote. None of which are reasonable assumptions here.


The solution is contained in the post. I can't find a problem with it. It was you who were suggesting this is a naive analysis.

Equilibria just seem to me to be the wrong tool for the job, you have to (as you have) invent a related infinite game with additional axioms and expectations before that kind of analysis kicks in.

Check your responses. If you can't describe what you're trying to say without mentioning anyone 'changing' their strategy, or assuming that a strategy is default, then you've got the wrong framework, I'd suggest (modulo my admitted lack of specific expertise).


This clip [1], from the UK game show "Golden Balls", is a very entertaining example of a two-player zero-sum game

EDIT: As explained in the replies, this is not actually a zero-sum game.

[1] https://www.youtube.com/watch?v=S0qjK3TWZE8


The comments mention that the rules were changed to disallow out-of-band bargaining after this event, which is important because this is one of the core solutions to the Prisoner's Dilemma (which is what they're playing, and is not a zero-sum game). The solution in general is contract law.

The verbal agreement "If you do this and I do that then I will split the money 50/50 with you after the show" is a verbal contract which (due to the cameras recording the event) is nevertheless indisputable. You are giving them a task to accomplish in exchange for a promised financial reward. They have all rights to sue you in court if they did what you asked and you didn't pay them for it.

What's really interesting is that he promises to split 50/50. In theory a rational actor who could commit to these things absolutely would say, "Look, I am pre-committing to steal, and I will give you $100 after the game if you choose split." Then it is still rational for the other guy to choose "split" (getting $100) rather than "steal" (getting 0). However most people, in study after study, will be insulted by the disparity of only getting 1% of the winning, and the insult is magnified by the camera: it is public shame. So in some sense being insulted by unfairness introduces out-of-band costs that also tend to pull you to splitting the money, and splitting it more fairly.


Basically there are qualitative reputation costs at play, which is why it's not really a zero sum.

An anonymous, blind experiment would be more interesting, but less entertaining.


There's decision-theoretic reasons to be the sort of person who refuses unfair bargains that make you slightly better off. Specifically, people will give you much fairer bargains, because otherwise you might refuse them.


By the way, in some cultures the opposite happened (they viewed being given large sums as an obligation they could never repay and were displeased with them).


That's not a zero-sum game, though. The total money awarded to both players isn't the same in all outcomes. That's the whole reason why the game isn't purely adversarial and it makes sense to negotiate.

I remember seeing this clip in 2012, and back then I proposed a solution based on correlated equilibria which still seems pretty good to me:

"We have two minutes to talk, right? I'm going to ask you to flip a coin (visibly to both of us) at the last possible moment, the exact second where we must cease talking. If the coin comes up heads, I promise I'll cooperate, you can just go ahead and claim the whole prize. If the coin comes up tails, I promise I'll defect. Please cooperate in this case, because you have nothing to gain by defecting, and anyway the arrangement is fair, isn't it?"


I think you have not read the rules carefully enough to appreciate the challenge of this game,

> Third, each pirate would prefer to throw another overboard, if all other results would otherwise be equal.

If a pirate found themself in a situation where they thought their vote wouldn't change the outcome -- i.e. all other results would otherwise be equal -- they would vote to murder the proposer. In the situation you describe, every pirate would find themselves in that situation, and would thus vote for blood.


This is fascinating. I'm pretty sure the wikipedia page implicitly assumes that pirates are superrational (https://en.wikipedia.org/wiki/Superrationality).

A group of superrational players may choose an outcome which is not even a NE! For example, two superrational players playing the prisoner's dilemma game will choose to cooperate.

Conversely, Nash equilibria are no longer "stable" in the original sense that there is no incentive for any player to unilaterally deviate. The concept of "unilaterally deviating" doesn't even make sense, because a group of superrational players will all choose the same strategy. The superrationality allows them to cooperatively deviate.

To demonstrate, I consider the smallest case for which your equilibrium strategy differs from the wiki pages' strategy, that is n=3. Suppose there are 3 pirates, A, B and C. Suppose the first pirate proposes that he gets all the coins. Then the state where everyone votes yes is a Nash equilibrium because no one has an incentive to unilaterally. However I claim that a group of supperrational players will vote to kill A.

Here is the proof of my claim (caveat: I don't think this is actually a valid analysis - see next paragraph. however it's the same level of rigor as the wiki page and demonstrates how B and C can cooperate to kill A, "escaping" the NE of both voting Y). C will vote No because accepting the proposal is the worst possible outcome for him (he gets 0 gold and no one dies; note that it is not possible for C to die), so strategies where he votes No in this round dominate strategies where he votes Yes in this round. B knows that C is thinking this. Hence he will vote No.

Explanation of caveat: actually, the nonstrict dominance proof in the above paragraph suggests that C does not know how B will vote and the dominance is nonstrict because of this unknown. However, since B and C are superrational, there is no such thing as not knowing how someone would vote; C knows that B will vote No, so the dominance is strict. However this argument seems circular (even though, because of superrationality, I think it really isn't).

Unrelated point: If we ammend the rules of the pirate game to include another tiebreaker with lowest priority: that all pirates prefer to vote "no", then your situation is no longer even a Nash equilibrium (much less a subgame-perfect NE). Then how many NEs are there in the ammended game?


> In the end there's no possible setup that everyone will stick with.

Wouldn't Alice, Bob and Carol realize that then and agree to divide the cake evenly since it's the only solution that is fair to everybody?


That would be the Nash equilibrium across splitting multiple cakes over and over again.

For a single cake, offering a fair split is just as hard to predict success as any other split. It also assumes there are no other signals, which is why it's not very insightful to begin with.


>Another problem which puzzles me even more is dividing a cake among three people by majority vote.

Is there a name for this problem? I can only find the fair cake cutting problem, which has been solved for 3 people.


https://www.math.hmc.edu/~su/fairdivision/theory.html

Or more specifically, https://en.wikipedia.org/wiki/Envy-free_cake-cutting

On a tangential note, something that's always bothered me about write-ups of "envy-free" division is that they assume a couple of things I don't think are true:

- That someone cutting a cake always executes exactly the cut they planned.

- That once the cake has been divided, no one will experience envy because their piece is worth, to them, at least their "fair share" of the total.

Take the classic example of envy-free division of one cake among two people. The solution is simple enough that most people understand it instantly: person A divides the cake into two portions however they please, and person B then selects one of those portions, A receiving the other. Everyone will agree that this is intuitively fair. But it's certainly not fair for either of the two reasons I dispute -- A might still envy the portion of the cake that went to B, imagining a different world where the entire cake was theirs... or A might slip up with the knife and create one portion which is obviously better than the other portion, which B then chooses.

Rather, I think the benefits of envy-free division come from two other points:

- The parties involved agree beforehand that the process is fair. This requires them to be able to understand it, which is easy for the two-person division and quite difficult for more than two.

- Instead of assuming that division goes off without a hitch, we can observe that if you receive a portion you're unhappy with, it's because you messed up. The blame falls on you rather than someone else.


I think he's describing a different problem, since he mentions majority voting. If I was splitting a cake with 2 other people (A and B), A suggested fair division into thirds, and B suggested half for me and half for B, B's suggestion would clearly win the vote.


No, there's no name AFAIK. I learned it from Robin Hanson (well-known blogger and economics professor at GMU). It's different from envy-free cake cutting, which doesn't involve voting.


> The pirates do not trust each other, and will neither make nor honor any promises between pirates apart from a proposed distribution plan that gives a whole number of gold coins to each pirate.


[deleted]


You have to remember this is merely a logic puzzle devised to have a non-intuitive solution. The pirates thing is merely a pasted-on theme. If it helps, consider these are robots who must follow a set of predetermined rules.


Arrr matey, it certainly would help because

a) not once have I seen an 'Arrr matey' in the comments b) the article didn't mention walking the plank at all c) I don't think that we can count on lower level pirates to be competent at complex logic


There's a stable solution if you can offer a credible pre-commitment. Instead of a tentative 50/50 split between Alice and Bob, they pre-commit to either voting for a 50/50 split or voting for giving Carol the entire cake.


Yeah, if you reformulate the game to allow precommitment, it will have many Nash equilibria. But I think of it as a last resort, because it gives you too many solutions with no way to choose between them.


I heard research into the 3 person cake division is held back by years due to all the researchers getting diabetes and dying young.

True story.


I really hate questions like this. Since they mistake rationality for computer-like adherence to a set of rules.

——————————————————————

Imagine a 100 rounds of ultimatum games. In each round, Alice proposes a split of $1 and Bob can either accept or reject. If Bob rejects, the $1 is burned in a fire. How much money will each end up with?

Lets work backwards, as we do in the pirate game.

In ROUND100, Alice knows Bob will take whatever non-zero offer she makes. She can offer him 1 cent and Bob will agree.

In ROUND99, Alice knows that Bob can will accept whatever non-zero offer she makes in ROUND100, so she can make whatever non-zero offer she wants this round without fear of repercussion. She offers him 1 cent and knows he will agree.

Continuing to work backwards, we find that in all 100 rounds, Alice takes 99 cents and offers Bob 1 cent. Alice ends up with $99 and Bob ends up with $1.

——————————————————————

This is obviously (and experimentally proven) not what would happen in real life. In experimental outcomes, Bob rejects offers that are too low to broadcast that he is “irrational”. As a result, Bob is able to negotiate a much higher outcome (around 40%).

So who is more rational? The "rational" Bob who gets $1, or the “irrational” Bob who gets $40?


I really love questions like this.

The insight obtained from from comparing the computer-like rationality to what would happen in the real world is truly fascinating to me.

Also, your example of Bob negotiating fits in very well with the theoretical computer-like rationality. Alice's offer of 1c is equivalent to an offer of 0c nominal in the hypothetical case.

The real-life 40% is more a commentary of utility of money as opposed to decision making. I think that you're conflating concepts.


I think what game theory could use is a little bit of the concept of price-of-entry into a new market, Ie, doing something at a loss now in order to generate profit later. This is similar but not exactly the same as the idea of a loss-leader. What a lot of classic game theory is missing is the idea of a feedback-loop.

In a game where "Bob" is continually offered one cent, if he burns the other player who is trying to get the other 99 cents, he's signaling that they'll have to cut him a better deal. The other player would be irrational to continue offering the 99/1 split after it had been previously rejected.

For the other player, even (especially!) if it's a computer, the rational choice is to keep upping Bob's percentage until it finds a number he'll accept. So you're right that 40% only speaks to a specific utility, but in any case, the only rational number is whatever number Bob (and the first player) will accept.


What are some such insights to be gained from the pirate example? What did we learn? How a set of "rational" computers might play out this scenario? How do we apply that to our decision making? These questions might seem obtuse or questioning the value of game theory, but I'm just having a hard time understanding what to take from it.

The real-life 40% is more a commentary of utility of money as opposed to decision making.

Only real-life humans make decisions though. If there is no implied utility of money then Bob might as well reject all offers. And it does seem that the only important question is "what would real humans do?" We can set up a simulation that strictly adheres to a simple set of rules and watch how it plays out, but what is that telling us?


> What are some such insights to be gained from the pirate example? What did we learn?

It's a logic puzzle. We learned that the intuitive solution of having the first proposal be "I don't want any money, I just want to live" is too conservative, and that there is a way better solution for the first pirate.

We didn't gain any deep insight in how to split money between real people, since those cases seldom involve pure, unemotional logic.


Game theory is often used as a guiding principle in human negotiations, but as you say, there is never a truly isolated system when it comes to people in real world situations. Good negotiators are cognizant of this.


I should rephrase. I hate it from an interview-question point of view. I like it from an insight point of view.


> mistake rationality for computer-like adherence to a set of rules

I might be mistaken but I think in terms of economics that this is literally the definition of rationality.

Your experimental example is with real people. Real people are not rational actors (as a rational actor is defined in economics). This is considered by many to be a major flaw in traditional economics.


> Real people are not rational actors (as a rational actor is defined in economics). This is considered by many to be a major flaw in traditional economics.

On the flip side, this is considered by many traditional economists to be a major flaw in real people.


Seeing as the "irrational" actor outperforms the "rational" actor, we should be happy real people are "irrational" at least some of the time.


This is probably because evolution operates on a genetic systematic basis, rewarded by external outputs. Capacity for rational thought is a side effect of this process, and does not necessarily proceed from first principles like much of economics would prefer.


I.e. classical rational economics is wrong -- and you provide a first principle reason why (adding to the experimental stuff I already knew about).


I hear that people on various anti-depression drugs tend to behave more like proper rational actors. Maybe if we all agreed to take the pills economists could actually understand what is going on...


By the definition I understand, rational means payoff maximizing. How an actor gets to payoff maximization is irrelevant. In other words if "irrational" actions result in payoff maximization, then the "irrational" actions ARE rational.


>In ROUND100, Alice knows Bob will take whatever non-zero offer she makes.

Wait, how does she know that? Can you explain why you consider that the most rational choice for Bob to make? As experiment shows, this is not the optimal strategy, so I don't see why you would call it the "rational" one.

Also, keep in mind that you're describing an iterative game (where the same people play the same game with the same circumstances repeatedly) and the pirate game is non-iterative, since the players and circumstances change between rounds. As you note, iterative games are much more complicated and the sort of induction used in the pirate game doesn't really work.


> As experiment shows, this is not the optimal strategy, so I don't see why you would call it the "rational" one.

It's rational because if Bob accepts an offer for a positive value of money, even if it's a cent, he gains value from it. Rejecting it just throws that cent away, which he could have had in his pocket, it's irrational to throw away this value.

There's a difference between rational and optimal for a reason though, but you need to feel like you're getting ripped off for that to come into play, you need to feel like you'd rather someone else lose $0.99 than you gain $0.01.


That only makes sense if you restrict Bob's decision making to consider each game in isolation, without regard to how how rejecting an offer in one game might lead to a higher offer in a future game. At that point, I would call it a different problem entirely, so there isn't much point in comparing the results. I certainly wouldn't label that version of the problem "rational" and the version where Bob considers future games "irrational".


But pre-commiting to a strategy of say: "accept if 40 cents or more" could net more coins overall, right?

I agree that if it was a one off game, then that would make sense, but it's an iterated game.



This 100 round progression reminds me of (though is not equivalent to) Zeno's Paradox.


This puzzle should be called "The VC game". Real pirates were actually far more egalitarian. http://www.newyorker.com/magazine/2007/07/09/the-pirates-cod...


I initially thought the parent was being unfair to the VCs, but then considered policies that exclude other investors, like preferred shares and buyouts that fall just short of any payout to common shareholders, and so forth, and realized it's pretty accurate.


Humans in general actively punish perceived unfairness. Put simply, that's because real life is iterated and this puzzle is explicitly a one-off game. Give those real pirates a guaranteed one-off game and they would be far less egalitarian.


Just on the off chance that anyone else reading this new yorker article clicked on the link to the Buccaneers.pdf file and also found that it 404'd:

Archive.org have a mirror:

http://web.archive.org/web/20081218012123/http://history.byu...


Isn't this our current economic status? The rich (A) become and stay rich by scoring votes with key voters (C and E). These key voters are kept in line by intimidation, because they know they could end up worse (like B and D).


In addition B, C, D and E are usually the same people divided by A.


This is an interesting way to think about it, i.e. getting thrown overboard == ruling class overthrown in revolution.


There's a Korean gameshow called "The Genius" which plays on these kinds of games. It's more socially oriented but the contestants and the "narrator" go through the mathematics and strategies a bit. Highly recommended.


I second that recommendation.

The usual format is that they play a game each episode, and the winner and another player of their choice are guaranteed to go to the next round. The loser of the main game then plays a one-on-one game with another player of their choice, and the loser of that game is eliminated. This creates an interesting metagame surrounding the main games.

One thing that makes it work particularly well is that the participants all seem to be relatively pleasant people. There were alliances and sudden-but-inevitable betrayal, but no one was ever unpleasant about it. I'm not sure if it's a cultural thing, or that they're all notionally "geniuses" (talented in some field) rather than random people or celebrities, but that atmosphere really made a good premise into an excellent show.


A tantalizing idea I've been trying to spell out properly that goes against the usual reasoning behind similar problem is that of putting into question the correctness of causality in hypotheticals.

Each such problem I've read about assumes that we can assume causality while reasoning about hypothetical, but strangely, if we let go of that assumptions, then we can arrive at different answers which hinge on otherwise surprising behaviour. I relate this idea to the fact that in logical reasoning, all true statements are, once proved, held to be simultaneously true. That is, given if A then B, with A being true, we don't hold B to be true 'after' A, but to have been always true, given A.

In the pirate problem, limiting ourselves to three pirates to shorten my explanation, we end up with the split being: 99 to C, 0 to D, 1 to E. This is because we assume that if only D and E were left, D would keep 100 to himself. Now given that distribution (99,0,1), D should now change his hypothetical proposal to (0,98,2). If we assume that C is not a 'non-causality' believer, but both D and E to be, then E would vote no to (99,0,1) and D would do the (0,98,2) split. You may argue that D could then do a (0,100,0) split, but that's not how a non-causality believer MUST act, because he knows that to get to that point, he must know thet E can logically know that he will do this split. This can be justified by arguing that when a pirates survives, he will enter such gold-splitting game later on. But my argument is subtler than this and doesn't require it. It basically become this: all such pirates, posited to be perfectly logical, are interchangeable. Thus they must all reason in the same fashion. Thus, my argument is that true pure logical minds see that the true way to maximize their gold profit is to hold a world view that maximize their profit, even if causality is discarded. Thus both D and E know that tehy can maximize their profit by discarding their belief in causality. That is how D ends up proposing (0,98,2) and E accepts it.

Of course, I ended up there by assuming C was not a believer, but given my argument, C must also be ready to throw causality out of the window, otherwise he will end up dead. I believe my argument ends up splitting (49,51,0), but I'm not sure. Once causality is throw out, it's hard to tell, but intuitively, with three pirates, those voting must be given almost equal gold and the remaining pirates must not have a majority.


That's not completely right, but close enough to be dangerous. Indeed, quite a few researchers around LessWrong, MIRI, FHI etc. agree that the right way to formalize decision-making should use the timeless view that you advocate. Decisions and precommitments are provable consequences of an agent's source code, agents with the same source code can use that fact to coordinate, and so on. I've done a lot of work on this, so feel free to ask :-)


I love game theory, but I hate that the focus outside of academia is almost always on simplistic single period games. These games can result in some interesting conclusions and teach some valuable concepts, but they are terrible at representing how people behave in the real world.

Anyone interested in a more complex and realistic examination of the economics of pirating should consider Peter Leeson's book, The Invisible Hook: The Hidden Economics of Pirates.

I would also recommend Leeson's Anarchy Unbound: Why Self-Governance Works Better Than You Think for some interesting applications of game theory in more realistic historical contexts.


It's not meant to teach you about people or pirates; it's a simplified tool used to teach concepts and methods of thinking.

Much like a two-body system is useful for explaining gravity, even though there's virtually no place in the real world where that particular example would be useful.


I get that, which is why I said it's useful for teaching, but I disagree with your notion that game theory is incapable of being anything other than an educational tool.

When models utilize more realistic assumptions, they can be incredibly powerful tools for understanding past behavior and predicting future behavior. Historical evaluation is particularly powerful when game theory is utilized with the analytical narrative form of analysis.


Not sure I agree with your last statement. Our solar system can be modelled quite accurately and usefully as several overlapping 2-body systems. The interplanetary interactions are very small perturbations.


This kind of reminds me of the surprise execution paradox.

https://en.m.wikipedia.org/wiki/Unexpected_hanging_paradox


I remember reading about this game on a private music torrent tracker what.cd and was working on the solution for a couple of hours. I believe there was a whole thread dedicated to these kinds of problems.

Quite an entertaining problem, skip the result and try it out yourself, the sense of accomplishment is fulfilling, especially if your mood is shaky.


Let's see if my logic works. It seems some motivations and negotiations are left out of the proposed solution.

If I were C I would have voted no on A's proposal and try to make a deal with D and E. The idea is that knowing if A were thrown overboard then B might be more apt to provide a more beneficial coin sharing program in an effort to save his life. After all, he just watched the previous guy get tossed overboard and die. The most I could lose would be 1 coin if B decided to try A's method on D and E and they went along with it.

The key is that I would have already negotiated a deal with D and E, if A or B doesn't share the wealth fairly then we vote to toss them. Once I'm in charge, I'll share the coins equally as possible.


That fails because one of the explicit stipulations in the problem is: "The pirates do not trust each other, and will neither make nor honor any promises between pirates apart from a proposed distribution plan that gives a whole number of gold coins to each pirate."

I read that as "an actual proposal, by the leader" not as a "theoretical future proposal, iff that negotiator becomes the leader" and I believe that's the intended reading.

Thus, D and E cannot trust that you'll hold your word to give them 33 coins each provided you get to be in charge.


If that were true then the whole thing falls apart as soon as the first proposal is made. They all have to trust that A will follow through with the proposal, of which they apparently do not.

But yes, I glossed over that part apparently.


They do trust A (the first proposal). If all of them accept A's proposal, they are all committed to honoring it. What they cannot trust is any hypothetical proposals made by a pirate whose turn is not to make the current proposal.

"I have the turn and I propose we split this way": trusted.

"I don't have the turn yet, but when it's my turn, I promise I'll propose this / kill him / won't kill you": not trusted.


Making a deal like you propose is explicitly forbidden by the rules. Or more accurately, pirates don't trust each other and will not honor any deals beyond the current split proposal. Of course, pirates in real life wouldn't behave like this, but these are not real-life pirates.


Yes, I admit I missed that part from another response. But, as I said there, if there's no trust at all then how does any of it make any sense to begin with?


There is trust in only one thing, as per the rules of the game: that the current proposed split (and only the current one), if accepted by all pirates, will be carried out.

Anything else cannot be trusted. Because the pirates are completely rational (think robots), they not only don't trust each other, they also know the rest don't trust them either. So no pirate will attempt any kind of deal.


This only works if C has the trust of D and E to honor his promises. And trust can only be developed in interated games.

In a one-off game a rational player C will negotiate and then propose a 99-0-1 distribution.


I guess I was going with the notion that this wasn't a group of pirates that had no prior interaction.


Pirates aren't hyperrational, so the whole game isn't a real-life scenario. Think robots, all playing with the same ruleset and having perfect knowledge of the game, if it helps.


I was asked to solve this puzzle once during a project management interview. The solution here says "work backwards," but I found that "work recursively" was a more helpful (and instructive) way to think about it.


Applying the maximization of benefit logic, "work backwards" is very meaningful to a very large population of readers, "work recursively" is somewhat more meaningful to a very small population.


The are very similar. Recursion is expressing a problem in terms of a smaller case of the same problem. Working backwards/induction is building up from a base case.


Recursion is exactly the same (as induction) because it also requires specifying what to do in a base case. I guess I think recursion is a more instructive term than "working backwards" because it speaks to a rigorous programmatic approach.


Wikipedia is not written for programmers only.


They could have explained it differently instead of going backwards. I tried to explain the backwards part to my mum and she didn't quite grok it. Then I explained it from perspective of A, B, C and so on.

When A is in charge this affects....

When B is in charge this affects...

Maybe that's recursively?

solve(pirates) {..... solve(mutatedPirates) }


I had to do this in a programming interview. Thankfully people don't do this type of question much any more.


The results of the pirate game seem a lot like the stock allocation at my company. A billion dollar acquisition and Pirate A figuring out how to distribute the coins... guess where they end up!

Note to self: don't get thrown overboard.


I think this is a great game and a portrayal of reality where a small number of people get most of the wealth, a 50% number are middle class with enough to be just okay (or at 1) and with a still a very large number who can barely have a comfortable life (say $4k/month household income).


Shouldn't it be E: 2 in the solution?

If pirates, all things being equal, prefer violence to peace (which is implied) then E getting 1 from A is less preferable than killing A and B and getting 1 from C.

Or is it that E knows that B will be offering next, and B will offer him 0, so it makes sense for him to accept A's offer of 1?


> Or is it that E knows that B will be offering next, and B will offer him 0, so it makes sense for him to accept A's offer of 1?

Yes. That's why it is sufficient to offer E: 1.


Reminded me of the game theory course by Stanford on Coursera. It was in the exercise on [backward induction](https://en.wikipedia.org/wiki/Backward_induction), if I remember correctly.


It's a nice demonstration of how the lack of trust ultimate screws everybody except the leader.


I was asked this question in an interview 10+ years ago. I thought it was a really good question to promote logical discussions. Lots of branches and extensions you can add to see how people think. Really fun.


If you like this type of puzzle, I can recommend the book: “How Would You Move Mount Fuji?: Microsoft's Cult of the Puzzle”.


This game reminds me of some employee stock option negotiations I've been on the lessor end of...


These outcomes assume you can live rich (having ~98 coins) in the same space as poor pirates without them stabbing you and taking your coins! Real pirates would distribute more evenly for fear of death after distribution.


Well now I have to choose a new interview question.


Now that you have that down, allow the pirates to secretly bribe each other out of their own savings before the vote. If multiple votes occur, previous bribes are added to a pirate's savings. If a pirate is thrown overboard, his remaining personal savings are looted and added to the pool of coins to be divided. The personal savings of each pirate are secret from all the other pirates, but all pirates always have more in savings than the next pirate in order of decreasing seniority.

According to the pirate's code, a bribe is a binding contract, but only to the extent that breaching the terms of the contract results in returning the amount of the bribe after the breach occurs. Otherwise, the bribed pirate is thrown overboard. A pirate may therefore spend the bribe money before reneging, if reneging will still yield enough money to pay back the bribe afterward.

Pirate P[x] has savings s[x], and s[x] > s[x+1].

The degenerate case where n=1 is easy to deduce. P[1] proposes {pool}, votes yes, and wins.

The case where n=2 is also easy. P[1] proposes {pool, 0}, votes yes, and wins.

When n=3, it gets more complicated. P[1] needs one more vote to win. If P[2] is able to propose a split, he will be proposing {0, pool + s[1], 0}. So the cost of P[2]'s vote is at least pool + s[1] + 1, which is normally impossible to achieve for P[1]. But since P[2] does not know how much s[1] is, other than that it is more than s[2], P[1] might be able to risk it. But since P[1] doesn't know s[2], bluffing a lower amount for s[1] is risky, as if the bluff amount is lower than s[2] + 1, then P[2] will immediately know to vote no. The cost of P[3]'s vote is at least 1. P[1] and P[2] will therefore be bidding competitively for P[3]'s vote. P[1] has the choice of offering value to P[3] publicly in the proposal, or secretly as a bribe. Either way, P[1] is likely to propose 0 for P[2].

So if P[1] proposes { pool, 0, 0 }, and bribes P[3] to vote yes with s[1], P[2] might try bribing P[3] with any amount from 1 to s[2] to vote no, with no effect. The net effect is { +pool -s[1], 0, +s[1] }.

Remember also that any bribe P[1] pays to P[2] could be added to a bribe from P[2] to P[3]. If P[1] bribes 1 to P[2] to vote yes, and s[1]-1 to P[3] to vote yes, and P[2] bribes s[2]+1 to P[3] to vote no, then if P[2] and P[3] both vote no, P[1] is thrown overboard for losing the vote, and P[2] is thrown overboard for reneging on a bribe and not having the coin to pay it back, so P[3] gets everything. If P[3] suspects that P[1] may have bribed P[2] to vote yes, and knows that P[2] bribed him to vote no--which would be useless unless P[2] himself intended to vote no--then P[3] may vote no on the possibility of getting pool + s[1] + s[2], which would otherwise be impossible for him. So knowing this, P[1] may be confident that any bribe to P[2] to vote yes would not reasonably be re-bribed to P[3] to vote no.

So P[1] could bribe 1 to P[2] to vote yes, tell P[3] that he had bribed P[2], tell P[2] that he had told P[3], and then propose {pool - 1, 0, 1}. It would be useless for P[2] to bribe P[3] unless he intended to reneg, he can't add P[1]'s bribe to his own, and he knows that s[1] - 1 >= s[2]. With the 1 from the pool, he could not match such a theoretical bribe anyway. So the net effect is now { +pool -2, 1, 1 }.

I'm not exactly sure if that is a stable solution or not.

I think maybe that P[3] might be able to get more by bribing P[1] to vote no, or P[2] to vote yes.


If the pirates were that strictly rational and interested in their own survival, they wouldn't be pirates. They'd have a nice, safe job, free from threat of death or injury.


I don't normally go for pop culture quotes here, but this one seems rather appropriate:

Oh, get a job? Just get a job? Why don't I strap on my job helmet and squeeze down into a job cannon and fire off into job land, where jobs grow on little jobbies?!


Yeah, and while you're doing that, why not have a chat to the pirates so bloodthirsty to kill each other, but with such a code of honour that they're satisfied with the booty split which has 98% of the money going to only 20% of the crew.

Why are you so disturbed about the ridiculousness of my comment, yet quite happy to take the ridiculousness of the situation's premise?


What gives you the impression that I'm "disturbed"?


Probably an artifact of seeing a bizarre response in general to the comment, really. The tone of the quote also doesn't help, as it sounds pissed-off.


Pretty much all of the jobs during pirates' heyday were dangerous, at least those available to the lower classes. Piracy was immensely profitable and comparatively safe for years until the Royal Navy clamped down heavily with summary execution being the usual punishment for captured pirates.

Basically, they had the same advantages as pirates near Somalia and Indonesia have today - very few military ships, and merchants are unwilling to fight because it risks the entire crew being massacred.


Again, if we're being realistic, nothing in the game tells us that it takes place in the golden age of piracy. It's piracy at sea (because 'overboard') and it's taking stuff ('booty'), but we have no other indicators. Well, unlikely to be modern pirates because 'gold coins', though possible. Piracy goes at least as far back as ancient Greece and up to today, with the world's hottest spot having moved to around Singapore. :)


Being a pirate was a job. A government job in some cases.


Yes, working for a privateer. Reference: https://en.wikipedia.org/wiki/Privateer


You should read The Invisible Hook. I bet you will have a different opinion by the end of it. You also forgot to put a value on self-determination.


I guess you could use the unusually democratic nature of some historical pirate crews to defend a grossly undemocratic split of 98-0-1-0-1 in the game, but I'm not sure I see the link, personally.


If you were ever curious what the phrase "couldn't see the forest for the trees" meant... here's an example.


Frankly, it's funny that there's been such a backlash against this comment. People seem outraged that I should suggest such a thing, yet find it perfectly acceptable that five pirates would be satisfied with a booty divide that has 98% of the money going to one guy. As if these pirates, so ready to kill each other, wouldn't have the four remaining pirates toss the first overboard.

If ANY of the respondedants and downvoters to my comment actually wanted a touch of realism, they'd understand that I was pointing out the ridiculousness of 'perfectly rational actors' in these games.


I don't think many jobs during the Golden Age of Piracy would be considered safe and free from threat of death or injury.


People are downvoting you and explaining why, but I'd like to ask you:

Why do you think rationality and survival are specifically at odds with being a pirate?


They're telling me about the 'job market in the age of sail', not about the problem at hand. Basically, you've pulled a strawman out of what I've said - I didn't say that rationality and survival were at odds with being a pirate. I said that if the pirates were "that strictly rational", that is, that they'd accept the ridiculous split given in the game, that they wouldn't be pirates in the first place. If you want to talk about pirates, cool, I love them and their history, let's talk about pirates! But if you want to make a game with humans adhering to ridiculous artificial rules, then lets throw some spanners in those works.

I'm actually trying to inject some humanity into the problem, highlighting (as others have) that we're talking about humans here, not logic gates. If the pirates were so extreme in their rationality, then they wouldn't be pirates in the first place. Talk of such grandiose concepts like self-determination seems out of place when you have 80% of the crew being satisfied with a measly two coins.


  "If the pirates were that strictly rational and interested
   in their own survival, they wouldn't be pirates"
to which I replied

  "Why do you think rationality and survival are 
  specifically at odds with being a pirate?"
Your comment is distilled down to

  if strictly rational and & interested in survival, then not pirates
So my question asking why you think this is not following incorrect logic, let alone a strawman -- if you're going to be pedantic about rhetoric then at least know what fallacy you're accusing of me means. You can't just say 'thats a strawman!' when someone drills down on your argument.


I never said that rationality and survival were at odds with being a pirate - that's something you injected into my words, hence strawman. I said that the strict rationality in the article was; the extreme nature of the rationality described.

I mean, I even italicised the word 'strictly' in my reply, to clearly indicate the important bit of my comment that you blithely removed to create your strawman. You didn't 'drill down' my argument, you twisted it to say something I never said. It's like if I said "if a person inhales too much water, they can drown" and you replied "why do you think water is at odds with survival?".

Hell, even in this new "distillation" of yours, the word 'strictly' is used. If you're a fan of rhetoric, then get on board with the meanings of words. And if you want to be pedantic, I didn't just say "that's a strawman", I pointed out why it was.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: