1. Introduction
The three-person game
baccara banque (or
baccara à deux tableaux) is closely related to the two-person game
baccara chemin de fer. In fact,
baccara banque has been described as “a game in which a banker plays
chemin-de-fer simultaneously against two players” (Downton and Lockwood [
1]). Game-theoretic analyses of
baccara chemin de fer have been provided by Kemeny and Snell [
2], Foster [
3], Downton and Lockwood [
4], Deloche and Oguer [
5], and Ethier and Gámez [
6]. The more complicated game
baccara banque has received less attention. Foster [
7] was the first to approach the game from the perspective of game theory, though the details of his research were not published. Kendall and Murchland [
8] used simulation to study the game. Downton and Holder [
9] discussed the special case of highly unbalanced stakes. Judah and Ziemba [
10] analyzed a variant of the game in which two of the three players have mandated strategies. Downton and Lockwood [
1] provided the most detailed study of
baccara banque, although it is partially incorrect.
To explain the purpose of this paper, we must first describe the rules of baccara banque. There are three players, Player 1, Player 2, and Banker. Two bets are available to participants, a bet on the hand of Player 1 and a bet on the hand of Player 2. Six 52-card decks are mixed together and dealt from a sabot or shoe. Denominations A, 2–9, 10, J, Q, K have values 1, 2–9, 0, 0, 0, 0, respectively, and suits are irrelevant. The total of a hand, comprising two or three cards, is the sum of the values of the cards, modulo 10. In other words, only the final digit of the sum is used to evaluate a hand. Two cards are dealt face down to each of Player 1, Player 2, and Banker. A two-card total of 8 or 9 is a natural. First, if either Banker or both Players have naturals, play ends. If only one Player has a natural and Banker does not, that Player wins the amount bet from Banker, while play continues between the other Player and Banker. Next, if neither Player 1 nor Banker has a natural, Player 1 has the option of drawing a third card. Then, if neither Player 2 nor Banker has a natural, Player 2 has the option of drawing a third card. In either case, the Player must draw on a two-card total of 4 or less and stand on a two-card total of 6 or 7. When his two-card total is 5, he is free to draw or stand as he chooses. Any third card is dealt face up. Finally, if at least one Player and Banker fail to have naturals, Banker has the option of drawing a third card, and his strategy is unconstrained. Bets are then settled, both Player 1 vs. Banker and Player 2 vs. Banker. In both competitions, the higher total wins. Winning bets are paid by Banker at even odds. Losing bets are collected by Banker. Equal totals result in a push (no money changes hands).
There is a subtle point in the rules that is left ambiguous in most descriptions of the game, concerning the information available to Player 2 when he makes his decision. In what Downton and Lockwood [
1] called the traditional form of the game, Player 2 sees Player 1’s third card, if any, or that he has a natural. The traditional rule is unambiguously stated in Morehead and Mott-Smith ([
11] pp. 522–523), for example. In a more recent variation, Player 2 would know only Player 1’s intention to draw or stand, or that he has a natural. This rule has been used in Great Britain (Downton and Lockwood [
1]) and in Monte Carlo (Barnhart ([
12] pp. 42–43)).
Thus, we have a three-person zero-sum game. Let us assume, as did Kemeny and Snell [
2], that cards are dealt with replacement and that only two-card totals (not compositions) are seen. Player 1 has two pure strategies, draw or stand on two-card totals of 5. Assuming the traditional form of the game, Player 2 also has a draw-or-stand decision on two-card totals of 5 in each of 12 possible situations (Player 1 third-card value 0–9; or stand or natural). Banker then has a draw-or-stand decision in each of
possible situations (12 possibilities for Player 1, 12 for Player 2, and 8 for Banker, except when both Players have naturals). Therefore, the game is a
trimatrix game, which is zero-sum. Under the rules of the more recent variation, the game would be
.
A simplified model assumes that Player 2 ignores his information about Player 1’s third card (or stand or natural). We then have a trimatrix game, which is again zero-sum.
In any case, the payoffs depend on
, where the amounts bet on Player 1’s and Player 2’s hands are in the proportions
. To see intuitively why
θ plays an important role, suppose Player 1’s third card is 7 and Player 2’s third card is 8. If Banker were playing
baccara chemin de fer against Player 1, he would draw on 0–6 and stand on 7. If he were playing
baccara chemin de fer against Player 2, he would draw on 0–2 and stand on 3–7. Notice that Banker would act differently against the two Players if his two-card total were 3–6. In
baccarat banque, however, he must make the same move (draw or stand) against both Players. Le Myre ([
13] p. 114) called this a “cruel embarrassment” for Banker. The parameter
θ determines his correct choice in these conflicting situations.
Downton and Lockwood [
1] claimed that the Nash equilibrium is of only academic interest because “it implies an attitude to the game by all three participants, which is unlikely to be realized in practice.” Presumably, they meant that the two Players regard themselves as competing against Banker but not against each other. Downton and Lockwood’s preferred alternative is what we call the
independent cooperative equilibrium, that is, the solution of the two-person zero-sum game in which Players 1 and 2, acting independently, form a coalition against Banker. Actually, the idea goes back to Foster [
7]. The independence is a requirement of rules not previously stated, which do not permit collaboration between Players 1 and 2. We emphasize that
baccara banque is a noncooperative game, so any cooperation between Players 1 and 2 must be limited to their agreement, prior to the game, that they use, independently, the strategies of the independent cooperative equilibrium. But, as we will show, this solution does not necessarily exist, in the sense that the lower and upper values of the game may differ. Nevertheless, the lower value and the Players’ maximin strategy are of relevance to the Players. A third solution, which we call the
correlated cooperative equilibrium, always exists. Its value and Banker’s minimax strategy are of relevance to Banker. Here there is no independence constraint, so the implementation of these strategies requires collaboration between Players 1 and 2 during the course of play. Despite the fact that these strategies are technically illegal, this concept will turn out to be a useful one.
Downton and Lockwood [
1] assumed the full model (in which Player 2 has
pure strategies). They evaluated the Players’ behavioral strategies in the independent cooperative equilibrium at
, reporting Banker’s behavioral strategy at
. They rounded the Players’ mixing probabilities to two decimal places, and rounded Banker’s mixing probabilities to 0 or 1. These results are, with minor exceptions, correct. They also derived the Nash equilibrium at the same level of detail, but these results are incorrect because their algorithm for finding the Nash equilibrium is flawed, as we explain later.
In this paper we focus our attention on the simplified model (in which Player 2 has two pure strategies). In particular, by symmetry we may assume, without loss of generality, that
. In effect, we interpret Player 1 as the Player on whose hand the smaller amount is bet. We have found, for every
, the correlated cooperative equilibrium, which is typically unique in behavioral strategies, and the Nash equilibrium, which is often nonunique in behavioral strategies. As for the independent cooperative equilibrium, we have found the maximin strategy of the Players, which is again typically unique in behavioral strategies; the minimax strategy of Banker coincides with the corresponding strategy in the correlated cooperative equilibrium. As Downton and Lockwood [
1] put it, the latter Banker strategy “provides a safety-first strategy which guarantees a return to the bank, whatever strategy the players actually adopt.” It is for this reason that we regard the correlated cooperative equilibrium as more useful, from Banker’s perspective, than the Nash equilibrium. On the other hand, from the Players’ perspective, it is arguable whether the independent cooperative equilibrium is more useful than the Nash equilibrium. (The Players’ strategy in the correlated cooperative equilibrium typically requires collaboration and is therefore illegal.)
To mention a few of our findings, the two cooperative equilibria coincide when , and the Nash equilibrium is nearly the same. Elsewhere, with two exceptions, they differ. The correlated cooperative equilibrium is piecewise continuous in θ but with 109 discontinuities in . The Nash equilibrium is piecewise continuous in θ but with 102 discontinuities in . In both cases, the discontinuities come from Banker’s strategy. The Players’ strategies are continuous in θ in the Nash equilibrium except for one point of discontinuity. In the independent cooperative equilibrium, there are as many as 13 points of discontinuity in the Players’ strategies. The game’s lower value (to the Players) is continuous on and increasing on , approximately, and it is maximized at about . The maximum value is about . The game’s upper value (to the Players) is continuous on andincreasing on , approximately, and it is maximized at about . The maximum value is about .
The correlated cooperative equilibrium, the independent cooperative equilibrium, and the Nash equilibrium are not easy to describe precisely. Complete, albeit necessarily lengthy, descriptions are provided in Appendixes A, B, and C of the arXiv version of this paper [
14]. Examples of the
Mathematica notebook files we used can be downloaded [
15,
16,
17].
As we have already noticed, several British statisticians [
1,
7,
8,
9] studied
baccara banque in the 1960s and 1970s. Their primary concern was in the fairness of the game and in particular whether it met the standards of the British Gaming Acts of 1960 and 1968. In fact, Foster was hired by a London gambling club specifically to investigate the legality of
baccara banque. Our motivation for this paper was different. We wanted to know whether technology not available in the 1970s (specifically, computer algebra software) would allow a more complete analysis of a complex game such as
baccara banque than was possible at that time.
Baccara banque is a three-player game complicated not just by the large number of strategy profiles but also by the fact that it depends on a continuous parameter
θ. It is not a contrived game—it is a relative of
baccara chemin de fer, which attracted the interest of game theorists in the 1950s [
2]. As we have seen, the work of Downton and Lockwood [
1] is the most complete study of the game in the scientific literature. They considered two solution concepts and computed approximate solutions for several values of
θ. In this paper we introduce a third solution concept and we compute exact solutions for all
θ. How does one even describe a Nash equilibrium, for example, of such a game as a function of
θ? Our goal in this paper is to answer questions like this, to clarify the distinctions between the three solution concepts, and in doing so to better understand the game
baccara banque.
There have been a number of attempts to quantify Banker’s advantage at
baccara banque in the case of equal amounts bet on Players 1 and 2 (
i.e.,
). Le Myre ([
13] p. 166) and Boll ([
18] pp. 43, 70) made the first estimates (1.11% and 0.87%, resp.), assuming that the Players independently draw on 5 with probability 1/2, and Banker makes a best response. The same assumption was made by Barnhart ([
12] p. 81), who obtained 0.84%. None of these authors was familiar with game theory (or with computers). Foster [
7] and Downton and Lockwood [
1] gave the first game-theoretic estimates (0.87% and 0.85%, resp.), which are in fact accurate to two significant digits under their respective assumptions. Kendall and Murchland [
8] gave a simulated estimate (0.819%), which is inaccurate owing to small sample size. Judah and Ziemba [
10] determined Banker’s best response when the Players always draw on 5, and obtained 0.81685% (the correct figure is about 0.922104%). Under the simplified model that we are assuming and with
, Banker’s advantage is about 0.8677394%, as we will see below.
We conclude this introduction with a historical note.
Baccara banque was made famous by the Prince of Wales (later Edward VII) in the Royal Baccarat Scandal of 1891 (Shore [
19]). It became the game of choice for wealthy gamblers in 1922 when Nicolas Zographos, a founding member of the Greek Syndicate, announced “Tout va” or unlimited stakes. As he put it (Graves ([
20] pp. 27–28)),
My idea is so sensational that practically nobody will play chemin-de-fer. If I guarantee to take any stake of any size, all the millionaires will want to take part in this fantastic party. The biggest gamblers in the world will come to ruin me. I suggest we start at Deauville.
In recent years, both baccara banque and baccara chemin de fer have been largely superseded by a nonstrategic form of the game. Nevertheless, baccara banque is still offered at the Salons Privés of the Casino de Monte-Carlo, Thurs.–Sun. from 5 p.m.
3. Correlated Cooperative Equilibrium
We first find the correlated cooperative equilibrium, that is, the solution of the two-person zero-sum game in which the two Players form a coalition against Banker and are not constrained to act independently. This is a
matrix game with payoff matrix having entries
as defined in
Section 2. For fixed
, we can obtain a solution as follows: Given an arbitrary mixture
of the four pure strategies of the Players, minimize
as a function of
T (this is Banker’s best response
; for information sets where Banker is indifferent, Banker can either draw or stand; for specificity we let Banker stand in such cases), and then maximize
as a function of
p. The maximizing
p is the Players’ maximin strategy, and the maximal value of
is the value of the game, assuming
θ is fixed.
For a given Banker information set ( Player 1’s third-card value, Player 2’s third-card value, Banker’s two-card total), Banker’s optimal move (draw or stand) may or may not depend on the Players’ mixed strategy p. In fact, for only m of the 1144 information sets, where , is there dependence on p.
Let us elaborate on this point. First, assume that
. Suppose Player 1’s third card is 7 and Player 2’s third card is 8 (as in the example mentioned in
Section 1). We can compute, for each Banker total
, the difference between the Players’ expectation when Banker draws and when Banker stands. We find that Banker draws with totals
, regardless of
p, and stands with total 7, regardless of
p, but may draw or stand with totals
, depending on
p, accounting for four undetermined cases. Doing the same analysis for the 142 other possible pairs of Player 1 and 2 third cards, we find that there are an additional 64 undetermined cases. Thus,
when
. On the other hand, it is easy to see that
when
θ is near 0 but positive. In that case Banker plays
baccara chemin de fer against Player 2 and ignores Player 1. In
baccara chemin de fer it is well known that
. This must be multiplied by 12 (possible Player 1 third cards), and there are an additional four cases when Player 2 has a natural, in which Banker plays
baccara chemin de fer against Player 1. Thus,
for
θ positive and small enough. In general, we have found that
.
Writing
, it follows that
where the constants
are computable rational numbers (if
θ is rational) depending on
θ.
For fixed
θ, the function
is concave in
p and its maximum occurs at an intersection of three of the
planes
and
,
,
, and
. The
m planes in Equation (18) might be called “indifference planes”. This leads to a simple algorithm to find the optimal
p. For each of the
potential points
p just mentioned, check whether
,
,
, and
, and if so, evaluate
. Then determine at which such
p the value
is largest and if it isuniquely so.
In the case
, the number of summands is
, hence there are
potential points of intersection
p. Of these only 2364 belong to the three-dimensional simplex and there is a unique maximum at
in addition,
there. The point
p is the intersection of three planes, the two indifference planes for
(Player 1’s third card is 6, Player 2 stands, Banker’s two-card total is 6) and
and the plane
. Banker’s best response is displayed in
Table 1.
Table 1.
Banker’s strategy in the correlated cooperative equilibrium when . Specifically the table displays Banker’s maximum drawing total as a function of Player 1’s and Player 2’s third-card values. For example, if Player 1’s third card is 7 and Player 2’s third card is 8, the entry 3 signifies that Banker draws on 0–3 and stands on 4–7. 5+ signifies that Banker draws on 0–5, mixes on 6, and stands on 7. The table is symmetric in Player 1 and Player 2. Similar entries are shaded similarly for readability.
Table 1.
Banker’s strategy in the correlated cooperative equilibrium when . Specifically the table displays Banker’s maximum drawing total as a function of Player 1’s and Player 2’s third-card values. For example, if Player 1’s third card is 7 and Player 2’s third card is 8, the entry 3 signifies that Banker draws on 0–3 and stands on 4–7. 5+ signifies that Banker draws on 0–5, mixes on 6, and stands on 7. The table is symmetric in Player 1 and Player 2. Similar entries are shaded similarly for readability.
Player 1’s Third-Card Value | Player 2’s Third-Card Value (10 = Stand, 11 = Natural) |
---|
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 |
---|
0 | 3 | 3 | 3 | 4 | 4 | 4 | 4 | 3 | 3 | 3 | 5 | 3 |
1 | 3 | 3 | 4 | 4 | 4 | 4 | 4 | 4 | 3 | 3 | 5 | 3 |
2 | 3 | 4 | 4 | 4 | 4 | 4 | 5 | 4 | 3 | 3 | 5 | 4 |
3 | 4 | 4 | 4 | 4 | 4 | 5 | 5 | 4 | 4 | 3 | 5 | 4 |
4 | 4 | 4 | 4 | 4 | 5 | 5 | 5 | 5 | 4 | 4 | 5 | 5 |
5 | 4 | 4 | 4 | 5 | 5 | 5 | 5 | 5 | 5 | 4 | 5 | 5 |
6 | 4 | 4 | 5 | 5 | 5 | 5 | 6 | 6 | 5 | 4 | 5+ | 6 |
7 | 3 | 4 | 4 | 4 | 5 | 5 | 6 | 6 | 3 | 3 | 6 | 6 |
8 | 3 | 3 | 3 | 4 | 4 | 5 | 5 | 3 | 2 | 3 | 5 | 2 |
9 | 3 | 3 | 3 | 3 | 4 | 4 | 4 | 3 | 3 | 3 | 5 | 3 |
10 | 5 | 5 | 5 | 5 | 5 | 5 | 5+ | 6 | 5 | 5 | 5 | 5 |
11 | 3 | 3 | 4 | 4 | 5 | 5 | 6 | 6 | 2 | 3 | 5 | |
There are two information sets,
and
, at which Banker is indifferent. This leads to a
matrix game with payoff matrix
Rows correspond to pure strategies of the Players, SS, SD, DS, and DD on 5 by Player 1 and Player 2. Columns correspond to pure strategies of Banker, which follow
Table 1 except for SS, SD, DS, DD on
and
. This game has two extreme equilibria
, where
p is as above, and
or
The corresponding Banker behavioral strategies are the same for both equilibria,
This completes the derivation in the case
.
Next, we extend this solution to the largest
θ-interval in
for which the best response coincides with
Table 1. The maximum of
found above occurred at the intersection of three planes, the two indifference planes for
and
and the plane
. The intersection of the three corresponding
θ-dependent planes occurs at
With this choice of
we can ask, what is the smallest
θ for which Banker’s best response coincides with
Table 1? Checking each of the 1144 Banker information sets, we find that the first change occurs at
. The contribution to the difference between the Players’ expectation when Banker draws and the Players’ expectation when Banker stands (due to
) vanishes at
; more precisely,
is a root of the cubic polynomial
.
On the interval
Banker mixes at
and
, and it remains to determine the mixing probabilities. With
denoting the
θ-dependent version of Equation (19), the value
of the game satisfies
We find that
Since the Players have three strategies active, we seek a
kernel, and two of the four possibilities give nonnegative Banker mixing probabilities, the ones corresponding to
and to
. The resulting two solutions of
where
x(
θ) ≤
v(
θ), give the same Banker behavioral strategies, namely,
This completes the derivation for the interval (
θ*, 1/2].
Repeating this process (from right to left, or from left to right), we find that there are 110 such intervals in (0, 1/2]. That is, there exist 0 = θ0 < θ1 < θ2 < … θ109 = θ8 < θ110 = 1/2 such that the correlated cooperative equilibrium is a rational function of θ on interval i, namely (θi−1, θi), for i = 1, 2, …, 110. Each θi is a root of a polynomial of degree 4 or less. At the boundary points, discontinuities occur in Banker’s strategy.
There are two types of intervals, those in which the number of Banker information sets at which Banker mixes is two and those in which it is three. When it is two, the resulting 4 × 4 game has a 3 × 3 kernel. This is a consequence of the fact that the payoff matrix has the form
When it is three, the resulting 4 × 8 game has a 4 × 4 kernel. For intervals 1–41,
p11(
θ) = 0; for intervals 42–46, 102–103, and 107–110,
p00(
θ) = 0; for internals 61-66,
p10(
θ) = 0; and for all remaining intervals the Players have all strategies active. In all cases, despite the correlated cooperative equilibrium being nonunique in mixed strategies, it is unique in behavioral strategies. (This can be proved algebraically.) However, there are exceptions. At each boundary point, the solutions from both adjacent intervals apply, so there is nonuniqueness of Banker behavioral strategies at the 109 such
θ.
The value function is continuous on
, increasing on
and decreasing on
(see Equation (20)). Its maximum value is
. See
Figure 1 for a sketch of the graph.
Figure 1.
The graph of the value of the game to the Players (or minus the value to Banker), assuming the correlated cooperative equilibrium.
Figure 1.
The graph of the value of the game to the Players (or minus the value to Banker), assuming the correlated cooperative equilibrium.
Of particular interest are intervals 75–97, in which the Players’ strategy at equilibrium is, , and , that is, Players 1 and 2 play independently, drawing on 5 with probabilities . In this case, the correlated and independent cooperative equilibria coincide, as we demonstrate in Proposition 1 below.
4. Independent Cooperative Equilibrium
Let and be Player 1’s and Player 2’s two-card totals, and let and be their third-card values. Let Y be Banker’s two-card total, and let be Banker’s third-card value. Let be the probability that Player 1 draws on 5, and let be the probability that Player 2 draws on 5. Let and be the mixed strategies of Players 1 and 2, that is, random variables with distributions and . Assume they are independent of but not necessarily of each other.
Then Equation (2.1b) of Downton and Lockwood [
1], which represents the conditional expected gain to Banker when he draws (measured in units of total amount bet), given that Player 1’s third-card value is
, Player 2’s third-card value is
, and Banker’s two-card total is
, can be writte
Generally, one does not add two conditional expectations when they are conditioned on different events. However, if
and
are independent, then
which is evidently what was intended. Here we have used the simple fact that
The point is that Downton and Lockwood [
1] effectively assumed that Player 1 and Player 2 act independently, even though they make no such assumption explicitly. Thus, their “co-operative optimum strategy” is what we call the independent cooperative equilibrium.
Let us find the independent cooperative equilibrium, that is, the solution of the two-person zero-sum game in which the two Players form a coalition against Banker but are constrained to act independently. This is a
matrix game with payoff matrix having entries
as defined in
Section 2, but mixtures of the four pure strategies of the Players must have the form
for some
. The following proposition shows that the solution need not exist, in the sense that the lower and upper values of the game may differ. (For a closely related result, see Maschler
et al. ([
21] p. 179).) Let
.
Proposition 1 Given ,
let A be the payoff matrix for a matrix game, with the additional constraint that the row player is required to use a mixed strategy of the formfor some .
Let us describe such elements of as belonging to .
Then the lower value of the game iswhile the upper value of the game iswhich is equal to the value of the unconstrained game. In particular, if and only if the maximum in Equation (22)
occurs at a point in .
Proof. Equation (21)
is by definition. The value of the unconstrained game is, by the minimax theorem,
the right side of which is, by definition, the upper value of the constrained game. The last equality uses the fact that a linear function has the same maximum over Δ
4 as over Δ
2 × Δ
2 because the latter contains the extreme points of the former (namely (0, 0, 0, 1), (0, 0, 1, 0), (0, 1, 0, 0), (1, 0, 0, 0).
Remark 1. A mixed strategy p for the row player that achieves the maximum in Equation (21) is called a maximin strategy, and it assures the row player of an expected gain of at least v. A mixed strategy q for the column player that achieves the minimum in the center or on the right side of Equation (23) is called a minimax strategy, and it assures the column player of an expected loss of at most .
For fixed
, we can obtain the Players’ maximin strategy and the lower value of the game as follows: Given an arbitrary probabilities
and
(of drawing on 5 for Player 1 and Player 2), minimize
as a function of
T (this is Banker’s best response
), and then maximize
as a function of
. The maximizing
is the Players’ maximin strategy, and the maximal value of
is the lower value of the game, assuming
θ is fixed. (
Cf. Equation (21).)
For a given Banker information set
, Banker’s optimal move (draw or stand) may or may not depend on the Players’ strategy
. In fact, for only
m of the 1144 information sets, where
, is there dependence on
. It follows that
has the form
where the constants
are computable rational numbers (if
θ is rational) depending on
θ—but they are not the same as the ones in
Section 3.
For fixed
θ, the function
, although it depends on only two variables instead of three, is more complicated than
. It is not concave, and its maximum does not necessarily occur at an intersection of two of the
curves
and
,
,
, and
. Its maximum could occur at a point on a single curve but typically occurs at a point of intersection. The
m curves in Equation (25) might be called “indifference curves”. This leads to an algorithm to find the optimal
. For each of the
potential points
just mentioned, check whether
and
, and if so, evaluate
. Then determine at which such
the value
is largest and if it is uniquely so. Finally, confirm that this gives a global maximum. (If it does not, look for a global maximum along one of the
curves. The global maximum cannot occur at a point that avoids all of these curves because 1,
,
, and
are harmonic in
; a smooth function
is
harmonic if
throughout its domain.)
In the case
, the number of summands is
, hence there are
potential points of intersection
. Of these, only 1003 belong to the unit square and there is a unique maximum at
and
there. It can then be confirmed that this determines a global maximum. Furthermore, Banker’s best response to this choice of
is exactly as in
Table 1. This completes the derivation in the case
.
Next, we extend this solution to the largest
θ-interval in
for which Banker’s best response coincides with
Table 1. The maximum of
found above occurred at the intersection of the two indifference curves for
and
. The
θ-dependent versions of these two indifference curves intersect at the point
, where
and
. Then, with
as before,
with
The first change in the matrix of
Table 1 occurs at the
entry, which changes from 3 to 4 as
θ goes from
to
, where
(
is a root of a quartic polynomial). However, for
θ close to but greater than
, we find that Banker is indifferent at
and
, so there must be a change in Banker’s best response in
. To find the
θ at which the first change occurs, we determine where the lower value functions for
θ near and to the left of 1/2, which is
, and for
θ near and to the right of
are equal. This occurs at about
. When
θ is close to this value, we find that the global maximum occurs along the indifference curve for
. This leads to a third expression for the lower value function, call it
, and the
θ at which
, call it
, is the actual left endpoint of the first interval,
. We find that
(
is a root of a polynomial of degree 8). This completes the derivation for the interval
.
Figure 2.
The graphs of the Players’ strategies and in the independent cooperative equilibrium, restricted to . ( for and for , approximately.) has 13 discontinuities on , whereas has 11 discontinuities.
Figure 2.
The graphs of the Players’ strategies and in the independent cooperative equilibrium, restricted to . ( for and for , approximately.) has 13 discontinuities on , whereas has 11 discontinuities.
Repeating this process, we find that there are 131 such intervals in
. That is, there exist
such that the independent cooperative equilibrium (described by the Players’ maximin strategy and Banker’s best response; the latter, as we saw in Proposition 1, is not Banker’s minimax strategy so is useful primarily for determining the lower value function) is a continuous function of
θ on interval
i, namely
, for
. Each
is a root of a polynomial of degree 13 or less. In exactly nine of these intervals, the maximum occurs along a single indifference curve rather than at a point of intersection. At the boundary points there are discontinuities in Banker’s best response, whereas the Players’ strategies are typically continuous except for a number of discontinuities. Actually, at the boundary points, solutions from both adjacent intervals apply, so there is nonuniqueness of the Players’ strategies at points of discontinuity. In
Figure 2 we graph
and
as functions of
θ.
We have seen that the correlated and independent cooperative equilibria coincide when
. With two exceptions, these are the only
θ values at which the two equilibria coincide. The exceptions are
and
, as can be seen from the plot of the difference between the upper and lower value functions in
Figure 3.
Figure 3.
The graph of the difference between the upper and lower value functions, multiplied by , in the independent cooperative equilibrium.
Figure 3.
The graph of the difference between the upper and lower value functions, multiplied by , in the independent cooperative equilibrium.
5. Nash Equilibrium
Here we find the Nash equilibrium of our
trimatrix game. Our method involves finding one or more Nash equilibria explicitly, but it does not permit any uniqueness assertions. Let us begin with the case
. For now we simply claim that there exists a Nash equilibrium in this case with
. Banker’s best response is as in
Table 1 but with five changes: Entry 5 at
,
, and
becomes 5+. Entry 5+ at
and
becomes 6. Thus, there are now three information sets at which Banker is indifferent,
,
, and
.
Let
A and
B be the
payoff matrices for Player 1
vs. Banker and for Player 2
vs. Banker, but with the rows labeled by the Players’ pure strategies: SS, SD, DS, DD on 5 by Player 1 and Player 2. The columns are labeled by Banker’s eight pure strategies: SSS, SSD, SDS, SDD, DSS, DSD, DDS, DDD on
,
, and
. Of course, Banker makes a best response except in the three cases in which he is indifferent. We find that
and
Since the Players must act independently, we let
We now determine whether there is a mixture
of Banker’s eight pure strategies such that
is constant in
p1 and
b(9/11,
p2)
q⊺ is constant in
p2. This would ensure that
p1 = 9/11 (in fact any strategy
p1 of Player 1) is a best response to
p2 = 9/11 and
q; similarly,
p2 = 9/11 (in fact any
p2) is a best response to
p1 = 9/11 and
q. And of course
q is automatically a best response to
p1 =
p2 9/11. A necessary and sufficient condition on
q is
and
qj ≥ 0 for
j = 1, 2, …, 8. Summing the three equations gives
q6 +
q7 +
q8 = 1 −
q1 −
q2 −
q3 −
q4 −
q5, so any such
q is automatically a probability vector.
By testing all possible supports of size two or three, we find that the eight Banker pure strategies are mixed in 11 extreme Nash equilibria as follows:
1. | (0, 15175619=33313280, 15175619=33313280, 0, 0, 0, 0, 1481021=16656640). |
2. | (0, 1=2, 4229827=11421696, 0, 0, 0, 1481021=11421696, 0). |
3. | (0, 4229827=11421696, 1=2, 0, 0, 1481021=11421696, 0, 0). |
4. | (0, 4705731=12373504, 4705731=12373504, 0, 1481021=6186752, 0, 0, 0). |
5. | (0, 3753923=10469888, 3753923=10469888, 1481021=5234944, 0, 0, 0, 0). |
6. | (15175619=21891584, 0, 0, 0, 0, 0, 0, 6715965=21891584). |
7. | (1988135=3331328, 0, 0, 0, 0, 1343193=6662656, 1343193=6662656, 0). |
8. | (1568577=3807232, 0, 0, 0, 2238655=3807232, 0, 0, 0). |
9. | (3753923=10469888, 0, 0, 6715965=10469888, 0, 0, 0, 0). |
10. | (4229827=10945792, 0, 6715965=21891584, 0, 0, 6715965=21891584, 0, 0). |
11. | (4229827=10945792, 6715965=21891584, 0, 0, 0, 0, 6715965=21891584, 0). |
If Player 1, Player 2, and Banker play according to their equilibrium strategies, Banker’s expected gain per unit stake is
A list of extreme Nash equilibria is the usual way to express the solutions of a noncooperative game, but it is unnecessarily complicated in this case. A better approach is to express these equilibria in terms of behavioral strategies. Only one of the three information sets, (10; 10; 6), (10; 11; 6), and (11; 10; 6), is encountered during the play of a single game. Thus, knowing the draw probabilities in each of the three cases is sufficient. Expressed in terms of these behavioral strategies, the Banker strategies in the 11 extreme equilibria all have the form (
r1,
r2,
r2), and there are only two extreme points, namely
This completes the derivation in the case
θ = 1/2.
In fact the same Nash equilibria apply on the interval
for
. (Here they are not
θ-dependent.) This is the first
θ (moving right to left) at which a change occurs in
Table 1 (beyond the five changes already noted). As
θ moves from
to
, the
entry in
Table 1 changes from 4 to 5.
We can repeat this process 40 times. In each new interval the Players’ strategy is the same (independent with ), while Banker’s strategy changes from the previous interval. This determines the Nash equilibria for all . At the next interval there is no mixture q satisfying the required properties.
Now let us consider what happens when
. Each of the remaining 62 intervals is one of two types: Banker mixes on two information sets (40 cases), or Banker mixes on one information set and
(22 cases). First we consider the interval whose right endpoint is
. Suppose we know that Banker mixes on
and on
in this interval. The intersection of the two indifference curves occurs at
, where
Evaluating the payoff matrices
A and
B for Players 1 and 2 (now
), we can argue as above, and this leads to two extreme equilibria, which have the same Banker behavioral strategies. The left endpoint of the interval is
because of a change in Banker’s strategy at
. As for the information sets on which Banker mixes in the next interval, there are three candidates, namely any two of the three
,
, and
, with the first two being most likely. This approach allows us to move from one interval to the next in a systematic way. In those cases where Banker mixes on only one information set and
, we do not need to have
constant in
p1; it suffices that it be maximized at
p1 = 0.
In
Figure 4 we graph
and
as functions of
θ. We restrict to
since
whenever
.
Figure 4.
The graphs of the Players’ strategies and in the Nash equilibrium, restricted to . ( for .) Both and have a unique discontinuity, at .
Figure 4.
The graphs of the Players’ strategies and in the Nash equilibrium, restricted to . ( for .) Both and have a unique discontinuity, at .
Downton and Lockwood [
1] observed that Player 1 has positive expectation for
. The reason is clear: Banker focusses his attention on Player 2 and therefore plays suboptimally against Player 1. How large is this expectation for small
θ? Let us consider interval 1 (
), in which Player 2 bets about 56.9383 or more times as much as Player 1. Player 1’s expectation, per unit bet by Player 1, when both Players and Banker use their equilibrium strategies, is
which is about
at the left endpoint of the interval and about
at the right endpoint. A 0.475% advantage is substantial but it presumably occurs rarely.
Finally, let us explain why Downton and Lockwood’s [
1] Nash equilibrium algorithm is incorrect. Let
be the expectation of Player 1 per unit bet by Player 1 when Players 1 and 2 independently draw on 5 with probabilities
and
and Banker makes a best response to
and
θ. Let
be defined analogously. Then, recalling Equation (24),
(
is continuous in
for fixed
θ, but
and
are not.) Downton and Lockwood [
1] proposed an algorithm for evaluating the Nash equilibrium based on the functions
and
. (We consider it in the context of our simplified model rather than in terms of the more elaborate model they analyzed.) Specifically, they defined
and
It appears that the aim was to find
such that
and
though only the second of these two inequalities actually follows from Equations (26) and (27). Then Banker would make a best response to
. While inequality (28) appears to say that
is a best response by Player 1, it does
not do so because its right-hand side
is defined in terms of a Banker best response to
, not
. Inequality (29) has the same problem. We observe that, when
and
, Equations (28) and (29) fail, which helps to confirm that the method is flawed.