Consistent Beliefs in Extensive Form Games

Paulo Barelli

doi:10.3390/g1040415

¹

Department of Economics, University of Rochester, 214 Harkness Hall, Rochester, NY 14627, USA

²

Insper Institute of Education and Research, Rua Quatá, 300 - Vila Olímpia 04546-042, São Paulo, Brazil

Games2010, 1(4), 415-421;https://doi.org/10.3390/g1040415

This article belongs to the Special Issue Epistemic Game Theory and Modal Logic

Version Notes

Order Reprints

Abstract

We introduce consistency of beliefs in the space of hierarchies of conditional beliefs (Battigalli and Siniscalchi) and use it to provide epistemic conditions for equilibria in finite multi-stage games with observed actions.

Keywords:

hierarchies of conditional beliefs; epistemic conditions; common belief; correlated subgame perfect equilibrium

1. Introduction

Battigalli and Sinischalchi [1] constructed the space of hierarchies of conditional beliefs and used it to provide epistemic foundations for solution concepts in dynamic games. We consider the question of consistency of beliefs in the space of hierarchies of conditional beliefs. In the space of hierarchies of beliefs, Aumann [2], Aumann and Brandenburger [3] and Barelli [4], among others, have used consistency of beliefs to provide epistemic foundations for solution concepts in games in normal form. Here we provide an analogous analysis for multi-stage games with observable actions, in the corresponding space of hierarchies of conditional beliefs. In particular, we show that consistency of beliefs and extensive form rationality provide epistemic foundations for correlated subgame perfect equilibrium (correlated SPE), and these two conditions, plus a notion of constancy of conjectures, provide epistemic foundations for subgame perfect equilibrium (SPE).1

The following simple example helps understand the ideas involved. Consider the standard Battle of Sexes game, with the payoff matrix below,

	F	O
F	$2, 1$	$0, 0$
O	$0, 0$	$1, 2$

The story is that the players decide simultaneously where to meet (either at a football game, F, or an opera house, O), and each player would rather go to the same place as the other, but has a preference for one venue over the other. Let

A_{i} = {F, O}

for

i = 1, 2

and

A = A_{1} \times A_{2}

. A correlated equilibrium for such a simultaneous move game is a Nash equilibrium of the game augmented by some payoff irrelevant state space, which is understood by both players. Consider, for instance, that each player chooses F if the weather is good, and chooses O otherwise (that is, they go to the outdoor event if the weather is good, and to the indoor event if the weather is not good). It is clear that such a strategy is a Nash equilibrium of the game augmented by the state space {good weather, weather not good}: if the other player uses the strategy, it is in the given player’s interest use it as well (if the weather is good (not good), a given player knows that the other will go to the football game (opera house), and will do well to go there too). Let p be the probability of the weather being good. Then the pair of strategies above gives rise to the distribution of joint actions

η \in Δ (A)

given by

η (F, F) = p

and

η (O, O) = 1 - p

, and it is without loss to focus directly on such distributions in describing a correlated equilibrium. It suffices that, for each

a_{i} \in A_{i}

, the expected payoff of

a_{i}

given

η (a_{i}, \cdot) \in Δ (A_{j})

,

j \neq i

, is not smaller than the expected payoff of any other action

a_{i}^{'}

, for

i = 1, 2

.

Now consider that the players play the game twice. That is, the players play the game once, observe its outcome, play it again, and get the sum of the payoffs obtained in each round. Let

H = {\emptyset} \cup A

denote the set of histories. The empty history represents the first round, and each of the four joint strategies in A represents a possible second round. Recall that a SPE is a Nash equilibrium of the entire game that induces Nash equilibria at each subgame. Analogously, a correlated SPE is a correlated equilibrium of the entire game that induces correlated equilibria at each subgame. It can be described as follows. Let

η \in Δ (A)

be a correlated equilibrium of the original Battle of the Sexes game, like the η described above. A correlated SPE is a list of probability distributions

{(ν_{h})}_{h \in H}

with

ν_{h} \in Δ (A)

for each

h \in H

, where

ν_{h}

a correlated equilibrium for the continuation game at history

h \in A

and

ν_{\emptyset}

a correlated equilibrium of the one shot game given by the first round outcome and the contingent second round outcome, given

ν_{h}

with

h \in A

. That is, each of the four continuation games is simply the original Battle of the Sexes game played after the first round. So a correlated equilibrium for a continuation game is a probability distribution

η \in Δ (A)

. In the first round, on the other hand, each joint action gives rise to a (potentially) different continuation strategy. So it is not a simple stage game as the games in the second round. But it can be viewed as an one-shot game, with payoffs given by the sum of what is obtained in the first round and of the conditional payoffs in the second round, given the correlated equilibria of the four potential continuation games. Then, for instance,

ν_{h} = η

for all

h \in H

is a correlated SPE, because

ν_{a}

(

= η

) is a correlated equilibrium of the continuation game after history

h = a

for each

a \in A

, and given the four continuation correlated moves

{(ν_{a})}_{a \in A}

,

ν_{\emptyset}

(

= η

) is a correlated equilibrium of the game with payoffs

u_{i} (a) + u_{i} (η)

, where

u_{i}

is player i’s stage game payoff and

u_{i} (η)

is the expected payoff given η (so, in particular, the expected payoff given

ν_{\emptyset}

is simply

u_{i} (η) + u_{i} (η)

). More complex correlated SPE involving different correlated continuation strategies can be constructed analogously.

Likewise, let

η \in Δ (A_{1}) \times Δ (A_{2})

be a joint distribution associated with a Nash equilibrium of the stage game. For instance,

η (F, F) = η (O, O) = \frac{2}{9}

,

η (F, O) = \frac{4}{9}

and

η (O, F) = \frac{1}{9}

, which is the joint distribution associated with the Nash equilibrium of the original Battle of the Sexes game in non degenerate mixed strategies. Then a list

{(ν_{h})}_{h \in H}

with

ν_{h} = η

for all

h \in H

is a SPE of the game, for the same reason as above. More complex SPE with different Nash equilibria of the continuation games can be constructed analogously.

Now let’s perform an epistemic analysis on the game, that is, an analysis of knowledge and beliefs of the players. In order to do so, we append a type structure

(T_{1}, T_{2}, g_{1}, g_{2})

with

g_{i, h} \in Δ (S \times T_{j})

for each

h \in H

, where

S = S_{1} \times S_{2}

with

S_{i} = {F, O}^{H}

for

i = 1, 2

. The beliefs of a type

t_{i}

,

{(g_{i, h} (t_{i}))}_{h \in H}

form a conditional probability system (CPS), (the formal definitions are provided below). A “state" for a player is a strategy-type pair

(s_{i}, t_{i})

, describing the player’s strategy choice and beliefs. Epistemic statements can now be stated in terms of the states of the players. For instance, let

S_{i} (h)

be player i’s set of strategies consistent with history

h \in H

. Let

η = η {(\cdot | S_{j} (h))}_{h \in H}

, where

η (\cdot | S_{j} (h)) \in Δ (S_{j} (h))

for each

h \in H

. We say that

s_{i}

is a best response to η, written

s_{i} \in r_{i} (η)

, if

s_{i}

maximizes the expected utility with respect to

η (\cdot | S_{j} (h))

for every history h consistent with

s_{i}

. And we say that the strategy-type pair

(s_{i}, t_{i}) \in S_{i} \times T_{i}

is rational if

s_{i} \in r_{i} ({({marg}_{S_{j}} g_{i, h} (t_{i}))}_{h \in H})

. Statements like “rationality is common knowledge among the players" can be described by a type structure where in each state

(s, t) \in S_{1} \times S_{2} \times T_{1} \times T_{2}

both players are rational. Note that a type of a player determines the conditional beliefs at every history, and rationality captures sequentially rational choices, after every history (given the conditional beliefs).

Assume that the beliefs of the players are consistent in the following sense. There is a CPS

{(μ_{h})}_{h \in H}

with

μ_{h} \in Δ (S (h) \times T)

for each

h \in H

, such that

g_{i, h} (t_{i}) (E \times T_{j}) = μ_{h} (E \times T | t_{i})

for all

E \subset S

,

t_{i} \in T_{i}

and

i = 1, 2

. The idea is analogous to action-consistency in Barelli [4], which is a generalization of the standard common prior assumption. Because strategies are in principle verifiable entities, we can conceive of an outside observer offering bets on S, conditional on each history, where the payouts of the bets are measured in utils. The two players will be in a no-bets situation if there does not exist a bet that yields a sure gain to an outsider. In Barelli [4] it is shown that this is equivalent to consistency of beliefs, as defined above.

Now, if consistency and rationality obtain at every

(s, t) \in S \times T

,2 then we can identify a correlated SPE

{(ν_{h})}_{h \in H}

from the CPS

{(μ_{h})}_{h \in H}

by putting

ν_{h} (a) = μ_{h} ({s : s_{h} = a} \times T)

, for all

a \in A

. Indeed, if it is the case that beliefs are consistent and the CPS

{(μ_{h})}_{h \in H}

satisfies

μ_{h} ({s : s_{h} = (F, F)} \times T) = p

and

μ_{h} ({s : s_{h} = (O, O)} \times T) = 1 - p

for every

h \in H

, then it is straightforward to verify that rationality is obtained at every state, and that we obtain the correlated SPE described above. Indeed, rationality implies that no player wants to deviate from the recommended action, as required in a correlated SPE, and

{(ν_{h})}_{h \in H}

is exactly the correlated SPE above. Other correlated SPE are analogously obtained as we vary the consistent CPS

{(μ_{h})}_{h \in H}

. The key observation here is that, under consistency, rationality ensures that the system of inequalities defining a correlated SPE is met.

If instead

μ_{h} ({s : s_{h} = (F, F)} \times T) = μ_{h} ({s : s_{h} = (O, O)} \times T) = \frac{2}{9}

,

μ_{h} ({s : s_{h} = (F, O)} \times T) = \frac{4}{9}

and

μ_{h} ({s : s_{h} = (O, F)} \times T) = \frac{1}{9}

for every

h \in H

, then we again have rationality at every state, and the SPE described above is obtained. As in Aumann and Brandenburger [3] and Barelli [4], the key observation is that constancy of conjectures in the support of the CPS

{(μ_{h})}_{h \in H}

ensures that

{(μ_{h} ({s : s_{h} = a} \times T))}_{a \in A}

is the product of its marginals, just as above. So rationality, consistency and constancy of conjectures in the support of the CPS are sufficient conditions for a SPE. It is important to note that constancy of conjectures is implied by (but does not imply) conjectures being commonly known among the players.

2. Set Up

The set up is as in Battigalli and Siniscalchi [1]. Let X be a Polish space, and let

A

be its Borel sigma algebra. Let

B \in A

be a countable collection of clopen sets, with

\emptyset \notin B

. The collection

B

represents the relevant hypotheses. A CPS on

(X, A, B)

is a mapping

μ (\cdot | \cdot) : A \times B \to [0, 1]

satisfying: (i)

μ (B | B) = 1

for all

B \in B

, (ii)

μ (\cdot | B) \in Δ (X)

, and (iii) for all

A \in A

,

B, C \in B

, if

A \subset B \subset C

then

μ (A | B) μ (B | C) = μ (A | C)

.3 The set of CPSs on

(X, A, B)

is a closed subset of

{[Δ (X)]}^{B}

, and it denoted by

Δ^{B} (X)

.

Consider a finite multi-stage game G with observable actions (Fudenberg and Tirole [5], Chap. 3). Let

H

be the set of histories and let

S_{i}

be the set of strategies

s_{i} : H \to A_{i}

, where

A_{i}

is the set of all possible actions for player

i \in I

, and

s_{i} (h) \in A_{i} (h)

for each

h \in H

, where

A_{i} (h)

is the set of actions available at h. Let

u_{i} : S \to R

denote player i’s utility function, with

S = \times_{i \in I} S_{i}

. As usual, we use

A_{- i} = \times_{j \neq i} A_{j}

and

A = \times_{i \in I} A_{i}

(likewise for other sets, like

T_{i}

,

T_{- i}

and T below.)

A correlated equilibrium of a finite normal form game

{(A_{i}, u_{i})}_{i \in I}

is a probability distribution

η \in Δ (A)

satisfying

\sum_{a_{- i} \in A_{- i}} [u_{i} (a_{i}, a_{- i}) - u_{i} (a_{i}^{'}, a_{- i})] η (a) \geq 0

for all

i \in I

and all

a_{i}, a_{i}^{'} \in A_{i}

. The interpretation is the one provided in the Introduction: the players use some external random device to peg their actions to, and assuming that the other players follow the recommended choices with the implied likelihoods, a given player has no incentive to deviate from his/her recommended choices. Because any such equilibrium generates a probability distribution over the joint actions, it is convenient to focus directly on such distributions (in the same way that a mixed strategy Nash equilibrium is defined directly on distributions, and not on the random variables that generate the distributions).

A correlated SPE of a finite multi-stage game with observed actions is given by

ν = {(ν_{h})}_{h \in H}

, with

ν_{h} \in Δ (A (h))

, which induces correlated equilibria at every subgame. That is, for each history h we have a continuation game

G (h)

where the payoffs are defined for the histories that are consistent with h. Given h, we have a continuation correlated strategy

ν | h

, given by the restriction of ν to histories consistent with h. Then a correlated SPE is ν such that

ν | h

is a correlated equilibrium of

G (h)

for every

h \in H

. Standard dynamic programming arguments show that this is equivalent to the description provided in the Introduction. An SPE is a correlated SPE ν with

ν_{h} \in \times_{i \in I} Δ (A_{i} (h))

for each

h \in H

.

Let

S_{i} (h)

be player i’s set of strategies consistent with history

h \in H

, and let

H (s_{i})

be the set of histories consistent with

s_{i}

. The relevant hypotheses for the players are thus the collection

B = {S (h) : h \in H}

. For a given player i, the hypotheses that are consistent with i’s strategies are

B_{i} = {S_{i} (h) : h \in H}

. As in Battigalli and Siniscalchi [1], we simplify notation by writing

Δ^{B_{i}} (\cdot)

and

Δ^{B} (\cdot)

as

Δ^{H} (\cdot)

.

In order to perform an epistemic analysis, we append a type structure to the game, describing the beliefs of the players. A type space is a tuple

T = {(T_{i}, g_{i})}_{i \in I}

with

g_{i} : T_{i} \to Δ^{H} (S \times T_{- i})

for each

i \in I

. Again, to simplify notation we write

{(g_{i, h} (t_{i}))}_{h \in H} \in Δ^{H} (S \times T_{- i})

instead of

{(g_{i, S (h)} (t_{i}))}_{S (h) \in B} \in Δ^{B} (S \times T_{- i})

.

Let

η = η {(\cdot | S_{- i} (h))}_{h \in H} \in Δ^{H} (S_{- i})

. We say that

s_{i}

is a best response to η, written

s_{i} \in r_{i} (η)

, if for all

h \in H (s_{i})

and

s_{i}^{'} \in S_{i} (h)

, we have

\sum_{s_{- i} \in S_{- i} (h)} [u_{i} (s_{i}, s_{- i}) - u_{i} (s_{i}^{'}, s_{- i})] η (s_{- i} | S_{- i} (h)) \geq 0

We then say that a strategy-type pair

(s_{i}, t_{i})

is rational if

s_{i} \in r_{i} ({({marg}_{S_{- i}} g_{i, h} (t_{i}))}_{h \in H})

, and if

s_{i} \in S_{i} (h)

then

g_{i, h} (t_{i}) ({(s_{i}} \times S_{- i} \times T_{- i}) = 1

. We say that player i is rational at state

(s, t) \in S \times T

if the strategy-type pair

(s_{i}, t_{i}) \in S_{i} \times T_{i}

is rational.

A CPS

μ \in Δ^{H} (S \times T)

is called a consistent prior if

μ_{h} (E \times T) = \int_{T_{i}} g_{i, h} (t_{i}) (E \times T_{- i}) {marg}_{T_{i}} μ_{h} (d t_{i})

for all

i \in I

, all

h \in H

and all

E \subset S

. It then follows that

g_{i, h} (t_{i}) (E \times T_{- i}) = μ_{h} (E \times T | t_{i})

for all

i \in I

, all

h \in H

, all

E \subset S

and

{marg}_{T_{i}} μ_{h}

-a.e.

t_{i}

. Let

supp μ = ⋃_{h \in H} supp μ_{h}

denote the support of the consistent prior. As advanced above, consistency is founded on players being in a no-bets situation, that is, a situation where an outside observer cannot make a sure gain on the group of players by offering bets on the strategy choices of the players. Proposition 5.3 in Barelli [4] establishes the equivalence between consistency and no-bets, and the reader is referred to that paper for further details.

For the sake of comparison with the literature, consider a finite normal form game

G = {(A_{i}, u_{i})}_{i \in I}

and a type space

T = {(T_{i}, λ_{i})}_{i \in I}

, with

λ_{i} (t_{i}) \in Δ (A \times T_{- i})

capturing hierarchies of beliefs. Player i is rational at state

(a, t)

if

a_{i}

is a best response to his conjecture

{marg}_{A_{- i}} λ_{i} (t_{i})

and

λ_{i} (t_{i}) ({a_{i}} \times A_{- i} \times T_{- i}) = 1

. A common prior is a probability measure

p \in Δ (A \times T)

such that

λ_{i} (t_{i}) = p (\cdot | t_{i})

for

{marg}_{T_{i}} p

-a.e.

t_{i}

. An action-consistent prior is a probability measure

π \in Δ (A \times T)

such that

{marg}_{A} λ_{i} (t_{i}) = {marg}_{A} π (\cdot | t_{i})

for

{marg}_{T_{i}} π

-a.e.

t_{i}

. Aumann [2] showed that, when there is a common prior, common knowledge of rationality implies that players play a correlated equilibrium. Aumann and Brandenbuger [3] showed that common knowledge of rationality and of conjectures and the existence of a common prior are sufficient conditions for players to play a Nash equilibrium. These results were extended in Barelli [4] with the use of action-consistency in the place of common prior, rationality in the support of the action-consistent prior in the place of common knowledge of rationality and constancy of conjectures in the support of the action-consistent prior in the place of common knowledge of conjectures.

Note that the notion of consistency used here is much more demanding than using action-consistency in the normal form of the game. Consistency requires that players be at a no-bets situation after every history

h \in H

, whereas action-consistency allows for players to not be at a no-bets situation after histories that are not compatible with the strategy profiles in the support of the action-consistent prior. Players are required to be aware of a potential outsider at every counter factual that they envisage while choosing their strategies.

3. Results

For a given consistent prior μ, let

ν = {(ν_{h})}_{h \in H}

be given by

ν_{h} (a) = {marg}_{S} μ_{h} (s : s_{h} = a)

for each

a \in A (h)

, so that

ν_{h} \in Δ (A (h))

. We have:

Proposition 1.

Let G be a finite multi-stage game with observed actions, and let

T

be a type space associated with G. Assume that there exists a consistent prior

μ \in Δ^{H} (S \times T)

such that player i is rational at all

(s, t) \in supp μ

, for every

i \in I

. Then ν defined above is a correlated SPE.

Proof.

By consistency, we have

{marg}_{S_{- i}} g_{i, h} (t_{i}) = {marg}_{S_{- i}} μ_{h} (\cdot | t_{i})

for every

i \in I

,

h \in H

and

t_{i} \in supp {marg}_{T_{i}} μ_{h}

. By rationality we then have for each

i \in I

and every

h \in H (s_{i})

\sum_{s_{- i} \in S_{- i} (h)} [u_{i} (s_{i}, s_{- i}) - u_{i} (s_{i}^{'}, s_{- i})] {marg}_{S_{- i}} μ_{h} (s_{- i} | t_{i}) \geq 0

for every

(s_{i}, t_{i})

and every

s_{i}^{'} \in S_{i} (h)

. Let

η = {(η_{h})}_{h \in H}

with

η_{h} \in Δ (S (h))

be given by

η_{h} (s) = \int_{T_{i} (s_{i})} {marg}_{S_{- i}} μ_{h} (s_{- i} | t_{i}) {marg}_{T_{i}} μ_{h} (d t_{i})

where

T_{i} (s_{i}) = {t_{i}^{'} \in T_{i} : (s_{i}, t_{i}^{'}) is rational}

, so that

\sum_{s_{- i} \in S_{- i} (h)} [u_{i} (s_{i}, s_{- i}) - u_{i} (s_{i}^{'}, s_{- i})] η_{h} (s) \geq 0

for every

i \in I

,

s_{i}, s_{i}^{'}

in

S_{i} (h)

and

h \in H (s_{i})

. Now notice that the restriction of ν to a history

h \in H

,

ν | h

, is the behavioral representation of

η_{h}

. By Kuhn’s Theorem, the distribution over final outcomes induced by

η_{h}

is the same as that induced by

ν | h

, so

ν | h

is a correlated equilibrium of the continuation game

G (h)

for every history

h \in H

, and we are done.

Let

ϕ_{i, h} (t_{i}) = {marg}_{S_{- i}} g_{i, h} (t_{i})

denote the conjecture of type

t_{i}

at

h \in H

. We have:

Proposition 2.

Let G be a finite multi-stage game with observed actions, and let

T

be a type space associated with G. Assume that there exists a consistent prior

μ \in Δ^{H} (S \times T)

such that (i) player i is rational at all

(s, t) \in supp μ

for every

i \in I

and (ii)

ϕ_{i, h} (t_{i}) = ϕ_{i, h} (t_{i}^{'})

for every

i \in I

for every

t_{i}, t_{i}^{'} \in supp {marg}_{T_{i}} μ_{h}

, for each

h \in H

. Then ν defined above is a SPE.

Proof.

Fix

h \in H

and let

ϕ_{i, h}

be player i’s constant conjecture in the support of

{marg}_{T_{i}} μ_{h}

. By consistency and rationality, we have for each

s \in S (h)

{marg}_{S} μ_{h} (s) = {marg}_{T_{i}} μ_{h} (T_{i} (s_{i})) ϕ_{i, h} (s_{- i})

where

T_{i} (s_{i})

is as in Proposition 1. Hence

{marg}_{S} μ_{h} = {marg}_{S_{i}} μ_{h} \otimes {marg}_{S_{- i}} μ_{h}

Now induction in the number of players shows that

{marg}_{S} μ_{h} = \otimes_{i \in I} {marg}_{S_{i}} μ_{h}

, and a fortiori

ν_{h} \in \times_{i \in I} Δ (A_{i} (h))

, for all

h \in H

. The result then follows from Proposition 1.

4. Conclusion

The results in section 3 tell us the following: under consistency, rationality yields correlated SPE and adding constancy of conjectures to these two conditions yield SPE. These results are analogous to the results in Aumann [2], Aumann and Brandenburger [3] and Barelli [4]. As in the latter, beliefs are required to be consistent only at events that are potentially observable by an outsider, who could in principle force beliefs to be consistent by offering bets on the observable events. Rationality and constancy of conjectures have to hold in the support of the consistent prior. Because rationality and/or constancy of conjectures are implied by (but do not imply) rationality and/or conjectures being commonly known among the players, we have that rationality need not be common knowledge for players to play a correlated SPE, and neither do conjectures have to be common knowledge for players to play a SPE.

References

Battigalli, P.; Sinischalchi, M. Hierarchies of Conditional Beliefs and Interactive Epistemology in Dynamic Games. J. Econ. Theory 1999, 88, 188–230. [Google Scholar] [CrossRef]
Aumann, R. Correlated Equilibrium as an Expression of Bayesian Rationality. Econometrica 1987, 55, 1–18. [Google Scholar] [CrossRef]
Aumann, R.; Brandenburger, A. Epistemic Conditions for Nash Equilibrium. Econometrica 1995, 63, 1161–1180. [Google Scholar] [CrossRef]
Barelli, P. Consistency of Beliefs and Epistemic Conditions for Nash and Correlated Equilibria. Game. Econ. Behav. 2009, 67, 363–375. [Google Scholar] [CrossRef]
Fudenberg, D.; Tirole, J. Game Theory; The MIT Press: Cambridge, MA, USA, 1992. [Google Scholar]

^1.For simplicity we deal only with finite multi-stage games with with observed actions, so sequential rationality is well captured by subgame perfection; the analysis can be generalized to include incomplete information and/or more complex information structures, where sequential equilibrium is the relevant equilibrium concept to capture sequential rationality.
^2.More precisely, if throughout the support of the CPS ${(μ_{h})}_{h \in H}$ defined above we have rational strategy-type pairs.
^3. $Δ (X)$ denotes the space of probability measures on $(X, A)$ .

© 2010 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license http://creativecommons.org/licenses/by/3.0/.

Consistent Beliefs in Extensive Form Games

Abstract

1. Introduction

2. Set Up

3. Results

4. Conclusion

References

Article Metrics

Citations

Article Access Statistics