1. Introduction
Battigalli and Sinischalchi [
1] constructed the space of hierarchies of conditional beliefs and used it to provide epistemic foundations for solution concepts in dynamic games. We consider the question of consistency of beliefs in the space of hierarchies of conditional beliefs. In the space of hierarchies of beliefs, Aumann [
2], Aumann and Brandenburger [
3] and Barelli [
4], among others, have used consistency of beliefs to provide epistemic foundations for solution concepts in games in normal form. Here we provide an analogous analysis for multi-stage games with observable actions, in the corresponding space of hierarchies of conditional beliefs. In particular, we show that consistency of beliefs and extensive form rationality provide epistemic foundations for correlated subgame perfect equilibrium (correlated SPE), and these two conditions, plus a notion of constancy of conjectures, provide epistemic foundations for subgame perfect equilibrium (SPE).
1The following simple example helps understand the ideas involved. Consider the standard Battle of Sexes game, with the payoff matrix below,
| F | O |
F | | |
O | | |
The story is that the players decide simultaneously where to meet (either at a football game, F, or an opera house, O), and each player would rather go to the same place as the other, but has a preference for one venue over the other. Let for and . A correlated equilibrium for such a simultaneous move game is a Nash equilibrium of the game augmented by some payoff irrelevant state space, which is understood by both players. Consider, for instance, that each player chooses F if the weather is good, and chooses O otherwise (that is, they go to the outdoor event if the weather is good, and to the indoor event if the weather is not good). It is clear that such a strategy is a Nash equilibrium of the game augmented by the state space {good weather, weather not good}: if the other player uses the strategy, it is in the given player’s interest use it as well (if the weather is good (not good), a given player knows that the other will go to the football game (opera house), and will do well to go there too). Let p be the probability of the weather being good. Then the pair of strategies above gives rise to the distribution of joint actions given by and , and it is without loss to focus directly on such distributions in describing a correlated equilibrium. It suffices that, for each , the expected payoff of given , , is not smaller than the expected payoff of any other action , for .
Now consider that the players play the game twice. That is, the players play the game once, observe its outcome, play it again, and get the sum of the payoffs obtained in each round. Let denote the set of histories. The empty history represents the first round, and each of the four joint strategies in A represents a possible second round. Recall that a SPE is a Nash equilibrium of the entire game that induces Nash equilibria at each subgame. Analogously, a correlated SPE is a correlated equilibrium of the entire game that induces correlated equilibria at each subgame. It can be described as follows. Let be a correlated equilibrium of the original Battle of the Sexes game, like the η described above. A correlated SPE is a list of probability distributions with for each , where a correlated equilibrium for the continuation game at history and a correlated equilibrium of the one shot game given by the first round outcome and the contingent second round outcome, given with . That is, each of the four continuation games is simply the original Battle of the Sexes game played after the first round. So a correlated equilibrium for a continuation game is a probability distribution . In the first round, on the other hand, each joint action gives rise to a (potentially) different continuation strategy. So it is not a simple stage game as the games in the second round. But it can be viewed as an one-shot game, with payoffs given by the sum of what is obtained in the first round and of the conditional payoffs in the second round, given the correlated equilibria of the four potential continuation games. Then, for instance, for all is a correlated SPE, because () is a correlated equilibrium of the continuation game after history for each , and given the four continuation correlated moves , () is a correlated equilibrium of the game with payoffs , where is player i’s stage game payoff and is the expected payoff given η (so, in particular, the expected payoff given is simply ). More complex correlated SPE involving different correlated continuation strategies can be constructed analogously.
Likewise, let be a joint distribution associated with a Nash equilibrium of the stage game. For instance, , and , which is the joint distribution associated with the Nash equilibrium of the original Battle of the Sexes game in non degenerate mixed strategies. Then a list with for all is a SPE of the game, for the same reason as above. More complex SPE with different Nash equilibria of the continuation games can be constructed analogously.
Now let’s perform an epistemic analysis on the game, that is, an analysis of knowledge and beliefs of the players. In order to do so, we append a type structure with for each , where with for . The beliefs of a type , form a conditional probability system (CPS), (the formal definitions are provided below). A “state" for a player is a strategy-type pair , describing the player’s strategy choice and beliefs. Epistemic statements can now be stated in terms of the states of the players. For instance, let be player i’s set of strategies consistent with history . Let , where for each . We say that is a best response to η, written , if maximizes the expected utility with respect to for every history h consistent with . And we say that the strategy-type pair is rational if . Statements like “rationality is common knowledge among the players" can be described by a type structure where in each state both players are rational. Note that a type of a player determines the conditional beliefs at every history, and rationality captures sequentially rational choices, after every history (given the conditional beliefs).
Assume that the beliefs of the players are consistent in the following sense. There is a CPS
with
for each
, such that
for all
,
and
. The idea is analogous to action-consistency in Barelli [
4], which is a generalization of the standard common prior assumption. Because strategies are in principle verifiable entities, we can conceive of an outside observer offering bets on
S, conditional on each history, where the payouts of the bets are measured in utils. The two players will be in a
no-bets situation if there does not exist a bet that yields a sure gain to an outsider. In Barelli [
4] it is shown that this is equivalent to consistency of beliefs, as defined above.
Now, if consistency and rationality obtain at every
,
2 then we can identify a correlated SPE
from the CPS
by putting
, for all
. Indeed, if it is the case that beliefs are consistent and the CPS
satisfies
and
for every
, then it is straightforward to verify that rationality is obtained at every state, and that we obtain the correlated SPE described above. Indeed, rationality implies that no player wants to deviate from the recommended action, as required in a correlated SPE, and
is exactly the correlated SPE above. Other correlated SPE are analogously obtained as we vary the consistent CPS
. The key observation here is that, under consistency, rationality ensures that the system of inequalities defining a correlated SPE is met.
If instead
,
and
for every
, then we again have rationality at every state, and the SPE described above is obtained. As in Aumann and Brandenburger [
3] and Barelli [
4], the key observation is that constancy of conjectures in the support of the CPS
ensures that
is the product of its marginals, just as above. So rationality, consistency and constancy of conjectures in the support of the CPS are sufficient conditions for a SPE. It is important to note that constancy of conjectures is implied by (but does not imply) conjectures being commonly known among the players.
2. Set Up
The set up is as in Battigalli and Siniscalchi [
1]. Let
X be a Polish space, and let
be its Borel sigma algebra. Let
be a countable collection of clopen sets, with
. The collection
represents the relevant hypotheses. A CPS on
is a mapping
satisfying: (i)
for all
, (ii)
, and (iii) for all
,
, if
then
.
3 The set of CPSs on
is a closed subset of
, and it denoted by
.
Consider a finite multi-stage game
G with observable actions (Fudenberg and Tirole [
5], Chap. 3). Let
be the set of histories and let
be the set of strategies
, where
is the set of all possible actions for player
, and
for each
, where
is the set of actions available at
h. Let
denote player
i’s utility function, with
. As usual, we use
and
(likewise for other sets, like
,
and
T below.)
A correlated equilibrium of a finite normal form game
is a probability distribution
satisfying
for all
and all
. The interpretation is the one provided in the Introduction: the players use some external random device to peg their actions to, and assuming that the other players follow the recommended choices with the implied likelihoods, a given player has no incentive to deviate from his/her recommended choices. Because any such equilibrium generates a probability distribution over the joint actions, it is convenient to focus directly on such distributions (in the same way that a mixed strategy Nash equilibrium is defined directly on distributions, and not on the random variables that generate the distributions).
A correlated SPE of a finite multi-stage game with observed actions is given by , with , which induces correlated equilibria at every subgame. That is, for each history h we have a continuation game where the payoffs are defined for the histories that are consistent with h. Given h, we have a continuation correlated strategy , given by the restriction of ν to histories consistent with h. Then a correlated SPE is ν such that is a correlated equilibrium of for every . Standard dynamic programming arguments show that this is equivalent to the description provided in the Introduction. An SPE is a correlated SPE ν with for each .
Let
be player
i’s set of strategies consistent with history
, and let
be the set of histories consistent with
. The relevant hypotheses for the players are thus the collection
. For a given player
i, the hypotheses that are consistent with
i’s strategies are
. As in Battigalli and Siniscalchi [
1], we simplify notation by writing
and
as
.
In order to perform an epistemic analysis, we append a type structure to the game, describing the beliefs of the players. A type space is a tuple with for each . Again, to simplify notation we write instead of .
Let
. We say that
is a best response to
η, written
, if for all
and
, we have
We then say that a strategy-type pair
is rational if
, and if
then
. We say that player
i is rational at state
if the strategy-type pair
is rational.
A CPS
is called a consistent prior if
for all
, all
and all
. It then follows that
for all
, all
, all
and
-a.e.
. Let
denote the support of the consistent prior. As advanced above, consistency is founded on players being in a
no-bets situation, that is, a situation where an outside observer cannot make a sure gain on the group of players by offering bets on the strategy choices of the players. Proposition 5.3 in Barelli [
4] establishes the equivalence between consistency and no-bets, and the reader is referred to that paper for further details.
For the sake of comparison with the literature, consider a finite normal form game
and a type space
, with
capturing hierarchies of beliefs. Player
i is rational at state
if
is a best response to his conjecture
and
. A common prior is a probability measure
such that
for
-a.e.
. An action-consistent prior is a probability measure
such that
for
-a.e.
. Aumann [
2] showed that, when there is a common prior, common knowledge of rationality implies that players play a correlated equilibrium. Aumann and Brandenbuger [
3] showed that common knowledge of rationality and of conjectures and the existence of a common prior are sufficient conditions for players to play a Nash equilibrium. These results were extended in Barelli [
4] with the use of action-consistency in the place of common prior, rationality in the support of the action-consistent prior in the place of common knowledge of rationality and constancy of conjectures in the support of the action-consistent prior in the place of common knowledge of conjectures.
Note that the notion of consistency used here is much more demanding than using action-consistency in the normal form of the game. Consistency requires that players be at a no-bets situation after every history , whereas action-consistency allows for players to not be at a no-bets situation after histories that are not compatible with the strategy profiles in the support of the action-consistent prior. Players are required to be aware of a potential outsider at every counter factual that they envisage while choosing their strategies.