Abstract
We introduce consistency of beliefs in the space of hierarchies of conditional beliefs (Battigalli and Siniscalchi) and use it to provide epistemic conditions for equilibria in finite multi-stage games with observed actions.
1. Introduction
Battigalli and Sinischalchi [] constructed the space of hierarchies of conditional beliefs and used it to provide epistemic foundations for solution concepts in dynamic games. We consider the question of consistency of beliefs in the space of hierarchies of conditional beliefs. In the space of hierarchies of beliefs, Aumann [], Aumann and Brandenburger [] and Barelli [], among others, have used consistency of beliefs to provide epistemic foundations for solution concepts in games in normal form. Here we provide an analogous analysis for multi-stage games with observable actions, in the corresponding space of hierarchies of conditional beliefs. In particular, we show that consistency of beliefs and extensive form rationality provide epistemic foundations for correlated subgame perfect equilibrium (correlated SPE), and these two conditions, plus a notion of constancy of conjectures, provide epistemic foundations for subgame perfect equilibrium (SPE).1
The following simple example helps understand the ideas involved. Consider the standard Battle of Sexes game, with the payoff matrix below,
F | O | |
F | ||
O |
The story is that the players decide simultaneously where to meet (either at a football game, F, or an opera house, O), and each player would rather go to the same place as the other, but has a preference for one venue over the other. Let for and . A correlated equilibrium for such a simultaneous move game is a Nash equilibrium of the game augmented by some payoff irrelevant state space, which is understood by both players. Consider, for instance, that each player chooses F if the weather is good, and chooses O otherwise (that is, they go to the outdoor event if the weather is good, and to the indoor event if the weather is not good). It is clear that such a strategy is a Nash equilibrium of the game augmented by the state space {good weather, weather not good}: if the other player uses the strategy, it is in the given player’s interest use it as well (if the weather is good (not good), a given player knows that the other will go to the football game (opera house), and will do well to go there too). Let p be the probability of the weather being good. Then the pair of strategies above gives rise to the distribution of joint actions given by and , and it is without loss to focus directly on such distributions in describing a correlated equilibrium. It suffices that, for each , the expected payoff of given , , is not smaller than the expected payoff of any other action , for .
Now consider that the players play the game twice. That is, the players play the game once, observe its outcome, play it again, and get the sum of the payoffs obtained in each round. Let denote the set of histories. The empty history represents the first round, and each of the four joint strategies in A represents a possible second round. Recall that a SPE is a Nash equilibrium of the entire game that induces Nash equilibria at each subgame. Analogously, a correlated SPE is a correlated equilibrium of the entire game that induces correlated equilibria at each subgame. It can be described as follows. Let be a correlated equilibrium of the original Battle of the Sexes game, like the η described above. A correlated SPE is a list of probability distributions with for each , where a correlated equilibrium for the continuation game at history and a correlated equilibrium of the one shot game given by the first round outcome and the contingent second round outcome, given with . That is, each of the four continuation games is simply the original Battle of the Sexes game played after the first round. So a correlated equilibrium for a continuation game is a probability distribution . In the first round, on the other hand, each joint action gives rise to a (potentially) different continuation strategy. So it is not a simple stage game as the games in the second round. But it can be viewed as an one-shot game, with payoffs given by the sum of what is obtained in the first round and of the conditional payoffs in the second round, given the correlated equilibria of the four potential continuation games. Then, for instance, for all is a correlated SPE, because () is a correlated equilibrium of the continuation game after history for each , and given the four continuation correlated moves , () is a correlated equilibrium of the game with payoffs , where is player i’s stage game payoff and is the expected payoff given η (so, in particular, the expected payoff given is simply ). More complex correlated SPE involving different correlated continuation strategies can be constructed analogously.
Likewise, let be a joint distribution associated with a Nash equilibrium of the stage game. For instance, , and , which is the joint distribution associated with the Nash equilibrium of the original Battle of the Sexes game in non degenerate mixed strategies. Then a list with for all is a SPE of the game, for the same reason as above. More complex SPE with different Nash equilibria of the continuation games can be constructed analogously.
Now let’s perform an epistemic analysis on the game, that is, an analysis of knowledge and beliefs of the players. In order to do so, we append a type structure with for each , where with for . The beliefs of a type , form a conditional probability system (CPS), (the formal definitions are provided below). A “state" for a player is a strategy-type pair , describing the player’s strategy choice and beliefs. Epistemic statements can now be stated in terms of the states of the players. For instance, let be player i’s set of strategies consistent with history . Let , where for each . We say that is a best response to η, written , if maximizes the expected utility with respect to for every history h consistent with . And we say that the strategy-type pair is rational if . Statements like “rationality is common knowledge among the players" can be described by a type structure where in each state both players are rational. Note that a type of a player determines the conditional beliefs at every history, and rationality captures sequentially rational choices, after every history (given the conditional beliefs).
Assume that the beliefs of the players are consistent in the following sense. There is a CPS with for each , such that for all , and . The idea is analogous to action-consistency in Barelli [], which is a generalization of the standard common prior assumption. Because strategies are in principle verifiable entities, we can conceive of an outside observer offering bets on S, conditional on each history, where the payouts of the bets are measured in utils. The two players will be in a no-bets situation if there does not exist a bet that yields a sure gain to an outsider. In Barelli [] it is shown that this is equivalent to consistency of beliefs, as defined above.
Now, if consistency and rationality obtain at every ,2 then we can identify a correlated SPE from the CPS by putting , for all . Indeed, if it is the case that beliefs are consistent and the CPS satisfies and for every , then it is straightforward to verify that rationality is obtained at every state, and that we obtain the correlated SPE described above. Indeed, rationality implies that no player wants to deviate from the recommended action, as required in a correlated SPE, and is exactly the correlated SPE above. Other correlated SPE are analogously obtained as we vary the consistent CPS . The key observation here is that, under consistency, rationality ensures that the system of inequalities defining a correlated SPE is met.
If instead , and for every , then we again have rationality at every state, and the SPE described above is obtained. As in Aumann and Brandenburger [] and Barelli [], the key observation is that constancy of conjectures in the support of the CPS ensures that is the product of its marginals, just as above. So rationality, consistency and constancy of conjectures in the support of the CPS are sufficient conditions for a SPE. It is important to note that constancy of conjectures is implied by (but does not imply) conjectures being commonly known among the players.
2. Set Up
The set up is as in Battigalli and Siniscalchi []. Let X be a Polish space, and let be its Borel sigma algebra. Let be a countable collection of clopen sets, with . The collection represents the relevant hypotheses. A CPS on is a mapping satisfying: (i) for all , (ii) , and (iii) for all , , if then .3 The set of CPSs on is a closed subset of , and it denoted by .
Consider a finite multi-stage game G with observable actions (Fudenberg and Tirole [], Chap. 3). Let be the set of histories and let be the set of strategies , where is the set of all possible actions for player , and for each , where is the set of actions available at h. Let denote player i’s utility function, with . As usual, we use and (likewise for other sets, like , and T below.)
A correlated equilibrium of a finite normal form game is a probability distribution satisfying
for all and all . The interpretation is the one provided in the Introduction: the players use some external random device to peg their actions to, and assuming that the other players follow the recommended choices with the implied likelihoods, a given player has no incentive to deviate from his/her recommended choices. Because any such equilibrium generates a probability distribution over the joint actions, it is convenient to focus directly on such distributions (in the same way that a mixed strategy Nash equilibrium is defined directly on distributions, and not on the random variables that generate the distributions).
A correlated SPE of a finite multi-stage game with observed actions is given by , with , which induces correlated equilibria at every subgame. That is, for each history h we have a continuation game where the payoffs are defined for the histories that are consistent with h. Given h, we have a continuation correlated strategy , given by the restriction of ν to histories consistent with h. Then a correlated SPE is ν such that is a correlated equilibrium of for every . Standard dynamic programming arguments show that this is equivalent to the description provided in the Introduction. An SPE is a correlated SPE ν with for each .
Let be player i’s set of strategies consistent with history , and let be the set of histories consistent with . The relevant hypotheses for the players are thus the collection . For a given player i, the hypotheses that are consistent with i’s strategies are . As in Battigalli and Siniscalchi [], we simplify notation by writing and as .
In order to perform an epistemic analysis, we append a type structure to the game, describing the beliefs of the players. A type space is a tuple with for each . Again, to simplify notation we write instead of .
Let . We say that is a best response to η, written , if for all and , we have
We then say that a strategy-type pair is rational if , and if then . We say that player i is rational at state if the strategy-type pair is rational.
A CPS is called a consistent prior if
for all , all and all . It then follows that for all , all , all and -a.e. . Let denote the support of the consistent prior. As advanced above, consistency is founded on players being in a no-bets situation, that is, a situation where an outside observer cannot make a sure gain on the group of players by offering bets on the strategy choices of the players. Proposition 5.3 in Barelli [] establishes the equivalence between consistency and no-bets, and the reader is referred to that paper for further details.
For the sake of comparison with the literature, consider a finite normal form game and a type space , with capturing hierarchies of beliefs. Player i is rational at state if is a best response to his conjecture and . A common prior is a probability measure such that for -a.e. . An action-consistent prior is a probability measure such that for -a.e. . Aumann [] showed that, when there is a common prior, common knowledge of rationality implies that players play a correlated equilibrium. Aumann and Brandenbuger [] showed that common knowledge of rationality and of conjectures and the existence of a common prior are sufficient conditions for players to play a Nash equilibrium. These results were extended in Barelli [] with the use of action-consistency in the place of common prior, rationality in the support of the action-consistent prior in the place of common knowledge of rationality and constancy of conjectures in the support of the action-consistent prior in the place of common knowledge of conjectures.
Note that the notion of consistency used here is much more demanding than using action-consistency in the normal form of the game. Consistency requires that players be at a no-bets situation after every history , whereas action-consistency allows for players to not be at a no-bets situation after histories that are not compatible with the strategy profiles in the support of the action-consistent prior. Players are required to be aware of a potential outsider at every counter factual that they envisage while choosing their strategies.
3. Results
For a given consistent prior μ, let be given by for each , so that . We have:
Proposition 1.
Let G be a finite multi-stage game with observed actions, and let be a type space associated with G. Assume that there exists a consistent prior such that player i is rational at all , for every . Then ν defined above is a correlated SPE.
Proof.
By consistency, we have for every , and . By rationality we then have for each and every
for every and every . Let with be given by
where , so that
for every , in and . Now notice that the restriction of ν to a history , , is the behavioral representation of . By Kuhn’s Theorem, the distribution over final outcomes induced by is the same as that induced by , so is a correlated equilibrium of the continuation game for every history , and we are done.
Let denote the conjecture of type at . We have:
Proposition 2.
Let G be a finite multi-stage game with observed actions, and let be a type space associated with G. Assume that there exists a consistent prior such that (i) player i is rational at all for every and (ii) for every for every , for each . Then ν defined above is a SPE.
Proof.
Fix and let be player i’s constant conjecture in the support of . By consistency and rationality, we have for each
where is as in Proposition 1. Hence
Now induction in the number of players shows that , and a fortiori , for all . The result then follows from Proposition 1.
4. Conclusion
The results in section 3 tell us the following: under consistency, rationality yields correlated SPE and adding constancy of conjectures to these two conditions yield SPE. These results are analogous to the results in Aumann [], Aumann and Brandenburger [] and Barelli []. As in the latter, beliefs are required to be consistent only at events that are potentially observable by an outsider, who could in principle force beliefs to be consistent by offering bets on the observable events. Rationality and constancy of conjectures have to hold in the support of the consistent prior. Because rationality and/or constancy of conjectures are implied by (but do not imply) rationality and/or conjectures being commonly known among the players, we have that rationality need not be common knowledge for players to play a correlated SPE, and neither do conjectures have to be common knowledge for players to play a SPE.
References
- Battigalli, P.; Sinischalchi, M. Hierarchies of Conditional Beliefs and Interactive Epistemology in Dynamic Games. J. Econ. Theory 1999, 88, 188–230. [Google Scholar] [CrossRef]
- Aumann, R. Correlated Equilibrium as an Expression of Bayesian Rationality. Econometrica 1987, 55, 1–18. [Google Scholar] [CrossRef]
- Aumann, R.; Brandenburger, A. Epistemic Conditions for Nash Equilibrium. Econometrica 1995, 63, 1161–1180. [Google Scholar] [CrossRef]
- Barelli, P. Consistency of Beliefs and Epistemic Conditions for Nash and Correlated Equilibria. Game. Econ. Behav. 2009, 67, 363–375. [Google Scholar] [CrossRef]
- Fudenberg, D.; Tirole, J. Game Theory; The MIT Press: Cambridge, MA, USA, 1992. [Google Scholar]
- 1.For simplicity we deal only with finite multi-stage games with with observed actions, so sequential rationality is well captured by subgame perfection; the analysis can be generalized to include incomplete information and/or more complex information structures, where sequential equilibrium is the relevant equilibrium concept to capture sequential rationality.
- 2.More precisely, if throughout the support of the CPS defined above we have rational strategy-type pairs.
- 3. denotes the space of probability measures on .
© 2010 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license http://creativecommons.org/licenses/by/3.0/.