Abstract
In (Bonanno, 2013), a solution concept for extensive-form games, called perfect Bayesian equilibrium (PBE), was introduced and shown to be a strict refinement of subgame-perfect equilibrium; it was also shown that, in turn, sequential equilibrium (SE) is a strict refinement of PBE. In (Bonanno, 2016), the notion of PBE was used to provide a characterization of SE in terms of a strengthening of the two defining components of PBE (besides sequential rationality), namely AGM consistency and Bayes consistency. In this paper we explore the gap between PBE and SE by identifying solution concepts that lie strictly between PBE and SE; these solution concepts embody a notion of “conservative” belief revision. Furthermore, we provide a method for determining if a plausibility order on the set of histories is choice measurable, which is a necessary condition for a PBE to be a SE.
1. Introduction
Since its introduction in 1982 [], sequential equilibrium has been the most widely used solution concept for extensive-form games. In applications, however, checking the “consistency” requirement for beliefs has proved to be rather difficult; thus, similarly motivated—but simpler—notions of equilibrium have been sought. The simplest solution concept is “weak sequential equilibrium” [,] which is defined as an assessment that is sequentially rational and satisfies Bayes’ rule at information sets that are reached with positive probability by the strategy profile (while no restrictions are imposed on the beliefs at information sets that have zero probability of being reached). However, this solution concept is too weak in that it is possible for an assessment (where σ is a strategy profile and μ is a system of beliefs) to be a weak sequential equilibrium without σ being a subgame-perfect equilibrium []. Hence the search in the literature for a “simple” (yet sufficiently strong) solution concept that lies in the gap between subgame-perfect equilibrium and sequential equilibrium. The minimal desired properties of such a solution concept, which is usually referred to as “perfect Bayesian equilibrium” (PBE), are sequential rationality and the “persistent” application of Bayes’ rule. The exact meaning of the latter requirement has not been easy to formalize.
Several attempts have been made in the literature to provide a satisfactory definition of PBE; they are reviewed in Section 5. In this paper we continue the study of one such notion, introduced in [], where it is shown that (a) the proposed solution concept is a strict refinement of subgame-perfect equilibrium; and (b) in general, the set of sequential equilibria is a proper subset of the set of perfect Bayesian equilibria. This definition of PBE is based on two notions (besides sequential rationality): (1) the qualitative property of AGM-consistency relative to a plausibility order1; and (2) the quantitative property of Bayes consistency. This notion of PBE was further used in [] to provide a new characterization of sequential equilibrium, in terms of a strengthening of both AGM consistency and Bayes consistency. In this paper we explore the gap between PBE and sequential equilibrium, by identifying solution concepts that lie strictly between PBE and sequential equilibrium. These solution concepts capture the notion of revising one’s beliefs in a “conservative” or “minimal” way.
The paper is organized as follows. Section 2 reviews the notation, definitions and main results of [,]. The new material is contained in Section 3 and Section 4. In Section 3 we introduce properties of the plausibility order that can be used to define solution concepts that lie between PBE and sequential equilibrium; the main result in this section is Proposition 2. In Section 4 we offer a method (Proposition 3) for determining whether a plausibility order satisfies the property of “choice measurability”, which is one of the two conditions that, together, are necessary and sufficient for a PBE to be a sequential equilibrium. Section 5 discusses related literature and Section 6 concludes. The proofs are given in Appendix A.
2. Perfect Bayesian Equilibrium and Sequential Equilibrium
In this section we review the notation and the main definitions and results of [,].
We adopt the history-based definition of extensive-form game (see, for example, []). If A is a set, we denote by the set of finite sequences in A. If and , the sequence is called a prefix of h; if then we say that is a proper prefix of h. If and , we denote the sequence by .
A finite extensive form is a tuple whose elements are:
- A finite set of actions A.
- A finite set of histories which is closed under prefixes (that is, if and is a prefix of h, then ). The null history denoted by ∅, is an element of H and is a prefix of every history. A history such that, for every , , is called a terminal history. The set of terminal histories is denoted by Z. denotes the set of non-terminal or decision histories. For every history , we denote by the set of actions available at h, that is, . Thus if and only if . We assume that (that is, we restrict attention to actions that are available at some decision history).
- A finite set of players. In some cases there is also an additional, fictitious, player called chance.
- A function that assigns a player to each decision history. Thus is the player who moves at history h. A game is said to be without chance moves if for every For every , let be the set of histories assigned to player i. Thus is a partition of If history h is assigned to chance, then a probability distribution over is given that assigns positive probability to every .
- For every player , is an equivalence relation on . The interpretation of is that, when choosing an action at history h, player i does not know whether she is moving at h or at . The equivalence class of is denoted by and is called an information set of player i; thus . The following restriction applies: if then , that is, the set of actions available to a player is the same at any two histories that belong to the same information set of that player.
- The following property, known as perfect recall, is assumed: for every player , if , and is a prefix of then for every there exists an such that is a prefix of . Intuitively, perfect recall requires a player to remember what she knew in the past and what actions she took previously.
Given an extensive form, one obtains an extensive gameby adding, for every player ,a utility (or payoff) function (where denotes the set of real numbers).
A total pre-order on the set of histories H is a binary relation ≾ which is complete2 and transitive3. We write as a short-hand for the conjunction: and , and write as a short-hand for the conjunction: and not .
Definition 1.
Given an extensive form, a plausibility order is a total pre-order ≾ on H that satisfies the following properties: ,
| , | |
| (i) such that , | |
| (ii) if then, , | |
| if history h is assigned to chance, then , |
The interpretation of is that history h is at least as plausible as history ; thus means that h is more plausible than and means that h is just as plausible as 4. Property says that adding an action to a decision history h cannot yield a more plausible history than h itself. Property says that at every decision history h there is at least one action a which is “plausibility preserving” in the sense that adding a to h yields a history which is as plausible as h; furthermore, any such action a performs the same role with any other history that belongs to the same information set as h. Property says that all the actions at a history assigned to chance are plausibility preserving.
Definition 2.
Given an extensive-form, an assessment is AGM-consistent if there exists a plausibility order ≾ on the set of histories H such that:
- (i)
- the actions that are assigned positive probability by σ are precisely the plausibility-preserving actions: ,
- (ii)
- the histories that are assigned positive probability by μ are precisely those that are most plausible within the corresponding information set:
If ≾ satisfies properties and with respect to , we say that ≾ rationalizes .
An assessment is sequentially rational if, for every player i and every information set I of hers, player i’s expected payoff—given the strategy profile σ and her beliefs at I (as specified by μ)—cannot be increased by unilaterally changing her choice at I and possibly at information sets of hers that follow I6.
Consider the extensive-form game shown in Figure 17 and the assessment where and μ is the following system of beliefs: and . This assessment is AGM-consistent, since it is rationalized by the following plausibility order8:
Figure 1.
An extensive-form game.
Furthermore is sequentially rational9. The property of AGM-consistency imposes restrictions on the support of the behavior strategy σ and on the support of the system of beliefs μ. The following property imposes constraints on how probabilities can be distributed over those supports.
Definition 3.
Given an extensive form, let ≾ be a plausibility order that rationalizes the assessment . We say that is Bayes consistent (or Bayesian) relative to ≾ if, for every equivalence class E of ≾ that contains some decision history h with [that is, , where ], there exists a probability density function (recall that H is a finite set) such that:
Property requires that if and only if and . Property requires to be consistent with the strategy profile σ in the sense that if , , and then the probability that assigns to is equal to the probability that assigns to h multiplied by the probabilities (according to σ) of the actions that lead from h to 10. Property requires the system of beliefs μ to satisfy Bayes’ rule in the sense that if and (so that E is the equivalence class of the most plausible elements of ) then, for every history , (the probability assigned to by μ) coincides with the probability of conditional on using the probability density function 11.
Consider again the game of Figure 1, and the assessment where and and . Let ≾ be the plausibility order (1) given above, which rationalizes . Then is Bayes consistent relative to ≾. In fact, we have that and the equivalence classes of ≾ that have a non-empty intersection with are , and . Let , , and . Then the three probability density functions , and satisfy the properties of Definition 3 and hence is Bayes consistent relative to ≾.
Definition 4.
An assessment is a perfect Bayesian equilibrium (PBE) if it is sequentially rational, it is rationalized by a plausibility order on the set of histories and is Bayes consistent relative to it.
We saw above that, for the game illustrated in Figure 1, the assessment where and and is sequentially rational, it is rationalized by the plausibility order (1) and is Bayes consistent relative to it. Thus it is a perfect Bayesian equilibrium.
Remark 1.
It is proved in [] that if is a perfect Bayesian equilibrium then σ is a subgame-perfect equilibrium and that every sequential equilibrium is a perfect Bayesian equilibrium. Furthermore, the notion of PBE is a strict refinement of subgame-perfect equilibrium and sequential equilibrium is a strict refinement of PBE.
Next we recall the definition of sequential equilibrium []. An assessment is KW-consistent (KW stands for ‘Kreps-Wilson’) if there is an infinite sequence of completely mixed behavior strategy profiles such that, letting be the unique system of beliefs obtained from by applying Bayes’ rule12, . An assessment is a sequential equilibrium if it is KW-consistent and sequentially rational. In [] it is shown that sequential equilibrium can be characterized as a strengthening of PBE based on two properties: (1) a property of the plausibility order that constrains the supports of the belief system; and (2) a strengthening of the notion of Bayes consistency, that imposes constraints on how the probabilities can be distributed over those supports. The details are given below.
Given a plausibility order ≾ on the finite set of histories H, a function (where denotes the set of non-negative integers) is said to be an ordinal integer-valued representation of ≾ if, for every ,
Since H is finite, the set of ordinal integer-valued representations is non-empty. A particular ordinal integer-valued representation, which we will call canonical and denote by ρ, is defined as follows.
Definition 5.
Let , and, in general, for every integer , . Thus is the equivalence class of ≾ containing the most plausible histories, is the equivalence class containing the most plausible among the histories left after removing those in , etc.13 The canonical ordinal integer-valued representation of ≾, , is given by
We call the rank of history
Instead of an ordinal integer-valued representation of the plausibility order one could seek a cardinal representation which, besides (2), satisfies the following property: if h and belong to the same information set (that is, ) and , then
If we think of F as measuring the “plausibility distance” between histories, then we can interpret as a distance-preserving condition: the plausibility distance between two histories in the same information set is preserved by the addition of the same action.
Definition 6.
A plausibility order ≾ on the set of histories H is choice measurable if it has at least one integer-valued representation that satisfies property .
For example, the plausibility order (1) is not choice measurable, since any integer-valued representation F of it must be such that and .
Let be an assessment which is rationalized by a plausibility order ≾. As before, let be the set of decision histories to which μ assigns positive probability: . Let be the set of equivalence classes of ≾ that have a non-empty intersection with . Clearly is a non-empty, finite set. Suppose that is Bayesian relative to ≾ and let be a collection of probability density functions that satisfy the properties of Definition 3. We call a probability density function a full-support common prior of if, for every , , that is, for all , . Note that a full support common prior assigns positive probability to all decision histories, not only to those in .
Definition 7.
Consider an extensive form. Let be an assessment which is rationalized by the plausibility order ≾ and is Bayesian relative to it and let be a collection of probability density functions that satisfy the properties of Definition 3. We say that is uniformly Bayesian relative to ≾ if there exists a full-support common prior of that satisfies the following properties.
We call such a function ν auniform full-support common prior of .
requires that the common prior ν be consistent with the strategy profile σ, in the sense that if then (thus extending Property of Definition 3 from to D). requires that the relative probability, according to the common prior ν, of any two histories that belong to the same information set remain unchanged by the addition of the same action.
It is shown in [] that choice measurability and uniform Bayesian consistency are independent properties. The following proposition is proved in [].
Proposition 1.
(I) and (II) below are equivalent:
- (I)
- () is a perfect Bayesian equilibrium which is rationalized by a choicemeasurable plausibility order and is uniformly Bayesian relative to it.
- (II)
- () is a sequential equilibrium.
3. Exploring the Gap between PBE and Sequential Equilibrium
The notion of perfect Bayesian equilibrium (Definition 4) incorporates—through the property of AGM-consistency—a belief revision policy which can be interpreted either as the epistemic state of an external observer14 or as a belief revision policy which is shared by all the players15. For example, the perfect Bayesian equilibrium considered in Section 2 for the game of Figure 1, namely and , reflects the following belief revision policy: the initial beliefs are that Player 1 will play d; conditional on learning that Player 1 did not play d, the observer would become convinced that Player 1 played either b or c (that is, she would judge a to be less plausible than b and she would consider c to be as plausible as b) and would expect Player 2 to play e; upon learning that (Player 1 did not play d and) Player 2 played f, the observer would become convinced that Player 1 played either a or b, hence judging to be as plausible as , thereby modifying her earlier judgment that a was less plausible than b. Although such a belief revision policy does not violate the rationality constraints introduced in [], it does involve a belief change that is not “minimal”or “conservative”. Such “non-minimal” belief changes can be ruled out by imposing the following restriction on the plausibility order: if h and belong to the same information set (that is, ) and a is an action available at h(), then
says that if h is deemed to be at least as plausible as then the addition of any available action a must preserve this judgment, in the sense that must be deemed to be at least as plausible as , and vice versa; it can also be viewed as an “independence” condition, in the sense that observation of a new action cannot lead to a change in the relative plausibility of previous histories16. Any plausibility order that rationalizes the assessment and , for the game of Figure 1 must violate (since while ).
We can obtain a strengthening of the notion of perfect Bayesian equilibrium (Definition 4) by (1) adding property ; and (2) strengthening Bayes consistency (Definition 3) to uniform Bayesian consistency (Definition 7).
Definition 8.
Given an extensive-form game, an assessment (σ,μ) is a weakly independent perfect Bayesian equilibrium if it is sequentially rational, it is rationalized by a plausibility order that satisfies and is uniformly Bayesian relative to that plausibility order.
As an example of a weakly independent PBE consider the game of Figure 2 and the assessment (σ,μ) where (highlighted by double edges in Figure 2) and (thus ) (the decision histories x such that are shown as black nodes and the decision histories x such that are shown as gray nodes)). This assessment is sequentially rational and is rationalized by the following plausibility order:
Figure 2.
It is straightforward to check that plausibility order (4) satisfies 17. To see that (σ,μ) is uniformly Bayesian relative to plausibility order (4), note that and thus the only equivalence classes that have a non-empty intersection with are , , and . Letting , , and , this collection of probability distributions satisfies the Properties of Definition 3. Let ν be the uniform distribution over the set of decision histories (thus for every ). Then ν is a full support common prior of the collection and satisfies Properties and of Definition 7.
Note, however, that (σ,μ) is not a sequential equilibrium. This can be established by showing that (σ,μ) is not KW-consistent; however, we will show it by appealing to the following lemma (proved in Appendix A) which highlights a property that will motivate a further restriction on belief revision (property below).
Lemma 1.
Let ≾ be a plausibility order over the set H of histories of an extensive-form game and let be an integer-valued representation of ≾ (that is, for all , if and only if ). Then the following are equivalent:
- (A)
- F satisfies Property (Definition 6)
- (B)
- F satisfies the following property: for all and , if then.
Using Lemma 1 we can prove that the assessment (σ,μ) where and , for the game of Figure 2, is not a sequential equilibrium. By Proposition 1 it will be sufficient to show that (σ,μ) cannot be rationalized by a choice measurable plausibility order (Definition 6). Let ≾ be a plausibility order that rationalizes (σ,μ) and let F be an integer-valued representation of ≾. Then, by ( of Definition 2, it must be that (because and ) and (because and ); thus and , so that F violates property ; hence, by Lemma 1, F violates property and thus ≾ is not choice measurable.
The ordinal counterpart to Property is Property below, which can be viewed as another “independence” condition: it says that if action a is implicitly judged to be at least as plausible as action b, conditional on history h (that is, ), then the same judgment must be made conditional on any other history that belongs to the same information set as h: if and , then
Note that Properties and are independent. An example of a plausibility order that violates but satisfies is plausibility order (1) for the game of Figure 1: is violated because but and is satisfied because at every non-singleton information set there are only two choices, one of which is plausibility preserving and the other is not. An example of a plausibility order that satisfies but violates is plausibility order (4) for the game of Figure 218. Adding Property to the properties given in Definition 8 we obtain a refinement of the notion of weakly independent perfect Bayesian equilibrium.
Definition 9.
Given an extensive-form game, an assessment (σ,μ) is a strongly independent perfect Bayesian equilibrium if it is sequentially rational, it is rationalized by a plausibility order that satisfies Properties and , and is uniformly Bayesian relative to that plausibility order.
The following proposition states that the notions of weakly/strongly independent PBE identify two (nested) solution concepts that lie strictly in the gap between PBE and sequential equilibrium. The proof of the first part of Proposition 2 is given in Appendix A, while the example of Figure 3 establishes the second part.
Figure 3.
Proposition 2.
Consider an extensive-form game and an assessment (σ,μ). If (σ,μ) is a sequential equilibrium then it is a strongly independent perfect Bayesian equilibrium (PBE). Furthermore, there are games where there is a strongly independent PBE which is not a sequential equilibrium.
To see that the notion of strongly independent PBE is weaker than sequential equilibrium, consider the game of Figure 3 (which is based on an example discussed in [,,]) and the assessment where (highlighted by double edges), for and for every other decision history x (the decision histories x such that are shown as black nodes and the decision histories x such that are shown as gray nodes). This assessment is rationalized by the following plausibility order:
It is straightforward to check that plausibility order (5) satisfies Properties 19 and 20. Furthermore (σ,μ) is trivially uniformly Bayesian relative to plausibility order (5)21. Thus (σ,μ) is a strongly independent PBE. Next we show that (σ,μ) is not a sequential equilibrium, by appealing to Proposition 1 and showing that any plausibility order that rationalizes (σ,μ) is not choice measurable22. Let ≾ be a plausibility order that rationalizes ; then it must satisfy the following properties:
- (because they belong to the same information set and while ). Thus if F is any integer-valued representation of ≾ it must be that
- (because and belong to the same information set and while ; furthermore, ℓ is a plausibility-preserving action since ). Thus if F is any integer-valued representation of ≾ it must be that
- (because ℓ is a plausibility-preserving action, and belong to the same information set and while ). Thus if F is any integer-valued representation of ≾ it must be that
Suppose that ≾ is choice measurable and let F be an integer-valued representation of it that satisfies Property . From (6) and (7) we get that
and by Property it must be that
Subtracting from both sides of (8) we obtain
It follows from (11) and (12) that , which can be written as , yielding a contradiction, because Property requires that .
Are the notions of weakly/strongly independent PBE “better” or “more natural” than the basic notion of PBE? This issue will be discussed briefly in Section 6.
4. How to Determine if a Plausibility Order Is Choice Measurable
In this section we provide a method for determining if a plausibility order is choice measurable. More generally, we provide a necessary and sufficient condition that applies not only to plausibility orders over sets of histories in a game but to a more general class of structures.
Let S be an arbitrary finite set and let ≾ be a total pre-order on S. Let be the set of ≾-equivalence classes of S. If the equivalence class of s is denoted by (where, as before, is a short-hand for “ and ”); thus . Let ≐ be an equivalence relation on . The interpretation of is that the distance between the equivalence classes and is required to be equal to the distance between the equivalence classes and .
Remark 2.
In the special case of a plausibility order ≾ on the set of histories H of a game, we shall be interested in the following equivalence relation ≐ on , which is meant to capture property above: if , , and are equivalence classes of ≾ then if and only if there exist two decision histories that belong to the same information set [] and a non-plausibility-preserving action such that and (or and ).
The general problem that we are addressing is the following.
Problem 1.
Given a pair , where ≾ is a total pre-order on a finite set S and ≐ is an equivalence relation on the set of pairs of equivalence classes of ≾, determine whether there exists a function such that, for all , (1) if and only if and (2) if , with and , then .
Instead of expressing the equivalence relation ≐ in terms of pairs of elements of , we shall express it in terms of pairs of numbers obtained by using the canonical ordinal representation ρ of ≾23. That is, if and then we shall write this as . For example, let and let ≾ be as shown in (13) below, together with the corresponding canonical representation ρ24:
If the equivalence relation ≐ contains the following pairs25:
A bag (or multiset) is a generalization of the notion of set in which members are allowed to appear more than once. An example of a bag is . Given two bags and their union, denoted by , is the bag that contains those elements that occur in either or and, furthermore, the number of times that each element occurs in is equal to the number of times it occurs in plus the number of times it occurs in . For instance, if and then . We say that is a proper sub-bag of , denoted by if and each element that occurs in occurs also, and at least as many times, in For example,
Given a pair with , we associate with it the set . For example, Given a set of pairs (with for every ) we associate with it the bag . For example, if then
Definition 10.
For every element of ≐, expressed (using the canonical representation ρ) as (with and ), the equation corresponding to it is . By the system of equations corresponding to ≐ we mean the set of all such equations26.
For example, consider the total pre-order given in (13) and the following equivalence relation ≐ (expressed in terms of ρ and omitting the reflexive pairs):
Then the corresponding system of equations is given by:
We are now ready to state the solution to Problem 1. The proof is given in Appendix A.
Proposition 3.
Given a pair , where ≾ is a total pre-order on a finite set S and ≐ is an equivalence relation on the set of pairs of equivalence classes of ≾, (A), (B) and (C) below are equivalent.
- (A)
- There is a function such that, for all , (1) if and only if ; and (2) if , with and , then ,
- (B)
- The system of equations corresponding to ≐ (Definition 10) has a solution consisting of positive integers.
- (C)
- There is no sequence in ≐ (expressed in terms of the canonical representation ρ of ≾ ) such that where and .
As an application of Proposition 3 consider again the game of Figure 3 and plausibility order (5) which rationalizes the assessment , for and for every other decision history x; the order is reproduced below together with the canonical integer-valued representation ρ:
By Remark 2, two elements of ≐ are and , which—expressed in terms of the canonical ordinal representation ρ—can be written as
Then and . Thus, since , by Part (C) of Proposition 3 ≾ is not choice measurable.
As a further application of Proposition 3 consider the total pre-order ≾ given in (13) together with the subset of the equivalence relation ≐ given in (14). Then there is no cardinal representation of ≾ that satisfies the constraints expressed by ≐, because of Part (C) of the above proposition and the following sequence27:
where
In fact, the above sequence corresponds to the following system of equations:
Adding the four equations we get which simplifies to , which is incompatible with a positive solution.
Remark 3.
In [] an algorithm is provided for determining whether a system of linear equations has a positive solution and for calculating such a solution if one exists. Furthermore, if the coefficients of the equations are integers and a positive solution exists, then the algorithm yields a solution consisting of positive integers.
5. Related Literature
As noted in Section 1, the quest in the literature for a “simple” solution concept intermediate between subgame-perfect equilibrium and sequential equilibrium has produced several attempts to provide a general definition of perfect Bayesian equilibrium.
In [] a notion of perfect Bayesian equilibrium was provided for a small subset of extensive-form games (namely the class of multi-stage games with observed actions and independent types), but extending that notion to arbitrary games proved to be problematic28.
In [] a notion of perfect Bayesian equilibrium is provided that can be applied to general extensive-form games (although it was defined only for games without chance moves); however, the proposed definition is in terms of a more complex object, namely a “tree-extended assessment” where ν is a conditional probability system on the set of terminal nodes. The main idea underlying the notion of perfect Bayesian equilibrium proposed in [] is what the author calls “strategic independence”: when forming beliefs, the strategic choices of different players should be regarded as independent events.
Several more recent contributions [,,] have re-addressed the issue of providing a definition of perfect Bayesian equilibrium that applies to general extensive-form games. Since [] has been the focus of this paper, here we shall briefly discuss [,]. In [] the notion of “simple perfect Bayesian equilibrium” is introduced and it is shown to lie strictly between subgame-perfect equilibrium and sequential equilibrium. This notion is based on an extension of the definition of sub-tree, called “quasi sub-tree”, which consists of an information set I together with all the histories that are successors of histories in I (that is, is a quasi-subtree that starts at I if if and only if there exists an such that h is a prefix of ). A quasi sub-tree is called regular if it satisfies the following property: if and then (that is, every information set that has a non-empty intersection with is entirely included in ). An information set I is called regular if the quasi-subtree that starts at I is regular. For example, in the game of Figure 4, the singleton information set of Player 2 is not regular. An assessment () is defined to be a “simple perfect Bayesian equilibrium” if it is sequentially rational and, for every regular quasi-subtree , Bayes’ rule is satisfied at every information set that is reached with positive probability by σ in (in other words, if the restriction of () to every regular quasi-subtree is a weak sequential equilibrium of the quasi-subtree). This notion of perfect Bayesian equilibrium is weaker than the notion considered in this paper (Definition 4). For example, in the game of Figure 4, the pure-strategy profile (highlighted by double edges), together with the system of beliefs , is a simple perfect Bayesian equilibrium, while (as shown in []) there is no system of beliefs such that is a perfect Bayesian equilibrium. A fortiori, the notion of simple perfect Bayesian equilibrium is weaker than the refinements of PBE discussed in the Section 3.
Figure 4.
In [], the author proposes a definition of perfect Bayesian equilibrium which is framed not in terms of assessments but in terms of “appraisals”. Each player is assumed to have a (possibly artificial) information set representing the beginning of the game and an appraisal for player i is a map that associates with every information set of player i a probability distribution over the set of pure-strategy profiles that reach that information set. Thus an appraisal for player i captures, for every information set of hers, her conjecture about how the information set was reached and what will happen from this point in the game. An appraisal system is defined to be “plainly consistent” if, whenever an information set of player i has a product structure (each information set is identified with the set of pure-strategy profiles that reach that information set), the player’s appraisal at that information set satisfies independence29. A strategy profile σ is defined to be a perfect Bayesian equilibrium if there is a plainly consistent appraisal system P that satisfies sequential rationality and is such that at their “initial” information sets all the players assign probability 1 to σ; in [] (p. 15), the author summarizes the notion of PBE as being based on “a simple foundation: sequential rationality and preservation of independence and Bayesian updating where applicable” (that is, on subsets of strategy profiles that have the appropriate product structure and independence property). Despite the fact that the notion of PBE suggested in [] incorporates a notion of independence, it can be weaker than the notion of PBE discussed in Section 2 (Definition 4) and thus, a fortiori, weaker than the notion of weakly independent PBE (Definition 8, Section 3). This can be seen from the game of Figure 5, which essentially reproduces an example given in []. The strategy profile (highlighted by double edges), together with any system of beliefs μ such that cannot be a PBE according to Definition 4 (Section 2). In fact, since while , any plausibility order that rationalizes must be such that , which implies that (because cannot be most plausible in the set ). On the other hand, σ can be a PBE according to the definition given in [] (p. 15), since the information set of Player 3 does not have a product structure so that Player 3 is not able to separate the actions of Players 1 and 2. For example, consider the appraisal system P where, initially, all the players assign probability 1 to σ and, at his information set, Player 2 assigns probability 1 to the strategy profile of Players 1 and 3 and, at her information set, Player 3 assigns probability to each of the strategy profiles and of Players 1 and 2. Then P is plainly consistent and sequentially rational, so that σ is a PBE as defined in [].
Figure 5.
Thus, a fortiori, the notion of perfect Bayesian equilibrium given in [] can be weaker than the notions of weakly/strongly independent PBE introduced in Section 3.
6. Conclusions
Besides sequential rationality, the notion of perfect Bayesian equilibrium (Definition 4) introduced in [] is based on two elements: (1) rationalizability of the assessment by a plausibility order (Definition 2); and (2) the notion of Bayesian consistency relative to the plausibility order. The first property identifies the set of decision histories that can be assigned positive conditional probability by the system of beliefs, while the second property imposes constraints on how conditional probabilities can be distributed over that set in order to guarantee “Bayesian updating as long as possible”30. In [] it was shown that by strengthening these two conditions one obtains a “limit free” characterization of sequential equilibrium. The strengthening of the first condition is that the plausibility order that rationalizes the given assessment be choice measurable, that it, that there be a cardinal representation of it (which can be interpreted as measuring the plausibility distance between histories in a way that is preserved by the addition of a common action). The strengthening of the second condition imposes “uniform consistency” on the conditional probability density functions on the equivalence classes of the plausibility order, by requiring that there be a full-support common prior that preserves the relative probabilities of two decision histories in the same information set when a common action is added. There is a “substantial” gap between the notion of PBE and that of sequential equilibrium. In this paper we identified two solution concepts that lie in this gap. The first notion, weakly independent PBE (Definition 8), is obtained by adding a restriction on the belief revision policy encoded in the plausibility order that rationalizes the given assessment (together with strengthening Bayes consistency to uniform Bayes consistency). This restriction says that observation of a new action at an information set should not alter the relative plausibility of any two histories in that information set (condition ); it can be interpreted as an “independence” or “conservative” principle, in the sense that observation of a new action should not lead to a reversal of judgment of plausibility concerning past histories. The second notion, strongly independent PBE (Definition 9), is obtained by adding to the first notion a further restriction, according to which the implicit plausibility ranking of two actions available at the same information set should be independent of the history at which the actions are taken.
A further contribution of this paper has been to provide a method for determining if a plausibility order is choice measurable, which is one of the two conditions that, together, are necessary and sufficient for a PBE to be a sequential equilibrium.
This paper highlights the need to conceptualize refinements of subgame-perfect equilibrium in extensive form games in terms of principles of belief revision. Through the notion of plausibility order and AGM-consistency we have appealed to the principles of belief revision that underlie the so-called AGM theory []. However, in a dynamic game, beliefs typically need to be revised several times in a sequence as new information sets are reached. Thus the relevant notion of belief revision is iterated belief revision. There is an extensive literature on the topic of iterated belief revision (see, for example, [,,,] and the special issue of the Journal of Philosophical Logic, Vol. 40 (2), April 2011). An exploration of different solution concepts in the gap between PBE and sequential equilibrium, based on different principles of iterated belief revision, seems to be a promising area of research.
Conflicts of Interest
The author declares no conflict of interest.
Appendix A. Proofs
Proof of Lemma 1.
Let ≾ be a plausibility order on the set of histories H and let be an integer-valued representation of ≾. We want to show that properties and below are equivalent (for arbitrary , with , and ))
First of all, note that, without loss of generality, we can assume that 31.
First we show that Let F be an integer-valued representation of ≾ that satisfies . For every decision history h and action , define
The function is well defined, since, by , implies that . Then, for every history , . In fact,
Thus, for every and , . Hence, and and, therefore,
Next we show that Let ≾ be a plausibility order and let be an integer-valued representation that satisfies . Select arbitrary and . Let be a plausibility-preserving action at h (there must be at least one such action: see Definition 1); then, and . Hence, since F is a representation of ≾, and and thus
By , . From this and (A2) it follows that . ☐
Proof of Proposition 2.
Let be a sequential equilibrium. We want to show that is a strongly independent PBE (Definition 9). By Proposition 1, it is sufficient to show that is rationalized by a plausibility order ≾ that satisfies Properties and . By Proposition 1 there is a choice measurable plausibility order ≾ that rationalizes . Let F be an integer-valued representation of ≾ that satisfies Property . Let h and be decision histories that belong to the same information set and let . Then, by ,
If then ; by (A3), it follows that and thus . Conversely, if then and thus, by (A3), so that . Hence ≾ satisfies .
Let h and be decision histories that belong to the same information set and let . We want to show that holds, that is, that if and only if . Let F be an integer-valued representation of ≾ that satisfies Property . By Lemma 1 F satisfies Property , that is,
Proof of Proposition 3.
. Let satisfy the properties of Part (A). Select an arbitrary and define by . Then F is also a function that satisfies the properties of Part (A) (note that since, for all , , ; furthermore, for all ). Let for some (where ρ is the canonical ordinal representation of ≾: see Footnote 23). For every , define
Thus is the distance, as measured by F, between the equivalence class of some t such that and the immediately preceding equivalence class (that is, the equivalence class of some s such that )32. Note that is well defined since, if are such that and , then and and thus, by (1) of Property (A), and . Note also that, for all , is a positive integer, since and imply that and thus, by (1) of Property (A), . We want to show that the values defined in (A5) provide a solution to the system of equations corresponding to ≐ (Definition 10). Select an arbitrary element of ≐, (with and and express it, using the canonical ordinal representation ρ (see Footnote 23), as (thus , , , , and ). Then the corresponding equation (see Definition 10) is: . By (2) of Property (A),
Similarly, . Thus, by (A6), .
. Assume that the system of equations corresponding to ≐ has a solution consisting of positive integers . Define as follows: if (equivalently, ) then and if (equivalently, for ) then (where ρ and the sets are as defined in Footnote 23). We need to show that F satisfies the properties of Part (A). Select arbitrary with . Then and thus . Conversely, suppose that are such that . Then and thus , so that . Thus Property (1) of Part (A) is satisfied. Now let be such that , and Let , , and (thus and ). Then, by (A5), and . Since is the equation corresponding to (which - using ρ - can be expressed as ), by our hypothesis and thus , so that (2) of Property (A) is satisfied.
. Suppose that there is a sequence in ≐ (expressed in terms of the canonical representation ρ of ≾) such that
where and . Let be the system of equations corresponding to the above sequence (for example, is the equation ). Let L be the sum of the left-hand-side and R be the sum of the right-hand-side of the equations . Note that for every integer i, is a summand of L if and only if i appears in exactly n times and similarly is a summand of R if and only if i appears in exactly n times. By (A7), if is a summand of L then is a summand of R with and, furthermore, . Thus there cannot be a positive solution of , because it would be incompatible with . Since is a subset of the system of equations corresponding to ≐, it follows that the latter cannot have a positive solution either.
It only remains to prove that . We will return to this below after providing an additional result. ☐
First some notation. Given two vectors we write (1) if , for every ; (2) if and ; and (3) if , for every .
Lemma 2.
Let A be the matrix such that the system of equations corresponding to ≐ (Definition 10) can be expressed as (note that each entry of A is either , 0 or 1; furthermore, by symmetry of ≐, for each row of A there is another row such that )33. If the system of equations does not have a positive integer solution then there exist r rows of A, with and r positive integers such that if B is the submatrix of A consisting of the r rows (thus for every , , where is the row of B) then .
Proof.
By Stiemke’s theorem34 if the system of equations does not have a positive integer solution then there exists a (where denotes the set of integers) such that (that is, ). Let . Let r be the cardinality of K; then, without loss of generality, we can assume that 35. Furthermore, again without loss of generality, we can assume that for every , 36. Let B be the submatrix of A consisting of those rows of A such that and for let be the vector corresponding to 37.
Then and for all . ☐
Completion of Proof of Proposition 3.
It remains to prove that . Let A be the matrix such that the system of equations corresponding to ≐ can be expressed as and assume that does not have a positive integer solution. Let B be the submatrix of A and the vector of positive integers of Lemma 2 such that . Define two matrices and as follows (recall that each entry of B is either , 0 or 1):
Then, for every , and thus (since )
Let be the matrix obtained from C by replacing each row of C with copies of it and let be constructed from D similarly. Then, letting , and are matrices whose entries are either 0 or 1. It follows from (A9) that
Consider the system of equations
For every , the coordinate of is the number of times that the variable appears on the left-hand-side of (A11) and the coordinate of is the number of times that the variable appears on the right-hand-side of (A11). Hence, by (A10), for every , the number of times that the variable appears on the left-hand-side of (A11) is less than or equal to the number of times that it appears on the right-hand-side of (A11) and for at least one j it is less. Thus, letting be the sequence of elements of ≐ corresponding to the equations in (A11), we have that where and . ☐
References
- Kreps, D.; Wilson, R. Sequential equilibrium. Econometrica 1982, 50, 863–894. [Google Scholar] [CrossRef]
- Mas-Colell, A.; Whinston, M.D.; Green, J.R. Microeconomic Theory; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
- Myerson, R. Game Theory: Analysis of Conflict; Harvard University Press: Cambridge, MA, USA, 1991. [Google Scholar]
- Selten, R. Re-examination of the perfectness concept for equilibrium points in extensive games. Int. J. Game Theory 1975, 4, 25–55. [Google Scholar] [CrossRef]
- Bonanno, G. AGM-consistency and perfect Bayesian equilibrium. Part I: Definition and properties. Int. J. Game Theory 2013, 42, 567–592. [Google Scholar] [CrossRef]
- Alchourrón, C.; Gärdenfors, P.; Makinson, D. On the logic of theory change: Partial meet contraction and revision functions. J. Symb. Log. 1985, 50, 510–530. [Google Scholar] [CrossRef]
- Bonanno, G. AGM belief revision in dynamic games. In Proceedings of the 13th Conference on Theoretical Aspects of Rationality and Knowledge (TARK XIII), Groningen, The Netherlands, 12–14 July 2011; Apt, K.R., Ed.; ACM: New York, NY, USA, 2011; pp. 37–45. [Google Scholar]
- Bonanno, G. AGM-consistency and perfect Bayesian equilibrium. Part II: From PBE to sequential equilibrium. Int. J. Game Theory 2005. [Google Scholar] [CrossRef]
- Osborne, M.; Rubinstein, A. A Course In Game Theory; MIT Press: Cambridge, MA, USA, 1994. [Google Scholar]
- Hendon, E.; Jacobsen, J.; Sloth, B. The one-shot-deviation principle for sequential rationality. Games Econ. Behav. 1996, 12, 274–282. [Google Scholar] [CrossRef]
- Perea, A. A note on the one-deviation property in extensive form games. Games Econ. Behav. 2002, 40, 322–338. [Google Scholar] [CrossRef]
- Kohlberg, E.; Reny, P. Independence on relative probability spaces and consistent assessments in game trees. J. Econ. Theory 1997, 75, 280–313. [Google Scholar] [CrossRef]
- Perea, A.; Jansen, M.; Peters, H. Characterization of consistent assessments in extensive-form games. Games Econ. Behav. 1997, 21, 238–252. [Google Scholar] [CrossRef]
- Battigalli, P. Strategic independence and perfect Bayesian equilibria. J. Econ. Theory 1996, 70, 201–234. [Google Scholar] [CrossRef]
- Dines, L. On positive solutions of a system of linear equations. Ann. Math. 1926–1927, 28, 386–392. [Google Scholar] [CrossRef]
- Fudenberg, D.; Tirole, J. Perfect Bayesian equilibrium and sequential equilibrium. J. Econ. Theory 1991, 53, 236–260. [Google Scholar] [CrossRef]
- González-Díaz, J.; Meléndez-Jiménez, M.A. On the notion of perfect Bayesian equilibrium. TOP J. Span. Soc. Stat. Oper. Res. 2014, 22, 128–143. [Google Scholar] [CrossRef]
- Watson, J. Perfect Bayesian Equilibrium: General Definitions and Illustrations; Working Paper; University of California San Diego: San Diego, CA, USA, 2016. [Google Scholar]
- Bonanno, G. Belief change in branching time: AGM-consistency and iterated revision. J. Philos. Log. 2012, 41, 201–236. [Google Scholar] [CrossRef]
- Boutilier, C. Iterated revision and minimal change of conditional beliefs. J. Philos. Log. 1996, 25, 262–305. [Google Scholar] [CrossRef]
- Darwiche, A.; Pearl, J. On the logic of iterated belief revision. Artif. Intell. 1997, 89, 1–29. [Google Scholar] [CrossRef]
- Stalnaker, R. Iterated belief revision. Erkenntnis 2009, 70, 189–209. [Google Scholar] [CrossRef]
- Schrijver, A. Theory of Linear and Integer Programming; John Wiley & Sons: Hoboken, NJ, USA, 1986. [Google Scholar]
- Fishburn, P.C. Finite linear qualitative probability. J. Math. Psychol. 1996, 40, 64–77. [Google Scholar] [CrossRef]
- 1The acronym ‘AGM’ stands for ‘Alchourrón-Gärdenfors-Makinson’ who pioneered the literature on belief revision: see []. As shown in [], AGM-consistency can be derived from the primitive concept of a player’s epistemic state, which encodes the player’s initial beliefs and her disposition to revise those beliefs upon receiving (possibly unexpected) information. The existence of a plausibility order that rationalizes the epistemic state of each player guarantees that the belief revision policy of each player satisfies the so-called AGM axioms for rational belief revision, which were introduced in [].
- 2, either or .
- 3, if and then .
- 4As in [] we use the notation rather than the, perhaps more natural, notation , for two reasons: (1) it is the standard notation in the extensive literature that deals with AGM belief revision (for a recent survey of this literature see the special issue of the Journal of Philosophical Logic, Vol. 40 (2), April 2011); and (2) when representing the order ≾ numerically it is convenient to assign lower values to more plausible histories. An alternative reading of is “history h (weakly) precedes in terms of plausibility”.
- 5A behavior strategy profile is a list of probability distributions, one for every information set, over the actions available at that information set. A system of beliefs is a collection of probability distributions, one for every information set, over the histories in that information set.
- 6The precise definition is as follows. Let Z denote the set of terminal histories and, for every player i, let be player i’s von Neumann-Morgenstern utility function. Given a decision history h, let be the set of terminal histories that have h as a prefix. Let be the probability distribution over induced by the strategy profile σ, starting from history h (that is, if z is a terminal history and then ). Let I be an information set of player i and let be player i’s expected utility at I if σ is played, given her beliefs at I (as specified by μ). We say that player i’s strategy is sequentially rational at I if for every strategy of player i (where denotes the strategy profile of the players other than i). An assessment is sequentially rational if, for every player i and for every information set I of player i, is sequentially rational at Note that there are two definitions of sequential rationality: the weakly localone—which is the one adopted here—according to which at an information set a player can contemplate changing her choice not only there but possibly also at subsequent information sets of hers, and a strictly local one, according to which at an information set a player contemplates changing her choice only there. If the definition of perfect Bayesian equilibrium (Definition 4 below) is modified by using the strictly local definition of sequential rationality, then an extra condition needs to be added, namely the “pre-consistency” condition identified in [,] as being necessary and sufficient for the equivalence of the two notions. For simplicity we have chosen the weakly local definition.
- 7Rounded rectangles represent information sets and the payoffs are listed in the following order: Player 1’s payoff at the top, Player 2’s payoff in the middle and Player 3’s payoff at the bottom.
- 8We use the following convention to represent a total pre-order: if the row to which history h belongs is above the row to which belongs, then (h is more plausible than ) and if h and belong to the same row then (h is as plausible as ). ∅ denotes the empty history, which corresponds to the root of the tree. In (1) the plausibility-preserving actions are d, e and g; the most plausible histories in the information set are b and c and the two histories in the information set are equally plausible.
- 9Given σ, for Player 1 d yields a payoff of 2 while a and c yield 1 and b yields 2; thus d is sequentially rational. Given σ and μ, at her information set with e Player 2 obtains an expected payoff of 4 while with f her expected payoff is 3; thus e is sequentially rational. Given μ, at his information set , Player 3’s expected payoff from playing with g is 1.5 while his expected payoff from playing with k is 1; thus g is sequentially rational.
- 10Note that if and , then , for all . In fact, since , every action is plausibility preserving and therefore, by Property of Definition 2, .
- 11For an interpretation of the probabilities see [].
- 12That is, for every , (where is the set of actions that occur in history h). Since is completely mixed, for every and thus for all
- 13Since H is finite, there is an such that is a partition of H and, for every , with , and for every , if and then .
- 14For example, [] adopts this interpretation.
- 15For such an interpretation see [].
- 16Note, however, that is compatible with the following: (with ) and (with ).
- 17We have that (1) , and , (2) , and , (3) , and .
- 18That is satisfied was shown in Footnote 17. is violated because and but .
- 19In fact, (1) and for every ; (2) and for every ; (3) and for every ; (4) and for every ; (5) and for every ; and (6) and for every .
- 20This is easily verified: the important observation is that and and . The other comparisons involve a plausibility-preserving action versus a non-plausibility-preserving action and thus is trivially satisfied.
- 21As uniform full support common prior one can take, for example, the uniform distribution over the set of decision histories. Note that, for every equivalence class E of the order, is either empty or a singleton.
- 22To prove that is not a sequential equilibrium it is not sufficient to show that plausibility order (5) is not choice measurable, because there could be another plausibility order which is choice measurable and rationalizes .
- 23As in Definition 5, let , and, for every integer , . The canonical ordinal integer-valued representation of ≾, , is given by if and only if
- 24Thus for every , , , etc.
- 25For example, ≐ is the smallest reflexive, symmetric and transitive relation that contains the pairs given in (14).
- 26The system of linear equations of Definition 10 is somewhat related to the system of multiplicative equations considered in [] (Theorem 5.1). A direct comparison is beyond the scope of this paper and is not straightforward, because the structures considered in Definition 10 are more general than those considered in [].
- 27By symmetry of ≐, we can express the third and fourth constraints as and instead of and , respectively.
- 28The main element of the notion of PBE put forward in [] is the “no signaling what you don’t know” condition on beliefs. For example, if Player 2 observes Player 1’s action and Player 1 has observed nothing about a particular move of Nature, then Player 2 should not update her beliefs about Nature’s choice based on Player 1’s action.
- 29Intuitively, on consecutive information sets, a player does not change her beliefs about the actions of other players, if she has not received information about those actions.
- 30By “Bayesian updating as long as possible” we mean the following: (1) when information causes no surprises, because the play of the game is consistent with the most plausible play(s) (that is, when information sets are reached that have positive prior probability), then beliefs should be updated using Bayes’ rule; and (2) when information is surprising (that is, when an information set is reached that had zero prior probability) then new beliefs can be formed in an arbitrary way, but from then on Bayes’ rule should be used to update those new beliefs, whenever further information is received that is consistent with those beliefs.
- 31It is straightforward to check that if is an integer-valued representation of ≾ then so is defined by ; furthermore if satisfies property () then so does F.
- 32For example, if and ≾ is given by then and ; if F is given by and then and .
- 33For example, the system of Equation (15) can be written as , where and
- 34See, for example, [] (p. 216) or [] (Theorem 1.1, p. 65).
- 35Proof. Recall that for each row of A there is a row such that . If and for some i and k such that thenwhere all the multipliers (of or ) are positive. Thus one can set one of the two values of and to zero and replace the other value with the relevant of the above values while keeping unchanged. For example, if then one can replace with 0 and with thereby reducing the cardinality of K by one. This process can be repeated until the multipliers of half of the rows of A have been replaced by zero.
- 36Proof. Suppose that for some . Recall that there exists an i such that . By the argument of the previous footnote, . Then replace by 0 and replace by .
- 37For example, if and , , , then B is the matrix where and and , and .
© 2016 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).