Exploring the Gap between Perfect Bayesian Equilibrium and Sequential Equilibrium

Giacomo Bonanno

doi:10.3390/g7040035

Department of Economics, University of California, Davis, CA 95616-8578, USA

^†

I am grateful to two anonymous reviewers for helpful comments and suggestions.

Games2016, 7(4), 35;https://doi.org/10.3390/g7040035

This article belongs to the Special Issue Epistemic Game Theory and Logic

Version Notes

Order Reprints

Abstract

In (Bonanno, 2013), a solution concept for extensive-form games, called perfect Bayesian equilibrium (PBE), was introduced and shown to be a strict refinement of subgame-perfect equilibrium; it was also shown that, in turn, sequential equilibrium (SE) is a strict refinement of PBE. In (Bonanno, 2016), the notion of PBE was used to provide a characterization of SE in terms of a strengthening of the two defining components of PBE (besides sequential rationality), namely AGM consistency and Bayes consistency. In this paper we explore the gap between PBE and SE by identifying solution concepts that lie strictly between PBE and SE; these solution concepts embody a notion of “conservative” belief revision. Furthermore, we provide a method for determining if a plausibility order on the set of histories is choice measurable, which is a necessary condition for a PBE to be a SE.

Keywords:

plausibility order; minimal belief revision; Bayesian updating; independence; sequential equilibrium

1. Introduction

Since its introduction in 1982 [1], sequential equilibrium has been the most widely used solution concept for extensive-form games. In applications, however, checking the “consistency” requirement for beliefs has proved to be rather difficult; thus, similarly motivated—but simpler—notions of equilibrium have been sought. The simplest solution concept is “weak sequential equilibrium” [2,3] which is defined as an assessment that is sequentially rational and satisfies Bayes’ rule at information sets that are reached with positive probability by the strategy profile (while no restrictions are imposed on the beliefs at information sets that have zero probability of being reached). However, this solution concept is too weak in that it is possible for an assessment

(σ, μ)

(where σ is a strategy profile and μ is a system of beliefs) to be a weak sequential equilibrium without σ being a subgame-perfect equilibrium [4]. Hence the search in the literature for a “simple” (yet sufficiently strong) solution concept that lies in the gap between subgame-perfect equilibrium and sequential equilibrium. The minimal desired properties of such a solution concept, which is usually referred to as “perfect Bayesian equilibrium” (PBE), are sequential rationality and the “persistent” application of Bayes’ rule. The exact meaning of the latter requirement has not been easy to formalize.

Several attempts have been made in the literature to provide a satisfactory definition of PBE; they are reviewed in Section 5. In this paper we continue the study of one such notion, introduced in [5], where it is shown that (a) the proposed solution concept is a strict refinement of subgame-perfect equilibrium; and (b) in general, the set of sequential equilibria is a proper subset of the set of perfect Bayesian equilibria. This definition of PBE is based on two notions (besides sequential rationality): (1) the qualitative property of AGM-consistency relative to a plausibility order1; and (2) the quantitative property of Bayes consistency. This notion of PBE was further used in [8] to provide a new characterization of sequential equilibrium, in terms of a strengthening of both AGM consistency and Bayes consistency. In this paper we explore the gap between PBE and sequential equilibrium, by identifying solution concepts that lie strictly between PBE and sequential equilibrium. These solution concepts capture the notion of revising one’s beliefs in a “conservative” or “minimal” way.

The paper is organized as follows. Section 2 reviews the notation, definitions and main results of [5,8]. The new material is contained in Section 3 and Section 4. In Section 3 we introduce properties of the plausibility order that can be used to define solution concepts that lie between PBE and sequential equilibrium; the main result in this section is Proposition 2. In Section 4 we offer a method (Proposition 3) for determining whether a plausibility order satisfies the property of “choice measurability”, which is one of the two conditions that, together, are necessary and sufficient for a PBE to be a sequential equilibrium. Section 5 discusses related literature and Section 6 concludes. The proofs are given in Appendix A.

2. Perfect Bayesian Equilibrium and Sequential Equilibrium

In this section we review the notation and the main definitions and results of [5,8].

We adopt the history-based definition of extensive-form game (see, for example, [9]). If A is a set, we denote by

A^{*}

the set of finite sequences in A. If

h = ⟨a_{1}, \dots, a_{k}⟩ \in A^{*}

and

1 \leq j \leq k

, the sequence

h^{'} = ⟨a_{1}, \dots, a_{j}⟩

is called a prefix of h; if

j < k

then we say that

h^{'}

is a proper prefix of h. If

h = ⟨a_{1}, \dots, a_{k}⟩ \in A^{*}

and

a \in A

, we denote the sequence

⟨a_{1}, \dots, a_{k}, a⟩ \in A^{*}

by

h a

.

A finite extensive form is a tuple

⟨A, H, N, ι, {\{\approx_{i}\}}_{i \in N}⟩

whose elements are:

A finite set of actions A.
A finite set of histories $H \subseteq A^{*}$ which is closed under prefixes (that is, if $h \in H$ and $h^{'} \in A^{*}$ is a prefix of h, then $h^{'} \in H$ ). The null history $⟨⟩,$ denoted by ∅, is an element of H and is a prefix of every history. A history $h \in H$ such that, for every $a \in A$ , $h a \notin H$ , is called a terminal history. The set of terminal histories is denoted by Z. $D = H \ Z$ denotes the set of non-terminal or decision histories. For every history $h \in H$ , we denote by $A (h)$ the set of actions available at h, that is, $A (h) = {a \in A : h a \in H}$ . Thus $A (h) \neq ⌀$ if and only if $h \in D$ . We assume that $A = ⋃_{h \in D} A (h)$ (that is, we restrict attention to actions that are available at some decision history).
A finite set $N = {1, \dots, n}$ of players. In some cases there is also an additional, fictitious, player called chance.
A function $ι : D \to N \cup {c h a n c e}$ that assigns a player to each decision history. Thus $ι (h)$ is the player who moves at history h. A game is said to be without chance moves if $ι (h) \in N$ for every $h \in D .$ For every $i \in N \cup {c h a n c e}$ , let $D_{i} = ι^{- 1} (i)$ be the set of histories assigned to player i. Thus ${D_{c h a n c e}, D_{1}, \dots, D_{n}}$ is a partition of $D .$ If history h is assigned to chance, then a probability distribution over $A (h)$ is given that assigns positive probability to every $a \in A (h)$ .
For every player $i \in N$ , $\approx_{i}$ is an equivalence relation on $D_{i}$ . The interpretation of $h \approx_{i} h^{'}$ is that, when choosing an action at history h, player i does not know whether she is moving at h or at $h^{'}$ . The equivalence class of $h \in D_{i}$ is denoted by $I_{i} (h)$ and is called an information set of player i; thus $I_{i} (h) = {h^{'} \in D_{i} : h^{'} \approx_{i} h}$ . The following restriction applies: if $h^{'} \in I_{i} (h)$ then $A (h^{'}) = A (h)$ , that is, the set of actions available to a player is the same at any two histories that belong to the same information set of that player.
The following property, known as perfect recall, is assumed: for every player $i \in N$ , if $h_{1}, h_{2} \in D_{i}$ , $a \in A (h_{1})$ and $h_{1} a$ is a prefix of $h_{2}$ then for every $h^{'} \in I_{i} (h_{2})$ there exists an $h \in I_{i} (h_{1})$ such that $h a$ is a prefix of $h^{'}$ . Intuitively, perfect recall requires a player to remember what she knew in the past and what actions she took previously.

Given an extensive form, one obtains an extensive gameby adding, for every player

i \in N

,a utility (or payoff) function

U_{i} : Z \to R

(where

R

denotes the set of real numbers).

A total pre-order on the set of histories H is a binary relation ≾ which is complete2 and transitive3. We write

h \sim h^{'}

as a short-hand for the conjunction:

h ≾ h^{'}

and

h^{'} ≾ h

, and write

h ≺ h^{'}

as a short-hand for the conjunction:

h ≾ h^{'}

and not

h^{'} ≾ h

.

Definition 1.

Given an extensive form, a plausibility order is a total pre-order ≾ on H that satisfies the following properties:

\forall h \in D

,

$P L 1 .$	$h ≾ h a, \forall a \in A (h)$ ,
$P L 2 .$	(i) $\exists a \in A (h)$ such that $h \sim h a$ ,
	(ii) $\forall a \in A (h),$ if $h \sim h a$ then, $\forall h^{'} \in I (h)$ , $h^{'} \sim h^{'} a,$
$P L 3 .$	if history h is assigned to chance, then $h \sim h a$ , $\forall a \in A (h) .$

The interpretation of

h ≾ h^{'}

is that history h is at least as plausible as history

h^{'}

; thus

h ≺ h^{'}

means that h is more plausible than

h^{'}

and

h \sim h^{'}

means that h is just as plausible as

h^{'}

4. Property

P L 1

says that adding an action to a decision history h cannot yield a more plausible history than h itself. Property

P L 2

says that at every decision history h there is at least one action a which is “plausibility preserving” in the sense that adding a to h yields a history which is as plausible as h; furthermore, any such action a performs the same role with any other history that belongs to the same information set as h. Property

P L 3

says that all the actions at a history assigned to chance are plausibility preserving.

An assessment is a pair

(σ, μ)

where σ is a behavior strategy profile and μ is a system of beliefs5.

Definition 2.

Given an extensive-form, an assessment

(σ, μ)

is AGM-consistent if there exists a plausibility order ≾ on the set of histories H such that:

(i): the actions that are assigned positive probability by σ are precisely the plausibility-preserving actions: $\forall h \in D, \forall a \in A (h)$ ,

$σ (a) > 0 if and only if h \sim h a,$

(P1)
(ii): the histories that are assigned positive probability by μ are precisely those that are most plausible within the corresponding information set: $\forall h \in D,$

$μ (h) > 0 if and only if h ≾ h^{'}, \forall h^{'} \in I (h) .$

(P2)

If ≾ satisfies properties

P 1

and

P 2

with respect to

(σ, μ)

, we say that ≾ rationalizes

(σ, μ)

.

An assessment

(σ, μ)

is sequentially rational if, for every player i and every information set I of hers, player i’s expected payoff—given the strategy profile σ and her beliefs at I (as specified by μ)—cannot be increased by unilaterally changing her choice at I and possibly at information sets of hers that follow I6.

Consider the extensive-form game shown in Figure 1 7 and the assessment

(σ, μ)

where

σ = (d, e, g)

and μ is the following system of beliefs:

μ (a) = 0, μ (b) = \frac{1}{3}, μ (c) = \frac{2}{3}

and

μ (a f) = μ (b f) = \frac{1}{2}

. This assessment is AGM-consistent, since it is rationalized by the following plausibility order8:

(\begin{matrix} most plausible & \emptyset, d \\ b, c, b e, c e \\ a, a e \\ a f, b f, c f, a f g, b f g \\ least plausible & a f k, b f k \end{matrix})

(1)

Figure 1. An extensive-form game.

Furthermore

(σ, μ)

is sequentially rational9. The property of AGM-consistency imposes restrictions on the support of the behavior strategy σ and on the support of the system of beliefs μ. The following property imposes constraints on how probabilities can be distributed over those supports.

Definition 3.

Given an extensive form, let ≾ be a plausibility order that rationalizes the assessment

(σ, μ)

. We say that

(σ, μ)

is Bayes consistent (or Bayesian) relative to ≾ if, for every equivalence class E of ≾ that contains some decision history h with

μ (h) > 0

[that is,

E \cap D_{μ}^{+} \neq ⌀

, where

D_{μ}^{+} = {h \in D : μ (h) > 0}

], there exists a probability density function

ν_{E} : H \to [0, 1]

(recall that H is a finite set) such that:

\begin{matrix} B 1 . & ν_{E} (h) > 0 if and only if h \in E \cap D_{μ}^{+} . \\ B 2 . & If h, h^{'} \in E \cap D_{μ}^{+} and h^{'} = h a_{1} \dots a_{m} (that is, h is a prefix o f h^{'}) then \\ ν_{E} (h^{'}) = ν_{E} (h) \times σ (a_{1}) \times \dots \times σ (a_{m}) . \\ B 3 . & If h \in E \cap D_{μ}^{+}, then, \forall h^{'} \in I (h), μ (h^{'}) = ν_{E} (h^{'} | I (h)) \overset{d e f}{=} \frac{ν_{E} (h^{'})}{\sum_{h^{''} \in I (h)} ν_{E} (h^{''})} . \end{matrix}

Property

B 1

requires that

ν_{E} (h) > 0

if and only if

h \in E

and

μ (h) > 0

. Property

B 2

requires

ν_{E}

to be consistent with the strategy profile σ in the sense that if

h, h^{'} \in E

,

μ (h) > 0

,

μ (h^{'}) > 0

and

h^{'} = h a_{1} \dots a_{m}

then the probability that

ν_{E}

assigns to

h^{'}

is equal to the probability that

ν_{E}

assigns to h multiplied by the probabilities (according to σ) of the actions that lead from h to

h^{'}

10. Property

B 3

requires the system of beliefs μ to satisfy Bayes’ rule in the sense that if

h \in E

and

μ (h) > 0

(so that E is the equivalence class of the most plausible elements of

I (h)

) then, for every history

h^{'} \in I (h)

,

μ (h^{'})

(the probability assigned to

h^{'}

by μ) coincides with the probability of

h^{'}

conditional on

I (h)

using the probability density function

ν_{E}

11.

Consider again the game of Figure 1, and the assessment

(σ, μ)

where

σ = (d, e, g)

and

μ (a) = 0

, μ (b) = \frac{1}{3}, μ (c) = \frac{2}{3}

and

μ (a f) = μ (b f) = \frac{1}{2}

. Let ≾ be the plausibility order (1) given above, which rationalizes

(σ, μ)

. Then

(σ, μ)

is Bayes consistent relative to ≾. In fact, we have that

D_{μ}^{+} = {\emptyset, b, c, a f, b f}

and the equivalence classes of ≾ that have a non-empty intersection with

D_{μ}^{+}

are

E_{1} = {\emptyset, d}

,

E_{2} = {b, c, b e, c e}

and

E_{3} = {a f, b f, c f, a f g, b f g}

. Let

ν_{E_{1}} (\emptyset) = 1

,

ν_{E_{2}} (b) = \frac{1}{3}

,

ν_{E_{2}} (c) = \frac{2}{3}

and

ν_{E_{3}} (a f) = ν_{E_{3}} (b f) = \frac{1}{2}

. Then the three probability density functions

ν_{E_{1}}

,

ν_{E_{2}}

and

ν_{E_{3}}

satisfy the properties of Definition 3 and hence

(σ, μ)

is Bayes consistent relative to ≾.

Definition 4.

An assessment

(σ, μ)

is a perfect Bayesian equilibrium (PBE) if it is sequentially rational, it is rationalized by a plausibility order on the set of histories and is Bayes consistent relative to it.

We saw above that, for the game illustrated in Figure 1, the assessment

(σ, μ)

where

σ = (d, e, g)

and

μ (a) = 0, μ (b) = \frac{1}{3}, μ (c) = \frac{2}{3}

and

μ (a f) = μ (b f) = \frac{1}{2}

is sequentially rational, it is rationalized by the plausibility order (1) and is Bayes consistent relative to it. Thus it is a perfect Bayesian equilibrium.

Remark 1.

It is proved in [5] that if

(σ, μ)

is a perfect Bayesian equilibrium then σ is a subgame-perfect equilibrium and that every sequential equilibrium is a perfect Bayesian equilibrium. Furthermore, the notion of PBE is a strict refinement of subgame-perfect equilibrium and sequential equilibrium is a strict refinement of PBE.

Next we recall the definition of sequential equilibrium [1]. An assessment

(σ, μ)

is KW-consistent (KW stands for ‘Kreps-Wilson’) if there is an infinite sequence

⟨σ^{1}, \dots, σ^{m}, \dots⟩

of completely mixed behavior strategy profiles such that, letting

μ^{m}

be the unique system of beliefs obtained from

σ^{m}

by applying Bayes’ rule12,

{lim}_{m \to \infty} (σ^{m}, μ^{m}) = (σ, μ)

. An assessment

(σ, μ)

is a sequential equilibrium if it is KW-consistent and sequentially rational. In [8] it is shown that sequential equilibrium can be characterized as a strengthening of PBE based on two properties: (1) a property of the plausibility order that constrains the supports of the belief system; and (2) a strengthening of the notion of Bayes consistency, that imposes constraints on how the probabilities can be distributed over those supports. The details are given below.

Given a plausibility order ≾ on the finite set of histories H, a function

F : H \to N

(where

N

denotes the set of non-negative integers) is said to be an ordinal integer-valued representation of ≾ if, for every

h, h^{'} \in H

,

F (h) \leq F (h^{'}) if and only if h ≾ h^{'} .

(2)

Since H is finite, the set of ordinal integer-valued representations is non-empty. A particular ordinal integer-valued representation, which we will call canonical and denote by ρ, is defined as follows.

Definition 5.

Let

H_{0} = {h \in H : h ≾ x, \forall x \in H}

,

H_{1} = {h \in H \ H_{0} : h ≾ x, \forall x \in H \ H_{0}}

and, in general, for every integer

k \geq 1

,

H_{k} = {h \in H \ H_{0} \cup \dots \cup H_{k - 1} : h ≾ x, \forall x \in H \ H_{0} \cup \dots \cup H_{k - 1}}

. Thus

H_{0}

is the equivalence class of ≾ containing the most plausible histories,

H_{1}

is the equivalence class containing the most plausible among the histories left after removing those in

H_{0}

, etc.13 The canonical ordinal integer-valued representation of ≾,

ρ : H \to N

, is given by

ρ (h) = k if and only if h \in H_{k} .

(3)

We call

ρ (h)

the rank of history

h .

Instead of an ordinal integer-valued representation of the plausibility order one could seek a cardinal representation which, besides (2), satisfies the following property: if h and

h^{'}

belong to the same information set (that is,

h^{'} \in I (h)

) and

a \in A (h)

, then

F (h^{'}) - F (h) = F (h^{'} a) - F (h a) .

(CM)

If we think of F as measuring the “plausibility distance” between histories, then we can interpret

C M

as a distance-preserving condition: the plausibility distance between two histories in the same information set is preserved by the addition of the same action.

Definition 6.

A plausibility order ≾ on the set of histories H is choice measurable if it has at least one integer-valued representation that satisfies property

C M

.

For example, the plausibility order (1) is not choice measurable, since any integer-valued representation F of it must be such that

F (a) - F (b) > 0

and

F (a f) - F (b f) = 0

.

Let

(σ, μ)

be an assessment which is rationalized by a plausibility order ≾. As before, let

D_{μ}^{+}

be the set of decision histories to which μ assigns positive probability:

D_{μ}^{+} = \{h \in D : μ (h) > 0\}

. Let

E_{μ}^{+}

be the set of equivalence classes of ≾ that have a non-empty intersection with

D_{μ}^{+}

. Clearly

E_{μ}^{+}

is a non-empty, finite set. Suppose that

(σ, μ)

is Bayesian relative to ≾ and let

{\{ν_{E}\}}_{E \in E_{μ}^{+}}

be a collection of probability density functions that satisfy the properties of Definition 3. We call a probability density function

ν : D \to (0, 1]

a full-support common prior of

{\{ν_{E}\}}_{E \in E_{μ}^{+}}

if, for every

E \in E_{μ}^{+}

,

ν_{E} (\cdot) = ν (\cdot | E \cap D_{μ}^{+})

, that is, for all

h \in E \cap D_{μ}^{+}

,

ν_{E} (h) = \frac{ν (h)}{\sum_{h^{'} \in E \cap D_{μ}^{+}} ν (h^{'})}

. Note that a full support common prior assigns positive probability to all decision histories, not only to those in

D_{μ}^{+}

.

Definition 7.

Consider an extensive form. Let

(σ, μ)

be an assessment which is rationalized by the plausibility order ≾ and is Bayesian relative to it and let

{\{ν_{E}\}}_{E \in E_{μ}^{+}}

be a collection of probability density functions that satisfy the properties of Definition 3. We say that

(σ, μ)

is uniformly Bayesian relative to ≾ if there exists a full-support common prior

ν : D \to (0, 1]

of

{\{ν_{E}\}}_{E \in E_{μ}^{+}}

that satisfies the following properties.

\begin{matrix} U B 1 . & If a \in A (h) and h a \in D, then \\ (i) ν (h a) \leq ν (h) and, (i i) if σ (a) > 0 then ν (h a) = ν (h) \times σ (a) . \\ U B 2 . & If a \in A (h), h and h^{'} belong to the same information set and h a, h^{'} a \in D \\ then \frac{ν (h)}{ν (h^{'})} = \frac{ν (h a)}{ν (h^{'} a)} . \end{matrix}

We call such a function ν auniform full-support common prior of

{\{ν_{E}\}}_{E \in E_{μ}^{+}}

.

U B 1

requires that the common prior ν be consistent with the strategy profile σ, in the sense that if

σ (a) > 0

then

ν (h a) = ν (h) \times σ (a)

(thus extending Property

B 2

of Definition 3 from

D_{μ}^{+}

to D).

U B 2

requires that the relative probability, according to the common prior ν, of any two histories that belong to the same information set remain unchanged by the addition of the same action.

It is shown in [8] that choice measurability and uniform Bayesian consistency are independent properties. The following proposition is proved in [8].

Proposition 1.

(I) and (II) below are equivalent:

(I): ( $σ, μ$ ) is a perfect Bayesian equilibrium which is rationalized by a choice
measurable plausibility order and is uniformly Bayesian relative to it.
(II): ( $σ, μ$ ) is a sequential equilibrium.

3. Exploring the Gap between PBE and Sequential Equilibrium

The notion of perfect Bayesian equilibrium (Definition 4) incorporates—through the property of AGM-consistency—a belief revision policy which can be interpreted either as the epistemic state of an external observer14 or as a belief revision policy which is shared by all the players15. For example, the perfect Bayesian equilibrium considered in Section 2 for the game of Figure 1, namely

σ = (d, e, g)

and

μ (a) = 0, μ (b) = \frac{1}{3}, μ (c) = \frac{2}{3}

,

μ (a f) = μ (b f) = \frac{1}{2}

reflects the following belief revision policy: the initial beliefs are that Player 1 will play d; conditional on learning that Player 1 did not play d, the observer would become convinced that Player 1 played either b or c (that is, she would judge a to be less plausible than b and she would consider c to be as plausible as b) and would expect Player 2 to play e; upon learning that (Player 1 did not play d and) Player 2 played f, the observer would become convinced that Player 1 played either a or b, hence judging

a f

to be as plausible as

b f

, thereby modifying her earlier judgment that a was less plausible than b. Although such a belief revision policy does not violate the rationality constraints introduced in [7], it does involve a belief change that is not “minimal”or “conservative”. Such “non-minimal” belief changes can be ruled out by imposing the following restriction on the plausibility order: if h and

h^{'}

belong to the same information set (that is,

h^{'} \in I (h)

) and a is an action available at h(

a \in A (h)

), then

h ≾ h^{'} if and only if h a ≾ h^{'} a .

(IND₁)

I N D_{1}

says that if h is deemed to be at least as plausible as

h^{'}

then the addition of any available action a must preserve this judgment, in the sense that

h a

must be deemed to be at least as plausible as

h^{'} a

, and vice versa; it can also be viewed as an “independence” condition, in the sense that observation of a new action cannot lead to a change in the relative plausibility of previous histories16. Any plausibility order that rationalizes the assessment

σ = (d, e, g)

and

μ (a) = 0, μ (b) = \frac{1}{3}, μ (c) = \frac{2}{3}

,

μ (a f) = μ (b f) = \frac{1}{2}

for the game of Figure 1 must violate

I N D_{1}

(since

b ≺ a

while

b f \sim a f

).

We can obtain a strengthening of the notion of perfect Bayesian equilibrium (Definition 4) by (1) adding property

I N D_{1}

; and (2) strengthening Bayes consistency (Definition 3) to uniform Bayesian consistency (Definition 7).

Definition 8.

Given an extensive-form game, an assessment (σ,μ) is a weakly independent perfect Bayesian equilibrium if it is sequentially rational, it is rationalized by a plausibility order that satisfies

I N D_{1}

and is uniformly Bayesian relative to that plausibility order.

As an example of a weakly independent PBE consider the game of Figure 2 and the assessment (σ,μ) where

σ = (c, d, g, ℓ)

(highlighted by double edges in Figure 2) and

μ (b) = μ (a e) = μ (b f) = 1

(thus

μ (a) = μ (a f) = μ (b e) = 0

) (the decision histories x such that

μ (x) > 0

are shown as black nodes and the decision histories x such that

μ (x) = 0

are shown as gray nodes)). This assessment is sequentially rational and is rationalized by the following plausibility order:

(\begin{matrix} most plausible & \emptyset, c \\ b, b d \\ a, a d \\ b f, b f ℓ \\ b e, b e ℓ \\ a e, a e g \\ a f, a f g \\ b f m \\ b e m \\ a e k \\ least plausible & a f k \end{matrix})

(4)

Figure 2.

It is straightforward to check that plausibility order (4) satisfies

I N D_{1}

17. To see that (σ,μ) is uniformly Bayesian relative to plausibility order (4), note that

D_{μ}^{+} = {\emptyset, b, a e, b f}

and thus the only equivalence classes that have a non-empty intersection with

D_{μ}^{+}

are

E_{1} = {\emptyset, c}

,

E_{2} = {b, b d}

,

E_{3} = {a e, a e g}

and

E_{4} = {b f, b f ℓ}

. Letting

ν_{E_{1}} (\emptyset) = 1

,

ν_{E_{2}} (b) = 1

,

ν_{E_{3}} (a e) = 1

and

ν_{E_{4}} (b f) = 1

, this collection of probability distributions satisfies the Properties of Definition 3. Let ν be the uniform distribution over the set of decision histories

D = {\emptyset, a, b, a e, a f, b e, b f}

(thus

ν (h) = \frac{1}{7}

for every

h \in D

). Then ν is a full support common prior of the collection

{ν_{E_{i}}}_{i \in {1, 2, 3, 4}}

and satisfies Properties

U B 1

and

U B 2

of Definition 7.

Note, however, that (σ,μ) is not a sequential equilibrium. This can be established by showing that (σ,μ) is not KW-consistent; however, we will show it by appealing to the following lemma (proved in Appendix A) which highlights a property that will motivate a further restriction on belief revision (property

I N D_{2}

below).

Lemma 1.

Let ≾ be a plausibility order over the set H of histories of an extensive-form game and let

F : H \to N

be an integer-valued representation of ≾ (that is, for all

h, h^{'} \in H

,

F (h) \leq F (h^{'})

if and only if

h ≾ h^{'}

). Then the following are equivalent:

(A): F satisfies Property $C M$ (Definition 6)
(B): F satisfies the following property: for all $h, h^{'} \in H$ and $a, b \in A (h)$ , if $h^{'} \in I (h)$ then

$F (h b) - F (h a) = F (h^{'} b) - F (h^{'} a)$

(CM^′)

.

Using Lemma 1 we can prove that the assessment (σ,μ) where

σ = (c, d, g, ℓ)

and

μ (b) = μ (a e) = μ (b f) = 1

, for the game of Figure 2, is not a sequential equilibrium. By Proposition 1 it will be sufficient to show that (σ,μ) cannot be rationalized by a choice measurable plausibility order (Definition 6). Let ≾ be a plausibility order that rationalizes (σ,μ) and let F be an integer-valued representation of ≾. Then, by (

P 2)

of Definition 2, it must be that

a e ≺ a f

(because

μ (a e) > 0

and

μ (a f) = 0

) and

b f ≺ b e

(because

μ (b f) > 0

and

μ (b e) = 0

); thus

F (a e) - F (a f) < 0

and

F (b e) - F (b f) > 0

, so that F violates property

C M^{'}

; hence, by Lemma 1, F violates property

C M

and thus ≾ is not choice measurable.

The ordinal counterpart to Property

C M^{'}

is Property

I N D_{2}

below, which can be viewed as another “independence” condition: it says that if action a is implicitly judged to be at least as plausible as action b, conditional on history h (that is,

h a ≾ h b

), then the same judgment must be made conditional on any other history that belongs to the same information set as h: if

h^{'} \in I (h)

and

a, b \in A (h)

, then

h a ≾ h b if and only if h^{'} a ≾ h^{'} b .

(IND₂)

Note that Properties

I N D_{1}

and

I N D_{2}

are independent. An example of a plausibility order that violates

I N D_{1}

but satisfies

I N D_{2}

is plausibility order (1) for the game of Figure 1:

I N D_{1}

is violated because

b ≺ a

but

b f \sim a f

and

I N D_{2}

is satisfied because at every non-singleton information set there are only two choices, one of which is plausibility preserving and the other is not. An example of a plausibility order that satisfies

I N D_{1}

but violates

I N D_{2}

is plausibility order (4) for the game of Figure 2 18. Adding Property

I N D_{2}

to the properties given in Definition 8 we obtain a refinement of the notion of weakly independent perfect Bayesian equilibrium.

Definition 9.

Given an extensive-form game, an assessment (σ,μ) is a strongly independent perfect Bayesian equilibrium if it is sequentially rational, it is rationalized by a plausibility order that satisfies Properties

I N D_{1}

and

I N D_{2}

, and is uniformly Bayesian relative to that plausibility order.

The following proposition states that the notions of weakly/strongly independent PBE identify two (nested) solution concepts that lie strictly in the gap between PBE and sequential equilibrium. The proof of the first part of Proposition 2 is given in Appendix A, while the example of Figure 3 establishes the second part.

Figure 3.

Proposition 2.

Consider an extensive-form game and an assessment (σ,μ). If (σ,μ) is a sequential equilibrium then it is a strongly independent perfect Bayesian equilibrium (PBE). Furthermore, there are games where there is a strongly independent PBE which is not a sequential equilibrium.

To see that the notion of strongly independent PBE is weaker than sequential equilibrium, consider the game of Figure 3 (which is based on an example discussed in [12,13,14]) and the assessment

(σ, μ)

where

σ = (M, ℓ, a, c, e)

(highlighted by double edges),

μ (x) = 1

for

x \in {\emptyset, M, M r, L m, R ℓ}

and

μ (x) = 0

for every other decision history x (the decision histories x such that

μ (x) > 0

are shown as black nodes and the decision histories x such that

μ (x) = 0

are shown as gray nodes). This assessment is rationalized by the following plausibility order:

(\begin{matrix} most plausible & \emptyset, M, M ℓ \\ R, R ℓ, R ℓ e \\ M m, M m e \\ M r, M r a \\ L, L ℓ, L ℓ a \\ R m \\ L m, L m c \\ R r, R r c \\ L r \\ R ℓ f \\ M m f \\ L m d \\ R r d \\ M r b \\ least plausible & L ℓ b \end{matrix})

(5)

It is straightforward to check that plausibility order (5) satisfies Properties

I N D_{1}

19 and

I N D_{2}

20. Furthermore (σ,μ) is trivially uniformly Bayesian relative to plausibility order (5)21. Thus (σ,μ) is a strongly independent PBE. Next we show that (σ,μ) is not a sequential equilibrium, by appealing to Proposition 1 and showing that any plausibility order that rationalizes (σ,μ) is not choice measurable22. Let ≾ be a plausibility order that rationalizes

(σ, μ)

; then it must satisfy the following properties:

$L m ≺ R r$ (because they belong to the same information set and $μ (L m) > 0$ while $μ (R r) = 0$ ). Thus if F is any integer-valued representation of ≾ it must be that

$F (L m) < F (R r) .$

(6)
$M r ≺ L ℓ \sim L$ (because $M r$ and $L ℓ$ belong to the same information set and $μ (M r) > 0$ while $μ (L ℓ) = 0$ ; furthermore, ℓ is a plausibility-preserving action since $σ (ℓ) > 0$ ). Thus if F is any integer-valued representation of ≾ it must be that

$F (M r) < F (L) .$

(7)
$R \sim R ℓ ≺ M m$ (because ℓ is a plausibility-preserving action, $R ℓ$ and $M m$ belong to the same information set and $μ (R ℓ) > 0$ while $μ (M m) = 0$ ). Thus if F is any integer-valued representation of ≾ it must be that

$F (R) < F (M m) .$

(8)

Suppose that ≾ is choice measurable and let F be an integer-valued representation of it that satisfies Property

C M

. From (6) and (7) we get that

F (L m) - F (L) < F (R r) - F (M r)

(9)

and by Property

C M

it must be that

F (R r) - F (M r) = F (R) - F (M) .

(10)

It follows from (9) and (10) that

F (L m) - F (L) < F (R) - F (M) .

(11)

Subtracting

F (M)

from both sides of (8) we obtain

F (R) - F (M) < F (M m) - F (M) .

(12)

It follows from (11) and (12) that

F (L m) - F (L) < F (M m) - F (M)

, which can be written as

F (M) - F (L) < F (M m) - F (L m)

, yielding a contradiction, because Property

C M

requires that

F (M) - F (L) = F (M m) - F (L m)

.

Are the notions of weakly/strongly independent PBE “better” or “more natural” than the basic notion of PBE? This issue will be discussed briefly in Section 6.

4. How to Determine if a Plausibility Order Is Choice Measurable

In this section we provide a method for determining if a plausibility order is choice measurable. More generally, we provide a necessary and sufficient condition that applies not only to plausibility orders over sets of histories in a game but to a more general class of structures.

Let S be an arbitrary finite set and let ≾ be a total pre-order on S. Let

{S \}_{\sim}

be the set of ≾-equivalence classes of S. If

s \in S,

the equivalence class of s is denoted by

[s] = {t \in S : s \sim t}

(where, as before,

s \sim t

is a short-hand for “

s ≾ t

and

t ≾ s

”); thus

{S \}_{\sim} = {[s] : s \in S}

. Let ≐ be an equivalence relation on

{S \}_{\sim} {\times S \}_{\sim}

. The interpretation of

([s_{1}], [s_{2}]) ≐ ([t_{1}], [t_{2}])

is that the distance between the equivalence classes

[s_{1}]

and

[s_{2}]

is required to be equal to the distance between the equivalence classes

[t_{1}]

and

[t_{2}]

.

Remark 2.

In the special case of a plausibility order ≾ on the set of histories H of a game, we shall be interested in the following equivalence relation ≐ on

{H \}_{\sim} {\times H \}_{\sim}

, which is meant to capture property

C M

above: if

E_{1}

,

E_{2}

,

F_{1}

and

F_{2}

are equivalence classes of ≾ then

(E_{1}, E_{2}) ≐ (F_{1}, F_{2})

if and only if there exist two decision histories

h, h^{'} \in H

that belong to the same information set [

h^{'} \in I (h)

] and a non-plausibility-preserving action

a \in A (h)

such that

h \in E_{1}, h^{'} \in E_{2}, h a \in F_{1}

and

h^{'} a \in F_{2}

(or

h a \in E_{1}, h^{'} a \in E_{2}, h \in F_{1}

and

h^{'} \in F_{2}

).

The general problem that we are addressing is the following.

Problem 1.

Given a pair

(≾, ≐)

, where ≾ is a total pre-order on a finite set S and ≐ is an equivalence relation on the set of pairs of equivalence classes of ≾, determine whether there exists a function

F : S \to N

such that, for all

s, t, x, y \in S

, (1)

F (s) \leq F (t)

if and only if

s ≾ t

and (2) if

([s], [t]) ≐ ([x], [y])

, with

s ≺ t

and

x ≺ y

, then

F (t) - F (s) = F (y) - F (x)

.

Instead of expressing the equivalence relation ≐ in terms of pairs of elements of

{S \}_{\sim}

, we shall express it in terms of pairs of numbers

(j, k)

obtained by using the canonical ordinal representation ρ of ≾23. That is, if

s_{1}, s_{2}, t_{1}, t_{2} \in S

and

([s_{1}], [s_{2}]) ≐ ([t_{1}], [t_{2}])

then we shall write this as

(ρ (s_{1}), ρ (s_{2})) ≐ (ρ (t_{1}), ρ (t_{2}))

. For example, let

S = {a, b, c, d, e, f, g, h, ℓ, m}

and let ≾ be as shown in (13) below, together with the corresponding canonical representation ρ24:

(\begin{matrix} ≾ : & ρ : \\ a & 0 \\ b, c & 1 \\ d & 2 \\ e & 3 \\ f, g & 4 \\ h, ℓ & 5 \\ m & 6 \end{matrix})

(13)

If the equivalence relation ≐ contains the following pairs25:

\begin{matrix} ([a], [b]) ≐ ([h], [m]) \\ ([a], [b]) ≐ ([e], [f]) \\ ([a], [d]) ≐ ([f], [m]) \\ ([b], [e]) ≐ ([e], [f]) \\ ([b], [e]) ≐ ([f], [m]) \end{matrix} then we express them (using ρ) as \begin{matrix} (0, 1) ≐ (5, 6) \\ (0, 1) ≐ (3, 4) \\ (0, 2) ≐ (4, 6) \\ (1, 3) ≐ (3, 4) \\ (1, 3) ≐ (4, 6) \end{matrix}

(14)

A bag (or multiset) is a generalization of the notion of set in which members are allowed to appear more than once. An example of a bag is

\{1, 2, 2, 3, 4, 4, 5, 6\}

. Given two bags

B_{1}

and

B_{2}

their union, denoted by

B_{1} ⋓ B_{2}

, is the bag that contains those elements that occur in either

B_{1}

or

B_{2}

and, furthermore, the number of times that each element occurs in

B_{1} ⋓ B_{2}

is equal to the number of times it occurs in

B_{1}

plus the number of times it occurs in

B_{2}

. For instance, if

B_{1} = \{1, 2, 2, 3, 4, 4, 5, 6\}

and

B_{2} = \{2, 3, 6, 6\}

then

B_{1} ⋓ B_{2} = \{1, 2, 2, 2, 3, 3, 4, 4, 5, 6, 6, 6\}

. We say that

B_{1}

is a proper sub-bag of

B_{2}

, denoted by

B_{1} ⊏ B_{2},

if

B_{1} \neq B_{2}

and each element that occurs in

B_{1}

occurs also, and at least as many times, in

B_{2} .

For example,

\{1, 2, 4, 4, 5, 6\} ⊏ \{1, 1, 2, 4, 4, 5, 5, 6\} .

Given a pair

(i, j)

with

i < j

, we associate with it the set

B_{(i, j)} = {i + 1, i + 2, \dots, j}

. For example,

B_{(2, 5)} = {3, 4, 5} .

Given a set of pairs

P = {(i_{1}, j_{1}), (i_{2}, j_{2}), \dots, (i_{m}, j_{m})}

(with

i_{k} < j_{k},

for every

k = 1, \dots, m

) we associate with it the bag

B_{P} = B_{(i_{1}, j_{1})} ⋓ B_{(i_{2}, j_{2})} ⋓ \dots ⋓ B_{(i_{m}, j_{m})}

. For example, if

P = {(0, 2), (1, 4), (2, 5)}

then

B_{P} = {1, 2} ⋓ {2, 3, 4} ⋓ {3, 4, 5} = {1, 2, 2, 3, 3, 4, 4, 5} .

Definition 10.

For every element of ≐, expressed (using the canonical representation ρ) as

(i, j) ≐ (k, ℓ)

(with

i < j

and

k < ℓ

), the equation corresponding to it is

x_{i + 1} + x_{i + 2} + \dots + x_{j} = x_{k + 1} + x_{k + 2} + \dots + x_{ℓ}

. By the system of equations corresponding to ≐ we mean the set of all such equations26.

For example, consider the total pre-order given in (13) and the following equivalence relation ≐ (expressed in terms of ρ and omitting the reflexive pairs):

\{(0, 3) ≐ (2, 4), (2, 4) ≐ (0, 3), (2, 4) ≐ (3, 5), (3, 5) ≐ (2, 4), (0, 3) ≐ (3, 5), (3, 5) ≐ (0, 3)\}

Then the corresponding system of equations is given by:

\begin{matrix} x_{1} + x_{2} + x_{3} = x_{3} + x_{4} \\ x_{3} + x_{4} = x_{1} + x_{2} + x_{3} \\ x_{3} + x_{4} = x_{4} + x_{5} \\ x_{4} + x_{5} = x_{3} + x_{4} \\ x_{1} + x_{2} + x_{3} = x_{4} + x_{5} \\ x_{4} + x_{5} = x_{1} + x_{2} + x_{3} \end{matrix}

(15)

We are now ready to state the solution to Problem 1. The proof is given in Appendix A.

Proposition 3.

Given a pair

(≾, ≐)

, where ≾ is a total pre-order on a finite set S and ≐ is an equivalence relation on the set of pairs of equivalence classes of ≾, (A), (B) and (C) below are equivalent.

(A): There is a function $F : S \to N$ such that, for all $s, t, x, y \in S$ , (1) $F (s) \leq F (t)$ if and only if $s ≾ t$ ; and (2) if $([s], [t]) ≐ ([x], [y])$ , with $s ≺ t$ and $x ≺ y$ , then $F (t) - F (s) = F (y) - F (x)$ ,
(B): The system of equations corresponding to ≐ (Definition 10) has a solution consisting of positive integers.
(C): There is no sequence $⟨((i_{1}, j_{1}) ≐ (k_{1}, ℓ_{1})), \dots, ((i_{m}, j_{m}) ≐ (k_{m}, ℓ_{m}))⟩$ in ≐ (expressed in terms of the canonical representation ρ of ≾ ) such that $B_{l e f t} ⊏ B_{r i g h t}$ where $B_{l e f t} = B_{(i_{1}, j_{1})} ⋓ \dots ⋓ B_{(i_{m}, j_{m})}$ and $B_{r i g h t} = B_{(k_{1}, ℓ_{1})} ⋓ \dots ⋓ B_{(k_{m}, ℓ_{m})}$ .

As an application of Proposition 3 consider again the game of Figure 3 and plausibility order (5) which rationalizes the assessment

σ = (M, ℓ, a, c, e)

,

μ (x) = 1

for

x \in {\emptyset, M, M r, L m, R ℓ}

and

μ (x) = 0

for every other decision history x; the order is reproduced below together with the canonical integer-valued representation ρ:

(\begin{matrix} ≾ : & ρ : \\ most plausible & \emptyset, M, M ℓ & 0 \\ R, R ℓ, R ℓ e & 1 \\ M m, M m e & 2 \\ M r, M r a & 3 \\ L, L ℓ, L ℓ a & 4 \\ R m & 5 \\ L m, L m c & 6 \\ R r, R r c & 7 \\ L r & 8 \\ R ℓ f & 9 \\ M m f & 10 \\ L m d & 11 \\ R r d & 12 \\ M r b & 13 \\ least plausible & L ℓ b & 14 \end{matrix})

(16)

By Remark 2, two elements of ≐ are

([M], [R]) ≐ ([M r], [R r])

and

([M m], [L m]) ≐ ([M], [L])

, which—expressed in terms of the canonical ordinal representation ρ—can be written as

(0, 1) ≐ (3, 7)

(2, 6) ≐ (0, 4)

Then

B_{l e f t} = {1} ⋓ {3, 4, 5, 6} = {1, 3, 4, 5, 6}

and

B_{r i g h t} = {4, 5, 6, 7) ⋓ {1, 2, 3, 4} = {1, 2, 3, 4, 4, 5, 6, 7}

. Thus, since

B_{l e f t} ⊏ B_{r i g h t}

, by Part (C) of Proposition 3 ≾ is not choice measurable.

As a further application of Proposition 3 consider the total pre-order ≾ given in (13) together with the subset of the equivalence relation ≐ given in (14). Then there is no cardinal representation of ≾ that satisfies the constraints expressed by ≐, because of Part (C) of the above proposition and the following sequence27:

⟨((0, 1) ≐ (3, 4)), ((1, 3) ≐ (4, 6)), ((3, 4) ≐ (1, 3)), ((4, 6) ≐ (0, 2))⟩

where

B_{l e f t} = {1} ⋓ {2, 3} ⋓ {4} ⋓ {5, 6} = {1, 2, 3, 4, 5, 6} ⊏ B_{r i g h t} = {4} ⋓ {5, 6} ⋓ {2, 3} ⋓ {1, 2} = {1, 2, 2, 3, 4, 5, 6} .

In fact, the above sequence corresponds to the following system of equations:

\begin{matrix} x_{1} = x_{4} & corresponding to & (0, 1) ≐ (3, 4) \\ x_{2} + x_{3} = x_{5} + x_{6} & corresponding to & (1, 3) ≐ (4, 6) \\ x_{4} = x_{2} + x_{3} & corresponding to & (3, 4) ≐ (1, 3) \\ x_{5} + x_{6} = x_{1} + x_{2} & corresponding to & (4, 6) ≐ (0, 2) \end{matrix}

Adding the four equations we get

x_{1} + x_{2} + x_{3} + x_{4} + x_{5} + x_{6} = x_{1} + 2 x_{2} + x_{3} + x_{4} + x_{5} + x_{6}

which simplifies to

0 = x_{2}

, which is incompatible with a positive solution.

Remark 3.

In [15] an algorithm is provided for determining whether a system of linear equations has a positive solution and for calculating such a solution if one exists. Furthermore, if the coefficients of the equations are integers and a positive solution exists, then the algorithm yields a solution consisting of positive integers.

5. Related Literature

As noted in Section 1, the quest in the literature for a “simple” solution concept intermediate between subgame-perfect equilibrium and sequential equilibrium has produced several attempts to provide a general definition of perfect Bayesian equilibrium.

In [16] a notion of perfect Bayesian equilibrium was provided for a small subset of extensive-form games (namely the class of multi-stage games with observed actions and independent types), but extending that notion to arbitrary games proved to be problematic28.

In [14] a notion of perfect Bayesian equilibrium is provided that can be applied to general extensive-form games (although it was defined only for games without chance moves); however, the proposed definition is in terms of a more complex object, namely a “tree-extended assessment”

(ν, σ, μ)

where ν is a conditional probability system on the set of terminal nodes. The main idea underlying the notion of perfect Bayesian equilibrium proposed in [14] is what the author calls “strategic independence”: when forming beliefs, the strategic choices of different players should be regarded as independent events.

Several more recent contributions [5,17,18] have re-addressed the issue of providing a definition of perfect Bayesian equilibrium that applies to general extensive-form games. Since [5] has been the focus of this paper, here we shall briefly discuss [17,18]. In [17] the notion of “simple perfect Bayesian equilibrium” is introduced and it is shown to lie strictly between subgame-perfect equilibrium and sequential equilibrium. This notion is based on an extension of the definition of sub-tree, called “quasi sub-tree”, which consists of an information set I together with all the histories that are successors of histories in I (that is,

Γ_{I}

is a quasi-subtree that starts at I if

h^{'} \in Γ_{I}

if and only if there exists an

h \in I

such that h is a prefix of

h^{'}

). A quasi sub-tree

Γ_{I}

is called regular if it satisfies the following property: if

h \in Γ_{I}

and

h^{'} \in I (h)

then

h^{'} \in Γ_{I}

(that is, every information set that has a non-empty intersection with

Γ_{I}

is entirely included in

Γ_{I}

). An information set I is called regular if the quasi-subtree that starts at I is regular. For example, in the game of Figure 4, the singleton information set

{b}

of Player 2 is not regular. An assessment (

σ, μ

) is defined to be a “simple perfect Bayesian equilibrium” if it is sequentially rational and, for every regular quasi-subtree

Γ_{I}

, Bayes’ rule is satisfied at every information set that is reached with positive probability by σ in

Γ_{I}

(in other words, if the restriction of (

σ, μ

) to every regular quasi-subtree is a weak sequential equilibrium of the quasi-subtree). This notion of perfect Bayesian equilibrium is weaker than the notion considered in this paper (Definition 4). For example, in the game of Figure 4, the pure-strategy profile

σ = (c, d, f)

(highlighted by double edges), together with the system of beliefs

μ (a) = μ (b d) = 0, μ (b e) = 1

, is a simple perfect Bayesian equilibrium, while (as shown in [5]) there is no system of beliefs

μ^{'}

such that

(σ, μ^{'})

is a perfect Bayesian equilibrium. A fortiori, the notion of simple perfect Bayesian equilibrium is weaker than the refinements of PBE discussed in the Section 3.

Figure 4.

In [18], the author proposes a definition of perfect Bayesian equilibrium which is framed not in terms of assessments but in terms of “appraisals”. Each player is assumed to have a (possibly artificial) information set representing the beginning of the game and an appraisal for player i is a map that associates with every information set of player i a probability distribution over the set of pure-strategy profiles that reach that information set. Thus an appraisal for player i captures, for every information set of hers, her conjecture about how the information set was reached and what will happen from this point in the game. An appraisal system is defined to be “plainly consistent” if, whenever an information set of player i has a product structure (each information set is identified with the set of pure-strategy profiles that reach that information set), the player’s appraisal at that information set satisfies independence29. A strategy profile σ is defined to be a perfect Bayesian equilibrium if there is a plainly consistent appraisal system P that satisfies sequential rationality and is such that at their “initial” information sets all the players assign probability 1 to σ; in [18] (p. 15), the author summarizes the notion of PBE as being based on “a simple foundation: sequential rationality and preservation of independence and Bayesian updating where applicable” (that is, on subsets of strategy profiles that have the appropriate product structure and independence property). Despite the fact that the notion of PBE suggested in [18] incorporates a notion of independence, it can be weaker than the notion of PBE discussed in Section 2 (Definition 4) and thus, a fortiori, weaker than the notion of weakly independent PBE (Definition 8, Section 3). This can be seen from the game of Figure 5, which essentially reproduces an example given in [18]. The strategy profile

σ = (b, d, e)

(highlighted by double edges), together with any system of beliefs μ such that

μ (a c) > 0

cannot be a PBE according to Definition 4 (Section 2). In fact, since

σ (d) > 0

while

σ (c) = 0

, any plausibility order that rationalizes

(σ, μ)

must be such that

a \sim a d ≺ a c

, which implies that

μ (a c) = 0

(because

a c

cannot be most plausible in the set

{a c, a d, b c}

). On the other hand, σ can be a PBE according to the definition given in [18] (p. 15), since the information set of Player 3 does not have a product structure so that Player 3 is not able to separate the actions of Players 1 and 2. For example, consider the appraisal system P where, initially, all the players assign probability 1 to σ and, at his information set, Player 2 assigns probability 1 to the strategy profile

(b, e)

of Players 1 and 3 and, at her information set, Player 3 assigns probability

\frac{1}{3}

to each of the strategy profiles

(a, c), (a, d)

and

(b, c)

of Players 1 and 2. Then P is plainly consistent and sequentially rational, so that σ is a PBE as defined in [18].

Figure 5.

Thus, a fortiori, the notion of perfect Bayesian equilibrium given in [18] can be weaker than the notions of weakly/strongly independent PBE introduced in Section 3.

6. Conclusions

Besides sequential rationality, the notion of perfect Bayesian equilibrium (Definition 4) introduced in [5] is based on two elements: (1) rationalizability of the assessment by a plausibility order (Definition 2); and (2) the notion of Bayesian consistency relative to the plausibility order. The first property identifies the set of decision histories that can be assigned positive conditional probability by the system of beliefs, while the second property imposes constraints on how conditional probabilities can be distributed over that set in order to guarantee “Bayesian updating as long as possible”30. In [8] it was shown that by strengthening these two conditions one obtains a “limit free” characterization of sequential equilibrium. The strengthening of the first condition is that the plausibility order that rationalizes the given assessment be choice measurable, that it, that there be a cardinal representation of it (which can be interpreted as measuring the plausibility distance between histories in a way that is preserved by the addition of a common action). The strengthening of the second condition imposes “uniform consistency” on the conditional probability density functions on the equivalence classes of the plausibility order, by requiring that there be a full-support common prior that preserves the relative probabilities of two decision histories in the same information set when a common action is added. There is a “substantial” gap between the notion of PBE and that of sequential equilibrium. In this paper we identified two solution concepts that lie in this gap. The first notion, weakly independent PBE (Definition 8), is obtained by adding a restriction on the belief revision policy encoded in the plausibility order that rationalizes the given assessment (together with strengthening Bayes consistency to uniform Bayes consistency). This restriction says that observation of a new action at an information set should not alter the relative plausibility of any two histories in that information set (condition

I N D_{1}

); it can be interpreted as an “independence” or “conservative” principle, in the sense that observation of a new action should not lead to a reversal of judgment of plausibility concerning past histories. The second notion, strongly independent PBE (Definition 9), is obtained by adding to the first notion a further restriction, according to which the implicit plausibility ranking of two actions available at the same information set should be independent of the history at which the actions are taken.

A further contribution of this paper has been to provide a method for determining if a plausibility order is choice measurable, which is one of the two conditions that, together, are necessary and sufficient for a PBE to be a sequential equilibrium.

This paper highlights the need to conceptualize refinements of subgame-perfect equilibrium in extensive form games in terms of principles of belief revision. Through the notion of plausibility order and AGM-consistency we have appealed to the principles of belief revision that underlie the so-called AGM theory [7]. However, in a dynamic game, beliefs typically need to be revised several times in a sequence as new information sets are reached. Thus the relevant notion of belief revision is iterated belief revision. There is an extensive literature on the topic of iterated belief revision (see, for example, [19,20,21,22] and the special issue of the Journal of Philosophical Logic, Vol. 40 (2), April 2011). An exploration of different solution concepts in the gap between PBE and sequential equilibrium, based on different principles of iterated belief revision, seems to be a promising area of research.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Proofs

Proof of Lemma 1.

Let ≾ be a plausibility order on the set of histories H and let

F : H \to N

be an integer-valued representation of ≾. We want to show that properties

C M

and

C M^{'}

below are equivalent (for arbitrary

h, h^{'} \in H

, with

h^{'} \in I (h)

, and

a, b \in A (h

))

F (h^{'}) - F (h) = F (h^{'} a) - F (h a) .

(CM)

F (h b) - F (h a) = F (h^{'} b) - F (h^{'} a) .

(CM^′)

First of all, note that, without loss of generality, we can assume that

F (\emptyset) = 0

31.

First we show that

C M \Rightarrow C M^{'} .

Let F be an integer-valued representation of ≾ that satisfies

C M

. For every decision history h and action

a \in A (h)

, define

λ (a) = F (h a) - F (h) .

(A1)

The function

λ : A \to N

is well defined, since, by

C M

,

h^{'} \in I (h)

implies that

F (h^{'} a) - F (h^{'}) = F (h a) - F (h)

. Then, for every history

h = ⟨a_{1}, a_{2}, \dots, a_{m}⟩

,

F (h) = \sum_{i = 1}^{m} λ (a_{i})

. In fact,

\begin{array}{l} λ (a_{1}) + λ (a_{2}) + \dots + λ (a_{m}) = \\ = (F (a_{1}) - F (\emptyset)) + (F (a_{1} a_{2}) - F (a_{1})) + \dots + (F (a_{1} a_{2} \dots a_{m}) - F (a_{1} a_{2} \dots a_{m - 1})) = \\ = F (a_{1} a_{2} \dots a_{m}) = F (h) (recall the hypothesis that F (\emptyset) = 0) \end{array}

Thus, for every

h \in D

and

a \in A (h)

,

F (h a) = F (h) + λ (a)

. Hence,

F (h b) - F (h a) = F (h) + λ (b) - F (h) - λ (a) = λ (b) - λ (a)

and

F (h^{'} b) - F (h^{'} a) = F (h^{'}) + λ (b) - F (h^{'}) - λ (a) = λ (b) - λ (a)

and, therefore,

F (h b) - F (h a) = F (h^{'} b) - F (h^{'} a) .

Next we show that

C M' \Rightarrow C M .

Let ≾ be a plausibility order and let

F : H \to N

be an integer-valued representation that satisfies

C M^{'}

. Select arbitrary

h^{'} \in I (h)

and

a \in A (h)

. Let

b \in A (h)

be a plausibility-preserving action at h (there must be at least one such action: see Definition 1); then,

h \sim h b

and

h^{'} \sim h^{'} b

. Hence, since F is a representation of ≾,

F (h b) = F (h)

and

F (h^{'} b) = F (h^{'})

and thus

F (h^{'}) - F (h) = F (h^{'} b) - F (h b) .

(A2)

By

C M^{'}

,

F (h^{'} b) - F (h b) = F (h^{'} a) - F (h a)

. From this and (A2) it follows that

F (h^{'}) - F (h) = F (h^{'} a) - F (h a)

. ☐

Proof of Proposition 2.

Let

(σ, μ)

be a sequential equilibrium. We want to show that

(σ, μ)

is a strongly independent PBE (Definition 9). By Proposition 1, it is sufficient to show that

(σ, μ)

is rationalized by a plausibility order ≾ that satisfies Properties

I N D_{1}

and

I N D_{2}

. By Proposition 1 there is a choice measurable plausibility order ≾ that rationalizes

(σ, μ)

. Let F be an integer-valued representation of ≾ that satisfies Property

C M

. Let h and

h^{'}

be decision histories that belong to the same information set and let

a \in A (h)

. Then, by

C M

,

F (h) - F (h^{'}) = F (h a) - F (h^{'} a) .

(A3)

If

h ≾ h^{'}

then

F (h) \leq F (h^{'})

; by (A3), it follows that

F (h a) \leq F (h^{'} a)

and thus

h a ≾ h^{'} a

. Conversely, if

h a ≾ h^{'} a

then

F (h a) \leq F (h^{'} a)

and thus, by (A3),

F (h) \leq F (h^{'})

so that

h ≾ h^{'}

. Hence ≾ satisfies

I N D_{1}

.

Let h and

h^{'}

be decision histories that belong to the same information set and let

a, b \in A (h)

. We want to show that

I N D_{2}

holds, that is, that

h a ≾ h b

if and only if

h^{'} a ≾ h^{'} b

. Let F be an integer-valued representation of ≾ that satisfies Property

C M

. By Lemma 1 F satisfies Property

C M^{'}

, that is,

F (h a) - F (h b) = F (h^{'} a) - F (h^{'} b) .

(A4)

If

h a ≾ h b

then

F (h a) \leq F (h b)

and thus, by (A4),

F (h^{'} a) \leq F (h^{'} b)

, that is,

h^{'} a ≾ h^{'} b

. Conversely, if

h^{'} a ≾ h^{'} b

then

F (h^{'} a) \leq F (h^{'} b)

and thus, by (A4),

F (h a) \leq F (h b)

, so that

h a ≾ h b

. ☐

Proof of Proposition 3.

(A) \Rightarrow (B)

. Let

F^{'} : S \to N

satisfy the properties of Part (A). Select an arbitrary

s_{0} \in S_{0} = {s \in S : s ≾ t, \forall t \in S}

and define

F : S \to N

by

F (s) = F^{'} (s) - F^{'} (s_{0})

. Then F is also a function that satisfies the properties of Part (A) (note that since, for all

s \in S

,

F^{'} (s_{0}) \leq F^{'} (s)

,

F (s) \in N

; furthermore,

F (s^{'}) = 0

for all

s^{'} \in S_{0}

). Let

K = {k \in N : k = ρ (s)

for some

s \in S}

(where ρ is the canonical ordinal representation of ≾: see Footnote 23). For every

k \in K

, define

\begin{matrix} {\hat{x}}_{0} = 0 \\ and, for k > 0, \\ {\hat{x}}_{k} = F (t) - F (s) & for some s, t \in S such that ρ (t) = k and ρ (s) = k - 1 . \end{matrix}

(A5)

Thus

{\hat{x}}_{k}

is the distance, as measured by F, between the equivalence class of some t such that

ρ (t) = k

and the immediately preceding equivalence class (that is, the equivalence class of some s such that

ρ (s) = k - 1

)32. Note that

{\hat{x}}_{k}

is well defined since, if

x, y \in S

are such that

ρ (y) = k

and

ρ (x) = k - 1

, then

x \sim s

and

y \sim t

and thus, by (1) of Property (A),

F (x) = F (s)

and

F (y) = F (t)

. Note also that, for all

k \in K \ {0}

,

{\hat{x}}_{k}

is a positive integer, since

ρ (t) = k

and

ρ (s) = k - 1

imply that

s ≺ t

and thus, by (1) of Property (A),

F (s) < F (t)

. We want to show that the values

{\{{\hat{x}}_{k}\}}_{k \in K \ {0}}

defined in (A5) provide a solution to the system of equations corresponding to ≐ (Definition 10). Select an arbitrary element of ≐,

([s_{1}], [s_{2}]) ≐ ([t_{1}], [t_{2}])

(with

s_{1} ≺ s_{2}

and

t_{1} ≺ t_{2})

and express it, using the canonical ordinal representation ρ (see Footnote 23), as

(i_{1}, i_{2}) ≐ (j_{1}, j_{2})

(thus

i_{1} = ρ (s_{1})

,

i_{2} = ρ (s_{2})

,

j_{1} = ρ (t_{1})

,

j_{2} = ρ (t_{2})

,

i_{1} < i_{2}

and

j_{1} < j_{2}

). Then the corresponding equation (see Definition 10) is:

x_{i_{1} + 1} + x_{i_{1} + 2} + \dots + x_{i_{2}} = x_{j_{1} + 1} + x_{j_{1} + 2} + \dots + x_{j_{2}}

. By (2) of Property (A),

F (s_{2}) - F (s_{1}) = F (t_{2}) - F (t_{1})

(A6)

Using (A5),

F (s_{2}) - F (s_{1}) = {\hat{x}}_{i_{1} + 1} + {\hat{x}}_{i_{1} + 2} + \dots + {\hat{x}}_{i_{2}} .

To see this, for every

k \in \{i_{1} + 1, i_{1} + 2, \dots, i_{2} - 1\}

, select an arbitrary

r_{k} \in S

such that

ρ (r_{k}) = k

; then, by (A5),

F (s_{2}) - F (s_{1}) = \underset{= F (r_{i_{1} + 1}) - F (s_{1})}{\underset{︸}{{\hat{x}}_{i_{1} + 1}}} + \underset{= F (r_{i_{1} + 2}) - F (r_{i_{1} + 1})}{\underset{︸}{{\hat{x}}_{i_{2} + 2}}} + \dots + \underset{= F (s_{2}) - F (r_{i_{2} - 1})}{\underset{︸}{{\hat{x}}_{i_{2}}} .}

Similarly,

F (t_{2}) - F (t_{1}) = {\hat{x}}_{j_{1} + 1} + {\hat{x}}_{j_{1} + 2} + \dots + {\hat{x}}_{j_{2}}

. Thus, by (A6),

{\hat{x}}_{i_{1} + 1} + {\hat{x}}_{i_{1} + 2} + \dots + {\hat{x}}_{i_{2}} = {\hat{x}}_{j_{1} + 1} + {\hat{x}}_{j_{1} + 2} + \dots + {\hat{x}}_{j_{2}}

.

(B) \Rightarrow (A)

. Assume that the system of equations corresponding to ≐ has a solution consisting of positive integers

{\hat{x}}_{1}, \dots, {\hat{x}}_{m}

. Define

F : S \to N

as follows: if

ρ (s) = 0

(equivalently,

s \in S_{0}

) then

F (s) = 0

and if

ρ (s) = k > 0

(equivalently,

s \in S_{k}

for

k > 0

) then

F (s) = {\hat{x}}_{1} + {\hat{x}}_{2} + \dots + {\hat{x}}_{k}

(where ρ and the sets

S_{k}

are as defined in Footnote 23). We need to show that F satisfies the properties of Part (A). Select arbitrary

s, t \in S

with

s ≾ t

. Then

ρ (s) \leq ρ (t)

and thus

F (s) = {\hat{x}}_{1} + {\hat{x}}_{2} + \dots + {\hat{x}}_{ρ (s)} \leq F (t) = {\hat{x}}_{1} + {\hat{x}}_{2} + \dots + {\hat{x}}_{ρ (s)} + {\hat{x}}_{ρ (s) + 1} + \dots + {\hat{x}}_{ρ (t)}

. Conversely, suppose that

s, t \in S

are such that

F (s) \leq F (t)

. Then

{\hat{x}}_{1} + {\hat{x}}_{2} + \dots + {\hat{x}}_{ρ (s)} \leq {\hat{x}}_{1} + {\hat{x}}_{2} + \dots + {\hat{x}}_{ρ (t)}

and thus

ρ (s) \leq ρ (t)

, so that

s ≾ t

. Thus Property (1) of Part (A) is satisfied. Now let

s, t, x, y \in S

be such that

s ≺ t

,

x ≺ y

and

([s], [t]) ≐ ([x], [y]) .

Let

ρ (s) = i

,

ρ (t) = j

,

ρ (x) = k

and

ρ (y) = ℓ

(thus

i < j

and

k < ℓ

). Then, by (A5),

F (t) - F (s) = {\hat{x}}_{i + 1} + {\hat{x}}_{i + 2} + \dots + {\hat{x}}_{j}

and

F (y) - F (x) = {\hat{x}}_{k + 1} + {\hat{x}}_{k + 2} + \dots + {\hat{x}}_{ℓ}

. Since

x_{i + 1} + x_{i + 2} + \dots + x_{j} = x_{k + 1} + x_{k + 2} + \dots + x_{ℓ}

is the equation corresponding to

([s], [t]) ≐ ([x], [y])

(which - using ρ - can be expressed as

(i, j) ≐ (k, ℓ)

), by our hypothesis

{\hat{x}}_{i + 1} + {\hat{x}}_{i + 2} + \dots + {\hat{x}}_{j} = {\hat{x}}_{k + 1} + {\hat{x}}_{k + 2} + \dots + {\hat{x}}_{ℓ}

and thus

F (t) - F (s) = F (y) - F (x)

, so that (2) of Property (A) is satisfied.

n o t (B) \Rightarrow n o t (C)

. Suppose that there is a sequence in ≐ (expressed in terms of the canonical representation ρ of ≾)

⟨((i_{1}, j_{1}) ≐ (k_{1}, ℓ_{1})), \dots, ((i_{m}, j_{m}) ≐ (k_{m}, ℓ_{m}))⟩

such that

B_{l e f t} ⊏ B_{r i g h t}

(A7)

where

B_{l e f t} = B_{(i_{1}, j_{1})} ⋓ \dots ⋓ B_{(i_{m}, j_{m})}

and

B_{r i g h t} = B_{(k_{1}, ℓ_{1})} ⋓ \dots ⋓ B_{(k_{m}, ℓ_{m})}

. Let

E = {E_{1}, \dots, E_{m}}

be the system of equations corresponding to the above sequence (for example,

E_{1}

is the equation

x_{i_{1} + 1} + x_{i_{1} + 2} + \dots + x_{j_{1}} = x_{k_{1} + 1} + x_{k_{1} + 2} + \dots + x_{ℓ_{1}}

). Let L be the sum of the left-hand-side and R be the sum of the right-hand-side of the equations

E_{1}, \dots, E_{m}

. Note that for every integer i,

n x_{i}

is a summand of L if and only if i appears in

B_{l e f t}

exactly n times and similarly

n x_{i}

is a summand of R if and only if i appears in

B_{r i g h t}

exactly n times. By (A7), if

n x_{i}

is a summand of L then

m x_{i}

is a summand of R with

m \geq n

and, furthermore,

L \neq R

. Thus there cannot be a positive solution of

E

, because it would be incompatible with

L = R

. Since

E

is a subset of the system of equations corresponding to ≐, it follows that the latter cannot have a positive solution either.

It only remains to prove that

n o t (C) \Rightarrow n o t (B)

. We will return to this below after providing an additional result. ☐

First some notation. Given two vectors

x, y \in R^{m}

we write (1)

x \leq y

if

x_{i} \leq y_{i}

, for every

i = 1, \dots, m

; (2)

x < y

if

x \leq y

and

x \neq y

; and (3)

x ≪ y

if

x_{i} < y_{i}

, for every

i = 1, \dots, m

.

Lemma 2.

Let A be the

m \times n

matrix such that the system of equations corresponding to ≐ (Definition 10) can be expressed as

A x = 0

(note that each entry of A is either

- 1

, 0 or 1; furthermore, by symmetry of ≐, for each row

a_{i}

of A there is another row

a_{k}

such that

a_{k} = - a_{i}

)33. If the system of equations

A x = 0

does not have a positive integer solution then there exist r rows of A,

a_{i_{1}}, \dots, a_{i_{r}}

with

1 < r \leq \frac{m}{2}

and r positive integers

α_{1}, \dots, α_{r} \in N \ {0}

such that if B is the submatrix of A consisting of the r rows

a_{i_{1}}, \dots, a_{i_{r}}

(thus for every

k = 1, \dots, r

,

b_{k} = a_{i_{k}}

, where

b_{k}

is the

k^{t h}

row of B) then

\sum_{k = 1}^{r} α_{k} b_{k} < 0

.

Proof.

By Stiemke’s theorem34 if the system of equations

A x = 0

does not have a positive integer solution then there exists a

y \in Z^{m}

(where

Z

denotes the set of integers) such that

y A < 0

(that is,

\sum_{i = 1}^{m} y_{i} a_{i} < 0

). Let

K = {k \in Z : y_{k} \neq 0}

. Let r be the cardinality of K; then, without loss of generality, we can assume that

r \leq \frac{m}{2}

35. Furthermore, again without loss of generality, we can assume that for every

k \in K

,

y_{k} > 0

36. Let B be the

r \times n

submatrix of A consisting of those rows

a_{k}

of A such that

k \in K

and for

i = 1, \dots, r

let

α = (α_{1}, \dots, α_{r})

be the vector corresponding to

{(y_{k})}_{k \in K}

37. Then

α B = \sum_{j = 1}^{r} α_{j} b_{j} = y A < 0

and

α_{i} \in N \ {0}

for all

i = 1, \dots, r

. ☐

Completion of Proof of Proposition 3.

It remains to prove that

n o t (C) \Rightarrow n o t (B)

. Let A be the

m \times n

matrix such that the system of equations corresponding to ≐ can be expressed as

A x = 0

and assume that

A x = 0

does not have a positive integer solution. Let B be the

r \times n

submatrix of A and

α = (α_{1}, \dots, α_{r})

the vector of positive integers of Lemma 2 such that

α B = \sum_{j = 1}^{r} α_{j} b_{j} < 0

. Define two

r \times n

matrices

C = {(c_{i j})}_{i = 1, \dots, r; j = 1, \dots, n}

and

D = {(d_{i j})}_{i = 1, \dots, r; j = 1, \dots, n}

as follows (recall that each entry of B is either

- 1

, 0 or 1):

c_{i j} = \{\begin{matrix} 1 & i f b_{i j} = 1 \\ 0 & otherwise \end{matrix} and d_{i j} = \{\begin{matrix} 1 & i f b_{i j} = - 1 \\ 0 & otherwise \end{matrix} .

Then, for every

i = 1, \dots, r

,

b_{i} = c_{i} - d_{i}

and thus (since

\sum_{i = 1}^{r} α_{i} b_{i} < 0

)

\sum_{i = 1}^{r} α_{i} c_{i} < \sum_{i = 1}^{r} α_{i} d_{i} .

(A9)

Let

C^{'}

be the matrix obtained from C by replacing each row

c_{i}

of C with

α_{i}

copies of it and let

D^{'}

be constructed from D similarly. Then, letting

s = \sum_{i = 1}^{r} α_{i}

,

C^{'}

and

D^{'}

are

s \times n

matrices whose entries are either 0 or 1. It follows from (A9) that

\sum_{i = 1}^{s} c_{i}^{'} < \sum_{i = 1}^{s} d_{i}^{'} .

(A10)

Consider the system of equations

C^{'} x = D^{'} x .

(A11)

For every

j = 1, \dots, n

, the

j^{t h}

coordinate of

\sum_{i = 1}^{s} c_{i}^{'}

is the number of times that the variable

x_{j}

appears on the left-hand-side of (A11) and the

j^{t h}

coordinate of

\sum_{i = 1}^{s} d_{i}^{'}

is the number of times that the variable

x_{j}

appears on the right-hand-side of (A11). Hence, by (A10), for every

j = 1, \dots, n

, the number of times that the variable

x_{j}

appears on the left-hand-side of (A11) is less than or equal to the number of times that it appears on the right-hand-side of (A11) and for at least one j it is less. Thus, letting

⟨((i_{1}, j_{1}) ≐ (k_{1}, ℓ_{1})), \dots, ((i_{s}, j_{s}) ≐ (k_{s}, ℓ_{s}))⟩

be the sequence of elements of ≐ corresponding to the equations in (A11), we have that

B_{l e f t} ⊏ B_{r i g h t}

where

B_{l e f t} = B_{(i_{1}, j_{1})} ⋓ \dots ⋓ B_{(i_{m}, j_{m})}

and

B_{r i g h t} = B_{(k_{1}, ℓ_{1})} ⋓ \dots ⋓ B_{(k_{m}, ℓ_{m})}

. ☐

References

Kreps, D.; Wilson, R. Sequential equilibrium. Econometrica 1982, 50, 863–894. [Google Scholar] [CrossRef]
Mas-Colell, A.; Whinston, M.D.; Green, J.R. Microeconomic Theory; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
Myerson, R. Game Theory: Analysis of Conflict; Harvard University Press: Cambridge, MA, USA, 1991. [Google Scholar]
Selten, R. Re-examination of the perfectness concept for equilibrium points in extensive games. Int. J. Game Theory 1975, 4, 25–55. [Google Scholar] [CrossRef]
Bonanno, G. AGM-consistency and perfect Bayesian equilibrium. Part I: Definition and properties. Int. J. Game Theory 2013, 42, 567–592. [Google Scholar] [CrossRef]
Alchourrón, C.; Gärdenfors, P.; Makinson, D. On the logic of theory change: Partial meet contraction and revision functions. J. Symb. Log. 1985, 50, 510–530. [Google Scholar] [CrossRef]
Bonanno, G. AGM belief revision in dynamic games. In Proceedings of the 13th Conference on Theoretical Aspects of Rationality and Knowledge (TARK XIII), Groningen, The Netherlands, 12–14 July 2011; Apt, K.R., Ed.; ACM: New York, NY, USA, 2011; pp. 37–45. [Google Scholar]
Bonanno, G. AGM-consistency and perfect Bayesian equilibrium. Part II: From PBE to sequential equilibrium. Int. J. Game Theory 2005. [Google Scholar] [CrossRef]
Osborne, M.; Rubinstein, A. A Course In Game Theory; MIT Press: Cambridge, MA, USA, 1994. [Google Scholar]
Hendon, E.; Jacobsen, J.; Sloth, B. The one-shot-deviation principle for sequential rationality. Games Econ. Behav. 1996, 12, 274–282. [Google Scholar] [CrossRef]
Perea, A. A note on the one-deviation property in extensive form games. Games Econ. Behav. 2002, 40, 322–338. [Google Scholar] [CrossRef]
Kohlberg, E.; Reny, P. Independence on relative probability spaces and consistent assessments in game trees. J. Econ. Theory 1997, 75, 280–313. [Google Scholar] [CrossRef]
Perea, A.; Jansen, M.; Peters, H. Characterization of consistent assessments in extensive-form games. Games Econ. Behav. 1997, 21, 238–252. [Google Scholar] [CrossRef]
Battigalli, P. Strategic independence and perfect Bayesian equilibria. J. Econ. Theory 1996, 70, 201–234. [Google Scholar] [CrossRef]
Dines, L. On positive solutions of a system of linear equations. Ann. Math. 1926–1927, 28, 386–392. [Google Scholar] [CrossRef]
Fudenberg, D.; Tirole, J. Perfect Bayesian equilibrium and sequential equilibrium. J. Econ. Theory 1991, 53, 236–260. [Google Scholar] [CrossRef]
González-Díaz, J.; Meléndez-Jiménez, M.A. On the notion of perfect Bayesian equilibrium. TOP J. Span. Soc. Stat. Oper. Res. 2014, 22, 128–143. [Google Scholar] [CrossRef]
Watson, J. Perfect Bayesian Equilibrium: General Definitions and Illustrations; Working Paper; University of California San Diego: San Diego, CA, USA, 2016. [Google Scholar]
Bonanno, G. Belief change in branching time: AGM-consistency and iterated revision. J. Philos. Log. 2012, 41, 201–236. [Google Scholar] [CrossRef]
Boutilier, C. Iterated revision and minimal change of conditional beliefs. J. Philos. Log. 1996, 25, 262–305. [Google Scholar] [CrossRef]
Darwiche, A.; Pearl, J. On the logic of iterated belief revision. Artif. Intell. 1997, 89, 1–29. [Google Scholar] [CrossRef]
Stalnaker, R. Iterated belief revision. Erkenntnis 2009, 70, 189–209. [Google Scholar] [CrossRef]
Schrijver, A. Theory of Linear and Integer Programming; John Wiley & Sons: Hoboken, NJ, USA, 1986. [Google Scholar]
Fishburn, P.C. Finite linear qualitative probability. J. Math. Psychol. 1996, 40, 64–77. [Google Scholar] [CrossRef]

¹The acronym ‘AGM’ stands for ‘Alchourrón-Gärdenfors-Makinson’ who pioneered the literature on belief revision: see [6]. As shown in [7], AGM-consistency can be derived from the primitive concept of a player’s epistemic state, which encodes the player’s initial beliefs and her disposition to revise those beliefs upon receiving (possibly unexpected) information. The existence of a plausibility order that rationalizes the epistemic state of each player guarantees that the belief revision policy of each player satisfies the so-called AGM axioms for rational belief revision, which were introduced in [6].
² $\forall h, h^{'} \in H$ , either $h ≾ h^{'}$ or $h^{'} ≾ h$ .
³ $\forall h, h^{'}, h^{''} \in H$ , if $h ≾ h^{'}$ and $h^{'} ≾ h^{''}$ then $h ≾ h^{''}$ .
⁴As in [5] we use the notation $h ≾ h^{'}$ rather than the, perhaps more natural, notation $h ≿ h^{'}$ , for two reasons: (1) it is the standard notation in the extensive literature that deals with AGM belief revision (for a recent survey of this literature see the special issue of the Journal of Philosophical Logic, Vol. 40 (2), April 2011); and (2) when representing the order ≾ numerically it is convenient to assign lower values to more plausible histories. An alternative reading of $h ≾ h^{'}$ is “history h (weakly) precedes $h^{'}$ in terms of plausibility”.
⁵A behavior strategy profile is a list of probability distributions, one for every information set, over the actions available at that information set. A system of beliefs is a collection of probability distributions, one for every information set, over the histories in that information set.
⁶The precise definition is as follows. Let Z denote the set of terminal histories and, for every player i, let $U_{i} : Z \to R$ be player i’s von Neumann-Morgenstern utility function. Given a decision history h, let $Z (h)$ be the set of terminal histories that have h as a prefix. Let $P_{h, σ}$ be the probability distribution over $Z (h)$ induced by the strategy profile σ, starting from history h (that is, if z is a terminal history and $z = h a_{1} \dots a_{m}$ then $P_{h, σ} (z) = \prod_{j = 1}^{m} σ (a_{j})$ ). Let I be an information set of player i and let $u_{i} (I | σ, μ) = \sum_{h \in I} μ (h) \sum_{z \in Z (h)} P_{h, σ} (z) U_{i} (z)$ be player i’s expected utility at I if σ is played, given her beliefs at I (as specified by μ). We say that player i’s strategy $σ_{i}$ is sequentially rational at I if $u_{i} (I | (σ_{i}, σ_{- i}), μ) \geq u_{i} (I | (τ_{i}, σ_{- i}), μ)$ for every strategy $τ_{i}$ of player i (where $σ_{- i}$ denotes the strategy profile of the players other than i). An assessment $(σ, μ)$ is sequentially rational if, for every player i and for every information set I of player i, $σ_{i}$ is sequentially rational at $I .$ Note that there are two definitions of sequential rationality: the weakly localone—which is the one adopted here—according to which at an information set a player can contemplate changing her choice not only there but possibly also at subsequent information sets of hers, and a strictly local one, according to which at an information set a player contemplates changing her choice only there. If the definition of perfect Bayesian equilibrium (Definition 4 below) is modified by using the strictly local definition of sequential rationality, then an extra condition needs to be added, namely the “pre-consistency” condition identified in [10,11] as being necessary and sufficient for the equivalence of the two notions. For simplicity we have chosen the weakly local definition.
⁷Rounded rectangles represent information sets and the payoffs are listed in the following order: Player 1’s payoff at the top, Player 2’s payoff in the middle and Player 3’s payoff at the bottom.
⁸We use the following convention to represent a total pre-order: if the row to which history h belongs is above the row to which $h^{'}$ belongs, then $h ≺ h^{'}$ (h is more plausible than $h^{'}$ ) and if h and $h^{'}$ belong to the same row then $h \sim h^{'}$ (h is as plausible as $h^{'}$ ). ∅ denotes the empty history, which corresponds to the root of the tree. In (1) the plausibility-preserving actions are d, e and g; the most plausible histories in the information set ${a, b, c}$ are b and c and the two histories in the information set ${a f, b f}$ are equally plausible.
⁹Given σ, for Player 1 d yields a payoff of 2 while a and c yield 1 and b yields 2; thus d is sequentially rational. Given σ and μ, at her information set ${a, b, c}$ with e Player 2 obtains an expected payoff of 4 while with f her expected payoff is 3; thus e is sequentially rational. Given μ, at his information set ${a f, b f}$ , Player 3’s expected payoff from playing with g is 1.5 while his expected payoff from playing with k is 1; thus g is sequentially rational.
¹⁰Note that if $h, h^{'} \in E$ and $h^{'} = h a_{1} \dots a_{m}$ , then $σ (a_{j}) > 0$ , for all $j = 1, \dots, m$ . In fact, since $h^{'} \sim h$ , every action $a_{j}$ is plausibility preserving and therefore, by Property $P 1$ of Definition 2, $σ (a_{j}) > 0$ .
¹¹For an interpretation of the probabilities $ν_{E} (h)$ see [8].
¹²That is, for every $h \in D \ {\emptyset}$ , $μ^{m} (h) = \frac{\prod_{a \in A_{h}} σ^{m} (a)}{\sum_{h^{'} \in I (h)} \prod_{a \in A_{h^{'}}} σ^{m} (a)}$ (where $A_{h}$ is the set of actions that occur in history h). Since $σ^{m}$ is completely mixed, $σ^{m} (a) > 0$ for every $a \in A$ and thus $μ^{m} (h) > 0$ for all $h \in D \ {\emptyset} .$
¹³Since H is finite, there is an $m \in N$ such that ${H_{0}, \dots, H_{m}}$ is a partition of H and, for every $j, k \in N$ , with $j < k \leq m$ , and for every $h, h^{'} \in H$ , if $h \in H_{j}$ and $h^{'} \in H_{k}$ then $h ≺ h^{'}$ .
¹⁴For example, [12] adopts this interpretation.
¹⁵For such an interpretation see [6].
¹⁶Note, however, that $I N D_{1}$ is compatible with the following: $a ≺ b$ (with $b \in I (a)$ ) and $b c ≺ a d$ (with $b c \in I (a d), c, d \in A (a), c \neq d$ ).
¹⁷We have that (1) $b ≺ a$ , $b d ≺ a d, b e ≺ a e$ and $b f ≺ a f$ , (2) $a e ≺ a f$ , $a e g ≺ a f g$ and $a e k ≺ a f k$ , (3) $b f ≺ b e$ , $b f ℓ ≺ b e ℓ$ and $b f m ≺ b e m$ .
¹⁸That $I N D_{1}$ is satisfied was shown in Footnote 17. $I N D_{2}$ is violated because $b \in I (a)$ and $b f ≺ b e$ but $a e ≺ a f$ .
¹⁹In fact, (1) $M ≺ L$ and $M x ≺ L x$ for every $x \in {ℓ, m, r}$ ; (2) $M ≺ R$ and $M x ≺ R x$ for every $x \in {ℓ, m, r}$ ; (3) $R ≺ L$ and $R x ≺ L x$ for every $x \in {ℓ, m, r}$ ; (4) $M r ≺ L ℓ$ and $M r x ≺ L ℓ x$ for every $x \in {a, b}$ ; (5) $L m ≺ R r$ and $L m x ≺ R r x$ for every $x \in {c, d}$ ; and (6) $R ℓ ≺ M m$ and $R ℓ x ≺ M m x$ for every $x \in {e, f}$ .
²⁰This is easily verified: the important observation is that $M m ≺ M r$ and $L m ≺ L r$ and $R m ≺ R r$ . The other comparisons involve a plausibility-preserving action versus a non-plausibility-preserving action and thus $I N D_{2}$ is trivially satisfied.
²¹As uniform full support common prior one can take, for example, the uniform distribution over the set of decision histories. Note that, for every equivalence class E of the order, $E \cap D_{μ}^{+}$ is either empty or a singleton.
²²To prove that $(σ, μ)$ is not a sequential equilibrium it is not sufficient to show that plausibility order (5) is not choice measurable, because there could be another plausibility order which is choice measurable and rationalizes $(σ, μ)$ .
²³As in Definition 5, let $S_{0} = {s \in S : s ≾ t, \forall t \in S}$ , and, for every integer $k \geq 1$ , $S_{k} = {h \in S \ S_{0} \cup \dots \cup S_{k - 1} : s ≾ t, \forall t \in S \ S_{0} \cup \dots \cup S_{k - 1}}$ . The canonical ordinal integer-valued representation of ≾, $ρ : S \to N$ , is given by $ρ (s) = k$ if and only if $s \in S_{k} .$
²⁴Thus $a ≺ x$ for every $x \in S \ {a}$ , $[b] = {b, c}$ , $b ≺ d$ , etc.
²⁵For example, ≐ is the smallest reflexive, symmetric and transitive relation that contains the pairs given in (14).
²⁶The system of linear equations of Definition 10 is somewhat related to the system of multiplicative equations considered in [13] (Theorem 5.1). A direct comparison is beyond the scope of this paper and is not straightforward, because the structures considered in Definition 10 are more general than those considered in [13].
²⁷By symmetry of ≐, we can express the third and fourth constraints as $(4, 6) ≐ (0, 2)$ and $(3, 4) ≐ (1, 3)$ instead of $(0, 2) ≐ (4, 6)$ and $(1, 3) ≐ (3, 4)$ , respectively.
²⁸The main element of the notion of PBE put forward in [16] is the “no signaling what you don’t know” condition on beliefs. For example, if Player 2 observes Player 1’s action and Player 1 has observed nothing about a particular move of Nature, then Player 2 should not update her beliefs about Nature’s choice based on Player 1’s action.
²⁹Intuitively, on consecutive information sets, a player does not change her beliefs about the actions of other players, if she has not received information about those actions.
³⁰By “Bayesian updating as long as possible” we mean the following: (1) when information causes no surprises, because the play of the game is consistent with the most plausible play(s) (that is, when information sets are reached that have positive prior probability), then beliefs should be updated using Bayes’ rule; and (2) when information is surprising (that is, when an information set is reached that had zero prior probability) then new beliefs can be formed in an arbitrary way, but from then on Bayes’ rule should be used to update those new beliefs, whenever further information is received that is consistent with those beliefs.
³¹It is straightforward to check that if $F^{'} : H \to N$ is an integer-valued representation of ≾ then so is $F : H \to N$ defined by $F (h) = F^{'} (h) - F^{'} (\emptyset)$ ; furthermore if $F^{'}$ satisfies property $C M$ ( $C M'$ ) then so does F.
³²For example, if $S = {a, b, c, d, e, f}$ and ≾ is given by $a \sim b ≺ c ≺ d \sim e ≺ f$ then $ρ (a) = ρ (b) = 0, ρ (c) = 1, ρ (d) = ρ (e) = 2$ and $ρ (f) = 3$ ; if F is given by $F (a) = F (b) = 0, F (c) = 3, F (d) = F (e) = 5$ and $F (f) = 9$ then ${\hat{x}}_{0} = 0, {\hat{x}}_{1} = 3, {\hat{x}}_{2} = 2$ and ${\hat{x}}_{3} = 4$ .
³³For example, the system of Equation (15) can be written as $A x = 0$ , where $x = (x_{1}, \dots, x_{5})$ and

$A = (\begin{matrix} 1 & 1 & 0 & - 1 & 0 \\ - 1 & - 1 & 0 & 1 & 0 \\ 0 & 0 & 1 & 0 & - 1 \\ 0 & 0 & - 1 & 0 & 1 \\ 1 & 1 & 1 & - 1 & - 1 \\ - 1 & - 1 & - 1 & 1 & 1 \end{matrix})$

(A8)
³⁴See, for example, [23] (p. 216) or [24] (Theorem 1.1, p. 65).
³⁵Proof. Recall that for each row $a_{i}$ of A there is a row $a_{k}$ such that $a_{i} = - a_{k}$ . If $y_{i} \neq 0$ and $y_{k} \neq 0$ for some i and k such that $a_{i} = - a_{k}$ then

$y_{i} a_{i} + y_{k} a_{k} = \{\begin{matrix} 0 & i f y_{i} = y_{k} \\ (y_{k} - y_{i}) a_{k} & i f 0 < y_{i} < y_{k} \\ (y_{i} - y_{k}) a_{i} & i f 0 < y_{k} < y_{i} \\ (|y_{i}| + y_{k}) a_{k} & i f y_{i} < 0 < y_{k} \\ (y_{i} + |y_{k}|) a_{i} & i f y_{k} < 0 < y_{i} \\ (|y_{k}| - |y_{i}|) a_{i} & i f y_{i} < y_{k} < 0 \\ (|y_{i}| - |y_{k}|) a_{k} & i f y_{k} < y_{i} < 0 \end{matrix}$

where all the multipliers (of $a_{i}$ or $a_{k}$ ) are positive. Thus one can set one of the two values of $y_{i}$ and $y_{k}$ to zero and replace the other value with the relevant of the above values while keeping $y A$ unchanged. For example, if $y_{k} < y_{i} < 0$ then one can replace $y_{i}$ with 0 and $y_{k}$ with $(|y_{i}| - |y_{k}|)$ thereby reducing the cardinality of K by one. This process can be repeated until the multipliers of half of the rows of A have been replaced by zero.
³⁶Proof. Suppose that $y_{k} < 0$ for some $k \in K$ . Recall that there exists an i such that $a_{k} = - a_{i}$ . By the argument of the previous footnote, $y_{i} = 0$ . Then replace $y_{k}$ by 0 and replace $y_{i} = 0$ by ${\tilde{y}}_{i} = - y_{k}$ .
³⁷For example, if $K = {3, 6, 7}$ and $y_{3} = 2$ , $y_{6} = 1$ , $y_{7} = 3$ , then B is the $3 \times n$ matrix where $b_{1} = a_{3}, b_{2} = a_{6}$ and $b_{3} = a_{7}$ and $α_{1} = 2$ , $α_{2} = 1$ and $α_{3} = 3$ .

© 2016 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Exploring the Gap between Perfect Bayesian Equilibrium and Sequential Equilibrium^†

Abstract

1. Introduction

2. Perfect Bayesian Equilibrium and Sequential Equilibrium

3. Exploring the Gap between PBE and Sequential Equilibrium

4. How to Determine if a Plausibility Order Is Choice Measurable

5. Related Literature

6. Conclusions

Conflicts of Interest

Appendix A. Proofs

References

Article Metrics

Citations

Article Access Statistics

Exploring the Gap between Perfect Bayesian Equilibrium and Sequential Equilibrium †

Abstract

1. Introduction

2. Perfect Bayesian Equilibrium and Sequential Equilibrium

3. Exploring the Gap between PBE and Sequential Equilibrium

4. How to Determine if a Plausibility Order Is Choice Measurable

5. Related Literature

6. Conclusions

Conflicts of Interest

Appendix A. Proofs

References

Article Metrics

Citations

Article Access Statistics

Exploring the Gap between Perfect Bayesian Equilibrium and Sequential Equilibrium^†