1. Introduction
In most applications of noncooperative game theory, Nash equilibrium is used as a tool to predict behavior. Under what conditions, if any, is this approach justified? In his Ph.D. thesis, Nash [
1] suggested two interpretations of Nash equilibrium, one rationalistic, in which all players are fully rational, know the game, and play it exactly once. In the other, “mass action” interpretation, there is a large population of actors for each player role of the game, and now and then exactly one actor from each player population is drawn at random to play the game in his or her player role, and this is repeated (i.i.d.) indefinitely over time. Whereas the latter interpretation is studied in the literature on evolutionary game theory and social learning, the former—which is the interpretation we will be concerned with here—is studied in a sizeable literature on epistemic foundations of Nash equilibrium. It is by now well-known from this literature that players’ rationality and beliefs or knowledge about the game and each others’ rationality in general do not imply that they necessarily play a Nash equilibrium or even that their conjectures about each others’ actions form a Nash equilibrium; see Bernheim [
2], Pearce [
3], Aumann and Brandenburger [
4].
The problem is not only a matter of coordination of beliefs (conjectures or expectations), as in a game with multiple equilibria. It also concerns the fact that, in Nash equilibrium interpreted as an equilibrium in belief (see [
4], Theorems A and B), beliefs are supposed to correspond to
specific randomizations over the others’ strategies. In particular, a player might have opponents with multiple pure strategies that maximize their expected payoffs, given their equilibrium beliefs. Hence, for these opponents, any randomization over their pure best replies maximizes their expected payoffs. Yet in Nash equilibrium, the player is assumed to have a belief that singles out a randomization over the best replies of her opponents that serves to keep this player indifferent across the support of her equilibrium strategies, and ensures that none of the player’s other strategies are better replies. In addition, a player’s belief concerning the behavior of others assigns positive probability
only to best replies; players are not allowed to entertain any doubt about the rationality of their opponents.
Our aim is to formalize a notion of epistemic robustness that relaxes these requirements. In order to achieve this, we have to move away from point-valued to set-valued solution concepts. In line with the terminology of epistemic game theory, let a player’s epistemic
type correspond to a belief over the profiles of opponent strategies and types. Assume that the epistemic model is complete in the sense that all possible types are represented in the model. Let non-empty Cartesian products of (pure-strategy or type) subsets be referred to as (strategy or type)
blocks [
5]. Say that a strategy block
is epistemically robust if there exists a corresponding type block
such that: for each player
i,
- (I)
the strategy subset coincides with the set of best replies of the types in ;
- (II)
the set contains all player types that believe with sufficient probability that the others are of types in and play best replies.
Here, for each player, (II) requires the player’s type subset to be robust in the sense of including all possible probability distributions over opponent pure-strategy profiles that consist of best replies to the beliefs of opponent types that are included in the opponents’ type subsets, even including player types with a smidgen of doubt that only these strategies are played. In particular, our epistemic model does not allow a player to pinpoint a specific opponent type or a specific best reply for an opponent type that has multiple best replies. The purpose of (I) is, for each player, to map this robust type subset into a robust subset of pure strategies by means of the best reply correspondence.
Consider, in contrast, the case where point (II) above is replaced by:
- (II′)
the set contains only player types that believe with probability 1 that the others are of types in and play best replies.
Tan and Werlang [
6] show that the strategy block
X is a
best reply set [
3] if there exists a corresponding type block
Y such that (I) and (II
) hold for all players. This epistemic characterization of a best reply set
X explains why, for each player
i, all strategies in
are included. In contrast, the concept of epistemic robustness explains why all strategies outside
are excluded, as a rational player will never choose such a strategy, not even if the player with small probability believes that opponents will not stick to their types
or will not choose best replies.
Any strict Nash equilibrium, viewed as a singleton strategy block, is epistemically robust. In this case, each player has opponents with unique pure strategies that maximize their expected payoffs, given their equilibrium beliefs. The player’s equilibrium strategy remains her unique best reply, as long as she is sufficiently sure that the others stick to their unique best replies. By contrast, non-strict pure-strategy Nash equilibria by definition have ‘unused’ best replies and are consequently not epistemically robust: a player, even if she is sure that her opponents strive to maximize their expected payoffs given their equilibrium beliefs, might well believe that her opponents play such alternative best replies.
In informal terms, our Proposition 1 establishes that epistemic robustness is sufficient and necessary for the non-existence of such ‘unused’ best replies. Consequently, epistemic robustness captures, through restrictions on the players’ beliefs, a property satisfied by strict Nash equilibria, but not by non-strict pure-strategy Nash equilibria. The restrictions on players’ beliefs implied by epistemic robustness can be imposed also on games without strict Nash equilibria. Indeed, our Propositions 2–5 show how epistemic robustness is achieved by variants of
curb sets. A
curb set (mnemonic for ‘closed under rational behavior’) is a strategy block that contains, for each player, all best replies to all probability distributions over the opponent strategies in the block
1. Hence, if a player believes that her opponents stick to strategies from their components of a
curb set, then she’d better stick to her strategies as well.
A strategy block is fixed under rational behavior (
furb; or ‘tight’
curb in the terminology of Basu and Weibull [
7]) if each player’s component not only contains, but is identical with the set of best replies to all probability distributions over the opponent strategies in the block. Basu and Weibull [
7] show that minimal
curb (
mincurb) sets and the unique largest
furb set are important special cases of
furb sets. The latter equals the strategy block of rationalizable strategies [
2,
3]. At the other extreme,
mincurb is a natural set-valued generalization of strict Nash equilibrium. The main purpose of this paper is to provide epistemic foundations for set-valued generalizations of strict Nash equilibrium. Our results are not intended to advocate any particular point- or set-valued solution concept, only to propose a definition of epistemic robustness and apply this to some set-valued solution concepts currently in use
2.
In order to illustrate our line of reasoning, consider first the two-player game
In its unique Nash equilibrium, player 1’s equilibrium strategy assigns probability
to her first pure strategy and player 2’s equilibrium strategy assigns probability
to his first pure strategy. However, even if player 1’s belief about the behavior of player 2 coincides with his equilibrium strategy,
, player 1 would be indifferent between her two pure strategies. Hence, any pure or mixed strategy would be optimal for her, under the equilibrium belief about player 2. For all other beliefs about her opponent’s behavior, only one of her pure strategies would be optimal, and likewise for player 2. The unique
curb set and unique epistemically robust subset in this game is the full set
of pure-strategy profiles.
Add a third pure strategy for each player to obtain the two-player game
Strategy profile
is a Nash equilibrium (indeed a perfect and proper equilibrium). However, if player 2’s belief concerning the behavior of 1 coincides with
, then 2 is indifferent between his pure strategies
l and
c, and if 1 assigns equal probability to these two pure strategies of player 2, then 1 will play the unique best reply
d, a pure strategy outside the support of the equilibrium
3. Moreover, if player 2 expects 1 to reason this way, then 2 will play
r: the smallest epistemically robust subset containing the support of the mixed equilibrium
is the entire pure strategy space. By contrast, the pure-strategy profile
is a strict equilibrium. In this equilibrium, no player has any alternative best reply and each equilibrium strategy remains optimal also under some uncertainty as to the other player’s action: the set
is epistemically robust. In this game, all pure strategies are rationalizable,
is a
furb set, and the game’s unique
mincurb set (thus, the unique minimal
furb set) is
. These are also the epistemically robust subsets; in particular,
is not epistemically robust.
Our results can be described as follows. First, the intuitive link between strict Nash equilibria and our concept of epistemic robustness in terms of ruling out the existence of ‘unused’ best replies is formalized in Proposition 1: a strategy block
X is
not epistemically robust if and only if for each type block
Y raised in its defense—so that
X is the set of best reply profiles associated with
Y—there is a player
i and a type
with a best reply outside
, even if
believes with high probability that his opponents are of types in
and play best replies. Second, in part (a) of Proposition 2, we establish that epistemically robust strategy blocks are
curb sets. As a consequence (see [
15]), every epistemically robust strategy block contains at least one strategically stable set in the sense of Kohlberg and Mertens [
16]. In part (b) of Proposition 2, although not every
curb set is epistemically robust (since a
curb set may contain non-best replies), we establish that every
curb set contains an epistemically robust strategy block and we also characterize the largest such subset. As a by-product, we obtain the existence of epistemically robust strategy blocks in all finite games. Third, in Proposition 3, we show that a strategy block is
furb if and only if it satisfies the definition of epistemic robustness with equality, rather than inclusion, in (II).
furb sets thus have a clean epistemic robustness characterization in the present framework. Fourth, in Proposition 4, instead of starting with strategy blocks, we start from a type block and show how an epistemically robust strategy block can be algorithmically obtained; we also show that this is the smallest
curb set that contains all best replies for the initial type block. Fifth, Proposition 5 shows how
mincurb sets (which are necessarily
furb and hence epistemically robust) can be characterized by initiating the above algorithm with a single type profile, while no proper subset has this property. We argue that this latter result shows how
mincurb sets capture characteristics of strict Nash equilibrium.
As our notion of epistemic robustness checks for player types with ‘unused’ best replies on the basis of their beliefs about the opponents’ types and rationality, we follow, for instance, Asheim [
17] and Brandenburger, Friedenberg, and Keisler [
18], and model players as having beliefs about the opponents without modeling the players’ actual behavior. Moreover, we consider epistemic models that are complete in the sense of including all possible beliefs. In these respects, our modeling differs from that of Aumann and Brandenburger [
4]’s characterization of Nash equilibrium. In other respects, our modeling resembles that of Aumann and Brandenburger [
4]. They assume that players’ beliefs about opponent play is commonly known. Here, we require the existence of a type block
Y and consider, for each player
i, types of player
i who believe that opponent types are in
. In addition, as do Aumann and Brandenburger [
4], we consider types of players that believe that their opponents are rational.
The notion of persistent retracts [
19] goes part of the way towards epistemic robustness. These are product sets requiring the presence of
at least one best reply to arbitrary beliefs
close to the set. In other words, they are robust against small belief perturbations, but admit alternative best replies outside the set, in contrast to our concept of epistemic robustness. Moreover, as pointed out by (van Damme [
20] Section 4.5) and Myerson and Weibull [
5], persistence is sensitive to certain game details that might be deemed strategically inessential.
The present approach is related to Tercieux [
21]’s analysis in its motivation in terms of epistemic robustness of solution concepts and in its use of
p-belief. His epistemic approach, however, is completely different from ours. Starting from a two-player game, he introduces a Bayesian game where payoff functions are perturbations of the original ones and he investigates which equilibria are robust against this kind of perturbation. Zambrano [
22] studies the stability of non-equilibrium concepts in terms of mutual belief and is hence more closely related to our analysis. In fact, our Proposition 3 overlaps with but is distinct from his main results. Also Hu [
23] restricts attention to rationalizability, but allows for
p-beliefs, where
. In the games considered in Hu [
23], pure strategy sets are permitted to be infinite. By contrast, our analysis is restricted to finite games, but under the weaker condition of mutual, rather than Hu [
23]’s common,
p-belief of opponent rationality and of opponents’ types belonging to given type sets.
The remainder of the paper is organized as follows.
Section 2 contains the game theoretic and epistemic definitions used.
Section 3 characterizes variants of
curb sets in terms of epistemic robustness. An
appendix contains proofs of the propositions.
3. Epistemic Robustness
We define a strategy block
to be
epistemically robust if there exists a
such that, for each probability
, there is a type block
(possibly dependent on
p) such that
and
Hence, epistemic robustness requires the existence of a type block
Y satisfying, for each player
i, that
is the set of best replies of the types in
, and that every type of player
i who
p-believes that opponents are rational and of types in
is included in
. Condition (
2) is thus not an equilibrium condition as it is not interactive: it relates each player’s type subset to the same player’s strategy subset. The interactivity enters through condition (
3), which relates each player’s type subset to the type subsets of the other players. For each
, condition (
3) allows each player
i to attach a positive probability to the event that others do not play best replies and/or are of types outside
Y. It follows from the monotonicity of
with respect to
p that, for a fixed type block
Y, if inclusion (
3) is satisfied for
, then inclusion (
3) is satisfied also for all
.
Note that if condition (
2) is combined with a variant of condition (
3), with the weak inclusion reversed and
p set to 1, then we obtain a characterization of Pearce [
3]’s best reply set; see [
6].
In line with what we mentioned in the introduction, we can now formally show that if
is a strict Nash equilibrium, then
is epistemically robust. To see this, define for all
,
. Since the game is finite, there is, for each player
, a
such that
for all
with
. Let
. Then it holds for each
:
Thus, by condition (
2) and condition (
3),
is epistemically robust.
Also, as discussed in the introduction, non-strict pure-strategy Nash equilibria have ‘unused’ best replies. Our first result demonstrates that epistemic robustness is sufficient and necessary for the non-existence of such ‘unused’ best replies.
Proposition 1. The following two statements are equivalent:- (a)
is not epistemically robust.
- (b)
For all , there exists such that if satisfies , then there exist and such that and .
Hence, while an epistemically robust subset is defined by a set of profiles of player types, it suffices with one player and one possible type of this player to determine that a strategy block is not epistemically robust.
We now relate epistemically robust subsets to
curb sets. To handle the fact that all strategy profiles in any epistemically robust subset are profiles of best replies, while
curb sets may involve strategies that are not best replies, introduce the following notation: For each
and
, let
denote the pre-image (upper inverse) of
under player
i’s best reply correspondence
8. For a given subset
of
i’s pure strategies,
consists of the beliefs over others’ strategy profiles having the property that all best replies to these beliefs are contained in
.
Proposition 2. Let .
- (a)
If X is epistemically robust, then X is a curb set.
- (b)
If X is a curb set, then is epistemically robust. Furthermore, it is the largest epistemically robust subset of X.
Claim (a) implies that every epistemically robust subset contains at least one strategically stable set, both as defined in Kohlberg and Mertens [
16] and as defined in Mertens [
32], see Ritzberger and Weibull [
15] and Demichelis and Ritzberger [
33], respectively
9. Claim (a) also implies that subsets of epistemically robust sets need not be epistemically robust. Concerning claim (b), note that
equals the set of profiles of pure strategies that are best replies to some belief. Hence, since for each
, both
and
are monotonic with respect to set inclusion, it follows from Proposition 2(b) that any epistemically robust subset involves only strategies surviving one round of strict elimination. Thus,
is the largest epistemically robust subset, while the characterization of the smallest one(s) will be dealt with by Proposition 5.
Our proof shows that Proposition 2(a) can be slightly strengthened, as one only needs the robustness conditions with
; as long as there is a
such that
and condition (
3) holds with
,
X is
curb.
10 Moreover, although epistemic robustness allows that
depends on
p, the proof of (b) defines
Y independently of
p.
The following result shows that furb sets are characterized by epistemic robustness when player types that do not believe with sufficient probability that the others play best replies are removed:
Proposition 3. The following two statements are equivalent:
- (a)
is a furb set.
- (b)
There exists a such that, for each probability , there is a type block satisfying condition (2) such that condition (3) holds with equality.
The block of rationalizable strategies [
2,
3] is the game’s largest
furb set [
7]. Thus, it follows from Proposition 3 that epistemic robustness yields a characterization of the block of rationalizable strategies, without involving any explicit assumption of common belief of rationality. Instead, only mutual
p-belief of rationality and type sets are assumed. Proposition 3 also applies to
mincurb sets, as these sets are
furb. In particular, it follows from Propositions 2(a) and 3 that a strategy block is
mincurb if and only if it is a minimal epistemically robust subset
11.
As much of the literature on curb sets (recall footnote 1) focuses on minimal ones, we now turn to how smallest curb sets can be characterized in terms of epistemic robustness. This characterization is presented through Propositions 4 and 5.
Proposition 4 starts from an arbitrary block
Y of types and generates an epistemically robust subset by including all beliefs over the opponents’ best replies, and all beliefs over opponents’ types that have such beliefs over their opponents, and so on. Formally, define for any
the sequence
by
and, for each
and
,
Define the correspondence
, for any
, by
We show that the strategy block
of best replies is epistemically robust and is the smallest
curb set that includes
.
12Proposition 4. Let . Then is the smallest curb set satisfying . Furthermore, is epistemically robust.
Remark 1. If the strategy block contains strategies that are not rationalizable, then will not be furb. Therefore, the epistemic robustness of does not follow from Proposition 3; its robustness is established by invoking Proposition 2(b).
Note that if a strategy block
X is epistemically robust, then there exists a type block
Y satisfying condition (
2) such that condition (
3) is satisfied for
. Thus,
, showing that all epistemically robust strategy blocks can be obtained using the algorithm of Proposition 4.
The final Proposition 5 shows how mincurb sets can be characterized by epistemically robust subsets obtained by initiating the algorithm of Proposition 4 with a single type profile: a strategy block X is a mincurb set if and only if (a) the algorithm leads to X from a single type profile, and (b) no single type profile leads to a strict subset of X.
Proposition 5. is a mincurb set if and only if there exists a such that and there exists no such that .
Strict Nash equilibria (interpreted as equilibria in beliefs) satisfy ‘coordination’, in the sense that there is mutual belief about the players’ sets of best replies, ‘concentration’, in the sense that each player has only one best reply, and epistemic robustness (as defined here), implying that each player’s set of beliefs about opponent choices contains all probability distributions over opponent strategies that are best replies given their beliefs. In Proposition 5, starting with a single type profile t that corresponds to ‘coordination’, using the algorithm of Proposition 4 and ending up with ensures epistemic robustness, while the non-existence of such that is a proper subset of X corresponds to ‘concentration’. Hence, these three characteristics of strict Nash equilibria characterize mincurb sets in Proposition 5.
In order to illustrate Propositions 4 and 5, consider the Nash equilibrium
in game (
1) in the introduction. This equilibrium corresponds to a type profile
where
assigns probability
to
and probability
to
, and where
assigns probability
to
and probability
to
. We have that
, while the full strategy space
S is the smallest
curb set that includes
. Proposition 4 shows that
is not epistemically robust, since it does not coincide with the smallest
curb set that includes it. Recalling the discussion from the introduction: if player 2’s belief concerning the behavior of 1 coincides with
, then 2 is indifferent between his pure strategies
l and
c, and if 1 assigns equal probability to these two pure strategies of player 2, then 1 will play the unique best reply
d, a pure strategy outside the support of the equilibrium. Moreover, if player 2 expects 1 to reason this way, then 2 will play
r. Hence, to assure epistemic robustness, starting from type set
, the repeated inclusion of all beliefs over opponents’ best replies eventually leads to the smallest
curb set, here
S, that includes the Nash equilibrium that was our initial point of departure. By contrast, for the type profile
where
assigns probability 1 to
and
assigns probability 1 to
we have that
coincides with the smallest
curb set that includes it. Thus, the strict equilibrium
to which
corresponds is epistemically robust, when viewed as a singleton set. Furthermore, by Proposition 5,
is the unique
mincurb set.