Evolutionary Stable Strategies in Multistage Games

Petrosyan, Leon A.; Liu, Xiuxiu

doi:10.3390/math11112492

Open AccessArticle

Evolutionary Stable Strategies in Multistage Games

by

Leon A. Petrosyan

^*

and

Xiuxiu Liu

^*

Faculty of Applied Mathematics and Control Processes, Saint-Petersburg State University, Universitetskii Prospekt 35, 198504 St. Petersburg, Russia

^*

Authors to whom correspondence should be addressed.

Mathematics 2023, 11(11), 2492; https://doi.org/10.3390/math11112492

Submission received: 27 March 2023 / Revised: 14 May 2023 / Accepted: 26 May 2023 / Published: 29 May 2023

(This article belongs to the Special Issue Evolutionary Games, Propagation Processes and Control in Complex Systems)

Download

Browse Figures

Versions Notes

Abstract

:

Direct ESS has some disadvantages, which are seen even in the case of repeated games when the sequence of stage ESSs may not constitute the direct ESS in the repeated game. We present here the refinement of the ESS definition, which eliminates these disadvantages and represents the base for the definition of ESS in games in extensive form. The effectiveness of this approach for multistage n-person games is shown for metagame (this notion is used for the first time), in which under some relevant conditions, the existence of ESS is proved, and ESSs are constructed using threat strategies.

Keywords:

multistage game; ESS; evolutionary stability; strict Nash equilibrium

MSC:

91A11

1. Introduction

Evolutionary games were first formulated in [1]. We shall follow [2] in the definition of evolutionary stable strategies (ESSs) [3,4,5,6] for symmetric bimatrix games; here, the definition of ESS (so-called “direct ESS”) applicable for extensive-form game with perfect recall (see [7]) is also purposed. This definition is based upon the concept of symmetry in extensive-form games introduced in [8]. As we saw earlier (see [9]), the classical definition of ESS proposed for normal two-person games cannot be applied to repeated and multistage games. In this paper, we propose a refinement of this definition, which can be considered an attempt to solve the problem. First, we present an example of a two-stage Hawk and Dove game, for which we try to explain the problem and show the effectiveness of the new refined definition. After, we propose the new ESS definition for general n-person games and specially for repeated and multistage games (metagames). In the last section, we present an algorithm for constructing ESS in general n-person multistage games (metagames) and prove the corresponding theorem. This result is illustrated by an example.

2. Definition of ESS for Two-Person Games

Following [2], the symmetric extensive-form 2-person game is a pair

(Γ, T)

where

Γ

is an extensive-form game and T is a symmetry of

Γ

. If

b_{1}, b_{2}

are the behavior strategies of player 1 in

(Γ, T)

and

b_{1}^{T}, b_{2}^{T}

(behavior strategies of player 2) are the symmetric images of

b_{1}, b_{2}

, respectively, then the probability that the endpoint z is reached when

(b_{1}, b_{2}^{T})

is played is equal to the probability that

z^{T}

is reached when

(b_{2}, b_{1}^{T})

is played. Therefore, the expected payoff of player 1 when

(b_{1}, b_{2}^{T})

is played is equal to player 2’s expected payoff when

(b_{2}, b_{1}^{T})

is played [10]:

E_{1} (b_{1}, b_{2}^{T}) = E_{2} (b_{2}, b_{1}^{T}),

(1)

Equation (1), restricted to pure strategies, defines the symmetric normal form of

(Γ, T)

.

Definition 1.

Direct ESS in

(Γ, T)

is a behavior strategy

\bar{b}

of player 1 that satisfies

E_{1} (\bar{b}, {\bar{b}}^{T}) = max_{b} E_{1} (b, {\bar{b}}^{T})

(2)

and if b \neq \bar{b} and E_{1} (b, {\bar{b}}^{T}) = E_{1} (\bar{b}, {\bar{b}}^{T}),

then E_{1} (b, b^{T}) < E_{1} (\bar{b}, b^{T}) .

(3)

We try to purpose some refinement of this definition. Let

μ (b^{'}, b^{″})

be the probability generated over the set of endpoints in the game if players choose behavior strategies

b^{'}

,

b^{″}

, respectively.

Definition 2.

The behavior strategy

\bar{b}

is called ESS in

(Γ, T)

if

\bar{b}

satisfies

E_{1} (\bar{b}, {\bar{b}}^{T}) = max_{b} E_{1} (b, {\bar{b}}^{T})

(4)

and if for b^{'} such that μ (b^{'}, {\bar{b}}^{T}) \neq μ (\bar{b}, {\bar{b}}^{T})

the payoff E_{1} (b^{'}, {\bar{b}}^{T}) = E_{1} (\bar{b}, {\bar{b}}^{T}),

then E_{1} (b^{'}, {b^{'}}^{T}) < E_{1} (\bar{b}, {b^{'}}^{T}) .

(5)

Note that in Definition 2, the important condition

μ (b^{'}, {\bar{b}}^{T}) \neq μ (\bar{b}, {\bar{b}}^{T})

is weak, and if we revert to biological interpretations of ESS, we have to take into account that the biological populations may not react to the changes of strategies in extensive-form games (remember that the strategy in an extensive game has a very complicated structure), and it is clear that “animals” cannot realize the deviation from it and may react to changes in probability measure on the final positions of the game (on the set of outcomes). Thus, deviations which do not affect measure

μ

on the endpoints cannot be taken into account when considering ESS.

Example 1.

We repeated the Hawk and Dove game [11]. This game is a two-person bimatrix game Γ with payoff matrices:

A = \begin{matrix} \begin{matrix} H & D \end{matrix} \\ \begin{matrix} H \\ D \end{matrix} & [\begin{matrix} \frac{1}{2} (V - C) & V \\ 0 & \frac{1}{2} V \end{matrix}] \end{matrix} A^{T} = \begin{matrix} \begin{matrix} H & D \end{matrix} \\ \begin{matrix} H \\ D \end{matrix} & [\begin{matrix} \frac{1}{2} (V - C) & 0 \\ V & \frac{1}{2} V \end{matrix}] \end{matrix}

If

V > C

,

(H, H)

is ESS in Γ. Consider now a two-stage version of this game, which can be represented on Figure 1.

The strategy of player I (II) in this game is a rule, which defines the choice of one from two alternatives H or D in each information set of a player. Player I (II) has 5 information sets, and thus, each of them has 32 strategies, which can be represented as sequence

(H, H, D, H, D)

. Denote this strategy of player as

u (\cdot)

.

Consider the strategy

u (\cdot) = (H, H, H, H, H)

, which is composed from ESS (case V > C) in each stage game. It would be appropriate if this strategy is ESS in our two-stage game [12,13]. Unfortunately, it does not satisfy Definition 1, which was the reason to change in our paper this definition to Definition 2.

It can be easily seen that condition (2) holds since

(u (\cdot), u (\cdot))

is NE in the game Γ. However, there exist a strategy

v (\cdot) = (H, H, D, D, D)

for which the payoff

E_{1} (v (\cdot), u (\cdot)) = E_{1} (u (\cdot), u (\cdot)) .

Since

E_{1} (v (\cdot), u (\cdot)) = E_{1} (u (\cdot), u (\cdot)) = V - C

and

E (v (\cdot), v (\cdot)) = E (u (\cdot), v (\cdot)) = V - C

this shows that the condition (3) is not satisfied.

However, according to Definition 2, the strategy

u (\cdot) = (H, H, H, H, H)

is ESS since the strategy

v (\cdot) = (H, H, D, D, D)

giving the same payoff against

u (\cdot)

as

u (\cdot)

itself is excluded from consideration because of condition (5) of Definition 2.

Remark: In our example, ESS is in pure strategies, and thus in definitions ((2)–(5)), the mathematical expectation of the payoff coincides with the payoff itself [14].

Suppose now that

Γ

is the n-stage repeated bimatrix game. Let G be a stage symmetric bimatrix game. The strategies in G are alternatives in

Γ

. To each strategy i of player 1 in

Γ

, we correspond a strategy

T (i) = i

of player 2 in G with the same index i. Each alternative

c \in C_{i} (i = 1, 2)

in

Γ

is a strategy (index) in some stage game G in

Γ

. The mapping

T (c)

corresponds to the alternative c (strategy) of player 1 in stage game G, the alternative

T (c) = c

(strategy) of player 2 in the same stage game (strategy with the same index). To each information set

u_{1}

of player 1, mapping T corresponds the information set

u_{2}

of player 2 in the same stage game (the bimatrix game can be represented as a game in extensive form with two moves and two successive information sets

u_{1}

for player 1 and

u_{2}

for player 2).

Theorem 1.

If

\bar{β}

is a ESS in G, then the behavior strategy

\bar{b}

prescribing the behavior

\bar{β}

to the alternatives of each information set (

\bar{β}

is ESS in stage game G) is ESS in

(Γ, T)

.

3. Definition of ESS for n-Person Games

There are many different approaches to how the ESS should be extended to the n-person case. We shall follow the definition given in [15]. Suppose we have a game G in normal form:

G = < N; X_{1}, . . ., X_{n}; K_{1}, . . ., K_{n} >,

when

N = 1, . . ., n

is the set of players,

X_{i} = {x_{i}}

is the set of strategies of player i, and

K_{i} (x_{1}, . . ., x_{n})

is the payoff function of player i. We suppose for simplicity that the sets

X_{i}, i = 1, . . ., n

are finite.

Note that the strategy profile

\bar{x} = ({\bar{x}}_{1}, . . ., {\bar{x}}_{n})

is an ESS in G [16], if it is a strict Nash equilibrium, i.e., if

K_{i} (\bar{x} | | x_{i}) < K_{i} (\bar{x}) for all x_{i} \in X_{i}, i = 1, . . ., n .

(6)

It is proved that condition (6) protects the strategy

{\bar{x}}_{i}

against the invasion of a few mutants playing another strategy

y_{i}

.

It is also clear that (6) cannot be used to define ESS in multistage games since there is always a large number of strategies

y_{i} \in X_{i}

such that for any strategy profile,

x = (x_{1}, . . ., x_{n}), K_{i} (x | | y_{i}) = K_{i} (x)

.

Following the ideas of the previous section, try to refine the ESS concept specified in (6) in such a way that it could be useful also for n-person multistage games.

For this reason, we have to mention that (6) automatically excludes the mixed strategy profiles from consideration. Additionally, the refinement of this concept will act only with pure strategy profiles.

Denote by

U_{i}

the strategy set of player i in

Γ

.

u_{i} \in U_{i}

is the strategy of player i, and

H_{i} (u_{1}, . . ., u_{n})

is the payoff function of player i. Let

Γ

be a multistage n-person game.

Definition 3.

The strategy profile

\bar{u} = ({\bar{u}}_{1}, . ., {\bar{u}}_{n})

in Γ is called ESS if

H_{i} (\bar{u} | | u_{i}) \leq H_{i} (\bar{u}), u_{i} \in U_{i}, i = 1, . . ., n

(7)

and if

H_{i} (\bar{u} | | u_{i}) = H_{i} (\bar{u})

for some

i \in N, u_{i} \in U_{i}

, then paths corresponding to

(\bar{u} | | u_{i})

and

\bar{u}

necessarily coincide.

From Definition 3, it follows that strict inequality in (7) is valid for all those deviations, for which the resulting paths differ from that generated by the ESS strategy profile.

4. Existence of ESS in Multistage Repeated n-Person Games

Suppose that

Γ

is a finite stage repeated n-person game with simultaneous n-person stage game G. Suppose that G has an ESS (strict Nash equilibrium) [17]. Denote ESS in G as

\bar{x} = ({\bar{x}}_{1}, . . ., {\bar{x}}_{n})

, and the payoff in G as

K_{i} ({\bar{x}}_{1}, . . ., {\bar{x}}_{n})

. Denote

{\bar{K}}_{i} = K_{i} ({\bar{x}}_{1}, . . ., {\bar{x}}_{n})

. Consider zero-sum games

G^{i}

between player i as first player and subset

N \ {i}

as second player with strategy sets

X_{i}, X_{N \ {i}} = \prod_{k \in N \ {i}} X_{k}

correspondingly and payoff of player i equal to

K_{i} (x_{1}, . . ., x_{n})

. (The payoff of second player

N \ {i}

equals −

K_{i} (x_{1}, . . ., x_{n})

.) Denote by

μ^{*} = (μ_{i}^{*}, μ_{N \ {i}}^{*})

the corresponding mixed-strategy saddle point in

G^{i}

, and by

υ_{i}

the value of game

G^{i}

. Fix some n-tuple

\tilde{x} = ({\tilde{x}}_{1}, . . ., {\tilde{x}}_{n})

,

{\tilde{K}}_{i} = K_{i} (\tilde{x})

, and consider

\tilde{\tilde{K_{i}}} = max_{x_{i} \in X_{i}} K_{i} (\tilde{x} | | x_{i}) .

Suppose that the following conditions hold

{\tilde{K}}_{i} + {\bar{K}}_{i} > \tilde{\tilde{K_{i}}} + υ_{i}, {\tilde{K}}_{i} > {\bar{K}}_{i}, {\bar{K}}_{i} > υ_{i} .

(8)

Theorem 2.

If there exists such an n-tuple of strategies

\tilde{x} = ({\tilde{x}}_{1}, . . ., {\tilde{x}}_{n})

in G that (8) holds, then Γ has an ESS which is constructed as follows.

If l is a number of stages in Γ, then each player i has to play

{\tilde{x}}_{i}

on the first

l - 1

stages and

{\bar{x}}_{i}

(ESS in G) on the last stage l: in case on some stage

t < l

, player i first deviates for the first timefrom

{\tilde{x}}_{i}

, starting from the stage

t + 1

coalition of players,

(N \ {i})

chooses

μ_{N \ {i}}^{*}

from the mixed-strategy saddle point in

G^{i}

.

5. ESS for Metagames

Finite multistage game

Γ

, at each stage of which some n-person game G is played, is called metagame; the game realized at each stage depends on the players’ choices in previous games.

Over the strategy profiles x in the stage game G, the mapping

T_{G} (x)

is defined, which corresponds to each stage game G and strategy profile x for the next stage game

G^{1} = T_{G} (x)

.

Suppose that in metagame

Γ

, on the first stage, the stage game

G^{1}

is played. If in

G^{1}

, players choose strategy profile

x^{1} = (x_{1}^{1}, . . ., x_{n}^{1})

, then on the second stage, the game

G^{2} = T_{G^{1}} (x^{1})

is played. If on stage k, players playing the stage game

G^{k}

choose strategy profile

x^{k} = (x_{1}^{k}, . . ., x_{n}^{k})

, on the next stage, the game

G^{k + 1} = T_{G^{k}} (x^{k})

is played. The metagame ends on stage m. The payoff of player

i \in N

in the metagame is equal to the sum of their payoffs in stage games. Denote by

K_{i}^{l} (x_{1}^{l}, . ., x_{n}^{l})

, the payoff of player

i \in N

in stage game

G^{l}

, then the payoff of player

i \in N

in metagame is equal to

H_{i} = \sum_{l = 1}^{m} K_{i}^{l} (x_{1}^{l}, . . ., x_{n}^{l}), i \in N .

It is important that after each stage, players know all of the prehistory (prehistory—players’ choices before current stage of metagame).

The strategy

u_{i}

of player

i \in N

in

Γ

is a mapping which corresponds to the choice of strategy in stage game G as a function of the strategy profiles of all players in stage games realized before the stage game G.

Suppose that stage game G has ESS (strict Nash equilibrium). Note that strategy profile

\bar{x} = ({\bar{x}}_{1}, . . ., {\bar{x}}_{n})

is ESS in G, and

K_{i} ({\bar{x}}_{1}, . . ., {\bar{x}}_{n}) = {\bar{K}}_{i}

is the payoff of player i in G under the strategy profile

\tilde{u}

.

Suppose that under strategy profile

\bar{u} = (u_{1}, . . ., u_{i}, . . ., u_{n})

, the sequence of stage games

G^{1}, . . ., G^{k}, . . ., G^{n}

is realized. This sequence of stage games we shall call path corresponding to n is the strategy profile

u = (u_{1}, . . ., u_{i}, . . ., u_{n})

.

Now consider the stage game

G^{k}, k = 1, . . ., l - 1

. Note that the game

G^{k}

depends also upon choices made by players in previous stage game

G^{k - 1}

. This means that on stage k dependent on previous strategy choices, different games of type

G^{k}

can be realized. For each stage game

G^{k}

, denote by

G_{i}^{k}

the zero-sum game between player i as the first player and subset

N \ {i}

as the second player with sets of strategies

X_{i}^{k}, X_{N \ {i}}^{k} = \prod_{m \in N \ {i}}^{} X_{m}^{k}

respectively, and the payoff of player i is given by

K_{i}^{k} (x_{1}^{k}, . . ., x_{n}^{k})

. (Payoff of the second player

N \ {i}

is given by −

K_{i}^{k} (x_{1}^{k}, . ., x_{n}^{k})

.)

Denote by

({\hat{η}}_{i}^{k}, {\hat{η}}_{N \ {i}}^{k})

the corresponding mixed-strategy profile in the saddle point of

G_{i}^{k}

and by

υ_{i}^{k}

the value of

G_{i}^{k}

. Fix some strategy profile in

G^{k}

as

{\tilde{x}}^{k} = ({\tilde{x}}_{1}^{k}, . . ., {\tilde{x}}_{n}^{k}), {\tilde{K}}_{i}^{k} = K_{i}^{k} ({\tilde{x}}^{k})

and suppose that

{\tilde{K}}_{i}^{k} > {\bar{K}}_{i}^{k}, i = 1, . . ., n .

Consider

{\tilde{\tilde{K}}}_{i}^{k} = {max}_{x_{i}^{k} \in X_{i}^{k}} K_{i}^{k} ({\tilde{x}}^{k} | | x_{i}^{k})

.

Definition 4.

The strategy profile

u^{*} = (u_{1}^{*}, . . ., u_{n}^{*})

is ESS in the metagame if

H_{i} (u^{*}) \geq H_{i} (u^{*} | | u_{i})

for all i and all

u_{i}

, and if

H_{i} (u^{*} | u_{i}) = H_{i} (u^{*})

for some

i \in N, u_{i} \in U_{i}

, then paths corresponding to

(u^{*} | u_{i})

and

u^{*}

coincide.

This definition is common for definition of ESS for n-person games.

Generate strategy

u_{i}^{*}

of player i in metagame

Γ

as the following: in games

G^{k}, j = 1, . . ., l - 1

, players chooses strategies

{\tilde{x}}_{i}^{k}

, and at last stage in

G^{k}

—

{\bar{x}}_{i}^{k}

. Then, strategy profile

u^{*} = (u_{1}^{*}, . . ., u_{n}^{*})

realizes a sequence of stage games

G^{1 *}, G^{2 *}, . . ., G^{l *}

in metagame

Γ

, which we will call the optimal trajectory. Denote by

{\tilde{K}}_{i}^{k}

the payoff of player i in

{\tilde{G}}_{i}^{k}

.

Suppose that player i deviates from

u_{i}^{*}

at some stage

t < l

, then, beginning from stage

t + 1

, players from

N \ {i}

choose

{\hat{η}}_{N \ {i}}^{k}, k = t - 1, . . ., l

, see Figure 2. Define

{\bar{\bar{u}}}_{i}^{k}

, satisfying

H_{i} ({\tilde{u}}^{k} | | {\bar{\bar{u}}}_{i}^{k}) \geq H_{i} ({\tilde{u}}^{k})

. After stage t, players

N \ {i}

choose strategy

{\hat{η}}_{N \ {i}}^{l}, l > t

, optimal in the zero-sum game

G_{i}^{l}

.

Denote

W_{i} = {max}_{0 \leq k \leq l} [{max}_{G^{k}} υ_{i}^{k}]

. Suppose that

\sum_{k = t}^{l - 1} {\tilde{K}}_{i}^{k} + {\bar{K}}_{i}^{k} > {\tilde{\tilde{K}}}_{i}^{k} + (l - t) W_{i}, t = 1, . . ., l - 1 .

(9)

Theorem 3.

If there exist strategies

{\tilde{x}}_{i}^{k_{j}}

in games

G^{k_{j}}

such that (9) holds, then the strategy profile

u^{*}

, mentioned above, is ESS in metagame Γ.

Proof.

The payoff of player i when the strategy profile

u^{*}

is used is

H_{i} (u^{*}) = \sum_{k = 1}^{l - 1} {\tilde{K}}_{i}^{k} + {\bar{K}}_{i}^{k} = \sum_{k = 1}^{l - 1} {\tilde{K}}_{i}^{k} ({\tilde{x}}_{1}^{k}, . . ., {\tilde{x}}_{n}^{k}) + {\bar{K}}_{i}^{k} ({\bar{x}}_{1}^{k}, . . ., {\bar{x}}_{n}^{k}) .

It is important to note that

x_{i}^{k}

are pure strategies.

Suppose that player i deviates from

u_{i}^{*}

, and this happens at stage t of metagame

Γ

. Denote by

u_{i}

this new strategy of player i. Then we obtain a new strategy profile

(u^{*} | | u_{i})

in

Γ

, which realizes the path, different from the optimal trajectory. Consider the payoff of player i under strategy profile

(u^{*} | | u_{i})

, realizing the path different from the optimal trajectory. From (9), we obtain

\begin{matrix} \begin{matrix} H_{i} (u^{*} | | u_{i}) = \sum_{k = 1}^{t - 1} {\tilde{K}}_{i}^{k} + {\tilde{\tilde{K}}}_{i}^{k} + \sum_{k = t + 1}^{l} υ_{i}^{k} \leq \sum_{k = 1}^{t - 1} {\tilde{K}}_{i}^{k} + {\tilde{\tilde{K}}}_{i}^{k} + \sum_{j = t + 1}^{l} W_{i} \\ = \sum_{k = 1}^{t - 1} {\tilde{K}}_{i}^{k} + {\tilde{\tilde{K}}}_{i}^{k} + (l - t) W_{i} < \sum_{k = 1}^{l - 1} {\tilde{K}}_{i}^{k} + {\bar{K}}_{i}^{k} = H_{i} (u^{*}) \end{matrix} \end{matrix}

Thus,

u^{*}

is ESS (see Definition 4).

The theorem is proved. □

Example 2.

Consider a metagame Γ, in which one of two possible games

G^{'}

and

G^{″}

is played on each stage.

G^{'}

and

G^{″}

are two-player games with strategy sets

X_{1}^{'} = (x_{11}^{'}, x_{12}^{'}, x_{13}^{'})

,

X_{2}^{'} = (x_{21}^{'}, x_{22}^{'}, x_{23}^{'})

in

G^{'}

of players I and II, and strategy sets

X_{1}^{″} = (x_{11}^{″}, x_{12}^{″}, x_{13}^{″})

,

X_{2}^{″} = (x_{21}^{″}, x_{22}^{″}, x_{23}^{″})

in

G^{″}

of player I and II, correspondingly. The payoffs in

G^{'}

are defined as Table 1.

In

G^{″}

as Table 2.

In both games, the Nash equilibrium is

{\bar{x}}^{'} = (x_{12}^{'}, x_{22}^{'})

,

{\bar{x}}^{″} = (x_{12}^{″}, x_{22}^{″})

with payoffs

{\bar{K}}_{1}^{'} (x_{12}^{'}, x_{22}^{'}) = {\bar{K}}_{2}^{'} (x_{12}^{'}, x_{22}^{'}) = {\bar{K}}_{1}^{'} (2, 2) = {\bar{K}}_{2}^{'} (2, 2) = 6

and

{\bar{K}}_{1}^{″} (x_{12}^{″}, x_{22}^{″}) = {\bar{K}}_{2}^{″} (x_{12}^{″}, x_{22}^{″}) = {\bar{K}}_{1}^{″} (2, 2) = {\bar{K}}_{2}^{″} (2, 2) = 6 .

Also we have

K_{i}^{'} (1, 1) = 10 > 6 = K_{i}^{'} (2, 2), i = 1, 2

and

K_{i}^{″} (1, 1) = 11 > 6 = K_{i}^{″} (2, 2), i = 1, 2 .

Suppose

{\tilde{K}}_{i}^{'} = K_{i}^{'} (1, 1) = 10, i = 1, 2

. In both stage games, if player i deviates from

\tilde{x} = (1, 1) = (x_{11}^{'}, x_{21}^{'})

(or

(x_{11}^{″}, x_{21}^{″})

), they can obtain at most

{\tilde{\tilde{K}}}_{1}^{'} = max_{l} K_{1}^{'} (\tilde{x} | | x_{1 l}) = max_{l} K_{1}^{'} (x_{11}^{'}, x_{21}^{'} | | x_{1 l}) = K_{1}^{'} (2, 1) = 15 .

Similarly,

{\tilde{\tilde{K}}}_{2}^{″} = 15

. The metagame Γ proceeds as follows. On the first stage, players play the game

G^{'}

(

G^{1} = G^{'}

) and if in

G^{'}

, they choose strategy profile (1, 1) or (1, 2), on the next stage, the game

G^{'}

is repeated

(G^{2} = G^{'})

. In the other case (if strategy profiles (1, 2), (1, 3), (2, 1), (2, 3), (3, 1), (3, 2), and (3, 3) are chosen), on the next stage, the game

G^{″}

is played

(G^{2} = G^{″})

. If on stage k the game

G^{'} (G^{k} = G^{'})

is played, the next stage game is defined as in the first stage. If on stage k, the game

G^{″} (G^{k} = G^{''})

is played, on the next stage, the game

G^{'} (G^{k + 1} = G^{'})

is played if in stage game

G^{k}

, the strategy profiles (1, 1) or (1, 2) are chosen. In other cases, on stage

k + 1

, the game

G^{″}

is played

(G^{k + 1} = G^{″})

. The metagame ends on stage m.

In each case, when one of the players (player i) deviates from strategy profile

\tilde{x}

, the other player will choose strategy 2 on the next stages of the metagame. Hence, the payoff of the deviating player in all future stage games will be equal to 0. We see that the condition

{\tilde{K}}_{i} (2, 2) + {\bar{K}}_{i} (1, 1) = 6 + 10 > {\tilde{\tilde{K}}}_{i} + v_{i} = 15 + 0

is satisfied, and the strategy profile

u^{*}

constructed above is strong NE and, thus, ESS.

6. Conclusions

In this paper, based on the definition of “direct ESS”, we try to provide a broader definition of ESS, from two-person games to multistage repeated n-person games. We propose the concept of the “meta-game”, in which the superiority of a broad ESS definition can be highlighted. We prove the existence of ESS under certain conditions and show an example.

The proposed refinement of ESS in the repeated games has the natural property that the repetition of ESS in the stage game will constitute an ESS in the whole game. This is not true if the classical ESS definition is used for repeated games.

Additionally, the ESS definition and its construction are proposed for general multistage n-person games (we call them metagames). It is determined that the use of ESS in each stage game of the metagame (note that this case is different from repeated games, since in the metagame, the stage games are different and depend upon the history of the game process) does not give the ESS in the whole game. In spite of that, we propose an algorithm of constructing the ESS in metagames using the ESS in stage games and threat strategies.

Author Contributions

Conceptualization, L.A.P.; methodology, L.A.P.; validation, L.A.P. and X.L.; investigation, L.A.P. and X.L.; resources, L.A.P.; writing—original draft preparation, L.A.P. and X.L.; writing—review and editing, L.A.P. and X.L.; visualization, L.A.P. and X.L.; supervision, L.A.P.; project administration, L.A.P.; funding acquisition, L.A.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research is funded by the China Scholarship Council, grant number 202209010015.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Maynard Smith, J.; Price, G.R. The Logic of Animal Conflict. Nature 1973, 246, 15–18. [Google Scholar] [CrossRef]
Van Damme, E. Stability and Perfection of Nash Equilibria; Springer: Berlin/Heidelberg, Germany, 1991; Volume 340, pp. 215–258. [Google Scholar]
MacArthur, R.H. Theoretical and Mathematical Biology; Waterman, T., Horowitz, H., Eds.; Blaisdell: New York, NY, USA, 1965. [Google Scholar]
Hamilton, W.D. Extraordinary sex ratios. Science 1967, 156, 477–488. [Google Scholar] [CrossRef] [PubMed]
Maynard Smith, J. The Theory of Games and the Evolution of Animal Conflicts. J. Theor. Biol. 1974, 47, 209–221. [Google Scholar] [CrossRef] [PubMed]
Maynard Smith, J. Evolution and the Theory of Games; Cambridge University Press: Cambridge, UK, 1982; ISBN 0-521-28884-3. [Google Scholar]
Kuhn, H.W. Extensive Games and the Problem of Information. Ann. Math. Stud. 1953, 28, 193–216. [Google Scholar]
Selten, R. Evolutionary stability in extensive 2-person games. Math. Soc. Sci. 1983, 5, 269–363. [Google Scholar] [CrossRef]
Petrosjan, L.A. Dynamic Evolutionary Game Theory in Biology and Economics; University of Waterloo: Waterloo, ON, Canada, 1995; p. 4. [Google Scholar]
Hofbauer, J.; Sigmund, K. Evolutionary Games and Population Dynamics; Cambridge University Press: Cambridge, UK, 1998. [Google Scholar]
Cowden, C. Game Theory, Evolutionary Stable Strategies and the Evolution of Biological Interactions. Nat. Educ. Knowl. 2012, 3, 6. [Google Scholar]
Lin, Z. An algorithm of evolutionarily stable strategies for the single-population evolutionary game. J. Comput. Appl. Math. 2008, 217, 157–165. [Google Scholar] [CrossRef]
Loertscher, S. Rock–Scissors–Paper and evolutionarily stable strategies. Econ. Lett. 2013, 118, 473–474. [Google Scholar] [CrossRef]
Deng, X.; Wang, Z.; Liu, Q.; Deng, Y.; Mahadevan, S. A Belief-Based Evolutionarily Stable Strategy. J. Theor. Biol. 2014, 361, 81–86. [Google Scholar] [CrossRef] [PubMed]
Sandholm, W.H. Population Games and Evolutionary Dynamics; The MIT Press: Cambridge, MA, USA, 2010; pp. 269–363. [Google Scholar]
Apaloo, J.; Brown, J.; McNickle, G.G.; Vincent, T.L.; Vincent, T.L. ESS versus Nash: Solving evolutionary games. Evol. Ecol. Res. 2015, 16, 293–314. [Google Scholar]
Weibull, J.W. Evolutionary Game Theory; Cambridge University Press: Cambridge, UK, 1995. [Google Scholar]

Figure 1. Hawk and Dove game with two stage.

Figure 2. Multistage game G with deviation of player i.

Table 1. The payoffs in

G^{'}

.

Table 1. The payoffs in

G^{'}

.

	$x_{21}^{'}$	$x_{22}^{'}$	$x_{23}^{'}$
	1	2	3
$x_{11}^{'}$ 1	(10, 10)	(0, 15)	(0, 0)
$x_{12}^{'}$ 2	(15, 0)	(6, 6)	(0, 0)
$x_{13}^{'}$ 3	(0, 0)	(0, 0)	(0, 0)

Table 2. The payoffs in

G^{″}

.

Table 2. The payoffs in

G^{″}

.

	$x_{21}^{″}$	$x_{22}^{″}$	$x_{23}^{″}$
	1	2	3
$x_{11}^{″}$ 1	(11, 11)	(0, 15)	(2, 2)
$x_{12}^{″}$ 2	(15, 0)	(6, 6)	(2, 2)
$x_{13}^{″}$ 3	(2, 2)	(2, 2)	(2, 2)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Petrosyan, L.A.; Liu, X. Evolutionary Stable Strategies in Multistage Games. Mathematics 2023, 11, 2492. https://doi.org/10.3390/math11112492

AMA Style

Petrosyan LA, Liu X. Evolutionary Stable Strategies in Multistage Games. Mathematics. 2023; 11(11):2492. https://doi.org/10.3390/math11112492

Chicago/Turabian Style

Petrosyan, Leon A., and Xiuxiu Liu. 2023. "Evolutionary Stable Strategies in Multistage Games" Mathematics 11, no. 11: 2492. https://doi.org/10.3390/math11112492

APA Style

Petrosyan, L. A., & Liu, X. (2023). Evolutionary Stable Strategies in Multistage Games. Mathematics, 11(11), 2492. https://doi.org/10.3390/math11112492

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Evolutionary Stable Strategies in Multistage Games

Abstract

1. Introduction

2. Definition of ESS for Two-Person Games

3. Definition of ESS for n-Person Games

4. Existence of ESS in Multistage Repeated n-Person Games

5. ESS for Metagames

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI