Dynamic Multicriteria Games with Finite Horizon

Rettieva, Anna

doi:10.3390/math6090156

Open AccessArticle

Dynamic Multicriteria Games with Finite Horizon

by

Anna Rettieva

^1,2,†

¹

Institute of Applied Mathematical Research of the Karelian Research Centre of RAS, 11, Pushkinskaya Str., Petrozavodsk 185910, Russia

²

Saint-Petersburg State University, 7–9, Universitetskaya Nab., Saint-Petersburg 199034, Russia

^†

Current address: 11, Pushkinskaya str., Petrozavodsk 185910, Russia.

Mathematics 2018, 6(9), 156; https://doi.org/10.3390/math6090156

Submission received: 31 July 2018 / Revised: 8 August 2018 / Accepted: 3 September 2018 / Published: 5 September 2018

(This article belongs to the Special Issue Mathematical Game Theory)

Download

Browse Figures

Versions Notes

Abstract

The approaches to construct optimal behavior in dynamic multicriteria games with finite horizon are presented. To obtain a multicriteria Nash equilibrium, the bargaining construction (Nash product) is adopted. To construct a multicriteria cooperative equilibrium, a Nash bargaining scheme is applied. Dynamic multicriteria bioresource management problem with finite harvesting times is considered. The players’ strategies and the payoffs are obtained under cooperative and noncooperative behavior.

Keywords:

dynamic games; multicriteria games; Nash bargaining solution; cooperative equilibrium

MSC:

22E46

1. Introduction

Mathematical models involving more than one objective seem more adherent to real problems. Players can have more than one goal which are often not comparable. These situations are typical for game-theoretic models in economy and ecology. For example, in management problems the decision maker wants to maximize her profit and to minimize the production costs, in bioresource management problems the players wish to maximize their exploitation rates and to minimize the harm to the environment, and so on. Hence, a multicriteria game approach helps to make decisions in multi-objective problems.

Traditionally, equilibrium analysis in multicriteria problems is based on the static variant. Some concepts have been suggested to solve multicriteria games (e.g., the ideal Nash equilibrium [1], the E-equilibrium concept [2]). However the notion of Pareto equilibrium is the most-studied concept in multicriteria game theory.

This paper is dedicated to optimal behavior design in dynamic multicriteria games with finite horizon. To construct noncooperative equilibrium, we adopt the approach in [3]. The multicriteria Nash equilibrium is obtained by applying the bargaining concept (via Nash products), with the guaranteed payoffs playing the role of the status quo points. To determine the cooperative equilibrium in dynamic games with many objectives, we adopt the Nash bargaining scheme. Namely, the cooperative strategies and payoffs are constructed via a Nash bargaining solution, with the multicriteria Nash equilibrium payoffs playing the role of the status quo points.

Further exposition has the following structure. Classical solution concepts for noncooperative and cooperative multicriteria games are given in Section 2. Section 3 describes the proposed noncooperative and cooperative solution concepts for a finite horizon multicriteria dynamic game with two participants in discrete time. A two-player discrete-time game-theoretic bioresource management model (harvesting problem) with a finite planning horizon is treated in Section 4. The noncooperative behavior is obtained in Section 4.1, whereas the cooperative case is treated in Section 4.2. Finally, Section 5 provides the basic results and their discussion.

2. Multicriteria Games and Solution Concepts

A multicriteria noncooperative game is

G = 〈 {(X_{i})}_{i \in N}, {(u_{i})}_{i \in N} 〉,

where

N = {1, \dots, n}

gives the set of players,

X_{i}

is the set of strategies of player i, and

u_{i}

denotes the payoff function of player i,

u_{i} : \prod_{i = 1}^{n} X_{i} \to R^{m}

,

i = 1, \dots, n

.

Shapley [4] gave a generalization of the classical Nash equilibrium to Pareto equilibrium for such games.

Definition 1.

A strategy profile

x \in X = \prod_{i = 1}^{n} X_{i}

is a

1.: weak Pareto equilibrium if $\forall i \in N$

$\neg \exists y_{i} \in X_{i} : u_{i} (y_{i}, x_{- i}) > u_{i} (x),$
2.: strong Pareto equilibrium if $\forall i \in N$

$\neg \exists y_{i} \in X_{i} : u_{i} (y_{i}, x_{- i}) ≧ u_{i} (x) .$

Here

a > b \Leftrightarrow a_{i} > b_{i}

,

a ≧ b \Leftrightarrow a_{i} \geq b_{i}

,

\forall i = 1, \dots, m

.

Other solution concepts for multicriteria games, namely ideal Nash equilibrium and E-equilibrium, were introduced in [1,2], respectively. Reference [5] connected multicriteria games with potential games, and the coalition formation processes for multicriteria games were considered in [6].

A multicriteria cooperative game is defined as

〈 N, v 〉,

where

N = {1, \dots, n}

is the set of players,

v : 2^{N} \to R^{m}

denotes the characteristic function,

v (\emptyset) = 0

, and

v (S) = (\begin{matrix} v^{1} (S) \\ v^{2} (S) \\ \dots \\ v^{m} (S) \end{matrix}), \forall S \in 2^{N} .

For cooperative multicriteria games, the natural generalization of the Shapley value is applied to distribute the cooperative payoff among the players.

Definition 2.

The Shapley value

ϕ (v)

of the multicriteria game

〈 N, v 〉

is

ϕ_{i} (v) = \sum_{S \subset N, i \in S} \frac{(s - 1)! (n - s)!}{n!} (\begin{matrix} v^{1} (S) - v^{1} (S \ {i}) \\ v^{2} (S) - v^{2} (S \ {i}) \\ \dots \\ v^{m} (S) - v^{m} (S \ {i}) \end{matrix}) .

3. Dynamic Multicriteria Model with Finite Horizon and Solution Concepts

Consider a multicteria dynamic game with two participants in discrete time. The players exploit a common resource, and both wish to optimize m different criteria. The state dynamics is in the form

x_{t + 1} = f (x_{t}, u_{1 t}, u_{2 t}), x_{0} = x,

(1)

where

x_{t} \geq 0

is the resource size at time

t \geq 0

,

f (x_{t}, u_{1 t}, u_{2 t})

gives the natural growth function, and

u_{i t} \in U_{i}

denotes the strategy of player i at time

t \geq 0

,

i = 1, 2

.

The payoff functions of the players over the finite time horizon are defined by

J_{1} = (\begin{matrix} J_{1}^{1} = \sum_{t = 0}^{n} δ^{t} g_{1}^{1} (u_{1 t}, u_{2 t}) \\ \dots \\ J_{1}^{m} = \sum_{t = 0}^{n} δ^{t} g_{1}^{m} (u_{1 t}, u_{2 t}) \end{matrix}), J_{2} = (\begin{matrix} J_{2}^{1} = \sum_{t = 0}^{n} δ^{t} g_{2}^{1} (u_{1 t}, u_{2 t}) \\ \dots \\ J_{2}^{m} = \sum_{t = 0}^{n} δ^{t} g_{2}^{m} (u_{1 t}, u_{2 t}) \end{matrix}),

(2)

where

g_{i}^{j} (u_{1 t}, u_{2 t}) \geq 0

gives the instantaneous utility,

i = 1, 2

,

j = 1, \dots, m

, and

δ \in (0, 1)

denotes a common discount factor.

3.1. Multicriteria Nash Equilibrium

We design the equilibrium in dynamic multicriteria game applying the Nash bargaining products [3]. Therefore, we begin with the construction of guaranteed payoffs which play the role of status quo points.

There are three possible concepts to determine the guaranteed payoffs [3]. In the first one, four guaranteed payoff points are obtained as the solutions of zero-sum games. In particular, the first guaranteed payoff point is a solution of a zero-sum game where player 1 wishes to maximize her first criterion and player 2 wants to minimize it. Other points are obtained by analogy. Namely,

G_{1}^{j} is the solution of zero-sum game 〈 I, I I, U_{1}, U_{2}, J_{1}^{j} 〉, j = 1, \dots, m,

G_{2}^{j} is the solution of zero-sum game 〈 I, I I, U_{1}, U_{2}, J_{2}^{j} 〉, j = 1, \dots, m .

The second approach can be applied when the players’ objectives are comparable. Consequently, the guaranteed payoff points for player 1 (

G_{1}^{1}

, …,

G_{1}^{m}

) are obtained as the solution of a zero-sum game where she wants to maximize the sum of her criteria and player 2 wishes to minimize it (and, by analogy, for player 2). Namely,

G_{1}^{1}, \dots, G_{1}^{m} are the solution of zero-sum game 〈 I, I I, U_{1}, U_{2}, J_{1}^{1} + \dots + J_{1}^{m} 〉,

G_{2}^{1}, \dots, G_{2}^{m} are the solution of zero-sum game 〈 I, I I, U_{1}, U_{2}, J_{2}^{1} + \dots + J_{2}^{m} 〉 .

In the third approach, the guaranteed payoff points are constructed as the Nash equilibrium with the appropriate criteria of both players, respectively. Namely,

G_{1}^{1} and G_{2}^{1} is the Nash equilibrium in the game 〈 I, I I, U_{1}, U_{2}, J_{1}^{1}, J_{2}^{1} 〉,

\dots

G_{1}^{m} and G_{2}^{m} is the Nash equilibrium in the game 〈 I, I I, U_{1}, U_{2}, J_{1}^{m}, J_{2}^{m} 〉 .

To construct multicriteria payoff functions, we adopt the Nash products. The role of the status quo points belongs to the guaranteed payoffs of the players:

\begin{matrix} H_{1} (u_{1 t}, u_{2 t}) = (J_{1}^{1} (u_{1 t}, u_{2 t}) - G_{1}^{1}) \cdot \dots \cdot (J_{1}^{m} (u_{1 t}, u_{2 t}) - G_{1}^{m}), \\ H_{2} (u_{1 t}, u_{2 t}) = (J_{2}^{1} (u_{1 t}, u_{2 t}) - G_{2}^{1}) \cdot \dots \cdot (J_{2}^{m} (u_{1 t}, u_{2 t}) - G_{2}^{m}) . \end{matrix}

(3)

Definition 3.

A strategy profile

(u_{1 t}^{N}, u_{2 t}^{N})

is called a multicriteria (or multicriteria by-product) Nash equilibrium [3] of the problem (1), (2) if

\begin{matrix} H_{1} (u_{1 t}^{N}, u_{2 t}^{N}) \geq H_{1} (u_{1 t}, u_{2 t}^{N}) \forall u_{1 t} \in U_{1}, \\ H_{2} (u_{1 t}^{N}, u_{2 t}^{N}) \geq H_{2} (u_{1 t}^{N}, u_{2 t}) \forall u_{2 t} \in U_{2} . \end{matrix}

(4)

A two-player discrete-time game-theoretic bioresource management model with an infinite planning horizon was considered in [3]. The multicriteria Nash equilibrium was obtained for different variants of the guaranteed payoffs’ construction. It was shown that the worst variant for the environment is the first one since it leads to overexploitation. The variant where the guaranteed payoffs are determined as Nash equilibrium is beneficial for both players and, moreover, improves the ecological situation as it limits bioresource exploitation.

3.2. Multicriteria Cooperative Equilibrium

The multicriteria cooperative equilibrium is obtained as a solution of a Nash bargaining scheme with the multicriteria Nash equilibrium payoffs playing the role of the status quo points.

First we have to determine noncooperative payoffs as players’ gains when they apply multicriteria Nash equilibrium strategies

(u_{1 t}^{N}, u_{2 t}^{N})

:

J_{1}^{N} = (\begin{matrix} J_{1}^{1 N} = \sum_{t = 0}^{n} δ^{t} g_{1}^{1} (u_{1 t}^{N}, u_{2 t}^{N}) \\ \dots \\ J_{1}^{m N} = \sum_{t = 0}^{n} δ^{t} g_{1}^{m} (u_{1 t}^{N}, u_{2 t}^{N}) \end{matrix}), J_{2}^{N} = (\begin{matrix} J_{2}^{1 N} = \sum_{t = 0}^{n} δ^{t} g_{2}^{1} (u_{1 t}^{N}, u_{2 t}^{N}) \\ \dots \\ J_{2}^{m N} = \sum_{t = 0}^{n} δ^{t} g_{2}^{m} (u_{1 t}^{N}, u_{2 t}^{N}) \end{matrix}) .

(5)

Then, we construct a Nash product where the sum of players’ noncooperative payoffs plays a role as a status quo point. To construct the cooperative behavior we adopt a Nash bargaining solution, so it is required to solve the following problem:

\begin{matrix} (V_{1}^{1 c} + V_{2}^{1 c} - J_{1}^{1 N} - J_{2}^{1 N}) \cdot \dots \cdot (V_{1}^{m c} + V_{2}^{m c} - J_{1}^{m N} - J_{2}^{m N}) = \\ = (\sum_{t = 0}^{n} δ^{t} (g_{1}^{1} (u_{1 t}^{c}, u_{2 t}^{c}) + g_{2}^{1} (u_{1 t}^{c}, u_{2 t}^{c})) - J_{1}^{1 N} - J_{2}^{1 N}) \cdot \dots \\ \cdot (\sum_{t = 0}^{n} δ^{t} (g_{1}^{m} (u_{1 t}^{c}, u_{2 t}^{c}) + g_{2}^{m} (u_{1 t}^{c}, u_{2 t}^{c})) - J_{1}^{m N} - J_{2}^{m N}) \to max_{u_{1 t}^{c} \in U_{1}, u_{2 t}^{c} \in U_{2}}, \end{matrix}

(6)

where

J_{i}^{j N}

are the noncooperative gains determined in (5),

i = 1, 2

,

j = 1, \dots, m

.

Definition 4.

A strategy profile

(u_{1 t}^{c}, u_{2 t}^{c})

is called a multicriteria cooperative equilibrium of the problem (1), (2) if it solves the problem (6).

Now we pass to a dynamic bicriteria model related with the bioresource management problem (harvesting problem) to show how the suggested concepts work.

4. Dynamic Multicriteria Model with Finite Harvesting Times

Consider a bicriteria discrete-time dynamic bioresource management model with two participants and fixed harvesting times. Suppose that the two players (countries or fishing firms) harvest a fish stock during finite time horizon

[0, n]

. The fish population evolves according to the equation

x_{t + 1} = ε x_{t} - u_{1 t} - u_{2 t}, x_{0} = x,

(7)

where

x_{t} \geq 0

is the population size at time

t \geq 0

,

ε \geq 1

denotes the natural birth rate, and

u_{i t} \geq 0

gives the catch of player i at time t,

i = 1, 2

.

Each player has two goals to optimize: they wish to maximize their profit from selling fish and minimize the catching cost. Suppose that the market price of the resource differs for both players, but their costs are identical and depend on both of players’ catches. Specifically, the payoff functions of the players over the finite time horizon are defined by

J_{1} = (\begin{matrix} J_{1}^{1} = \sum_{t = 0}^{n} δ^{t} p_{1} u_{1 t} \\ J_{1}^{2} = - \sum_{t = 0}^{n} δ^{t} c u_{1 t} u_{2 t} \end{matrix}), J_{2} = (\begin{matrix} J_{2}^{1} = \sum_{t = 0}^{n} δ^{t} p_{2} u_{2 t} \\ J_{2}^{2} = - \sum_{t = 0}^{n} δ^{t} c u_{1 t} u_{2 t} \end{matrix}),

(8)

where, for

i = 1, 2

,

p_{i} \geq 0

is the market price of the resource for player i,

c \geq 0

indicates the catching cost, and

δ \in (0, 1)

denotes the discount factor.

4.1. Multicriteria Nash Equilibrium

We begin with the construction of guaranteed payoffs applying the Bellman optimality principle. The third variant of the guaranteed payoff points’ construction is adopted as it is beneficial for both players, and, moreover, improves the ecological situation [3].

In this case the guaranteed payoff points

G_{1}^{1}

and

G_{2}^{1}

are defined as the Nash equilibrium in the game

〈 I, I I, U_{1}, U_{2}, J_{1}^{1}, J_{2}^{1} 〉

. Let

V_{1} (t, x)

be a value function for player 1, and

V_{2} (t, x)

for player 2.

Applying the Bellman principle, the value functions satisfy

\begin{matrix} V_{1} (t, x_{t}) = max_{u_{1 t} \geq 0} {δ^{t} p_{1} u_{1 t} + V_{1} (t + 1, ε x_{t} - u_{1 t} - u_{2 t})}, \\ V_{2} (t, x_{t}) = max_{u_{2 t} \geq 0} {δ^{t} p_{2} u_{2 t} + V_{2} (t + 1, ε x_{t} - u_{1 t} - u_{2 t})} . \end{matrix}

Assuming the value functions and the strategies have the linear forms, we get the solution

u_{1 t} = u_{2 t} = (ε - 1) x_{t},

and the dynamics becomes

x_{t} = {(2 - ε)}^{t} x_{0} .

Hence, the guaranteed payoffs take the forms

\begin{matrix} G_{1}^{1} = \sum_{t = 0}^{n} δ^{t} p_{1} u_{1 t} = \frac{p_{1} (ε - 1) (δ^{n + 1} {(2 - ε)}^{n} (ε - 2) + 1)}{1 - δ (2 - ε)} x_{0}, \\ G_{2}^{1} = \sum_{t = 0}^{n} δ^{t} p_{2} u_{2 t} = \frac{p_{2} (ε - 1) (δ^{n + 1} {(2 - ε)}^{n} (ε - 2) + 1)}{1 - δ (2 - ε)} x_{0} . \end{matrix}

By analogy, determining the Nash equilibrium in the game with the second criteria of both players

J_{1}^{2} (u_{1 t}, u_{2 t})

and

J_{2}^{2} (u_{1 t}, u_{2 t})

, we get two more guaranteed payoff points

G_{1}^{2} = G_{2}^{2} = G = \frac{c {(ε^{2} - 1)}^{2} (δ^{n + 1} - 1)}{4 ε^{4} (δ - ε^{2})} x_{0}^{2} .

To determine the multicriteria Nash equilibrium of problem (7), (8), it is required to solve the following problem:

\begin{matrix} (\sum_{t = 0}^{n} δ^{t} p_{1} u_{1 t} - G_{1}^{1}) (- \sum_{t = 0}^{n} δ^{t} c u_{1 t} u_{2 t} - G) \to max_{u_{1 t} \geq 0}, \\ (\sum_{t = 0}^{n} δ^{t} p_{2} u_{2 t} - G_{2}^{1}) (- \sum_{t = 0}^{n} δ^{t} c u_{1 t} u_{2 t} - G) \to max_{u_{2 t} \geq 0} . \end{matrix}

(9)

Considering the process starting from one-step until n-step game and seeking the linear strategies, we get the multicriteria Nash equilibrium.

Theorem 1.

The multicriteria Nash equilibrium strategies in the problem (7), (8) have the form

u_{1 t}^{N} = u_{2 t}^{N} = \frac{ε^{t - 1} γ_{11}^{N}}{1 + 2 γ_{11}^{N} \sum_{j = 0}^{t - 2} ε^{j}} x_{t}, t = 1, \dots, n .

(10)

The players’ strategy on the last step

γ_{11}^{N}

takes the form

\begin{matrix} γ_{11}^{N} = \frac{c A ε^{n - 1} - 2 \tilde{G} \sum_{j = 0}^{n - 2} ε^{j} + ε^{n - 1} \sqrt{c^{2} A^{2} - 3 \tilde{G} c \sum_{j = 0}^{n - 1} δ^{j}}}{3 c ε^{2 (n - 1)} \sum_{j = 0}^{n - 1} δ^{j} + 4 \tilde{G} {(1 + ε)}^{n - 2} - 4 c ε^{n - 1} A \sum_{j = 0}^{n - 2} ε^{j}}, \end{matrix}

(11)

where

A = \frac{(ε - 1) (δ^{n + 1} {(2 - ε)}^{n} (ε - 2) + 1)}{1 - δ (2 - ε)}

,

\tilde{G} = \frac{{(ε^{2} - 1)}^{2} (δ^{n + 1} - 1)}{4 ε^{4} (δ - ε^{2})}

.

Proof.

See the cooperative case that is given below. ☐

4.2. Cooperative Equilibrium

Suppose that the players wish to cooperate. We construct the cooperative payoffs and strategies applying the Nash bargaining solution [7]. First, we have to determine noncooperative payoffs as the players’ gains when they apply multicriteria Nash strategies. Then, we construct a Nash product where the sum of players’ noncooperative payoffs plays a role as status quo points.

According to (10), (11), the noncooperative payoffs have the forms

\begin{matrix} J_{1}^{1 N} (x) = \sum_{t = 0}^{n} δ^{t} p_{1} u_{1 t}^{N} = p_{1} K_{1} x_{0}, \\ J_{2}^{1 N} (x) = \sum_{t = 0}^{n} δ^{t} p_{2} u_{2 t}^{N} = p_{2} K_{1} x_{0}, \\ J_{1}^{2 N} (x) = J_{2}^{2 N} (x) = - c \sum_{t = 0}^{n} δ^{t} u_{1 t}^{N} u_{2 t}^{N} = K_{2} x_{0}^{2}, \end{matrix}

(12)

where

K_{1} = γ_{11}^{N} (ε - 2 γ_{11}^{N}) \sum_{t = 0}^{n} δ^{t} L_{t}^{2}, K_{2} = {(γ_{11}^{N})}^{2} {(ε - 2 γ_{11}^{N})}^{2} \sum_{t = 0}^{n} δ^{t} L_{t}^{4}, L_{t} = \frac{ε^{t - 1}}{1 + 2 γ_{11}^{N} \sum_{j = 0}^{t - 2} ε^{j}} .

According to Definition 4, in order to construct the cooperative strategies it is required to solve the problem (6). Hence,

(V_{1}^{1 c} + V_{2}^{1 c} - J_{1}^{1 N} - J_{2}^{1 N}) (V_{1}^{2 c} + V_{2}^{2 c} - J_{1}^{2 N} - J_{2}^{2 N}) \to max_{u_{1 t}^{c}, u_{2 t}^{c} \geq 0},

(13)

where

J_{i}^{j N} (x)

are the noncooperative payoffs (12) (

i, j = 1, 2

), or

(\sum_{t = 0}^{n} δ^{t} (p_{1} u_{1 t}^{c} + p_{2} u_{2 t}^{c}) - G_{1} x) (- 2 c \sum_{t = 0}^{n} δ^{t} u_{1 t}^{c} u_{2 t}^{c} - G_{2} x^{2}) \to max_{u_{1 t}^{c}, u_{2 t}^{c} \geq 0},

(14)

where

G_{1} x = J_{1}^{1 N} (x) + J_{2}^{1 N} (x) = (p_{1} + p_{2}) K_{1} x

,

G_{2} x^{2} = J_{1}^{2 N} (x) + J_{2}^{2 N} (x) = 2 K_{2} x^{2}

.

We start with the one-step game. We seek the players’ strategies in linear form

u_{11}^{c} = γ_{11} x

and

u_{21}^{c} = γ_{21} x

.

To determine cooperative strategies for this one-step game, we solve the following problem:

\begin{matrix} (H_{11}^{c} (γ_{11}^{c}, γ_{21}^{c}; x) - G_{1} x) (H_{21}^{c} (γ_{11}^{c}, γ_{21}^{c}; x) - G_{2} x^{2}) = \\ (p_{1} γ_{11}^{c} x + p_{2} γ_{21}^{c} - G_{1} x) (- 2 c γ_{11}^{c} γ_{21}^{c} x^{2} - G_{2} x^{2}) \to max_{γ_{11}^{c}, γ_{21}^{c} \geq 0} . \end{matrix}

(15)

From the first-order conditions, we obtain the strategies

\begin{matrix} γ_{11}^{c} = \frac{c G_{1} + \sqrt{c^{2} G_{1}^{2} - 6 c p_{1} p_{2} G_{2}}}{6 c p_{1}}, \\ γ_{21}^{c} = \frac{c G_{1} + \sqrt{c^{2} G_{1}^{2} - 6 c p_{1} p_{2} G_{2}}}{6 c p_{2}} . \end{matrix}

(16)

We can now consider problem (13) for the two-step game. The objective function for the first criterion for the two-step game is

H_{12}^{c} (γ_{11}^{c}, γ_{12}^{c}, γ_{12}^{c}, γ_{22}^{c}; x) = = p_{1} γ_{12}^{c} x + p_{2} γ_{22}^{c} x + δ (p_{1} γ_{11}^{c} + p_{2} γ_{21}^{c}) (ε - γ_{12}^{c} - γ_{22}^{c}) x

and that for the second criterion is

H_{22}^{c} (γ_{11}^{c}, γ_{21}^{c}, γ_{12}^{c}, γ_{22}^{c}; x) = - 2 c γ_{12}^{c} γ_{22}^{c} x^{2} - 2 c δ γ_{11}^{c} γ_{21}^{c} {(ε - γ_{12}^{c} - γ_{22}^{c})}^{2} x^{2} .

To determine cooperative strategies for this two-step game we solve the following problem:

\begin{matrix} (H_{12}^{c} (γ_{11}^{c}, γ_{21}^{c}, γ_{12}^{c}, γ_{22}^{c}; x) - G_{1} x) \cdot \\ \cdot (H_{22}^{c} (γ_{11}^{c}, γ_{21}^{c}, γ_{12}^{c}, γ_{22}^{c}; x) - G_{2} x^{2}) \to max_{γ_{11}^{c}, γ_{21}^{c}, γ_{12}^{c}, γ_{22}^{c} \geq 0} . \end{matrix}

(17)

From the first-order conditions, we obtain the relationship between the players’ strategies in the one-step and two-step games:

\begin{matrix} γ_{22}^{c} = \frac{p_{1}}{p_{2}} γ_{12}^{c}, γ_{21}^{c} = \frac{p_{1}}{p_{2}} γ_{11}^{c}, \\ γ_{12}^{c} = \frac{ε p_{2} γ_{11}^{c}}{p_{2} + γ_{11}^{c} (p_{1} + p_{2})} . \end{matrix}

(18)

The first player’s strategy on the last step

γ_{11}^{c}

takes the form

γ_{11}^{c} = p_{2} \frac{- G_{2} (p_{1} + p_{2}) + c ε G_{1} + \sqrt{c ε^{2} (c G_{1}^{2} - 6 p_{1} p_{2} G_{2} (1 + δ))}}{6 c δ ε^{2} (1 + δ) + G_{2} {(p_{1} + p_{2})}^{2} - 2 c ε G_{1} (p_{1} + p_{2})} .

(19)

By continuing the described process for the n-step game, we easily obtain the cooperative behavior.

Theorem 2.

The multicriteria cooperative equilibrium strategies in the problem (7), (8) have the form

\begin{matrix} u_{1 t}^{c} = γ_{1 t}^{c} x_{t} = \frac{ε^{t - 1} (1 - ε) p_{2} γ_{11}^{c}}{p_{2} (1 - ε) + γ_{11}^{c} (p_{1} + p_{2}) (1 - ε^{t - 1})} x_{t}, t = 2, \dots, n, \\ u_{2 t}^{c} = γ_{2 t}^{c} x_{t} = \frac{p_{1}}{p_{2}} γ_{1 t}^{c} x_{t}, t = 1, \dots, n . \end{matrix}

(20)

The first player’s strategy on the last step

γ_{11}^{c}

takes the form

\begin{matrix} γ_{11}^{c} = p_{2} (- G_{2} (p_{1} + p_{2}) \sum_{j = 0}^{n - 2} ε^{j} + c ε^{n - 1} G_{1} + ε^{n - 1} \sqrt{c (c G_{1}^{2} - 6 p_{1} p_{2} G_{2} \sum_{j = 0}^{n - 1} δ^{j})}) / \\ (6 c δ ε^{2 (n - 1) \sum_{j = 0}^{n - 1} δ^{j}} + (p_{1} + p_{2}) (- 2 c ε^{n - 1} G_{1} \sum_{j = 0}^{n - 2} ε^{j} + G_{2} (p_{1} + p_{2}) {(1 + ε)}^{2 (n - 2)})) . \end{matrix}

(21)

We performed a numerical simulation for a 50-step game with the following parameters:

ε = 1.3, p_{1} = 100, p_{2} = 150, c = 50, δ = 0.8 .

Figure 1 shows the dynamics of the population size, whereas Figure 2 shows the catch of player 1 for noncooperative and cooperative behavior. As one can notice, the cooperation is beneficial for players and, moreover, improves the ecological situation as it limits bioresource exploitation.

5. Conclusions

An approach to constructing cooperative equilibrium in multicriteria dynamic games with finite horizon is presented. Cooperative behavior design was performed adopting the Nash bargaining solution. First, we evaluated the multicriteria Nash equilibrium strategies, and players’ payoffs played the role of the status quo points [3]. Then, we constructed the multicriteria cooperative strategies and payoffs via the bargaining scheme.

We studied a bicriteria discrete-time bioresource management problem, where the players differ in their aims and have finite planning horizons. Multicriteria Nash and cooperative equilibria strategies were derived analytically in linear forms, which allows their direct application to concrete populations with appropriate parameters. The results of numerical modeling showed that the presented approach stimulates cooperation, as it more beneficial for players to cooperate. Moreover, an important result related to ecological systems is that the cooperative behavior determined in such way leads to sparing the exploitation rate and improves the ecological situation.

Funding

This research was supported by the Russian Science Foundation, project no. 17-11-01079.

Conflicts of Interest

The author declares no conflict of interest.

References

Voorneveld, M.; Grahn, S.; Dufwenberg, M. Ideal equilibria in noncooperative multicriteria games. Math. Methods Oper. Res. 2000, 52, 65–77. [Google Scholar] [CrossRef]
Pusillo, L.; Tijs, S. E-equilibria for multicriteria games. Ann. ISDG 2013, 12, 217–228. [Google Scholar]
Rettieva, A.N. Equilibria in dynamic multicriteria games. IGTR 2017, 19, 1750002. [Google Scholar] [CrossRef]
Shapley, L.S. Equilibrium points in games with vector payoffs. Nav. Res. Logist. Q. 1959, 6, 57–61. [Google Scholar] [CrossRef]
Patrone, F.; Pusillo, L.; Tijs, S.H. Multicriteria games and potentials. TOP 2007, 15, 138–145. [Google Scholar] [CrossRef]
Pieri, G.; Pusillo, L. Multicriteria Partial Cooperative Games. Appl. Math. 2015, 6, 2125–2131. [Google Scholar] [CrossRef]
Rettieva, A.N. A discrete-time bioresource management problem with asymmetric players. Autom. Remote Control 2014, 75, 1665–1676. [Google Scholar] [CrossRef]

Figure 1. Population size: dark—cooperation, light—Nash equilibrium.

Figure 2. Player 1’s catch: dark—cooperation, light—Nash equilibrium.

© 2018 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rettieva, A. Dynamic Multicriteria Games with Finite Horizon. Mathematics 2018, 6, 156. https://doi.org/10.3390/math6090156

AMA Style

Rettieva A. Dynamic Multicriteria Games with Finite Horizon. Mathematics. 2018; 6(9):156. https://doi.org/10.3390/math6090156

Chicago/Turabian Style

Rettieva, Anna. 2018. "Dynamic Multicriteria Games with Finite Horizon" Mathematics 6, no. 9: 156. https://doi.org/10.3390/math6090156

APA Style

Rettieva, A. (2018). Dynamic Multicriteria Games with Finite Horizon. Mathematics, 6(9), 156. https://doi.org/10.3390/math6090156

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dynamic Multicriteria Games with Finite Horizon

Abstract

1. Introduction

2. Multicriteria Games and Solution Concepts

3. Dynamic Multicriteria Model with Finite Horizon and Solution Concepts

3.1. Multicriteria Nash Equilibrium

3.2. Multicriteria Cooperative Equilibrium

4. Dynamic Multicriteria Model with Finite Harvesting Times

4.1. Multicriteria Nash Equilibrium

4.2. Cooperative Equilibrium

5. Conclusions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI