Numerical Solution of Open-Loop Nash Differential Games Based on the Legendre Tau Method

Dehghan Banadaki, Mojtaba; Navidi, Hamidreza

doi:10.3390/g11030028

Open AccessArticle

Numerical Solution of Open-Loop Nash Differential Games Based on the Legendre Tau Method

by

Mojtaba Dehghan Banadaki

and

Hamidreza Navidi

^*

Department of Applied Mathematics, Shahed University, Tehran P.O. Box 18151-159, Iran

^*

Author to whom correspondence should be addressed.

Games 2020, 11(3), 28; https://doi.org/10.3390/g11030028

Submission received: 28 February 2020 / Revised: 6 April 2020 / Accepted: 9 April 2020 / Published: 23 July 2020

Download

Browse Figures

Versions Notes

Abstract

In this paper, an efficient implementation of the Tau method is presented for finding the open-loop Nash equilibrium of noncooperative nonzero-sum two-player differential game problems with a finite-time horizon. Regarding this approach, the two-point boundary value problem derived from Pontryagin’s maximum principle is reduced to a system of algebraic equations that can be solved numerically. Finally, a differential game arising from bioeconomics among firms harvesting a common renewable resource is included to illustrate the accuracy and efficiency of the proposed method and a comparison is made with the result obtained by fourth order Runge–Kutta method.

Keywords:

differential game theory; open-loop Nash equilibrium; Pontryagin’s maximum principle; Tau method; bioeconomics

1. Introduction

Differential game theory, as a natural extension of optimal control theory, deals with the problem where each control agent (player) tries to maximize his own profit, which conflicts with others, and it has received considerable attention in economics and management sciences in recent decades. It covers a large area in macroeconomics, microeconomics, resource management and bioeconomics. Some of the applications of this theory have been considered in many textbooks. In [1], an introduction to the theory of noncooperative differential games and its applications, such as marketing, natural resources and environmental economics are offered. Advertising competition and the Lanchester model are studied in [2]. Both deterministic and stochastic cooperative differential games are covered in [3], and some applications in resources and environmental economics are contained therein.

The Nash strategy is regarded as an equilibrium solution for simultaneous games, in which players cannot improve their payoffs by deviating unilaterally from it [4]. There exist two main types of equilibrium solutions for differential games, namely, closed-loop (or feedback) and open-loop. The closed-loop equilibrium is where each player’s strategy is a function of time and state variables, whereas in open-loop equilibrium, the strategy of each player is a function of time and initial state. To identify the open-loop Nash equilibrium in a differential game, the system of two-point boundary value problems (TPBVPs) derived from Pontryagin’s maximum principle as the necessary conditions for the existence of an open-loop Nash equilibrium must be solved [5]. Regarding this approach, the obtained system of TPBVPs is reduced to a system of algebraic equations that can be solved using well-known analytical and numerical techniques for systems of ordinary differential equations [6]. Solving differential game problems numerically is the most logical way to treat them as their analytical solutions are not always available. The main research studies in this field contain obtaining open-loop Nash equilibrium in linear quadratic dynamic games [7,8,9,10,11]. In [12], solving a nonlinear differential game arising from a pollution control problem is considered. The quasi-equilibrium of a special case of nonlinear differential games is found by studying the state-dependent Riccati equations [13]. In [14], a dynamic programming approach is presented to obtain the saddle point of a kind of nonlinear zero-sum differential game.

One of the best methods in terms of accuracy and efficiency, for a numerical solution of different kinds of differential equations by means of truncated series of orthogonal polynomials, is the spectral method [15,16,17,18,19]. There are three well-known spectral methods, namely, the Galerkin, Tau, and collocation methods, and the selection of the suitable spectral method depends on the type of differential equation and the boundary conditions governed by it [20,21]. The aim of this paper is to propose a numerical approach based on Pontryagin’s maximum principle and the Tau method to find the open-loop Nash equilibrium of noncooperative nonzero-sum differential games.

The remainder of the paper is organized as follows: In Section 2, the definition of a noncooperative nonzero-sum two-player differential game, open-loop Nash equilibrium, and the analytical form of the necessary conditions for an open-loop Nash equilibrium are revised. In Section 3, the Tau method for obtaining the open-loop Nash equilibrium of such games is introduced. In Section 4, a differential game arising from bioeconomics is presented to illustrate the accuracy and efficiency of the proposed method. Finally, the paper is concluded with a conclusion.

2. Problem Statement

In this section, we deal with a noncooperative nonzero-sum two-player differential game that is described by the following definition:

Definition 1.

A noncooperative nonzero-sum two-player differential game is defined as follows [22]:

\begin{array}{l} \max_{u_{i} (.)} J_{i} (u_{i} (.), u_{j} (.)) = \max_{u_{i} (.)} \int_{0}^{T} L_{i} (t, x (t), u_{i} (t), u_{j} (t)) d t + ψ_{i} (x (T)) \\ \overset{\cdot}{x} (t) = f (t, x (t), u_{1} (t), u_{2} (t)) \\ x (0) = x_{0} \in R \end{array}

(1)

with

i, j \in {1, 2}

and

i \neq j

.

In performance index

J_{i} (u_{i} (.), u_{i} (.))

given in (1),

u_{i} (.)

and

u_{j} (.)

are the controls (strategies) of players

i

and

j

, respectively; function

L_{i}

is player

i

’s instantaneous payoff, and function

ψ_{i}

is terminal payoff. The goal of game for players is maximizing their performance indices by choosing suitable control actions

u_{i}, i = 1, 2

.

A player’s open-loop strategy is the planned time path of his action. This type of equilibrium concept is time consistent, meaning that along the equilibrium path, no player is incentivized to deviate from his original plan [23]. Thus, the definition of an open-loop solution concept (equilibrium) can be as follows:

Definition 2.

The ordered pair

(ϕ^{1}, ϕ^{2})

of functions

ϕ^{i} : [0, T] \to R, i = 1, 2

is called an open-loop Nash equilibrium if, for each

i

, an optimal control path

u_{i}

of the problem (1) exists and is given by the open-loop Nash strategy

u_{i} = ϕ^{i}

[1].

An open-loop Nash equilibrium is characterized by introducing the Hamiltonian functions for formulating the first order necessary conditions of optimality for nonzero-sum differential games (1), and are introduced as the following [24]:

H_{i} (t, x, u_{i}, u_{j}, λ_{i}) = L_{i} (t, x, u_{i}, u_{j}) + λ_{i} f (t, x, u_{i}, u_{j}), i, j \in {1, 2}, i \neq j,

where the variables

λ_{i}

,

i = 1, 2

are called the costate variables or the adjoint variables associated with the state variable

x

.

To simplify the notation in the Hamiltonian functions, the time dependence has been neglected in the functions

x, u_{i}, u_{j}, λ_{i}

.1

Assuming that all functions

f, L_{1}, L_{2}, ψ_{1}, ψ_{2}

in (1) are continuously differentiable, first order necessary conditions for optimality are provided by Pontryagin’s maximum principle.

Based on Pontryagin’s maximum principle, the set of necessary conditions for the open-loop Nash equilibrium of a nonzero-sum differential game is obtained as follows:

\overset{\cdot}{x} = f (t, x, u_{1}, u_{2})

(2)

\overset{\cdot}{λ_{i}} = - \frac{\partial H_{i}}{\partial x} (t, x, u_{1}, u_{2}, λ_{i})

(3)

\frac{\partial H_{i}}{\partial u_{i}} (t, x, u_{i}, u_{j}, λ_{i}) = 0

(4)

x (0) = x_{0}

λ_{i} (T) = \frac{\partial ψ_{i} (x (T))}{\partial x}

with

i, j \in {1, 2}

and

i \neq j

.

Algebraic Equation (4) can be solved to obtain an expression for

u_{i}

,

i = 1, 2

in terms of

x

and

λ_{i}

; that is,

u_{i} = ϕ_{i} (t, x, λ_{i}) .

Substituting this expression into Equations (2) and (3), a system of differential equations is obtained involving only

t

,

x

and

λ_{i}

,

i = 1, 2

. This system of TPBVPs can be expressed as:

\overset{\cdot}{x} = f (t, x, ϕ_{1}, ϕ_{2})

(5)

\overset{\cdot}{λ_{i}} = - \frac{\partial H_{i}}{\partial x} (t, x, ϕ_{1}, ϕ_{2}, λ_{i})

(6)

x (0) = x_{0}

(7)

λ_{i} (T) = \frac{\partial ψ_{i} (x (T))}{\partial x}

(8)

where

ϕ_{i} = ϕ_{i} (t, x, λ_{i})

for

i = 1, 2

.

In general, this system of TPBVPs is nonlinear with split boundary values, hence obtaining an exact and analytical solution for the open-loop Nash equilibrium is difficult. Therefore, using a suitable numerical method is indispensable.

3. The Tau Method for Nonzero-Sum Differential Games

In this section, the implementation of the Tau method for solving the system of TPBVPs and finding the open-loop Nash equilibrium of a nonzero-sum differential game is presented.

The fundamental idea of this approach is the expansion of the function

f (x) \in L_{w}^{k} (- 1, 1)

into the form of a finite series of basis functions as

f (x) \approx f_{N} (x) = \sum_{i = 0}^{N} f_{i} P_{i} (x),

where

P_{i} (x), i = 0, 1, \dots, N

are Legendre polynomials and

f_{i}, i = 0, 1, \dots, N

are spectral coefficients [25].

Definition 3.

The Legendre polynomials

P_{n} (x), n = 0, 1, 2, \dots

are the eigenfunctions of the singular Sturm–Liouville problem

(1 - x^{2}) P_{n}^{″} (x) - 2 x P_{n}^{'} (x) + n (n + 1) P_{n} (x) = 0.

They are orthogonal on the interval

[- 1, 1]

with respect to the weight function

w (x) = 1

and satisfy the following recurrence formula:

P_{n + 1} (x) = \frac{2 n + 1}{n + 1} x P_{n} (x) - \frac{n}{n + 1} P_{n - 1} (x), n = 1, 2, \dots,

where

P_{0} (x) = 1, P_{1} (x) = x .

Theorem 1.

Let

f (x) \in H_{w}^{k} (- 1, 1)

(Sobolev space),

f_{N} (x) = \sum_{i = 0}^{N} f_{i} P_{i} (x)

be the best approximation of

f (x)

in

L_{w}^{2} - n o r m

, then

{‖ f (x) - f_{N} (x) ‖}_{L_{w}^{2} [- 1, 1]} \leq C_{0} N^{- k} {‖ f (x) ‖}_{H_{w}^{k} (- 1, 1)},

where

C_{0}

is a positive constant, which depends on the selected norm, independent of

f (x)

and

N .

Proof of Theorem 1.

[26]. □

Regarding Theorem 1, it is concluded that approximation rate of Legendre polynomials is

N^{- k} .

The basic results of the presented approach and theoretical treatment of its convergence are based on the well-known Weierstrass approximation theorem.

Theorem 2.

(Weierstrass approximation theorem)Let

f \in L_{w}^{2} [- 1, 1]

and

N \in ℕ

. Then there exists a unique

f_{N}^{*} \in P_{N}

, the space of all polynomials of degree at most

N

, such that

{‖ f - f_{N}^{*} ‖}_{w} = \inf_{f_{N} \in P_{N}} {‖ f - f_{N} ‖}_{w},

where

f_{N}^{*} (x) = \sum_{i = 0}^{N} {\overset{⌢}{f}}_{k} η_{k} (x), {\overset{⌢}{f}}_{k} = \frac{{〈 f, η_{k} 〉}_{w}}{{‖ η_{k} ‖}_{w}^{2}},

and

{η_{k}}_{k = 0}^{N}

form an

L_{w}^{2} -

orthogonal basis of

P_{N}

.

Proof of Theorem 2.

[27]. □

To use the Legendre polynomials on interval

[0, T]

, it is necessary to shift the defining domain by the following variable substitution:

x = \frac{2 t}{T} - 1

It is assumed that the solutions

x

and

λ_{i}, i = 1, 2

of the TPBVPs 5–8 are approximated by a linear combination of the shifted Legendre polynomials as follows:

x \approx x_{N} = \sum_{i = 0}^{N} a_{i} P_{i}^{*}

(9)

λ_{1} \approx λ_{1 N} = \sum_{i = 0}^{N} b_{i} P_{i}^{*}

(10)

λ_{2} \approx λ_{2 N} = \sum_{i = 0}^{N} c_{i} P_{i}^{*},

(11)

where

a_{i},

b_{i}

and

c_{i}

are unknown coefficients and

P_{i}^{*} = P_{i} (\frac{2 t}{T} - 1), i = 0, \dots, N

is the shifted Legendre polynomial on interval

[0, T]

.

The first derivative of

x

and

λ_{i}, i = 1, 2

can be approximated as follows:

\overset{\cdot}{x} \approx {\overset{\cdot}{x}}_{N} = \frac{2}{T} \sum_{i = 0}^{N} a_{i} P_{i} {^{*}}^{'}

(12)

{\overset{\cdot}{λ}}_{1} \approx {\overset{\cdot}{λ}}_{1 N} = \frac{2}{T} \sum_{i = 0}^{N} b_{i} P_{i} {^{*}}^{'}

(13)

{\overset{\cdot}{λ}}_{2} \approx {\overset{\cdot}{λ}}_{2 N} = \frac{2}{T} \sum_{i = 0}^{N} c_{i} P_{i} {^{*}}^{'} .

(14)

Equations (9)–(14) can be restated as the following vector forms:

x \approx x_{N} = A^{T} P^{*}

(15)

λ_{1} \approx λ_{1 N} = B^{T} P^{*}

(16)

λ_{2} \approx λ_{2 N} = C^{T} P^{*}

(17)

\overset{\cdot}{x} \approx {\overset{\cdot}{x}}_{N} = A^{T} S

(18)

{\overset{\cdot}{λ}}_{1} \approx {\overset{\cdot}{λ}}_{1 N} = B^{T} S

(19)

{\overset{\cdot}{λ}}_{2} \approx {\overset{\cdot}{λ}}_{2 N} = C^{T} S,

(20)

where

\begin{matrix} A^{T} = [a_{0}, \dots, a_{N}], B^{T} = [b_{0}, \dots, b_{N}], C^{T} = [c_{0}, \dots, c_{N}], \\ P^{*} = {[p_{0}^{*}, \dots, p_{N}^{*}]}^{T}, S = \frac{2}{T} [p_{0} {^{*}}^{'}, \dots, p_{N} {^{*}}^{'}] . \end{matrix}

To implement the Tau method, Equations (15)–(20) are substituted at first into the understudied differential Equations (5) and 6 to form the residuals as follows:

\begin{array}{l} R_{1} = {\overset{\cdot}{x}}_{N} - f (t, x_{N}, ϕ_{1 N}, ϕ_{2 N}) \\ R_{2} = {\overset{\cdot}{λ}}_{1 N} + \frac{\partial H_{1}}{\partial x_{N}} (t, x_{N}, ϕ_{1 N}, ϕ_{2 N}, λ_{1 N}) \\ R_{3} = {\overset{\cdot}{λ}}_{2 N} + \frac{\partial H_{2}}{\partial x_{N}} (t, x_{N}, ϕ_{1 N}, ϕ_{2 N}, λ_{2 N}) \end{array}

Then, the residuals are multiplied by

P_{i}^{*}, i = 0, \dots, N - 1

, integrated over the domain

[0, T]

and finally set equal to zero. This procedure, along with the initial and boundary conditions 7 and 8, generate the following system of algebraic equations:

{\begin{cases} \int_{0}^{T} R_{1} P_{i}^{*} d t = 0 \\ \int_{0}^{T} R_{2} P_{i}^{*} d t = 0 \\ \int_{0}^{T} R_{3} P_{i}^{*} d t = 0 \\ x_{N} (0) = x_{0} \\ λ_{j N} (T) = \frac{\partial ψ_{j} (x_{N} (T))}{\partial x_{N}}, j = 1, 2, \end{cases}

where unknown coefficients of the vectors

A

,

B

and

C

are determined by solving it.

4. Illustrative Example

In this section, a differential game arising from a bioeconomic model is investigated to demonstrate the accuracy and efficiency of the Legendre Tau method (LTM). In this model, each firm harvests a common natural renewable resource (e.g., in a fishery).

The motivation for using this bioeconomic model is that its system of TPBVPs, in contrast to many other economic models such as the competitive advertising in Sorger [28], is a strong nonlinear one, which can properly show the accuracy and efficiency of the presented numerical method. To check the accuracy of the presented method for this example, a comparison is made with the numerical solution obtained by using the discretization of time and the fourth order Runge–Kutta method (RK4) with time step

Δ t = 10^{- 4} .

The rate of change of the natural renewable resource population over the time interval

[0, T]

is described by the following state equation and initial condition [29]:

\overset{\cdot}{x} (t) = F (x (t)) - q_{1} x (t) u_{1} (t) - q_{2} x (t) u_{2} (t), x (0) = x_{0},

where the differentiable function

F (.) : R \to R

is the natural growth rate of the renewable resource, described by the logistic growth function as

F (x (t)) = r x (t) (1 - \frac{x (t)}{k})

, where

r

is an intrinsic growth rate and

k

is a carrying capacity. The quantity

x (t) > 0

is the population level of the renewable resource at time

t

, the quantities

u_{1} (t) \geq 0

and

u_{2} (t) \geq 0

are the harvesting efforts of the firms at time

t

and the constants

q_{1} > 0

and

q_{2} > 0

denote the catchability coefficients.

The payoff of each firm over the time interval

[0, T]

is given by

J_{1} (u_{1} (.), u_{2} (.)) = \int_{0}^{T} (π_{1} q_{1} x (t) u_{1} (t) - \frac{1}{2} u_{1}^{2} (t)) d t

for firm 1, and by

J_{2} (u_{1} (.), u_{2} (.)) = \int_{0}^{T} (π_{2} q_{2} x (t) u_{2} (t) - \frac{1}{2} u_{2}^{2} (t)) d t

for firm 2, where constants

π_{1}

and

π_{2}

represent the unit price of natural renewable resource for each firm. Furthermore,

\frac{1}{2} u_{1}^{2}

and

\frac{1}{2} u_{2}^{2}

show the harvesting costs at effort levels

u_{1}

and

u_{2}

, respectively [29].

To derive the Nash equilibrium of this bioeconomic game, the Hamiltonian for each firm is defined as the following:

H_{1} (t, x, u_{1}, u_{2}, λ_{1}) = π_{1} q_{1} x u_{1} - \frac{1}{2} u_{1}^{2} + λ_{1} (F (x) - q_{1} x u_{1} - q_{2} x u_{2})

H_{2} (t, x, u_{1}, u_{2}, λ_{2}) = π_{2} q_{2} x u_{2} - \frac{1}{2} u_{2}^{2} + λ_{2} (F (x) - q_{1} x u_{1} - q_{2} x u_{2})

By minimizing

H_{1} (t, x, u_{1}, u_{2}, λ_{1})

and

H_{2} (t, x, u_{1}, u_{2}, λ_{2})

with respect to

u_{1}

and

u_{2}

, the open-loop Nash equilibriums for firm 1 and firm 2 are determined respectively by

\begin{matrix} \frac{\partial H_{1}}{\partial u_{1}} = 0 \Rightarrow π_{1} q_{1} x - u_{1} - λ_{1} q_{1} x = 0 \Rightarrow \\ u_{1} = q_{1} x (π_{1} - λ_{1}) . \end{matrix}

(21)

\begin{matrix} \frac{\partial H_{2}}{\partial u_{2}} = 0 \Rightarrow π_{2} q_{2} x - u_{2} - λ_{2} q_{2} x = 0 \Rightarrow \\ u_{2} = q_{2} x (π_{2} - λ_{2}) . \end{matrix}

(22)

The adjoint dynamic of player 1 is as follows:

{\overset{\cdot}{λ}}_{1} = - \frac{\partial H_{1}}{\partial x} = - π_{1} q_{1} u_{1} - λ_{1} \overset{\cdot}{F} (x) + λ_{1} q_{1} u_{1} + λ_{1} q_{2} u_{2} .

(23)

Substituting Equation (22) into Equation (23) yields:

{\overset{\cdot}{λ}}_{1} = - π_{1} q_{1}^{2} x (π_{1} - λ_{1}) - λ_{1} \overset{\cdot}{F} (x) + λ_{1} q_{1}^{2} x (π_{1} - λ_{1}) + λ_{1} q_{2}^{2} x (π_{2} - λ_{2})

and the adjoint dynamic of player 2 is as follows:

{\overset{\cdot}{λ}}_{2} = - \frac{\partial H_{2}}{\partial x} = - π_{2} q_{2} u_{2} - λ_{2} \overset{\cdot}{F} (x) + λ_{2} q_{1} u_{1} + λ_{2} q_{2} u_{2},

(24)

where substituting Equation (22)into Equation (24) yields:

{\overset{\cdot}{λ}}_{2} = - π_{2} q_{2}^{2} x (π_{2} - λ_{2}) - λ_{2} \overset{\cdot}{F} (x) + λ_{2} q_{1}^{2} x (π_{1} - λ_{1}) + λ_{2} q_{2}^{2} x (π_{2} - λ_{2})

Therefore, the system of TPBVPs for this differential game can be expressed as follows:

\overset{\cdot}{x} = F (x) - q_{1}^{2} x^{2} (π_{1} - λ_{1}) - q_{2}^{2} x^{2} (π_{2} - λ_{2})

(25)

{\overset{\cdot}{λ}}_{1} = - π_{1} q_{1}^{2} x (π_{1} - λ_{1}) - λ_{1} \overset{\cdot}{F} (x) + λ_{1} q_{1}^{2} x (π_{1} - λ_{1}) + λ_{1} q_{2}^{2} x (π_{2} - λ_{2})

(26)

{\overset{\cdot}{λ}}_{2} = - π_{2} q_{2}^{2} x (π_{2} - λ_{2}) - λ_{2} \overset{\cdot}{F} (x) + λ_{2} q_{1}^{2} x (π_{1} - λ_{1}) + λ_{2} q_{2}^{2} x (π_{2} - λ_{2})

(27)

x (0) = x_{0}

(28)

λ_{1} (T) = 0, λ_{2} (T) = 0 .

(29)

Suppose that the unique solution of Equation (25) with the initial condition shown in Equation (28) is denoted by

y

. Furthermore, let the unique solutions of Equations (26) and (27) with terminal conditions shown in Equation (29) be denoted by

γ_{1}

and

γ_{2}

, respectively.

By the following theorem, the unique open-loop Nash equilibrium of the introduced bioeconomic game is characterized.

Theorem 3.

The unique open-loop Nash equilibrium for the introduced differential game is given by

u_{1} = q_{1} y (π_{1} - λ_{1})

(30)

u_{2} = q_{2} y (π_{2} - λ_{2}) .

(31)

Proof of Theorem 3.

For given controls

v_{i} \geq 0

,

i = 1, 2

, the following optimal control problems are considered:

\begin{array}{l} \max_{u_{1} \geq 0} J_{1} (u_{1} (.), v_{2} (.)) = \int_{0}^{T} (π_{1} q_{1} x u_{1} - \frac{1}{2} u_{1}^{2}) d t \\ s . t . \overset{\cdot}{x} = F (x) - q_{1} x u_{1} - q_{2} x v_{2}, x (0) = x_{0} \end{array}

and

\begin{array}{l} \max_{u_{2} \geq 0} J_{2} (v_{1} (.), u_{2} (.)) = \int_{0}^{T} (π_{2} q_{2} x u_{2} - \frac{1}{2} u_{2}^{2}) d t \\ s . t . \overset{\cdot}{x} = F (x) - q_{1} x v_{1} - q_{2} x u_{2}, x (0) = x_{0} . \end{array}

The dynamical system of these problems is linear with respect to the control variables

u_{i}

,

i = 1, 2

and the integrand of performance index

J_{i}

,

i = 1, 2

, is concave with respect to

u_{i}

,

i = 1, 2

, because

\frac{\partial^{2} J_{i}}{\partial u_{i}^{2}} = - 1 < 0, i = 1, 2 .

Therefore, these optimal control problems satisfy the existence and uniqueness conditions of the Filippov–Cesari existence theorem [30]. From this analysis, it is clear that the only candidates which satisfy these conditions are determined by Equations (30) and (31), and hence, the unique open-loop Nash equilibrium for the mentioned differential game is determined. □

The system of TPBVPs shown in Equations (25)–(29) is a system of nonlinear differential equations with split boundary values, and has no analytical solution in general. To solve it numerically by the method presented in the previous section, the numerical values of the parameters in the standard case are chosen as the following:

x_{0} = 0.1, q_{1} = q_{2} = 1, π_{1} = 2, π_{2} = 1.5, r = 0.1, k = 100, T = 1

Thus, the system of TPBVPs that should be solved numerically is as follows:

{\begin{cases} \overset{\cdot}{x} = 0.1 x - 3.501 x^{2} + x^{2} λ_{1} + x^{2} λ_{2} \\ {\overset{\cdot}{λ}}_{1} = - 4 x - 0.1 λ_{1} + 5.502 x λ_{1} - x λ_{1}^{2} - x λ_{1} λ_{2} \\ {\overset{\cdot}{λ}}_{2} = - 2.25 x - 0.1 λ_{2} + 5.002 x λ_{2} - x λ_{2}^{2} - x λ_{1} λ_{2} \\ x (0) = 0.1 \\ λ_{1} (1) = 0, λ_{2} (1) = 0. \end{cases}

In order to solve the above system of TPBVPs, the following approximations for

x

,

λ_{1}

and

λ_{2}

are considered:

\begin{array}{l} x \approx x_{N} = \sum_{i = 0}^{N} a_{i} P_{i}^{*} = A^{T} P^{*} \\ λ_{1} \approx λ_{1 N} = \sum_{i = 0}^{N} b_{i} P_{i}^{*} = B^{T} P^{*} \\ λ_{2} \approx λ_{2 N} = \sum_{i = 0}^{N} c_{i} P_{i}^{*} = C^{T} P^{*}, \end{array}

where

A^{T} = [a_{0}, \dots, a_{N}], B^{T} = [b_{0}, \dots, b_{N}]

and

C^{T} = [c_{0}, \dots, c_{N}]

are unknown vectors and

P^{*} = {[p_{0}^{*}, \dots, p_{N}^{*}]}^{T}

is the vector of the shifted Legendre Polynomials.

These approximations are substituted into the equations of this system of TPBVPs to form the residuals as follows:

\begin{array}{l} R_{1} = \frac{2}{T} \sum_{i = 0}^{N} a_{i} P_{i} {^{*}}^{'} - 0.1 \sum_{i = 0}^{N} a_{i} P_{i}^{*} + 3.501 {(\sum_{i = 0}^{N} a_{i} P_{i}^{*})}^{2} - {(\sum_{i = 0}^{N} a_{i} P_{i}^{*})}^{2} \sum_{i = 0}^{N} b_{i} P_{i}^{*} - {(\sum_{i = 0}^{N} a_{i} P_{i}^{*})}^{2} \sum_{i = 0}^{N} c_{i} P_{i}^{*} \\ R_{2} = \frac{2}{T} \sum_{i = 0}^{N} b_{i} P_{i} {^{*}}^{'} + 4 \sum_{i = 0}^{N} a_{i} P_{i}^{*} + 0.1 \sum_{i = 0}^{N} b_{i} P_{i}^{*} - 5.502 \sum_{i = 0}^{N} a_{i} P_{i}^{*} \sum_{i = 0}^{N} b_{i} P_{i}^{*} + \sum_{i = 0}^{N} a_{i} P_{i}^{*} {(\sum_{i = 0}^{N} b_{i} P_{i}^{*})}^{2} \\ + \sum_{i = 0}^{N} a_{i} P_{i}^{*} \sum_{i = 0}^{N} b_{i} P_{i}^{*} \sum_{i = 0}^{N} c_{i} P_{i}^{*} \\ R_{3} = \frac{2}{T} \sum_{i = 0}^{N} c_{i} P_{i} {^{*}}^{'} + 2.25 \sum_{i = 0}^{N} a_{i} P_{i}^{*} + 0.1 \sum_{i = 0}^{N} c_{i} P_{i}^{*} - 5.002 \sum_{i = 0}^{N} a_{i} P_{i}^{*} \sum_{i = 0}^{N} c_{i} P_{i}^{*} + \sum_{i = 0}^{N} a_{i} P_{i}^{*} {(\sum_{i = 0}^{N} c_{i} P_{i}^{*})}^{2} \\ + \sum_{i = 0}^{N} a_{i} P_{i}^{*} \sum_{i = 0}^{N} b_{i} P_{i}^{*} \sum_{i = 0}^{N} c_{i} P_{i}^{*} . \end{array}

The numerical results for optimal payoff functionals

J_{1}

and

J_{2}

with different values of

N

are shown in Table 1 and compared with RK4. The graphs of approximate solutions for open-loop Nash equilibrium for

N = 14

are given in Figure 1.

5. Conclusions

This paper has dealt with the numerical solution for obtaining the open-loop Nash equilibrium of nonlinear nonzero-sum differential games in a finite horizon based on the Legendre Tau method (LTM). Regarding this method, the solution functions of the system of TPBVPs derived from Pontryagin’s maximum principle were expanded in terms of Legendre polynomials and then a system of algebraic equations was obtained. A differential game arising from a bioeconomic model was considered to demonstrate the accuracy and efficiency of the proposed method. A comparative study between the presented method and the fourth order Runge–Kutta method (RK4) was presented graphically.

Author Contributions

Conceptualization, all authors; investigation, all authors; methodology, all authors; software, all authors; Supervision, all authors; validation, all authors; writing–original draft, all authors; writing–review and editing, all authors. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Dockner, E.J.; Jørgensen, S.; Van Long, N.; Sorger, G. Differential Games in Economics and Management Science; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
Erickson, G.M. Dynamic Models of Advertising Competition; Kluwer: Boston, MA, USA, 2003. [Google Scholar]
Yeung, D.W.K.; Petrosjan, L. Cooperative Stochastic Differential Games; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
Jafari, S.; Navidi, H. A game-theoretic approach for modeling competitive diffusion over social networks. Games 2018, 9, 8. [Google Scholar] [CrossRef]
Bressan, A. Bifurcation analysis of a non-cooperative differential game with one weak player. J. Differ. Equ. 2010, 248, 1297–1314. [Google Scholar] [CrossRef]
Bressan, A. Noncooperative differential games. A tutorial. Milan J. Math. 2011, 79, 357–427. [Google Scholar] [CrossRef]
Starr, A.; Ho, Y. Further propeties of nonzero-sum differential games. J. Optim. Theory Appl. 1969, 3, 207–219. [Google Scholar] [CrossRef]
Starr, A.; Ho, Y. Nonzero-sum differential games. J. Optim. Theory Appl. 1969, 3, 184–206. [Google Scholar] [CrossRef]
Engwerda, J.C. LQ Dynamic Optimization and Differential Games; John Wiley and Sons: Hoboken, NJ, USA, 2005. [Google Scholar]
Engwerda, J.C. On the open-loop Nash equilibrium in LQ games. J. Econom. Dynam. Control 1998, 22, 729–762. [Google Scholar] [CrossRef]
Engwerda, J.C. Feedback Nash equilibria in the scalar infinite horizon LQ game. Automatica 2000, 36, 135–139. [Google Scholar] [CrossRef]
Kossiorisa, G.; Plexousakis, M.; Xepapadeas, A.; de Zeeuwe, A.; Maler, K.G. Feedback Nash equilibria for non-linear differential games in pollution control. J. Econom. Dynam. Control 2008, 32, 1312–1331. [Google Scholar] [CrossRef]
Jiménez-Lizárraga, M.; Basin, M.; Rodríguez, V.; Rodríguez, P. Open-loop Nash equilibrium in polynomial differential games via state-dependent Riccati equation. Automatica 2015, 53, 155–163. [Google Scholar] [CrossRef]
Zhang, H.; Wei, Q.; Liu, D. An iterative dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 2011, 47, 207–214. [Google Scholar] [CrossRef]
Canuto, C.; Hussaini, M.Y.; Quarteroni, A.; Zang, T.A. Spectral Methods: Fundamentals in Single Domains; Springer: New York, NY, USA, 2006. [Google Scholar]
Bhrawy, A.H.; Zaky, M.A.; Baleanu, D. New numerical approximations for space-time fractional Burgers’ equations via a Legendre spectral-collocation method. Rom. Rep. Phys. 2015, 67, 2. [Google Scholar]
Bhrawy, A.H. An efficient Jacobi pseudospectral approximation for nonlinear complex generalized Zakharov system. Appl. Math. Comput. 2014, 247, 30–46. [Google Scholar] [CrossRef]
Doha, E.H.; Bhrawy, A.H.; Hafez, R.M. A Jacobi-Jacobi dual-Petrov-Galerkin method for third- and fifth-order differential equations. Math. Comput. Modell. 2011, 53, 1820–1832. [Google Scholar] [CrossRef]
Doha, E.H.; Abd-Elhameed, W.M.; Bhrawy, A.H. Efficient spectral ultraspherical-Galerkin algorithms for the direct solution of 2nth-order linear differential equations. Appl. Math. Modell. 2009, 33, 1982–1996. [Google Scholar] [CrossRef]
Guo, B.Y. Spectral Methods and Their Applications; World Scientific: Singapore, 1998. [Google Scholar]
Doha, E.H.; Abd-Elhameed, W.M.; Youssri, Y.H. New algorithms for solving third- and fifth-order two- point boundary value problems based on nonsymmetric generalized Jacobi Petrov–Galerkin method. J. Adv. Res. 2015, 6, 673–686. [Google Scholar] [CrossRef] [PubMed]
Grosset, L. A note on open loop Nash equilibrium in linear-state differential games. Appl. Math. Sci. 2014, 8, 7239–7248. [Google Scholar] [CrossRef]
Moosavi Mohseni, R. Mathematical Analysis of the Chaotic Behavior in Monetary Policy Games. Ph.D. Thesis, Auckland University of Technology, Auckland, New Zealand, 2019. [Google Scholar]
Nikooeinejad, Z.; Delavarkhalafi, A.; Heydari, M. A numerical solution of open-loop Nash equilibrium in nonlinear differential games based on Chebyshev pseudospectral method. J. Comput. Appl. Math. 2016, 300, 369–384. [Google Scholar] [CrossRef]
Doha, E.H.; Bhrawy, A.H.; Hafez, R.M. On shifted Jacobi spectral method for high-order multi-point boundary value problems. Commun. Nonlinear Sci. Numer. Simul. 2012, 17, 3802–3810. [Google Scholar] [CrossRef]
Canuto, C.; Hussaini, M.Y.; Quarteroni, A.; Zang, T.A. Spectral Methods on Fluid Dynamics; Springer: Berlin/Heidelberg, Germany, 1988. [Google Scholar]
Shen, J.; Tang, T.; Wang, L.L. Spectral Methods, in: Algorithms, Analysis and Applications; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Sorger, G. Competitive dynamic advertising: A modification of the case game. J. Econom. Dynam. Control 1989, 13, 55–80. [Google Scholar] [CrossRef]
Carlson, D.A.; Leitmann, G. An extension of the coordinate transformation method for open-loop Nash equilibria. J. Optim. Theory Appl. 2004, 123, 27–47. [Google Scholar] [CrossRef]
Cesari, L. Optimization-Theory and Applications: Problems with Ordinary Differential Equations; Springer: New York, NY, USA, 1983. [Google Scholar]

1	The removal of variable $t$ in the remaining parts of the paper has also been done for simplification matters.

Figure 1. Plots of approximate open-loop Nash equilibrium for illustrative example when

N = 14

.

Figure 1. Plots of approximate open-loop Nash equilibrium for illustrative example when

N = 14

.

Table 1. Optimal payoff functionals

J_{1}

and

J_{2}

for illustrative example with LTM as compared with RK4.

Table 1. Optimal payoff functionals

J_{1}

and

J_{2}

for illustrative example with LTM as compared with RK4.

$N$	$J_{1 L T M}$	$J_{2 L T M}$
8	0.016380209069964074615873141557194	0.0092479570969023022143164502992464
10	0.016380209069964074615873178289759	0.0092479570969023022143164746205906
12	0.016380209069964074615873178289820	0.0092479570969023022143164746206322
14	0.016380209069964074615873178289819	0.0092479570969023022143164746206318
$Δ t$	$J_{1 R K 4}$	$J_{2 R K 4}$
$10^{- 4}$	0.016380209069970129334132078913143	0.0092479570969076035409551440677725

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dehghan Banadaki, M.; Navidi, H. Numerical Solution of Open-Loop Nash Differential Games Based on the Legendre Tau Method. Games 2020, 11, 28. https://doi.org/10.3390/g11030028

AMA Style

Dehghan Banadaki M, Navidi H. Numerical Solution of Open-Loop Nash Differential Games Based on the Legendre Tau Method. Games. 2020; 11(3):28. https://doi.org/10.3390/g11030028

Chicago/Turabian Style

Dehghan Banadaki, Mojtaba, and Hamidreza Navidi. 2020. "Numerical Solution of Open-Loop Nash Differential Games Based on the Legendre Tau Method" Games 11, no. 3: 28. https://doi.org/10.3390/g11030028

APA Style

Dehghan Banadaki, M., & Navidi, H. (2020). Numerical Solution of Open-Loop Nash Differential Games Based on the Legendre Tau Method. Games, 11(3), 28. https://doi.org/10.3390/g11030028

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Numerical Solution of Open-Loop Nash Differential Games Based on the Legendre Tau Method

Abstract

1. Introduction

2. Problem Statement

3. The Tau Method for Nonzero-Sum Differential Games

4. Illustrative Example

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI