Abstract
In this paper, an efficient implementation of the Tau method is presented for finding the open-loop Nash equilibrium of noncooperative nonzero-sum two-player differential game problems with a finite-time horizon. Regarding this approach, the two-point boundary value problem derived from Pontryagin’s maximum principle is reduced to a system of algebraic equations that can be solved numerically. Finally, a differential game arising from bioeconomics among firms harvesting a common renewable resource is included to illustrate the accuracy and efficiency of the proposed method and a comparison is made with the result obtained by fourth order Runge–Kutta method.
1. Introduction
Differential game theory, as a natural extension of optimal control theory, deals with the problem where each control agent (player) tries to maximize his own profit, which conflicts with others, and it has received considerable attention in economics and management sciences in recent decades. It covers a large area in macroeconomics, microeconomics, resource management and bioeconomics. Some of the applications of this theory have been considered in many textbooks. In [1], an introduction to the theory of noncooperative differential games and its applications, such as marketing, natural resources and environmental economics are offered. Advertising competition and the Lanchester model are studied in [2]. Both deterministic and stochastic cooperative differential games are covered in [3], and some applications in resources and environmental economics are contained therein.
The Nash strategy is regarded as an equilibrium solution for simultaneous games, in which players cannot improve their payoffs by deviating unilaterally from it [4]. There exist two main types of equilibrium solutions for differential games, namely, closed-loop (or feedback) and open-loop. The closed-loop equilibrium is where each player’s strategy is a function of time and state variables, whereas in open-loop equilibrium, the strategy of each player is a function of time and initial state. To identify the open-loop Nash equilibrium in a differential game, the system of two-point boundary value problems (TPBVPs) derived from Pontryagin’s maximum principle as the necessary conditions for the existence of an open-loop Nash equilibrium must be solved [5]. Regarding this approach, the obtained system of TPBVPs is reduced to a system of algebraic equations that can be solved using well-known analytical and numerical techniques for systems of ordinary differential equations [6]. Solving differential game problems numerically is the most logical way to treat them as their analytical solutions are not always available. The main research studies in this field contain obtaining open-loop Nash equilibrium in linear quadratic dynamic games [7,8,9,10,11]. In [12], solving a nonlinear differential game arising from a pollution control problem is considered. The quasi-equilibrium of a special case of nonlinear differential games is found by studying the state-dependent Riccati equations [13]. In [14], a dynamic programming approach is presented to obtain the saddle point of a kind of nonlinear zero-sum differential game.
One of the best methods in terms of accuracy and efficiency, for a numerical solution of different kinds of differential equations by means of truncated series of orthogonal polynomials, is the spectral method [15,16,17,18,19]. There are three well-known spectral methods, namely, the Galerkin, Tau, and collocation methods, and the selection of the suitable spectral method depends on the type of differential equation and the boundary conditions governed by it [20,21]. The aim of this paper is to propose a numerical approach based on Pontryagin’s maximum principle and the Tau method to find the open-loop Nash equilibrium of noncooperative nonzero-sum differential games.
The remainder of the paper is organized as follows: In Section 2, the definition of a noncooperative nonzero-sum two-player differential game, open-loop Nash equilibrium, and the analytical form of the necessary conditions for an open-loop Nash equilibrium are revised. In Section 3, the Tau method for obtaining the open-loop Nash equilibrium of such games is introduced. In Section 4, a differential game arising from bioeconomics is presented to illustrate the accuracy and efficiency of the proposed method. Finally, the paper is concluded with a conclusion.
2. Problem Statement
In this section, we deal with a noncooperative nonzero-sum two-player differential game that is described by the following definition:
Definition 1.
A noncooperative nonzero-sum two-player differential game is defined as follows [22]:
withand.
In performance index given in (1), and are the controls (strategies) of players and , respectively; function is player ’s instantaneous payoff, and function is terminal payoff. The goal of game for players is maximizing their performance indices by choosing suitable control actions .
A player’s open-loop strategy is the planned time path of his action. This type of equilibrium concept is time consistent, meaning that along the equilibrium path, no player is incentivized to deviate from his original plan [23]. Thus, the definition of an open-loop solution concept (equilibrium) can be as follows:
Definition 2.
The ordered pairof functionsis called an open-loop Nash equilibrium if, for each, an optimal control pathof the problem (1) exists and is given by the open-loop Nash strategy[1].
An open-loop Nash equilibrium is characterized by introducing the Hamiltonian functions for formulating the first order necessary conditions of optimality for nonzero-sum differential games (1), and are introduced as the following [24]:
where the variables , are called the costate variables or the adjoint variables associated with the state variable .
To simplify the notation in the Hamiltonian functions, the time dependence has been neglected in the functions .1
Assuming that all functions in (1) are continuously differentiable, first order necessary conditions for optimality are provided by Pontryagin’s maximum principle.
Based on Pontryagin’s maximum principle, the set of necessary conditions for the open-loop Nash equilibrium of a nonzero-sum differential game is obtained as follows:
with and .
Algebraic Equation (4) can be solved to obtain an expression for , in terms of and ; that is,
Substituting this expression into Equations (2) and (3), a system of differential equations is obtained involving only , and , . This system of TPBVPs can be expressed as:
where for .
In general, this system of TPBVPs is nonlinear with split boundary values, hence obtaining an exact and analytical solution for the open-loop Nash equilibrium is difficult. Therefore, using a suitable numerical method is indispensable.
3. The Tau Method for Nonzero-Sum Differential Games
In this section, the implementation of the Tau method for solving the system of TPBVPs and finding the open-loop Nash equilibrium of a nonzero-sum differential game is presented.
The fundamental idea of this approach is the expansion of the function into the form of a finite series of basis functions as
where are Legendre polynomials and are spectral coefficients [25].
Definition 3.
The Legendre polynomialsare the eigenfunctions of the singular Sturm–Liouville problem
They are orthogonal on the interval with respect to the weight function and satisfy the following recurrence formula:
where
Theorem 1.
Let(Sobolev space),be the best approximation ofin, then
Proof of Theorem 1.
[26]. □
Regarding Theorem 1, it is concluded that approximation rate of Legendre polynomials is
The basic results of the presented approach and theoretical treatment of its convergence are based on the well-known Weierstrass approximation theorem.
Theorem 2.
(Weierstrass approximation theorem)Letand. Then there exists a unique, the space of all polynomials of degree at most, such that
Proof of Theorem 2.
[27]. □
To use the Legendre polynomials on interval , it is necessary to shift the defining domain by the following variable substitution:
It is assumed that the solutions and of the TPBVPs 5–8 are approximated by a linear combination of the shifted Legendre polynomials as follows:
where and are unknown coefficients and is the shifted Legendre polynomial on interval .
The first derivative of and can be approximated as follows:
Equations (9)–(14) can be restated as the following vector forms:
where
To implement the Tau method, Equations (15)–(20) are substituted at first into the understudied differential Equations (5) and 6 to form the residuals as follows:
Then, the residuals are multiplied by , integrated over the domain and finally set equal to zero. This procedure, along with the initial and boundary conditions 7 and 8, generate the following system of algebraic equations:
where unknown coefficients of the vectors , and are determined by solving it.
4. Illustrative Example
In this section, a differential game arising from a bioeconomic model is investigated to demonstrate the accuracy and efficiency of the Legendre Tau method (LTM). In this model, each firm harvests a common natural renewable resource (e.g., in a fishery).
The motivation for using this bioeconomic model is that its system of TPBVPs, in contrast to many other economic models such as the competitive advertising in Sorger [28], is a strong nonlinear one, which can properly show the accuracy and efficiency of the presented numerical method. To check the accuracy of the presented method for this example, a comparison is made with the numerical solution obtained by using the discretization of time and the fourth order Runge–Kutta method (RK4) with time step
The rate of change of the natural renewable resource population over the time interval is described by the following state equation and initial condition [29]:
where the differentiable function is the natural growth rate of the renewable resource, described by the logistic growth function as , where is an intrinsic growth rate and is a carrying capacity. The quantity is the population level of the renewable resource at time , the quantities and are the harvesting efforts of the firms at time and the constants and denote the catchability coefficients.
The payoff of each firm over the time interval is given by
for firm 1, and by
for firm 2, where constants and represent the unit price of natural renewable resource for each firm. Furthermore, and show the harvesting costs at effort levels and , respectively [29].
To derive the Nash equilibrium of this bioeconomic game, the Hamiltonian for each firm is defined as the following:
By minimizing and with respect to and , the open-loop Nash equilibriums for firm 1 and firm 2 are determined respectively by
The adjoint dynamic of player 1 is as follows:
Substituting Equation (22) into Equation (23) yields:
and the adjoint dynamic of player 2 is as follows:
where substituting Equation (22)into Equation (24) yields:
Therefore, the system of TPBVPs for this differential game can be expressed as follows:
Suppose that the unique solution of Equation (25) with the initial condition shown in Equation (28) is denoted by . Furthermore, let the unique solutions of Equations (26) and (27) with terminal conditions shown in Equation (29) be denoted by and , respectively.
By the following theorem, the unique open-loop Nash equilibrium of the introduced bioeconomic game is characterized.
Theorem 3.
The unique open-loop Nash equilibrium for the introduced differential game is given by
Proof of Theorem 3.
For given controls , , the following optimal control problems are considered:
and
The dynamical system of these problems is linear with respect to the control variables , and the integrand of performance index , , is concave with respect to , , because
Therefore, these optimal control problems satisfy the existence and uniqueness conditions of the Filippov–Cesari existence theorem [30]. From this analysis, it is clear that the only candidates which satisfy these conditions are determined by Equations (30) and (31), and hence, the unique open-loop Nash equilibrium for the mentioned differential game is determined. □
The system of TPBVPs shown in Equations (25)–(29) is a system of nonlinear differential equations with split boundary values, and has no analytical solution in general. To solve it numerically by the method presented in the previous section, the numerical values of the parameters in the standard case are chosen as the following:
Thus, the system of TPBVPs that should be solved numerically is as follows:
In order to solve the above system of TPBVPs, the following approximations for , and are considered:
where and are unknown vectors and is the vector of the shifted Legendre Polynomials.
These approximations are substituted into the equations of this system of TPBVPs to form the residuals as follows:
The numerical results for optimal payoff functionals and with different values of are shown in Table 1 and compared with RK4. The graphs of approximate solutions for open-loop Nash equilibrium for are given in Figure 1.
Table 1.
Optimal payoff functionals and for illustrative example with LTM as compared with RK4.

Figure 1.
Plots of approximate open-loop Nash equilibrium for illustrative example when .
5. Conclusions
This paper has dealt with the numerical solution for obtaining the open-loop Nash equilibrium of nonlinear nonzero-sum differential games in a finite horizon based on the Legendre Tau method (LTM). Regarding this method, the solution functions of the system of TPBVPs derived from Pontryagin’s maximum principle were expanded in terms of Legendre polynomials and then a system of algebraic equations was obtained. A differential game arising from a bioeconomic model was considered to demonstrate the accuracy and efficiency of the proposed method. A comparative study between the presented method and the fourth order Runge–Kutta method (RK4) was presented graphically.
Author Contributions
Conceptualization, all authors; investigation, all authors; methodology, all authors; software, all authors; Supervision, all authors; validation, all authors; writing–original draft, all authors; writing–review and editing, all authors. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Dockner, E.J.; Jørgensen, S.; Van Long, N.; Sorger, G. Differential Games in Economics and Management Science; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
- Erickson, G.M. Dynamic Models of Advertising Competition; Kluwer: Boston, MA, USA, 2003. [Google Scholar]
- Yeung, D.W.K.; Petrosjan, L. Cooperative Stochastic Differential Games; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
- Jafari, S.; Navidi, H. A game-theoretic approach for modeling competitive diffusion over social networks. Games 2018, 9, 8. [Google Scholar] [CrossRef]
- Bressan, A. Bifurcation analysis of a non-cooperative differential game with one weak player. J. Differ. Equ. 2010, 248, 1297–1314. [Google Scholar] [CrossRef]
- Bressan, A. Noncooperative differential games. A tutorial. Milan J. Math. 2011, 79, 357–427. [Google Scholar] [CrossRef]
- Starr, A.; Ho, Y. Further propeties of nonzero-sum differential games. J. Optim. Theory Appl. 1969, 3, 207–219. [Google Scholar] [CrossRef]
- Starr, A.; Ho, Y. Nonzero-sum differential games. J. Optim. Theory Appl. 1969, 3, 184–206. [Google Scholar] [CrossRef]
- Engwerda, J.C. LQ Dynamic Optimization and Differential Games; John Wiley and Sons: Hoboken, NJ, USA, 2005. [Google Scholar]
- Engwerda, J.C. On the open-loop Nash equilibrium in LQ games. J. Econom. Dynam. Control 1998, 22, 729–762. [Google Scholar] [CrossRef]
- Engwerda, J.C. Feedback Nash equilibria in the scalar infinite horizon LQ game. Automatica 2000, 36, 135–139. [Google Scholar] [CrossRef]
- Kossiorisa, G.; Plexousakis, M.; Xepapadeas, A.; de Zeeuwe, A.; Maler, K.G. Feedback Nash equilibria for non-linear differential games in pollution control. J. Econom. Dynam. Control 2008, 32, 1312–1331. [Google Scholar] [CrossRef]
- Jiménez-Lizárraga, M.; Basin, M.; Rodríguez, V.; Rodríguez, P. Open-loop Nash equilibrium in polynomial differential games via state-dependent Riccati equation. Automatica 2015, 53, 155–163. [Google Scholar] [CrossRef]
- Zhang, H.; Wei, Q.; Liu, D. An iterative dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 2011, 47, 207–214. [Google Scholar] [CrossRef]
- Canuto, C.; Hussaini, M.Y.; Quarteroni, A.; Zang, T.A. Spectral Methods: Fundamentals in Single Domains; Springer: New York, NY, USA, 2006. [Google Scholar]
- Bhrawy, A.H.; Zaky, M.A.; Baleanu, D. New numerical approximations for space-time fractional Burgers’ equations via a Legendre spectral-collocation method. Rom. Rep. Phys. 2015, 67, 2. [Google Scholar]
- Bhrawy, A.H. An efficient Jacobi pseudospectral approximation for nonlinear complex generalized Zakharov system. Appl. Math. Comput. 2014, 247, 30–46. [Google Scholar] [CrossRef]
- Doha, E.H.; Bhrawy, A.H.; Hafez, R.M. A Jacobi-Jacobi dual-Petrov-Galerkin method for third- and fifth-order differential equations. Math. Comput. Modell. 2011, 53, 1820–1832. [Google Scholar] [CrossRef]
- Doha, E.H.; Abd-Elhameed, W.M.; Bhrawy, A.H. Efficient spectral ultraspherical-Galerkin algorithms for the direct solution of 2nth-order linear differential equations. Appl. Math. Modell. 2009, 33, 1982–1996. [Google Scholar] [CrossRef]
- Guo, B.Y. Spectral Methods and Their Applications; World Scientific: Singapore, 1998. [Google Scholar]
- Doha, E.H.; Abd-Elhameed, W.M.; Youssri, Y.H. New algorithms for solving third- and fifth-order two- point boundary value problems based on nonsymmetric generalized Jacobi Petrov–Galerkin method. J. Adv. Res. 2015, 6, 673–686. [Google Scholar] [CrossRef] [PubMed]
- Grosset, L. A note on open loop Nash equilibrium in linear-state differential games. Appl. Math. Sci. 2014, 8, 7239–7248. [Google Scholar] [CrossRef]
- Moosavi Mohseni, R. Mathematical Analysis of the Chaotic Behavior in Monetary Policy Games. Ph.D. Thesis, Auckland University of Technology, Auckland, New Zealand, 2019. [Google Scholar]
- Nikooeinejad, Z.; Delavarkhalafi, A.; Heydari, M. A numerical solution of open-loop Nash equilibrium in nonlinear differential games based on Chebyshev pseudospectral method. J. Comput. Appl. Math. 2016, 300, 369–384. [Google Scholar] [CrossRef]
- Doha, E.H.; Bhrawy, A.H.; Hafez, R.M. On shifted Jacobi spectral method for high-order multi-point boundary value problems. Commun. Nonlinear Sci. Numer. Simul. 2012, 17, 3802–3810. [Google Scholar] [CrossRef]
- Canuto, C.; Hussaini, M.Y.; Quarteroni, A.; Zang, T.A. Spectral Methods on Fluid Dynamics; Springer: Berlin/Heidelberg, Germany, 1988. [Google Scholar]
- Shen, J.; Tang, T.; Wang, L.L. Spectral Methods, in: Algorithms, Analysis and Applications; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
- Sorger, G. Competitive dynamic advertising: A modification of the case game. J. Econom. Dynam. Control 1989, 13, 55–80. [Google Scholar] [CrossRef]
- Carlson, D.A.; Leitmann, G. An extension of the coordinate transformation method for open-loop Nash equilibria. J. Optim. Theory Appl. 2004, 123, 27–47. [Google Scholar] [CrossRef]
- Cesari, L. Optimization-Theory and Applications: Problems with Ordinary Differential Equations; Springer: New York, NY, USA, 1983. [Google Scholar]
| 1 | The removal of variable in the remaining parts of the paper has also been done for simplification matters. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).