Next Article in Journal
f(R,T)-Gravity Model with Perfect Fluid Admitting Einstein Solitons
Previous Article in Journal
Damage Detection and Isolation from Limited Experimental Data Using Simple Simulations and Knowledge Transfer
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Nikaido Isoda-Based Hybrid Genetic Algorithm and Relaxation Method for Finding Nash Equilibrium

by
Napat Harnpornchai
1 and
Wiriyaporn Wonggattaleekam
2,*
1
Center of Excellence in Econometrics, Faculty of Economics, Chiang Mai University, Chiang Mai 50200, Thailand
2
Department of Agricultural Economy and Development, Faculty of Agriculture, Chiang Mai University, Chiang Mai 50200, Thailand
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(1), 81; https://doi.org/10.3390/math10010081
Submission received: 29 November 2021 / Revised: 22 December 2021 / Accepted: 24 December 2021 / Published: 27 December 2021
(This article belongs to the Special Issue Game Theory: Mathematical Approaches and Applications)

Abstract

:
Nash Equilibrium (NE) plays a crucial role in game theory. The relaxation method in conjunction with the Nikaido–Isoda (NI) function, namely the NI-based relaxation method, has been widely applied to the determination of NE. Genetic Algorithm (GA) with adaptive penalty is introduced and incorporated in the original NI-based relaxation method. The GA enhances the capability in the optimization step for computing the optimum response function. The optimization of the non-convex and non-concave NI function is made possible by GA. The proposed method thus combines the advantageous feature of the GA in its optimization capability and that of the relaxation method in its implementation simplicity together. The applicability of the method is shown through the illustrative examples, including the generalized Nash Equilibrium problem with nonlinear payoff functions and coupled constraints, the game with multiple strategic variables for individual players, and the non-differentiable payoff functions. All test example results suggest the appropriate crossover and mutation rate to be 0.05 and 0.002 for use in GA. These numbers are closed to the recommended values by DeJong. The proposed method shows its capability of finding correct NEs in all test examples.

1. Introduction

The determination of Nash Equilibrium (NE) is the subject of interest in game theory. Among the NE finding methods, the relaxation method that is applied in conjunction with the Nikaido–Isoda (NI) function [1], referred to as the NI-based relaxation method, has been widely recognized [2]. Further improvement includes the NI-based relaxation method with inexact line search [3] and the gradient-based NI function [4].
The NI-based relaxation method and its variants have the virtue of its conceptual simplicity. The method consists of two steps, namely the optimization and relaxation. Given the present actions of all players, the optimal actions that create the improvement in the payoffs are obtained from the optimization step. The new actions that replace the present ones are defined as the weighted combination of the optimal and present actions via the relaxation algorithm in the second step. The procedure repeats both steps until the value of NI function becomes zero. The actions that result in the zero NI function value are taken as the NE. Regarding the relaxation algorithm, it is proved to converge to an NE for a wide class of problems [5,6]. The optimal actions are generally derived in an analytical form. However, when the NI function is highly non-linear, the analytical form of the optimal actions may not be obtained. Without the optimal actions, the relaxation step cannot be processed.
To extend the relaxation method to the case of the non-differentiable NI function, the methodologies that can solve the optimization problems with non-differentiable objective functions are used. It was discussed that analytical methods are not able to handle complicated or non-differentiable payoff functions while mathematical programming approaches may not reach the NE point, due to not being reliable due to their dependence on initial searching points [7]. Meta-heuristic algorithms are more effective in dealing with non-differentiable, non-linear, complex payoff functions from their fundamental procedures [7]. Accordingly, the evolutionary computation was proposed for finding the NE points [8]. The so-called Nash dominance concept was defined and has been directly used or modified in later research [7,9,10]. The concept was implemented in Non-dominated Sorting Genetic Algorithm II (NSGAII) [11] with changing from the Pareto dominance to the defined Nash dominance. Other Nash dominance concept-based algorithms include Nash Domination Evolutionary Multiplayer Optimization (NDEMO) [9] and co-evolutionary approach [10,12]. NDMO employed Differential Evolution (DE) as its optimizer in conjunction with Nash dominance sorting. NDEMO was applied to transportation systems management electricity markets. The co-evolutionary approach was applied for strategic bidding in competitive electricity markets. Genetic Algorithm (GA) and DE were both used in that work in which a non-convex, non-smooth, bi-level optimization problem was solved. The co-evolutionary approach was later combined with a ranking technique called Nash non-dominated sorting (NNDS) for the determination of NE [12]. Genetic Algorithm (GA) and DE were again utilized as optimization tools. Instead of checking the Nash Equilibrium condition based on the Nash dominance concept, the NI function was employed for the purpose [7]. However, the new actions were still created by the evolutionary algorithm DE. The use of the NI function shows shorter runtimes than the Nash dominance sorting. Most recently, Nash Equilibrium sorting Genetic Algorithm (NESGA) was introduced to identify NE [13]. The fitness assignment was based on sorting. The methodology solved a combinatorial game problem. Around the same time, a heuristic algorithm, namely a distributed algorithm was developed to solve non-smooth aggregative games [14]. Nevertheless, the distributed algorithm imposed many mathematical assumptions and therefore was limited to the shown aggregative games.
The motivation of this work is to extend the NI-based relaxation method with its simplicity of implementation to a novel method that can cope with non-linear, non-differentiable, and combinatorial forms of payoff and NI functions. GA has such desired properties. GA is a derivative-free algorithm and is applicable to continuous as well as combinatorial optimization problems [15]. Fundamentally, GA can be used for solving non-differentiable objective functions [13] as well as non-convex and non-concave optimization problems [16,17,18]. One real-world application is the portfolio optimization with typical transaction costs [19]. Considering the potential of GA in detecting the NE from the afore-reviewed literature and its ability in solving non-linear, non-differentiable, and combinatorial forms of payoff and NI functions, GA will be employed in the present paper as the optimization tool. The GA is also combined with an adaptive penalty scheme [20] and makes the proposed hybrid GA and relaxation method applicable to the general Nash Equilibrium problem as well.
The novelty of this paper is it is the first time that presents an NI-based hybrid GA and relaxation method for finding Nash Equilibrium. The proposed method utilizes the straightforward computation of the NI function, which has significantly less complication than the dominance evaluation and sorting of those GA/DE-extended literature methods. Consequently, the contribution of the paper is providing a simple but effective computational method for detecting the NE in generalized Nash Equilibrium problems. The presented method thus combines the advantageous feature of the GA in its optimization capability and the implementation simplicity of the relaxation method together.
The comparison of the literature and presented method is summarized in Table 1.
It is noted that there are more than 110 meta-heuristic algorithms (see e.g., [21,22]) that can be used instead of GA. As mentioned before, this is an initiative paper on combining a meta-heuristic with a relaxation method to obtain a simple method for tackling the problem of non-regular payoff and NI function. The comparison of the optimization algorithms is thus beyond the scope of the paper.
The applicability of the proposed method is shown through the illustrative examples. These examples include the generalized Nash Equilibrium problem with nonlinear payoff functions and coupled constraints, the game with multiple strategic variables for individual players, and the non-differentiable payoff functions. The results are compared with those from the literature and some meta-heuristic algorithms, such as DE, Gravitational Search Algorithm—GSA [23], Inclined Planes System Optimization Algorithm—IPO [24], and the most recent algorithms including Capuchin Search Algorithm—CSA [25] as well as the distributed algorithm [14]. The proposed method shows its capability of finding correct NEs in all test examples.
The structure of the paper is as follows. After this introduction, the fundamental concept is described. The details of the proposed method are next explained. The application of the method is demonstrated via the numerical examples. The conclusions are made at the end.

2. Problem Formulation

2.1. Definitions

Consider a game of n players. Each player takes an individual action, which is represented by a vector xi in the Euclidean space R m i . All players take a collective action x = ( x 1 , . . . . , x n ) R m 1 × × R m n . Let X i R m i be an action set of player i, and ϕ i : X i R be the corresponding payoff function. Let X be the collective action set. By definition, X X 1 × × X n R m 1 × × R m n = R m . Let x = ( x 1 , . . . . , x n ) and y = ( y 1 , . . . . , y n ) be elements of the collective action set X 1 × × X n . An element ( y i | x ) ( x 1 , . . . . , x i 1 , y i , x i + 1 , , x n ) of the collective action set is interpreted as a collection of actions when the i-th player tries yi, while the remaining players still keep x j , j = 1 , , i 1 , i + 1 , , n .
Definition 1.
A point  x = ( x 1 , . . . . , x n )  is called the Nash Equilibrium point [1] if, for each i,
ϕ i ( x ) = m a x ( x i | x ) X ϕ i ( x i | x )
Definition 2.
The NI function  Ψ : ( X 1 × × X n ) × ( X 1 × × X n ) R  is defined as [1]
Ψ ( x , y ) = i = 1 n [ ϕ i ( y i | x ) ϕ i ( x ) ]
Note that
Ψ ( x , x ) = 0
Each summand of the NI function represents the improvement in payoff that a player will receive when the player changes the action from xi to yi, while all other players continue playing according to x. That means that one player changes the action, while the others do not. Thus, the function represents the sum of these improvements in the payoff. Note that the maximum value of this function is always nonnegative for a given x. Furthermore, the function is nonpositive for all feasible when it is a NE, since no player can improve his payoff at equilibrium. In consequence, each summand can be at most zero at the NE [1].
Definition 3.
An element,  x X , is referred to as a Nash Equilibrium point [26], if
m a x y X Ψ ( x , y ) = 0
In addition to the payoff functions, the action spaces of each player may depend on the actions chosen by all other players. This type of problem is referred to as the generalized Nash Equilibrium problem. The dependency among different players’ actions is generally represented by a set of equality constraints,
c e q , r ( x ) = 0   ( r = 1 , , N e q )
and a set of inequality constraints,
c i e q , q ( x ) 0   ( q = 1 , , N i e q )
where Neq and Nieq are the number of the equality and inequality constraints, respectively. Consequently, the NE needs to satisfy not only the condition (4), but also the constraints (5) and (6).

2.2. NI-Based Relaxation Method

The NI-based relaxation method starts from determining the optimum response function Z(x):
Z ( x ) = arg   max y X Ψ ( x , y ) ;   x , Z ( x ) X
Let y that is corresponding to Z(x) be denoted by ymax(x). In case of the generalized Nash Equilibrium problem, ymax(x) must fulfill all constraint requirements (5) and (6). The optimum response function returns the set of players’ actions whereby the players all try to unilaterally maximize their respective payoffs. By playing actions ymax(x) rather than x, the players approach the equilibrium.
The relaxation algorithm is applied next. The relaxation algorithm follows
x s + 1 = ( 1 α s ) x s + α s y m a x ( x s ) ,   s = 1 , 2 ,
where 0 α s 1 . Assigning xs to the NI function (2), the optimum response function y m a x ( x s ) is obtained from solving the optimization problem (7). y m a x ( x s ) is then input into (8) and results in xs+1, which is successively used in the NI function (2). The procedure repeats until the condition (4) is satisfied, i.e.,
Ψ ( x t , y m a x ( x t ) ) = 0
The xt is the NE. There are two remarks regarding the determination of the NE. First, it can be difficult to attain the zero value of the NI function. In practice, the procedure is allowed to stop when
| Ψ ( x t , y m a x ( x t ) ) | ε
ε is the tolerance and has a small value close to zero. Second, the determination of ymax(x) is normally carried out in analytical manners. However, when the NI function is complicated, the analytical form of the optimum response function may not be possible to obtain. The next section explains an alternative method of obtaining y m a x ( x s ) , from which its numerical value is computed instead of deriving the closed form solution.

3. NI-Based Hybrid Genetic Algorithm and Relaxation Method

3.1. Methodological Concept

The previous application of the relaxation method relies only on the close form solution of ymax(x) [2,3]. This section proposes a method for determining the numerical value ymax(x). The so-called Genetic Algorithm (GA) is applied for this purpose. Let Ψ m i n be the minimum value of the NI function during the computational procedure, Ψ i n t be the intermediate value during the computation loop, count be the counter of computation loop, and L be the maximum number of computation loop. The NI-based hybrid GA and relaxation method has the following steps:
  • Starting with s = 1, initialize xs which satisfies the constraints (5) and (6). Then initialize Ψ m i n and Ψ i n t to large values compared to ε, e.g., equal to 1.
  • While Ψ i n t > ε and c o u n t L , perform steps 3 to 7.
  • Using GA, compute y m a x ( x s ) from the constrained optimization problem:
    Maximize
    Ψ ( x s , y m a x ( x s ) )
    subject to
    c e q , r ( y m a x ( x s ) ) = 0 ( r = 1 , , N e q )
    c i e q , q ( y m a x ( x s ) ) 0   ( q = 1 , , N i e q )  
  • Compute Ψ ( x s , y m a x ( x s ) ) , if | Ψ ( x s , y m a x ( x s ) ) | Ψ m i n :
    • Assign | Ψ ( x s , y m a x ( x s ) ) | to Ψ m i n .
    • Assign x s to x* and y m a x ( x s ) to y*.
  • Assign | Ψ ( x s , y m a x ( x s ) ) | to Ψ i n t .
  • Compute x s + 1 = ( 1 α s ) x s + α s y m a x ( x s ) and assign x s + 1 to x s .
  • Compute count = count + 1.
After the computational procedure terminates, the NE is equal to x*. The procedure of the NI-based hybrid GA and relaxation method is summarized in Figure 1. The GA is described in the following section.

3.2. Genetic Algorithm (GA)

GA is a stochastic search technique that has a population-based nature. The algorithm comes from the concept of natural selection. GA has randomized operators in its exploring and exploiting strategy [27]. Various optimization problems were successfully solved by GA (see, for example [28,29]). The use of the population and stochastic transition scheme reduces the probability of getting local optima. GA does not require mathematical assumptions and makes use of only numerical values of the objective function. Consequently, the algorithm is applicable to non-differentiable functions. Yet, the algorithm is easy to understand and thus simple for implementation.
The computational procedure starts with a random sampling of a set of the population, i.e., trial solution set. Each element in the population is encrypted as a chromosome. The goodness of a chromosome to be the solution is measured through a fitness function. Each chromosome is selected for further reproduction based on its fitness. The offspring undergoes the cross-over and mutation to yield new chromosomes [27]. The new chromosomes form a new population generation. The evolution repeats again until the termination condition is satisfied. Normally, a prescribed number of generations is used as a termination condition.
In the present paper, the design variable of the GA procedure is y m a x ( x s ) , to which a simple binary for the real value is applied for its encryption to become a chromosome. Given xs, the fitness function F ( y m a x ( x s ) ) of a chromosome is
F ( y m a x ( x s ) ) = { Ψ ( x s , y m a x ( x s ) ) ,   y m a x ( x s )   is   feasible Ψ ( x s , y m a x ( x s ) ) r = 1 N e q λ r v r ( y m a x ( x s ) ) q = 1 N i e q κ q u q ( y m a x ( x s ) ) ,   y m a x ( x s )   is   infeasible
v r ( y m a x ( x s ) ) = m a x ( c e q , r 2 ( y m a x ( x s ) ) , 0 )
u q ( y m a x ( x s ) ) = m a x ( c i e q , q y m a x ( x s ) , 0 )
Since original GA was created for non-constrained optimization problems. Accordingly, an adaptive penalty scheme was used for handling the constraints [20,30]. The adaptive penalty scheme proves its excellency in handling a variety of constrained optimization problems [31,32]. Based on the adaptive penalty scheme, the penalty parameters are given by
λ r = | m a x [ Ψ i n f ( x s , y m a x ( x s ) ) ] | v r j e q = 1 N e q [ v j e q ] 2
κ q = | m a x [ Ψ i n f ( x s , y m a x ( x s ) ) ] | u q l e q = 1 N i e q [ u l e q ] 2
m a x [ Ψ i n f ( x s , y m a x ( x s ) ) ] is the maximum of the objective function values in the current population in the infeasible region. v r is the violation magnitude of the r-th equality constraint. v r is the average of v r over the current population. λ r is the penalty parameter for the r-th equality constraint defined at each generation. u q is the violation magnitude of the q-th inequality constraint. u q is the average of u q over the current population. κ q is the penalty parameter for the q-th inequality constraint defined at each generation. The fulfillment of the equality constraint can be made less strict by alternatively using
v r ( y m a x ( x s ) ) = m a x ( c e q , r 2 ( y m a x ( x s ) ς ) , 0 )
where ς > 0 is the tolerance of deviation from zero and ς 0 .
After obtaining the fitness of chromosomes, the well-known reproduction process, namely the roulette wheel selection [27,29] is applied. The j-th chromosome is reproduced with the probability of
P j = F j u = 1 N Pop F u
in which NPop is the population size. The fitness value Fj is obtained from (20). Two-point crossover and binary mutation are then applied to the reproduced chromosomes to result in the new population generation [27]. The whole procedure is to be explained through the use of numerical examples in the following section.

4. Illustrative Examples

The following GA parameters are used in all examples. The population size is 500. DeJong [33] suggested the crossover rate to be 0.6 and the mutation rate to be 0.001. The suggested mutation and crossover rate have been found in many GA applications [34]. All combinations of the mutation rates of 0.0005, 0.001, 0.0015, as well as 0.002 and the crossover rates of 0.5, 0.6, and 0.7 are employed herein. Ten runs of GA are carried out for each parameter combination. The mutation and crossover rate that yields the best result for each example is reported in the example. The computation is performed using Intel(R) Core(TM) i5-2430M CPU @ 2.40GHz with installed RAM 4.00 GB. The system type is a 64-bit operating system, x64-based processor.

4.1. River Basin Pollution Game

This exemplified game [35] considers three players. Each player is engaged in paper pulp producing at a chosen level xj. The players must meet environmental conditions set by a local authority. Pollutants may be expelled into the river, where they disperse. Two monitoring stations l = 1, 2 are located along the river, at which the local authority has set maximum pollutant concentration levels.
The revenue for player j is
R j ( x ) = [ d 1 d 2 ( i = 1 3 x i ) ] x j
with expenditure
F j ( x ) = ( c 1 j + c 2 j x j ) x j
The payoff for the player j is thus
ϕ j ( x ) = [ d 1 d 2 ( i = 1 3 x i ) ( c 1 j + c 2 j x j ) ] x j
The emission constraint imposed by the local authority at location l is
q l ( x ) = j = 1 3 u j l e j x j K l ;   l = 1 ,   2
The economic constants d1 and d2 determine the inverse demand law and are equal to 3.0 and 0.1, respectively. The values of c1j and c2j are given in Table 1, and Kl = 100 (l = 1, 2). ujl are the decay and transportation coefficients from player j to location l. ej is the emission coefficient of player j, which is also given in Table 2 [35].
This game is a generalized Nash Equilibrium problem in which the solution is required to satisfy the imposed constraints apart from minimizing the NI function. The NI function corresponding to the prescribed payoffs is
ψ ( x , y ) = [ d 1 d 2 ( y 1 + x 2 + x 3 ) ( c 11 + c 21 y 1 ) ] y 1 +   [ d 1 d 2 ( x 1 + y 2 + x 3 ) ( c 12 + c 22 y 2 ) ] y 2 +   [ d 1 d 2 ( x 1 + x 2 + y 3 ) ( c 13 + c 23 y 3 ) ] y 3   [ d 1 d 2 ( i = 1 3 x i ) ( c 1 j + c 2 j x j ) ] x j
The NI-based hybrid GA and relaxation method is employed to determine the NE. α s = 1 is assigned to the relaxation algorithm and ensured the constraint satisfaction of x s + 1 . The convergence tolerance for the solution ε is set to be 1.0 × 10−4. The evolution of the NI function value is plotted in Figure 2.
The obtained NE is compared to the literature [35] in Table 3.
The best solution is obtained using the crossover rate of 0.05 and the mutation rate of 0.002. The adaptive penalty scheme in GA handles the constraint in an automatic manner without the assumption of the equal burden of satisfying constraints as in the previous work [35]. As is known, the equilibrium with coupled constraints can be a multiple equilibrium and depends on how the burden of satisfying the constraints is to be distributed among the players [36]. A priori fixing of burden may lead to local optima. The population-based search of GA and the adaptive penalty scheme as used here reduces the possibility of being trapped in the local optima and makes the solution satisfy constraints at the same time.

4.2. Electricity Market Game

Consider an electricity market using the IEEE 30-bus system [26]. In this case study, it is assumed that there are three generating companies and each of them possesses several generating units, as shown in Table 4 [37]. Pg and PC are the power generation of a unit and a company, respectively.
The cost of a generating unit i is of the type
C i ( P g i ) = ( c i / 2 ) P g i 2 + d i P g i + e i
whose coefficients are given in Table 5 [37].
The standard loading condition of the IEEE 30-bus system in a selected interval is supposed to be
P l o a d ( p ) = 189.2 0.5 p
p is the electricity price. The payoff made by company j that owns ngj generating unit is
ϕ j ( P g j ) = p j = 1 n g j P g j j = 1 n g j C j ( P g j ) = ( 378.4 2 i = 1 n g P g i ) j = 1 n g j P g j   j = 1 n g j ( c j 2 ) P g j 2 + d j P g j + e j
subject to the boundary constraints P m i n   g j P g j P m a x   g j . The NI function for (28) is
ψ ( P g j , y j ) = { [ 378.4 2 ( y 1 + P g 2 + + P g 6 ) ] y 1 ( c 1 2 y 1 2 + d 1 y 1 + e 1 ) } { [ 378.4 2 ( P g 1 + P g 2 + + P g 6 ) ] P g 1 ( c 1 2 P g 1 2 + d 1 P g 1 + e 1 ) } + { [ 378.4 2 ( P g 1 + y 2 + y 3 + P g 6 ) ] ( y 2 + y 3 ) ( c 2 2 y 2 2 + d 2 y 2 + e 2 ) ( c 3 2 y 3 2 + d 3 y 3 + e 3 ) } { [ 378.4 2 ( P g 1 + P g 2 + P g 3 + P g 6 ) ] ( P g 2 + P g 3 ) ( c 2 2 P g 2 2 + d 2 P g 2 + e 2 ) ( c 3 2 P g 3 2 + d 3 P g 3 + e 3 ) } + { [ 378.4 2 ( P g 1 + + y 4 + y 5 + y 6 ) ] ( y 4 + y 5 + y 6 ) ( c 4 2 y 4 2 + d 4 y 4 + e 4 ) ( c \ 5 2 y 5 2 + d 5 y 5 + e 5 ) ( c \ 6 2 y 6 2 + d 6 y 6 + e 6 ) } { [ 378.4 2 ( P g 1 + + P g 4 + P g 5 + P g 6 ) ] ( P g 4 + P g 5 + P g 6 ) ( c 4 2 P g 4 2 + d 4 P g 4 + e 4 ) ( c \ 5 2 P g 5 2 + d 5 P g 5 + e 5 ) ( c \ 6 2 P g 6 2 + d 6 P g 6 + e 6 ) }
It is noted that the NI function is complicated. Another characteristic is that while most games contain only one strategic variable for each player, some players of this game have more than one strategic variable. The company #1 owns one generator, the company #2 owns two generators, and the company #3 owns three generators. In other words, there are two and three strategic variables for the company #2 and #3, respectively. The two characteristics of the complicated NI function and multiple strategic variables make the close-form derivation of y m a x ( x s ) not tractable. The determination of NE using the proposed NI-based hybrid GA and relaxation method with α s = 0.5 is then performed. The convergence tolerance for the solution ε is set to be 1.0 × 10−4. The evolution of NI function value is plotted in Figure 3. The search terminates after the maximum number of loops, 50, is reached. The values of the strategic variables that yield the minimum NI function, ψ m i n = 1.3 × 10 4 , are taken as the NE. The obtained NE is compared to the literature [38] in Table 6.
The best solution is obtained using the crossover rate of 0.05 and the mutation rate of 0.0015. While the original NI-based relaxation method [38] requires the concavity-convexity property in the NI function, the employed GA has no computational limitation with respect to the requirement. This is due to the capability of the GA in solving non-convex and non-concave optimization problems, see e.g., [19].

4.3. Nash–Cournot Aggregative Game of Power Generation Firms

Consider five power generation firms [14]. The communication among the firms is delineated by an undirected ring graph. The power generation firm i { 1 , , 5 } has the following payoff function:
ϕ i ( P i ) = P i p ( σ ) c i ( P i )
All the firms subject to
i = 1 5 P i = i = 1 5 d i
and
P i   m i n P i P i   m a x
The payoff is in M$. Pi is the output power of firm i in MW, di is the local demand in MW, and Pimin and Pimax are the upper and lower bounds of the output power Pi, respectively. p ( σ ) is the electricity price and is defined by
p ( σ ) = p 0 a ( i = 1 5 P i )
p 0 = 10 and a = 0.001. c i ( P i ) is the generation cost and has the following form:
c i ( P i ) = α i + β i | P i ς i | + γ i P i 2
where α i , β i , γ i , and ς i are the characteristic parameters of firm i. All of the parameters are shown in Table 7 [14].
The NI function corresponding to the payoff function is
ψ ( P , y ) = y 1 [ p 0 a ( y 1 + P 2 + P 3 + P 4 + P 5 ) ] ( α 1 + β 1 | y 1 ς 1 | + γ 1 y 1 2 ) { P 1 [ p 0 a ( P 1 + P 2 + P 3 + P 4 + P 5 ) ] ( α 1 + β 1 | P 1 ς 1 | + γ 1 P 1 2 ) } + y 2 [ p 0 a ( P 1 + y 2 + P 3 + P 4 + P 5 ) ] ( α 2 + β 2 | y 2 ς 2 | + γ 2 y 2 2 ) { P 2 [ p 0 a ( P 1 + P 2 + P 3 + P 4 + P 5 ) ] ( α 2 + β 2 | P 2 ς 2 | + γ 2 P 2 2 ) } + y 3 [ p 0 a ( P 1 + P 2 + y 3 + P 4 + P 5 ) ] ( α 3 + β 3 | y 3 ς 3 | + γ 3 y 3 2 ) { P 3 [ p 0 a ( P 1 + P 2 + P 3 + P 4 + P 5 ) ] ( α 3 + β 3 | P 3 ς 3 | + γ 3 P 3 2 ) } + y 4 [ p 0 a ( P 1 + P 2 + P 3 + y 4 + P 5 ) ] ( α 4 + β 4 | y 4 ς 4 | + γ 4 y 4 2 ) { P 4 [ p 0 a ( P 1 + P 2 + P 3 + P 4 + P 5 ) ] ( α 4 + β 4 | P 4 ς 4 | + γ 4 P 4 2 ) }   + y 5 [ p 0 a ( P 1 + P 2 + P 3 + P 4 + y 5 ) ] ( α 5 + β 5 | y 5 ς 5 | + γ 5 y 5 2 ) { P 5 [ p 0 a ( P 1 + P 2 + P 3 + P 4 + P 5 ) ] ( α 5 + β 5 | P 5 ς 5 | + γ 5 P 5 2 ) }
where P = [ P 1 P 2 P 3 P 4 P 5 ] T and y = [ y 1 y 2 y 3 y 4 y 5 ] T .
Obviously, c i ( P i ) is a non-differentiable function [39,40]. Consequently, the payoff function and the NI function are also non-differentiable and the derivative-based optimization is not possible. The derivative-free GA is used for computing the optimum response function y m a x ( x s ) . The determination of NE using the proposed NI-based hybrid GA and relaxation method with α s = 0.5 is then performed. The convergence tolerance for the solution ε is set to be 1.0 × 10−4.
The evolution of the NI function value is plotted in Figure 4. The search terminates after the maximum number of loops, 50, is reached. The values of the strategic variables that yield the minimum NI function, ψ m i n = 9.78 × 10 2 , are taken as the NE. The results are shown in Table 8.
The best solution is obtained using the crossover rate of 0.07 and the mutation rate of 0.002. The results are close to those in Figure 1 from the distributed algorithm [14]. The distributed algorithm is specifically designed for this aggregative game and thus has a limited application when it is compared with the proposed NI-based hybrid GA and relaxation method. Furthermore, the present example shows for the first that the concept of NI function is applicable to a non-differentiable payoff function having the absolute value function.

4.4. Game with Local Optima

Consider a two-player game with non-concave payoff functions [7]:
ϕ 1 ( x 1 , x 2 ) = 21 + x 1 s i n ( π x 1 ) + x 1 x 2 s i n ( π x 2 )
ϕ 2 ( x 1 , x 2 ) = 21 + x 2 s i n ( π x 2 ) + x 1 x 2 s i n ( π x 1 )
x i [ 0 , 7.5 ] ; ( i = 1 , 2 ) .
The NI function corresponding to the payoff functions is:
ψ ( x , y ) = ϕ 1 ( y 1 , x 2 ) + ϕ 2 ( x 1 , y 2 ) ϕ 1 ( x 1 , x 2 ) ϕ 2 ( x 1 , x 2 )
The NI function contains several local optima. The determination of NE using the proposed NI-based hybrid GA and relaxation method with the standard value of α s = 0.5 is then performed. The evolution of the NI function value is plotted in Figure 5. The search terminates after the convergence tolerance is satisfied. The values of the variables that yield the minimum NI function, ψ m i n = 4.2 × 10 9 , are taken as the NE. Since there is a reference solution from the DE-based method [7,41], the comparison with other heuristic algorithms is done in this example. Three additional algorithms called the Gravitational Search Algorithm (GSA) [23,42], the Inclined Planes System Optimization Algorithm (IPO) [24,43], and the Capuchin Search Algorithm (CSA) [25,44] are performed for comparison. All computational programs were coded by the inventors of those three algorithms [25,45,46]. The convergence tolerance is for checking the NE, and therefore, it is the same for all algorithms. The convergence tolerance for the solution ε is set to be 1.0 × 10−8.
The parameters used by DE, GSA, IPO, and CSA are shown in Table 9, Table 10, Table 11 and Table 12, respectively.
The result comparison is reported in Table 13. The best solution by the present method is obtained using the crossover rate of 0.05 and the mutation rate of 0.002. The results of GA are significantly the same as those from the literature for DE and GSA, while IPO and CSA cannot detect the NE. This affirms the capability of GA as an optimizer on finding the NE, similar to previous research [7,8,9,10,12,13].
It is, however, not claimed that GA is the best method. It is indeed known that the GA solution is the NE because it satisfies the optimality condition, i.e., the zero or near-zero value of the NI function. Therefore, GA proves to be effective in itself without the necessity of comparing it with the other algorithms when using the NI function as a measure of reaching NE.

5. Conclusions

A Nikaido–Isoda (NI)-based hybrid Genetic Algorithm (GA) and relaxation method for finding Nash Equilibrium (NE) is proposed. The method combines the capability of the GA in optimizing non-convex and non-concave problems together with the implementation simplicity of the relaxation method. The adaptive penalty scheme is incorporated in the GA that the proposed method is applicable to the generalized Nash Equilibrium problem as well. The examples of the method applicability include the generalized Nash Equilibrium problem with nonlinear payoff functions and coupled constraints, the game with multiple strategic variables for individual players, and the non-differentiable payoff functions. All test example results suggest the appropriate crossover and mutation rate to be 0.05 and 0.002 for use in GA. These numbers are close to the recommended values by DeJong [33]. With respect to the performance in detecting the NE, the proposed method shows its capability of finding correct NEs in all test examples.
Future research should include the comparative study of meta-heuristic algorithms for use as alternative tools in the optimization step of this hybrid method. Another research direction is to apply the proposed method to the game problems with combinatorial payoff functions.

Author Contributions

Conceptualization, N.H.; data curation, N.H.; funding acquisition, N.H.; investigation, N.H.; methodology, N.H.; resources, N.H. and W.W.; software, N.H.; supervision, N.H.; visualization, N.H.; writing—original draft, N.H.; writing—review and editing, N.H. and W.W. All authors have read and agreed to the published version of the manuscript.

Funding

Center of Excellence in Econometrics, Faculty of Economics, Chiang Mai University, Chiang Mai 50200, Thailand.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

NENash Equilibrium
GAGenetic Algorithm
DEDifferential Evolution
NINikaido–Isoda
NSGAIINon-dominated Sorting Genetic Algorithm II
NDEMONash Domination Evolutionary Multiplayer Optimization
NIDENikaido–Isoda Differential Evolution
NESGANash Equilibrium Sorting Genetic Algorithm
GSAGravitational Search Algorithm
IPOInclined Planes System Optimization Algorithm
CSACapuchin Search Algorithm

References

  1. Nikaido, H.; Isoda, K. Note on cooperative convex games. Pac. J. Math. 1955, 5, 807–815. [Google Scholar] [CrossRef]
  2. Krawczyk, J.B.; Uryasev, S. Relaxation algorithms to find Nash equilibria with economic applications. Environ. Model. Assess. 2000, 5, 63–73. [Google Scholar] [CrossRef]
  3. Heusinger, V.; Kanzow, C. Relaxation Methods for Generalized Nash Equilibrium Problems with Inexact Line Search. J. Optim. Theory Appl. 2009, 143, 159–183. [Google Scholar] [CrossRef] [Green Version]
  4. Raghunathan, A.U.; Cherian, A.; Jha, D.K. Game Theoretic Optimization via Gradient-based Nikaido-Isoda Function. In Proceedings of the 36th International Conference on Machine Learning, Long Beach, CA, USA, 10–15 June 2019; Volume 97, pp. 1–10. [Google Scholar]
  5. Uryasev, S.; Rubinstein, R.Y. On relaxation algorithms in computation of non-cooperative equilibria. IEEE Trans. Automat. Contr. 1994, 39, 1263–1267. [Google Scholar] [CrossRef]
  6. Rosen, J.B. Existence and Uniqueness of Equilibrium Points for Concave N-Person Games. Econometrica 1965, 33, 520. [Google Scholar] [CrossRef] [Green Version]
  7. He, F.; Zhang, W.; Zhang, G. A Differential Evolution Algorithm Based on Nikaido-Isoda Function for Solving Nash Equilibrium in Nonlinear Continuous Games. PLoS ONE 2016, 11, e0161634. [Google Scholar] [CrossRef] [PubMed]
  8. Lung, R.I.; Dumitrescu, D. Computing Nash Equilibria by Means of Evolutionary Computation. Int. J. Comput. Commun. Control. 2008, 3, 364–368. [Google Scholar]
  9. Koh, A. An evolutionary algorithm based on Nash Dominance for Equilibrium Problems with Equilibriun Constraints. Appl. Soft Comput. 2012, 12, 161–173. [Google Scholar] [CrossRef] [Green Version]
  10. Zaman, F.; Elsayed, S.M.; Ray, T.; Sarker, R.A. Co-evolutionary approach for strategic bidding in competitive electricity markets. Appl. Soft Comput. 2017, 51, 1–22. [Google Scholar] [CrossRef]
  11. Deb, K.; Agrawal, S.; Pratab, A.; Meyarivan, T. A Fast Elitist Non-Dominated Sorting Genetic Algorithm for Multi-Objective Optimization: NSGA-II. IEEE Trans. Evol. Comput. 2002, 6, 182–197. [Google Scholar] [CrossRef] [Green Version]
  12. Zaman, F.; Elsayed, S.; Ray, T.; Sarker, R.A. Evolutionary Algorithms for Finding Nash Equilibria in Electricity Markets. IEEE Trans. Evol. Comput. 2017, 22, 536–549. [Google Scholar] [CrossRef]
  13. Konak, A.; Kulturel-Konak, S.; Snyder, L.V. Nash equilibrium sorting genetic algorithm for simultaneous competitive maximal covering location with multiple players. Eng. Optim. 2021, 1–15. [Google Scholar] [CrossRef]
  14. Deng, Z. Distributed generalized Nash equilibrium seeking algorithm for nonsmooth aggregative games. Automatica 2021, 132, 109794. [Google Scholar] [CrossRef]
  15. Arram, A.; Ayob, M. A novel multi-parent order crossover in genetic algorithm for combinatorial optimization problems. Comput. Ind. Eng. 2019, 133, 267–274. [Google Scholar] [CrossRef]
  16. Koeh, J.G. New directions in genetic algorithm theory. Annu. Oper. Res. 1997, 75, 49–68. [Google Scholar]
  17. Haupt, R.L.; Haupt, S.E. Practical Genetic Algorithms; Wiley: Hoboken, NJ, USA, 2004. [Google Scholar]
  18. Aryanezhad, M.B.; Hemati, M. A new genetic algorithm for solving nonconvex nonlinear programming problems. Appl. Math. 2008, 199, 186–194. [Google Scholar] [CrossRef]
  19. Lu, Z.; Wang, X. Improved portfolio optimization with non-convex and non-concave cost using genetic algorithms. In Proceedings of the 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC), Shenyang, China, 20–22 December 2013; pp. 2567–2570. [Google Scholar] [CrossRef]
  20. Obadage, A.; Harnpornchai, N. Determination of point of maximum likelihood in failure domain using genetic algorithms. Int. J. Press. Vessel. Pip. 2006, 83, 276–282. [Google Scholar] [CrossRef]
  21. Moshtaghi, H.R.; Eshlaghy, A.T.; Motadel, M.R. A Comprehensive Review on Meta-Heuristic Algorithms and their Classification with Novel Approach. J. Appl. Res. Ind. Eng. 2021, 8, 63–89. [Google Scholar]
  22. Campelo, F.; Aranha, C. EC Bestiary: A Bestiary of Evolutionary, Swarm and Other Metaphor-Based Algorithms (v2.0.1). 2018. Available online: https://zenodo.org/record/1293352#.Ycl8RWjMJPY (accessed on 12 December 2021).
  23. Rashedi, E.; Nezamabadi-Pour, H.; Saryazdi, S. GSA: A Gravitational Search Algorithm. Inf. Sci. 2009, 179, 2232–2248. [Google Scholar] [CrossRef]
  24. Mozaffari, M.H.; Abdy, H.; Zahiri, S.-H. IPO: An Inclined Planes System Optimization Algorithm. Comput. Inform. 2016, 35, 222–240. [Google Scholar]
  25. Braik, M.; Sheta, A.; Al-Hiary, H. A novel meta-heuristic search algorithm for solving optimization problems: Capuchin search algorithm. Neural Comput. Appl. 2020, 33, 2515–2547. [Google Scholar] [CrossRef]
  26. Aubin, J.P. Mathematical Methods of Game and Economic Theory; Elsevier: Amsterdam, The Netherlands, 1980. [Google Scholar]
  27. Goldberg, D.E. Genetic Algorithms in Search, Optimization and Machine Learning; Addison-Wesley: Massachusetts, MA, USA, 1989. [Google Scholar]
  28. Gen, M.; Cheng, R. Genetic Algorithms and Engineering Design; John Wiley & Sons: New York, NY, USA, 1997. [Google Scholar] [CrossRef]
  29. Deb, K. Optimization for Engineering Design Algorithms and Examples; Prentice-Hall: New York, NY, USA, 1995. [Google Scholar]
  30. Barbosa, H.J.; Lemonge, A.C. A new adaptive penalty scheme for genetic algorithms. Inf. Sci. 2003, 156, 215–251. [Google Scholar] [CrossRef]
  31. Harnpornchai, N.; Chakpitak, N.; Chandarasupsang, T.; Chaikijkosi, T.; Dahal, K. Dynamic adjustment of age distribution in Human Resource Management by genetic algorithms. In Proceedings of the IEEE Congress on Evolutionary computation, Singapore, 25–28 September 2007; IEEE Press: Piscataway, NJ, USA, 2007; pp. 1234–1239. [Google Scholar]
  32. Harnpornchai, N. Genetic algorithm-aided reliability analysis. Proc. Inst. Mech. Eng. Part O J. Risk Reliab. 2011, 225, 62–80. [Google Scholar] [CrossRef]
  33. DeJong, K. Analysis of the Behavior of a Class of Genetic Adaptive. Ph.D. Thesis, University of Michigan, Ann Arbor, MI, USA, 1975. [Google Scholar]
  34. Srinivas, M.; Patnaik, L.M. Genetic algorithms: A survey. Computer 1994, 27, 17–26. [Google Scholar] [CrossRef]
  35. Haurie, A.; Krawczyk, J.B. Optimal charges on river effluent from lumped and distributed sources. Environ. Model. Assess. 1997, 2, 177–189. [Google Scholar] [CrossRef]
  36. Krawczyk, J.B.; Pourtallier, O.; Tidball, M. A steady-state satisfactory solution to an environmental game with piece-wise defined payoffs. Nat. Resour. Model. 1998, 11, 301–329. [Google Scholar] [CrossRef]
  37. Maiorano, Y.; Song, H.; Trovato, M. Dynamics of noncollusive oligopolistic electricity markets. In Proceedings of the IEEE Power Engineering Society Winter Meeting, Singapore, 23–27 January 2000; pp. 838–844. [Google Scholar]
  38. Contreras, J.; Klusch, M.; Krawczyk, J. Numerical Solutions to Nash–Cournot Equilibria in Coupled Constraint Electricity Markets. IEEE Trans. Power Syst. 2004, 19, 195–206. [Google Scholar] [CrossRef] [Green Version]
  39. Binetti, G.; Davoudi, A.; Lewis, F.L.; Naso, D.; Turchiano, B. Distributed consensus-based economic dispatch with transmission losses. IEEE Trans. Power Syst. 2014, 29, 1711–1720. [Google Scholar] [CrossRef]
  40. Deng, Z.; Liang, S.; Hong, Y. Distributed Continuous-Time Algorithms for Resource Allocation Problems over Weight-Balanced Digraphs. IEEE Trans. Cybern. 2017, 48, 3116–3125. [Google Scholar] [CrossRef]
  41. Das, S.; Suganthan, P.N. Differential Evolution: A Survey of the State-of-the-Art. IEEE Trans. Evol. Comput. 2011, 15, 4–31. [Google Scholar] [CrossRef]
  42. Rashedi, E.; Rashedi, E.; Nezamabadi-Pour, H. A comprehensive survey on gravitational search algorithm. Swarm Evol. Comput. 2018, 41, 141–158. [Google Scholar] [CrossRef]
  43. Shahraki, N.S.; Zahiri, S.-H. Inclined planes optimization algorithm in optimal architecture of MLP neural networks. In Proceedings of the 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA), Shahrekord, Iran, 19–20 April 2017; pp. 189–194. [Google Scholar] [CrossRef]
  44. Braik, M. A Hybrid Multi-gene Genetic Programming with Capuchin Search Algorithm for Modeling a Nonlinear Challenge Problem: Modeling Industrial Winding Process, Case Study. Neural Process. Lett. 2021, 53, 2873–2916. [Google Scholar] [CrossRef]
  45. Rashedi, E. Gravitational Search Algorithm (GSA). MATLAB Central File Exchange. Available online: www.mathworks.com/matlabcentral/fileexchange/27756-gravitational-search-algorithm-gsa (accessed on 11 December 2021).
  46. Mohammadi, A. IPO-Inclined-Planes-System-Optimization-Algorithm. Available online: https://github.com/ali-ece/IPO-Inclined-Planes-system-Optimization-Algorithm (accessed on 11 December 2021).
Figure 1. The procedure of the NI-based hybrid GA and relaxation method.
Figure 1. The procedure of the NI-based hybrid GA and relaxation method.
Mathematics 10 00081 g001
Figure 2. Evolution of the NI function value.
Figure 2. Evolution of the NI function value.
Mathematics 10 00081 g002
Figure 3. Evolution of the NI function value.
Figure 3. Evolution of the NI function value.
Mathematics 10 00081 g003
Figure 4. Evolution of the NI function value.
Figure 4. Evolution of the NI function value.
Mathematics 10 00081 g004
Figure 5. Evolution of the NI function value.
Figure 5. Evolution of the NI function value.
Mathematics 10 00081 g005
Table 1. Comparison of methods for finding NE.
Table 1. Comparison of methods for finding NE.
MethodCheck of Nash Equilibrium ConditionCreation of New ActionsAbility to Solve Generalized Nash ProblemsHardship in Implementation
Evolutionary Computation [8]Nash Non-Dominated SortingEvolutionary Algorithm (NSGAII)Not AddressedHigh
NDEMO [9]Nash Dominance SortingEvolutionary Algorithm (DE)ApplicableHigh
NIDE [7]Nikai–Isoda Function ComputationEvolutionary Algorithm (DE)Not AddressedModerate
Co-evolutionary Approach [10,21]Nash Non-Dominated SortingCo-evolutionary Algorithm (GA and DE)ApplicableHigh
NESGA [13]Sorting of Payoff
Function Values
Evolutionary Algorithm (GA)Not AddressedHigh
Distributed Algorithm [14]Payoff Function Evaluation and SortingDynamic EquationsLimited to An Equality ConstraintHigh
NI-based Hybrid GA and Relaxation Method (Proposed Method)Nikai–Isoda Function ComputationRelaxation MethodApplicableLow
Table 2. Parameters for the river basin pollution game.
Table 2. Parameters for the river basin pollution game.
Player jc1jc2jejuj1uj2
10.100.010.506.54.853
20.120.050.255.06.250
30.150.010.755.53.750
Table 3. The NE for the river basin pollution game.
Table 3. The NE for the river basin pollution game.
Pulp Producing LevelThe Proposed Method[Krawczyk and Uryasev, 2000] [35]
x121.57021.140
x215.86016.030
x32.4402.728
Table 4. IEEE 30-bus system market data.
Table 4. IEEE 30-bus system market data.
Company #Generator # P g m i n (MW) P g m a x (MW) P C m i n (MW) P C m a x (MW)
11080080
220800130
3050
340550125
5030
6040
Table 5. Generating units cost coefficients.
Table 5. Generating units cost coefficients.
Generator # c i [$/MW2h] d i [$/MWh] e i [$/h]
10.0420
20.0351.750
30.12510
40.01663.250
50.0530
60.0530
Table 6. The NE for the electricity market game.
Table 6. The NE for the electricity market game.
Power Generation UnitThe Proposed Method (MW)(Contreras et al., 2004) [38] (MW)
P g 1 46.6646.66
P g 2 32.1532.16
P g 3 15.0115
P g 4 22.1322.13
P g 5 12.3512.33
P g 6 12.3112.33
Table 7. Characteristic parameters of all firms.
Table 7. Characteristic parameters of all firms.
ParameterFirm 1Firm 2Firm 3Firm 4Firm 5
α i 58697
β i 1210111113
γ i 2131.50.5
ς i 2548153045
di4535254030
Pimin2045102025
Pimax3755253340
Table 8. The NE for the Nash–Cournot game of power generation firms.
Table 8. The NE for the Nash–Cournot game of power generation firms.
Firm iOutput Power Pi (MW)
128.51
254.66
318.84
432.99
540.00
Table 9. Parameters used by DE.
Table 9. Parameters used by DE.
ParameterValue
Population size50
Maximum number of iterations400
Probability of crossover0.35
Mutation factor0.45
Termination tolerance0.0001
Table 10. Parameters used by GSA.
Table 10. Parameters used by GSA.
ParameterValue
Population size50
Maximum number of iterations1000
Number of runs30
G0100
α20
K050 and is decreased linearly to 1
Table 11. Parameters used by IPO.
Table 11. Parameters used by IPO.
ParameterValue
Population size50
Maximum number of iterations1000
Number of runs30
c10.72
c22.76
shift172.47
shift2188.51
scale10.04
scale20.82
Table 12. Parameters used by CSA.
Table 12. Parameters used by CSA.
ParameterValue
Population size30
Maximum number of iterations1000
Number of runs30
Velocity control constant1
Inertia parameter0.7
Balance factor0.7
Elasticity factor9
Table 13. The NE for the Nash–Cournot game of power generation firms.
Table 13. The NE for the Nash–Cournot game of power generation firms.
VariableProposed MethodDE [7]GSA [23]IOP [24]CSA [25]
x 1 6.6118088276.616.6117998883.214313.47359735
x 2 6.6118040396.616.6118014893.214314.982799
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Harnpornchai, N.; Wonggattaleekam, W. A Nikaido Isoda-Based Hybrid Genetic Algorithm and Relaxation Method for Finding Nash Equilibrium. Mathematics 2022, 10, 81. https://doi.org/10.3390/math10010081

AMA Style

Harnpornchai N, Wonggattaleekam W. A Nikaido Isoda-Based Hybrid Genetic Algorithm and Relaxation Method for Finding Nash Equilibrium. Mathematics. 2022; 10(1):81. https://doi.org/10.3390/math10010081

Chicago/Turabian Style

Harnpornchai, Napat, and Wiriyaporn Wonggattaleekam. 2022. "A Nikaido Isoda-Based Hybrid Genetic Algorithm and Relaxation Method for Finding Nash Equilibrium" Mathematics 10, no. 1: 81. https://doi.org/10.3390/math10010081

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop