Next Article in Journal
Subspace Detours Meet Gromov–Wasserstein
Previous Article in Journal
Metaheuristics in the Humanitarian Supply Chain
Previous Article in Special Issue
Short Communication: Optimally Solving the Unit-Demand Envy-Free Pricing Problem with Metric Substitutability in Cubic Time
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Branch-and-Bound Algorithm for Polymatrix Games ϵ-Proper Nash Equilibria Computation

Interdisciplinary Research Center on Smart Mobility and Logistics, Department of Mathematics, King Fahd University of Petroleum and Minerals, Dhahran 31261, Saudi Arabia
Algorithms 2021, 14(12), 365; https://doi.org/10.3390/a14120365
Submission received: 17 November 2021 / Revised: 5 December 2021 / Accepted: 14 December 2021 / Published: 16 December 2021
(This article belongs to the Special Issue Algorithmic Game Theory 2021)

Abstract

:
When several Nash equilibria exist in the game, decision-makers need to refine their choices based on some refinement concepts. To this aim, the notion of a ϵ -proper equilibria set for polymatrix games is used to develop 0–1 mixed linear programs and compute ϵ -proper Nash equilibria. A Branch-and-Bound exact arithmetics algorithm is proposed. Experimental results are provided on polymatrix games randomly generated with different sizes and densities.

1. Introduction

For quite a long time, Nash equilibrium solutions have been depicted as the most desirable outcome for games [1]. Avis and Fukuda [2], Audet et al. [3], and Audet, Belhaiza, and Hansen [4] proposed algorithmic approaches to enumerate the extreme Nash equilibria of two-player games. Daskalakis et al. [5] and Hazan and Krauthgamer [6] focused on the complexity of Nash equilibrium computation for two-player games. Etessami and Yannakis [7] focused on the complexity of approximated Nash equilibria computation for three and more player finite games.
Yanovskaya [8] used the Complementary Pivoting Method to compute Nash equilibria for polymatrix games. Howson [9], Eaves [10], and Howson and Rosenthal [11] used a similar approach. In particular, Quintas [12] demonstrated that a Nash equilibria set is a union of a finite number of convex polytopes for polymatrix games. Miller and Zucker [13] reduced the polymatrix game equilibrium computation problem to a copositive-plus linear complementarity problem (LCP) that requires a single-time application of Lemke’s algorithm [14]. On the same line, Wilson [15] adapted the Lemke and Howson algorithm [16] originally designed for two-player games to n-player games. Govindan and Wilson [17] developed polymatrix game sequences to approximate n-player games Nash equilibrium solutions. The enumeration of all polymatrix game extreme Nash equilibria was addressed in [4]. Strekalovskii and Ekhbat [18] reduced the search for Nash equilibria to a simple optimization problem. They also brought a simple proof on the existence of a Nash equilibrium for polymatrix games.
Meanwhile, an important concern emerged as decision-makers facing multiple Nash equilibria needed to refine their choices using more strict criteria. Although many refinement concepts were introduced, results on polymatrix games are quite limited. Papadimitriou and Roughgarden [19] demonstrated that the computation of a polymatrix game correlated equilibrium may be achieved in polynomial time. Belhaiza [20] demonstrated that perfect equilibria sets for polymatrix games are also unions of finite numbers of convex polytopes, possibly disjoint.
The proper refinement of the Nash equilibrium concept was introduced by Myerson [21] with the idea that reasonable players should try much harder to avoid the most costly mistakes than they should try to avoid the less costly ones. In fact, the proper refinement defines more stability conditions related to small rationality imperfections also called “trembling-hand perfection”. Myerson [21] demonstrated that there is always at least one proper equilibrium for any game in strategic form. For bimatrix games, Belhaiza et al. [22] introduced the concept of a set of ϵ -proper equilibria. They used sequences of quadratic programs to validate or reject the properness of a given Nash equilibrium point.
This paper brings theoretical and computational results on proper Nash equilibria for polymatrix games. To the best of our knowledge, no such results were published before. The sections of this paper are ordered as follows. Section 2 gives a formal description of polymatrix games. Section 3 recalls the definition of ϵ -proper equilibria and proposes a novel formulation of the ϵ -properness conditions. Section 4 proposes mixed 0–1 quadratic and linear formulations to detect ϵ -proper and non-proper Nash equilibria, and shows on a three-player polymatrix game how ϵ -proper sequences of equilibria can be analytically generated. Section 5 presents a Branch-and-Bound algorithm for polymatrix games ϵ -proper equilibria computation. The algorithm is based on a mixed 0–1 linear program combining Nash equilibrium primal and dual conditions and the ϵ -properness conditions. Finally, computational results on polymatrix games randomly generated are presented in Section 6.

2. Polymatrix Games

A polymatrix is a strategic interaction of n 2 players in a non-cooperative and normal setting, where the payoff of each player is the total sum of a series of partial payoffs. Particularly, when n = 2 , such interaction defines a bimatrix game. With N = { 1 , , n } as the set of all players, every player i N formulates his choice over a set of strategies S i = { s i 1 , . . . . , s i m i } , where | S i | = m i is finite and all element strategies are pure. A given player i’s partial payoff matrix with respect to a player j’s strategic decisions is a m i × m j matrix denoted by A i j = ( a i j k l ) . A partial payoff a i j ( s i k , s j l ) is rewarded to player i, if player i selects his strategy s i k and player j selects his strategy s j l . We define A i . = [ A i 1 . . A i j . . A i n ] the matrix of player i’s payoffs with respect to all players. The overall payoff for player i with respect to any choice ( s 1 k , , s n l ) in pure strategies of the n players, is
A i ( s 1 k , , s n l ) = j i a i j ( s i k , s j l ) .
Hence, a given player i’s payoff with respect to player j’s decisions is not related to the remaining players’ choices. Every player i sets a mixed strategy vector X i over his set of pure strategies and seeks to maximize his total game payoff. The vector X i is such that ( X i ) T = ( x i 1 , . . . . , x i m i ) , where for every k { 1 , , m i } , x i k represents the probability that player i selects his strategy s i k S i . A given player i’s mixed strategies vector X i belongs to the set S i ˜ = { X i : e m i t X i = 1 , X i 0 } , where e m i t is a one-row matrix with all entries set to 1. The game total payoff for player i is
α i ( X ) = ( X i ) T j i A i j X j = j i k = 1 m i l = 1 m j a i j k l x i k x j l .
Polymatrix games are used in some applications where the players’ payoff matrices are additive. For instance, Belhaiza et al. [23] used a polymatrix game to model a manager–controller–board of directors’ conflict. As for other strategic form games, a polymatrix game has indeed at least one Nash equilibrium as shown in [1]. We can define a Nash equilibrium X ^ = ( X ^ 1 , , X ^ n ) as a n-tuple of mixed strategies such that for any other n-tuple X = ( X ^ 1 , . . , X ^ i 1 , X i , X ^ i + 1 , . . , X ^ n ) the following condition is satisfied
( X ^ i ) T j i A i j X ^ j ( X i ) T j i A i j X ^ j , for i N .
Differently stated, a Nash equilibrium is a situation where a given player i’s payoff with respect to all remaining players is maximized, and similarly for all other players in N. A polymatrix game set of Nash equilibria N E is the union of a finite number of polytopes, also referred to as maximal Nash subsets. The set of extreme Nash equilibria is defined as the set of all different vertices of all the maximal Nash subsets. Any extreme Nash equilibrium is an extreme point of at least one of the maximal Nash subsets. Hence, a subset T NE is considered as a Nash subset, if and only if for every pair of elements ( X ^ i , Y ^ i ) , such that ( X ^ 1 , . . , X ^ i , . . , X ^ n ) T and ( Y ^ 1 , . . , Y ^ i , . . , Y ^ n ) T , we have ( X ^ 1 , . . , Y ^ i , . . , X ^ n ) T and ( Y ^ 1 , . . , X ^ i , . . , Y ^ n ) T , for all i N . Hence, X ^ i and Y ^ i are interchangeable. A given Nash subset T is considered as maximal if it is shown that it is not properly contained in any another Nash subset [24].

3. Set of ϵ -Proper Equilibria

The proper refinement originates from the assumption that reasonable players should try much harder to avoid more costly mistakes than they should try to avoid less costly ones. While it is well known that any proper equilibrium is always perfect, a perfect Nash equilibrium may be non-proper. Given a finite game in strategic form, Myerson [21] defines a proper equilibrium as the limit of an infinite sequence of ϵ -proper equilibria, when ϵ converges to zero. We here apply Myerson’s definition of the ϵ -proper equilibrium to polymatrix games. Let us first note A i · k and A i · q as the k t h and q t h rows of the payoff matrix A i · of player i, respectively.
Definition 1.
A polymatrix game profile X ^ = ( X ^ 1 , , X ^ n ) is ϵ-proper, for ϵ > 0 , if for all i N :
if j i A i j k X ^ j < j i A i j q X ^ j , then
x ^ i k ϵ x ^ i q , k , q 1 , 2 , , m i ,
and x ^ i k > 0 , k 1 , 2 , , m i .
For a given ϵ 0 and σ 0 , we now introduce the following set Ω ϵ σ such that:
Ω ϵ σ = { X = ( X 1 , , X n ) : i N , u i   and   L   such   that e m i t X i = 1 , σ x i k , k 1 , 2 , , m i , and k , q 1 , 2 , , m i , k q : j i A i j k X j j i A i j q X j + L u q k i , x i k + u k q i ϵ x i q + 1 , u k q i + u q k i 1 , with k < q , u k q i 0 , 1 . }
In the above, u k q i is a binary variable, and the parameter L + is set to a sufficiently large value. The following Proposition 1 gives a novel formulation of the ϵ -properness conditions and shows that each element of Ω ϵ σ is an ϵ -proper equilibrium.
Proposition 1.
If X = ( X 1 , , X n ) Ω ϵ σ , for ϵ > 0 and σ > 0 , then X is an ϵ-proper equilibrium.
Proof. 
Assume that X = ( X 1 , , X n ) Ω ϵ σ , for ϵ > 0 and σ > 0 . Let k and q be indices in 1 , 2 , , m i , such that k q . The inequalities u k q i + u q k i 1 ensure that the binary combination u k q i = 1 and u q k i = 1 is impossible. Moreover,
-
if u k q i = 0 and u q k i = 0 , then j i A i j k X j = j i A i j q X j ;
-
if u k q i = 1 , then ϵ x i k x i k ϵ x i q x i q implies that x i q ϵ x i k and u q k i = 0 , thus j i A i j k X j j i A i j q X j .
It follows that conditions (1) are satisfied. Finally, with 0 < σ x i k , for all k 1 , 2 , , m i , conditions (2) are satisfied. □
Proposition 2 states that every ϵ -proper equilibrium belongs to Ω ϵ σ , for every sufficiently small value of σ .
Proposition 2.
If X = ( X 1 , , X n ) is an ϵ-proper equilibrium for a given ϵ > 0 , there exists a real number σ ¯ > 0 , such that ( X 1 , , X n ) Ω ϵ σ , for 0 σ σ ¯ .
Proof. 
If a profile X = ( X 1 , , X n ) is an ϵ -proper equilibrium for some ϵ > 0 , for all i N , conditions (1) can be reformulated using the binary variables u k q i , for all k q 1 , 2 , , m i :
If j i A i j k X j < j i A i j q X j , x i k ϵ x i q , then j i A i j k X j j i A i j q X j + L u q k i , x i k + u k q i ϵ x i q + 1 , u q k i = 0 , u k q i = 1 .
If j i A i j q X j < j i A i j k X j , x i q ϵ x i k , then j i A i j q X j j i A i j k X j + L u k q i , x i q + u q k i ϵ x i k + 1 , u k q i = 0 , u q k i = 1 .
If j i A i j k X j = j i A i j q X j x i k 1 , x i q 1 , then j i A i j k X j j i A i j q X j + L u q k i , x i k + u k q i ϵ x i q + 1 , j i A i j q X j j i A i j k X j + L u k q i , x i q + u q k i ϵ x i k + 1 , u k q i = 0 , u q k i = 0 .
Finally, conditions (2) ensure the existence of a σ ¯ > 0 , such that σ ¯ x i k , for all k 1 , 2 , , m i and for all i N . Then, for every σ such that 0 < σ σ ¯ :
σ x i k , f o r a l l k 1 , 2 , , m i .
Thus, X = ( X 1 , , X n ) Ω ϵ σ for any σ , such that 0 < σ σ ¯ . □
The set Ω ϵ σ can now be called a set of ϵ -proper equilibria. Myerson [21] also showed that for every finite game in strategic form the set of proper equilibria is non-empty. Hence, such a sequence of ϵ -proper equilibria exists for every polymatrix game. Finding such sequences is not an easy task. While we proposed a 0–1 mixed quadratic programming approach to generate such sequences for bimatrix games [22], we now extend our approach to polymatrix games.

4. Computation of ϵ Proper Equilibria

Trying analytically to obtain such sequences of ϵ -proper equilibria would be unsuccessful if the equilibrium is non-proper. Therefore, one should try to detect equilibrium profiles that could help generate such sequences, and eliminate equilibrium profiles offering no possibility. To do so, we propose 0–1 mixed quadratic formulations that help detect the existence of ϵ -proper equilibria. To overcome the numerical noise that may appear while performing quadratic optimization with infinitesimal ϵ , we propose a 0–1 mixed linear formulation and implement it using exact arithmetics. We also show how the analytical computation of ϵ -proper equilibria could be performed, provided that ϵ -proper equilibria close to a Nash equilibrium are detected.

4.1. Mixed 0–1 Quadratic Formulations

We now propose a sequence of parametrized 0–1 mixed quadratic programs with optimal solutions defining a sequence of ϵ -proper equilibria while σ converges to 0.
Proposition 3.
A Nash equilibrium X ^ = ( X ^ 1 , , X ^ n ) is proper if and only if the following 0–1 mixed quadratic program ( Q ) is feasible for all σ ¯ > 0 , and if lim σ 0 + ϵ ( σ ) = 0 .
( Q ) ϵ ( σ ) = min ϵ s u b j e c t   t o x ^ i k ϵ x i k x ^ i k + ϵ , i N , k 1 , 2 , , m i , ( X 1 , , X n ) Ω ϵ σ , 0 ϵ 1 .
Proof. 
Assume that ( X 1 ( σ ) , , X n ( σ ) , ϵ ( σ ) ) is the optimal solution of ( Q ) for a known Nash equilibrium X ^ = ( X ^ 1 , , X ^ n ) . Proposition 1 asserts that ( X 1 ( σ ) , , X n ( σ ) ) is an ϵ ( σ ) -proper equilibrium. In addition, the convergence conditions are here reformulated through the minimization of ϵ such that x ^ i k ϵ x i k x ^ i k + ϵ , & k 1 , 2 , , m i , & i N , to make sure that the ϵ -proper equilibrium converges towards ( X ^ 1 , , X ^ n ) .
Hence, if the 0–1 mixed quadratic program ( Q ) is feasible for all σ > 0 and converging to 0, we may conclude from proposition (2) that there always exists an ϵ -proper equilibrium ( X 1 , , X n ) Ω ϵ σ .
In addition, if the Nash equilibrium X ^ = ( X ^ 1 , , X ^ n ) is proper, the optimal objective value ϵ ( σ ) converges necessarily towards 0, when σ > 0 converges to 0, to force the solution ( X 1 ( σ ) , , X n ( σ ) ) to converge towards ( X ^ 1 , , X ^ n ) simultaneously. We can also add that ϵ ( 0 ) = 0 in this case.
Otherwise, if ϵ ( σ ) does not converge towards 0, when σ > 0 converges to 0, such ϵ -proper sequence ( X 1 ( σ ) , , X n ( σ ) ) does not even exist, when ϵ converges to 0. In this case, it is obvious that the given Nash equilibrium X ^ cannot be proper. □
In conclusion, if ϵ ( σ ) converges to 0, while σ > 0 converges to 0, it is possible to obtain a sequence ( X 1 ( σ ) , , X n ( σ ) ) ϵ -proper converging to ( X ^ 1 , , X ^ n ) , when ϵ ( σ ) converges to 0. Therefore, we have to compute the values of ϵ ( σ ) for some small values of σ .
Corollary 1.
If ( X 1 ( σ ) , , X n ( σ ) , ϵ ( σ ) ) is the optimal solution to the 0–1 mixed quadratic program ( Q ), for some σ > 0 , then ( X 1 ( σ ) , , X n ( σ ) ) is an ϵ σ -proper equilibrium, and if 0 < σ < σ , then 0 ϵ ( σ ) ϵ ( σ ) .
Proof. 
If 0 < σ < σ , then the 0–1 mixed quadratic program ( Q ), for σ > 0 , is a relaxation of the same 0–1 mixed quadratic program ( Q ), but for σ > 0 . The main difference between the two programs lies in fact in the conditions of Ω ϵ σ and Ω ϵ σ :
σ x i k , k 1 , 2 , , m i , i N , a n d σ x i k , k 1 , 2 , , m i , i N ,
σ < σ x i k , k 1 , 2 , , m i , i N .
Thus, Ω ϵ σ Ω ϵ σ and 0 ϵ ( σ ) ϵ ( σ ) . □
Hence, there exists only two probable outcomes when computing ϵ ( σ ) for tiny values of σ . The first possible outcome corresponds to the case where ϵ ( σ ) is found to be bounded from below by a strictly positive value ϵ ¯ . The second possible outcome corresponds to the case where ϵ ( σ ) converges to zero.
Case 1: ϵ ( σ ) ϵ ¯ When ϵ ( σ ) is found to be bounded from below by an ϵ ¯ strictly positive indicates that there are no ϵ -proper equilibria close to ( X ^ 1 , , X ^ n ) , for ϵ values less than ϵ ¯ . Therefore, X ^ = ( X ^ 1 , , X ^ n ) would not be proper.
Now, let us assume that ϵ ( σ ) converges to some value ϵ ¯ > 0 , when σ > 0 converges to 0 as in ( Q ). We then define a 0–1 mixed quadratic program under the same conditions of Ω ϵ σ , but with ϵ ϵ ¯ / 2 and seeking to maximize σ . When the optimal objective value of this program is found to be equal to z e r o , we can obviously conclude that it is not possible to obtain a sequence ( X 1 ( σ ) , , X n ( σ ) ) of ϵ -proper equilibria converging to the Nash equilibrium X ^ . Hence, the Nash equilibrium X ^ = ( X ^ 1 , , X ^ n ) is not proper.
Proposition 4.
Consider the following 0–1 mixed quadratic program:
( Q ¯ ) max ϵ , σ σ s u b j e c t   t o x ^ i k ϵ x i k x ^ i k + ϵ , k 1 , 2 , , m i , i N , 0 ϵ ϵ ¯ / 2 , ( X 1 , , X n ) Ω ϵ σ .
If the optimal objective value of ( Q ¯ ) is zero for a given ϵ ¯ > 0 , the Nash equilibrium X ^ = ( X ^ 1 , , X ^ n ) is not proper.
Proof. 
If the optimal objective value of ( Q ¯ ) is equal to 0, it is not possible to obtain a sequence ( X 1 ( σ ) , , X n ( σ ) ) ϵ -proper converging to ( X ^ 1 , , X ^ n ) . Therefore, the Nash equilibrium X ^ = ( X ^ 1 , , X ^ n ) is not proper. □
At this stage, a numerical noise may appear with infinitesimal values of ϵ . It is risky to rely just on the nonlinear optimization results to detect ϵ -properness of a Nash equilibrium. The following mixed 0–1 linear formulations are proposed to overcome this problem.

4.2. Mixed 0–1 Linear Formulation

It is possible to resolve all numerical issues due to the non-linear formulation if ϵ is set to a fixed value. The following Corollary 2 can be formulated. It establishes a 0–1 mixed linear formulation from Proposition 4.
Corollary 2.
For a given ϵ > 0 , consider the following 0–1 mixed linear program
( P ) max σ s u b j e c t   t o x ^ i k ϵ x i k x ^ i k + ϵ , k 1 , 2 , , m i , i N , ( X 1 , , X n ) Ω ϵ σ .
If the optimal objective value of ( P ) is equal to 0, or ( P ¯ ) is unfeasible, the Nash equilibrium X ^ = ( X ^ 1 , , X ^ n ) is not proper.
The proof of this corollary is trivial (see Proof of Proposition 3). Hence, we can rely on the 0–1 mixed linear optimization results to certify that an equilibrium is non-proper. Computationally, this can be done using exact arithmetics. Exact arithmetics perform operations on real numbers defined as rationals instead of floats or doubles. A rational number consists of two very large integers, namely a numerator and a denominator. Overloading the elementary operators to handle algebraic operations on these rationals is fundamental. A Greatest Common denominator function is useful to reduce the size of the stored Big Integers. Exact arithmetics was successfully used by Audet et al. [25] and Avis et al. [26] in the context of two-person games equilibrium enumeration and refinement. The automatic detection of non-proper Nash equilibria can then safely be carried out for polymatrix games.
Case 2: ϵ ( σ ) converges to zero If ϵ ( σ ) converges towards zero, we can conclude on the existence of at least one sequence of ϵ -proper equilibria necessarily close to the given Nash equilibrium X ^ we are testing for properness. Computationally speaking, unless exact arithmetics is used here, this may not be enough to conclude on the properness of the given Nash equilibrium. Therefore, it is imperative to use exact arithmetics to get consistent results.
As an extension of Corollary 2, the following Corollary 3 can be formulated, when ϵ is set to a fixed value.
Corollary 3.
For some ϵ > 0 , consider the mixed 0–1 linear program ( P ) of corollary 2:
( P ) σ ( ϵ ) = max σ s u b j e c t   t o x ^ i k ϵ x i k x ^ i k + ϵ , k 1 , 2 , , m i , i N , ( X 1 , , X n ) Ω ϵ σ .
If the optimal objective value σ ( ϵ ) of ( P ) converges to 0 and remains strictly positive, when ϵ converges to 0, then ( X ^ 1 , , X ^ n ) is a proper equilibrium.
Proof. 
Let ( X 1 ( ϵ ) , , X n ( ϵ ) , σ ( ϵ ) ) be the optimal solution to (4), for some given equilibrium profile ( X ^ 1 , , X ^ n ) .
If σ ( ϵ ) is strictly positive, Proposition (1) ensures that ( X 1 ( ϵ ) , , X n ( ϵ ) ) is an ϵ -proper equilibrium.
Hence, if the optimal objective value σ ( ϵ ) converges to 0 and remains strictly positive when ϵ converges to 0, ( X 1 ( ϵ ) , , X n ( ϵ ) ) remains an ϵ -proper equilibrium and converges to the equilibrium ( X ^ 1 , , X ^ n ) . Therefore, ( X ^ 1 , , X ^ n ) is a proper equilibrium. □
Since it is based on a linear formulation, this corollary resolves all numerical issues due to the nonlinear formulation of Proposition 3. First, we illustrate our results with a polymatrix game where one of the extreme Nash equilibria is not perfect. With the following example, we show how the Corollary 3 can be applied.
Example Consider the three players polymatrix game ( 4 × 3 × 2 ) , where the following matrices A 1 . , A 2 . and A 3 . detail the payoffs of players I, II, and III.
A 1. = ( 1 1 3 1 2 0 3 4 1 2 3 2 | 1 1 4 1 3 2 1 2 ) A 2. = ( 5 2 2 3 2 5 3 4 1 4 2 1 | 2 5 1 2 4 1 )
A 3. ( 1 1 2 3 2 1 2 1 | 5 4 3 2 1 3 )
Using exact arithmetic, the E χ -MIP algorithm [4,25] enumerated three different extreme Nash equilibria for this game, as detailed in Table 1.
The optimization results in Table 2 (as shown in [20], a triplet of linear programs was used to conclude on the perfectness of the extreme Nash equilibria) and Table 3 point towards the existence of ϵ -proper equilibria sequences close to extreme Nash equilibria 1 and 3.
The optimization results in Table 3 show how the value of σ ( ϵ ) decreases while ϵ converges to 0 and the quest of ϵ -proper equilibria close to extreme Nash equilibrium 3 is successful.
The following figure (Figure 1) illustrates how the value of σ ( ϵ ) converges to 0, when ϵ converges to 0, for the extreme Nash equilibrium 3.

4.3. Analytical Computation

We now use the optimization results in Table 2 and Table 3 to analytically generate sequences of ϵ -proper equilibria from extreme Nash equilibrium 3. In the extreme Nash equilibrium 3 strategy profile, player I randomizes on x 11 and x 13 . A sequence of ϵ -proper equilibria should then take into account that player II is indifferent between his first and third strategies. One should think that if player I is not indifferent between his first and third strategies, he would have to assign to the less interesting strategy a probability less than ϵ times the probability of the other strategy.
Therefore, 3 x 21 + 4 x 22 + x 23 + 3 x 31 + 2 x 32 = x 21 + x 22 + 3 x 23 + x 31 + x 32 .
Thus, we have
2 x 23 = 2 x 21 + 3 x 22 + 2 x 31 + x 32
x 23 = 1 2 + x 21 + 3 2 x 22 + 1 2 x 31 .
One should also note that this is satisfied by equilibrium 3. Hence, player II would have to assign a probability larger than 1 2 to his third strategy.
Simultaneously, a sequence of ϵ -proper equilibria should take into account that player II is indifferent between his first and third strategies. Therefore, 5 x 11 + 2 x 12 + 2 x 13 + 3 x 14 + 2 x 31 + 5 x 32 = x 11 + 4 x 12 + 2 x 13 + x 14 + 4 x 31 + x 32 .
Thus, we have
2 x 11 + x 14 + 3 x 32 = x 12 + 1 .
This condition is satisfied by equilibrium 3. Taking into account that x 12 and x 14 would receive very small but strictly positive probabilities, player I would have to assign a probability less than 1 2 to his first strategy, and player III would have to assign a probability less than 1 3 to his second strategy.
Finally, a sequence of ϵ -proper equilibria would take into account that player III is indifferent between his first and second strategies. Therefore, x 11 + x 12 + 2 x 13 + 3 x 14 + 5 x 21 + 4 x 22 + 3 x 23 = 2 x 11 + x 12 + 2 x 13 + x 14 + 2 x 21 + x 22 + 3 x 23 .
Thus, we have
2 x 14 + 3 x 21 + 2 x 22 = x 11 .
This condition is also satisfied by equilibrium 3. Player II would have to assign a probability less than 1 6 to his first strategy.
We here detail some of the steps to generate a sequence of ϵ -proper equilibria converging to the extreme Nash equilibrium 3.
We first set x 31 = 7 9 α ϵ , x 32 = 2 9 + α ϵ , x 21 = 1 18 + λ ϵ , x 22 = 1 36 ϵ and x 23 = 17 18 + ρ ϵ , where λ + ρ + 1 36 = 0 .
With conditions (3), we obtain λ = 36 α 5 144 and ρ = 1 36 α 144 .
Since x 32 needs to be less than 1 3 , we have
α ϵ 1 9 .
For simplification purposes, we consider that player I would be indifferent between his second and fourth strategies, which yields x 14 = x 12 . This assumption does not eliminate the possibility for player I to have a marked preference for one of these two strategies.
We then set x 14 = x 12 and substitute condition (5) in (4).
We obtain
x 12 = x 14 = 7 324 α 284 × ϵ .
Since x 12 1 6 ϵ , because player I prefers his first strategy to his second, we obtain a lower bound on α :
α 121 972 .
Then, by (5) we have x 11 = 1 6 + ( 2 β + 3 λ + 1 8 ) ϵ and x 13 = 5 6 ( 4 β + 3 λ + 1 18 ) ϵ , where
β = 7 324 α 284 .
It has now been made clear that we obtained a sequence of ϵ -proper equilibria converging to the extreme Nash equilibrium 3 when ϵ converges to 0.

5. Branch-and-Bound Algorithm for ϵ -Proper Equilibrium Computation

We now propose a Branch-and-Bound algorithm for ϵ - proper equilibrium computation. The algorithm is based on a 0–1 mixed linear program which provides at optimality an ϵ -proper equilibrium to a polymatrix game. The 0–1 mixed linear program combines Nash equilibrium primal and dual conditions detailed in [4] and the ϵ -properness conditions. Note that ϵ here is set to a fixed rational value which is strictly positive, but very close to 0 .

5.1. The 0–1 Mixed Linear Master Program

The following 0–1 mixed linear program ( L ) combines the Nash equilibrium primal and dual conditions, for polymatrix games, and the ϵ -properness conditions aforementioned. The optimization of ( L ) using exact arithmetics, provides at optimality a Nash equilibrium X ^ and an ϵ -proper equilibrium X, when ϵ is set to a fixed value.
Proposition 5.
Given a polymatrix game, for some ϵ > 0 , consider the 0–1 mixed linear program ( L ) :
σ ( ϵ ) = M a x i m i z e X ^ , X , V , σ σ                          
Subject   to      e m t X ^ i = 1 ,      i N ,
         X ^ i + V i 1 ,      i N ,
   α i e i j = 1 , j i n A i j X ^ j 0 ,     i N
α i e i j = 1 , j i n A i j X ^ j L V i 0 ,    i N
x ^ i k ϵ x i k x ^ i k + ϵ , k 1 , , m i , i N ,
( X 1 , , X n ) Ω ϵ σ
       X ^ i 0 ,         i N ,
      V i { 0 , 1 } m i ,       i N .
This 0–1 mixed linear program ( L ) is always feasible for any ϵ > 0 and its optimal objective value σ ( ϵ ) is always strictly positive.
The mixed strategies vector X ^ = ( X ^ 1 , , X ^ n ) is a Nash equilibrium of the given polymatrix game and the completely mixed strategies vector X = ( X 1 , , X n ) is an ϵ-proper equilibrium.
Proof. 
Constraints (9)–(12) assert that X ^ = ( X ^ 1 , , X ^ n ) is a Nash equilibrium of the given polymatrix game (see [4]). Constraints (13) and (14) assert that X = ( X 1 , , X n ) is an ϵ -proper equilibrium as shown earlier. For any given polymatrix game, the set of Nash equilibria and the set of proper equilibria are non-empty. Therefore, for any ϵ > 0 , there exists a pair of vectors X ^ and X such that X ^ is a Nash equilibrium, and X is an ϵ -proper equilibrium. Hence, the optimal objective value σ ( ϵ ) is always strictly positive. □
The mixed 0–1 linear program ( L ) serves as the master program in the proposed ϵ -proper Branch-and-Bound algorithm.

5.2. The ϵ -Proper Branch-and-Bound Algorithm

Let P be defined as a master linear program obtained after relaxation of the 0–1 conditions in ( L ) . The proposed ϵ -proper Branch-and-Bound algorithm follows a principal branching tree scheme and many secondary branching trees. At every vertex on the principal tree, a binary branching is performed on one of the 0–1 variables. At every vertex of the principal tree, the algorithm solves a master linear program P with at most two constraints fixing one or more binary variables and cutting one or more binary variables combinations. Each time a Nash equilibrium X ^ is obtained, the objective function value σ is compared to a recorded maximum σ ¯ . In the case where the optimal objective σ is equal to zero or lower than σ ¯ , the given principal tree vertex and its sons cannot provide any ϵ -proper equilibrium with a larger σ than σ ¯ . The current σ ¯ stored serves as a bound to shorten the algorithm’s execution.
At every vertex of a secondary tree node, a linear sub-program Q is solved. This linear sub-program Q results from the master linear program P, with one combination of binary variables fixing constraints and at most two constraints fixing one or more continuous x variables. If a Nash equilibrium X ^ is obtained, the objective function value σ is compared to a recorded maximum σ ¯ . Again, in the case where the optimal objective σ is equal to zero or lower than σ ¯ , the given secondary tree vertex and its sons cannot provide any ϵ -proper equilibrium with a larger σ than σ ¯ .
The ϵ -proper Branch-and-Bound algorithm is formally stated as follows.
Step 1: Initialization.
Define:
-
  P; Master linear program defined by the relaxation of the 0–1 conditions in ( L ) .
-
   Q ; Sub-program defined by a secondary linear program where the U and V variables in P are fixed.
-
   X ^ , X; L ’s real decision variables.
-
   U , V ; L ’s binary decision variables.
-
   E = ; List of Nash equilibria visited.
-
   E ϵ = ; List of ϵ -proper Nash equilibria obtained.
-
   D = 0 ; Depth level reached in the principal tree.
-
   C R ; Principal tree root node.
-
  C; Current node.
-
   x ^ i k , x i k ; Real decision variables on the k t h strategy of player i ( i = 1 , , n ) .
-
   v i q ; Binary decision variables on the q t h strategy of player i ( i = 1 , , n ) .
-
   u k q i ; Binary decision variables on the k t h and q t h strategies of player i ( i = 1 , , n ) .
-
   σ ¯ ; Best value of σ found.
Set C = CR, σ ¯ = 0 and Go to step 2.
  Step 2: Solving master program.
If D | X ^ | + | U | , solve current node’s program P.
-
  If the program P is infeasible or the optimal objective σ is equal to 0; STOP.
-
  Else;
  -
  If D < | X ^ | + | U | , Go to step 3.
  -
  Else, Let  S = ( X ^ ; X ; V ) be the solution obtained;
  -
  If the optimal objective σ of P is less than σ ¯ , STOP.
  -
  Else, if the optimal objective σ of P is ≥ σ ¯ ,
Update σ ¯ = σ , AddX to E and X σ to E ϵ and Go to step 4.
  Step 3: Principal Branching.
Select a binary variable v i q V or u k q i U over which no principal branching was performed.
Let p = D + 1 :
If p | X ^ | + | U | set D = p and
-
  Set v i q V or u k q i U selected to 1 in P and Go to step 2.
-
  Set v i q V or u k q i U selected to 0 in P and Go to step 2.
Step 4: Secondary Branching.
Set binary variable vectors V and U at their values in S .
For every x ^ i k > 0 X ^ : Set x ^ i k = 0 and Go to step 5.
  Step 5: Solving subprogram.
Solve Q ( X ^ ) to optimality.
-
  If the program Q is infeasible or the optimal objective σ is equal to 0; STOP.
-
  Else, if the optimal objective σ of Q ( X ^ ) is less than σ ¯ , STOP.
-
  If the optimal objective σ of Q ( X ^ ) is σ ¯ , Update σ ¯ = σ , Add  X ^ to E and X to E ϵ . Go to step 4.
The sets E and E ϵ are non-empty and contain respectively all Nash equilibria and all ϵ -proper equilibria encountered. The maximum value of σ is σ ( ϵ ) = σ ¯ . The last element entering E ϵ corresponds to σ ( ϵ ) .
Theorem 1.
The ϵ-proper Branch-and-Bound algorithm generates at least one ϵ-proper equilibrium for any polymatrix game.
Proof. 
The algorithm searches for all possible ways to satisfy the Nash equilibrium, complementary slackness, and properness conditions through the principal branching tree and the secondary trees. If X ( ϵ ) is the unique ϵ -proper equilibrium, there exists at least one path in the algorithm’s generated branching tree leading to it. □
The complexity of the ϵ -proper Branch-and-Bound algorithm is certainly non-polynomial. It increases exponentially when the size of the polymatrix game increases.

6. Experimental Results

Table 4 presents the computational results obtained on randomly generated sets of three-player polymatrix games with different sizes and density. The size of the polymatrix games generated goes from ( 2 × 2 × 2 ) to ( 5 × 5 × 5 ) . The density of the polymatrix games generated goes from 12.5 % to 100 %. The entries of the polymatrix games generated are integers. The optimization of the master and secondary linear programs P and Q uses a C++ implementation of the Simplex algorithm in exact arithmetics.
In Table 4, column (Size) lists the different numbers of strategies for each player before elimination of strategies strongly dominated. The column (d) lists the different densities of the partial payoff matrices generated. The column ( ϵ ) lists the values of the parameter ϵ used. The column ( n e ) indicates the average number of ϵ -proper equilibria encountered over 10 polymatrix games randomly generated. Finally, the value (Time) indicates the average time in seconds required to find an ϵ -proper equilibrium with a maximum value of σ . Finally, the experimental results are obtained under MS Windows on workstations equipped with 3.3 GHz Intel Core i5 vPro processors and 3.2 GB RAM.
In particular, one can observe that reducing the partial payoff matrices density increases the computing time required to find an ϵ -proper equilibrium as it also increases the total number of ϵ -proper equilibria detected for the polymatrix game.

7. Conclusions

In this paper we have shown how to compute ϵ -proper Nash equilibria for polymatrix games. The notion of a set of ϵ -proper equilibria for polymatrix games was used to develop a class of 0–1 mixed linear programs to detect proper and non-proper Nash equilibria. Consequently, a 0–1 mixed linear program was proposed to compute an ϵ -proper Nash equilibrium for a polymatrix game. We have finally presented our experimental results on randomly generated polymatrix games. A future extension of this work should naturally tackle proper equilibrium computation for more general forms of n-player games.

Funding

This work was partly funded by King Fahd University of Petroleum and Minerals research grant POC19103 and the Interdisciplinary Research Center on Smart Mobility and Logistics research grant INML2107. This support is gratefully acknowledged.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Nash, J.F. Equilibrium points in n-person games. Proc. Natl. Acad. Sci. USA 1950, 36, 48–49. [Google Scholar] [CrossRef] [Green Version]
  2. Avis, D.; Fukuda, K. A pivoting algorithm for convex hulls and vertex enumeration of arrangements and polyhedra. Discret. Comput. Geom. 1992, 8, 295–313. [Google Scholar] [CrossRef]
  3. Audet, C.; Hansen, P.; Jaumard, B.; Savard, G. Enumeration of all extreme equilibrium strategies of bimatrix games. Siam J. Sci. Comput. 2001, 23, 323–338. [Google Scholar] [CrossRef]
  4. Audet, C.; Belhaiza, S.; Hansen, P. Enumeration of all Extreme Equilibria in Game Theory: Bimatrix and Polymatrix Games. J. Optim. Theory Appl. 2006, 129, 349–372. [Google Scholar] [CrossRef]
  5. Daskalakis, C.; Goldberg, P.; Papadimitriou, C. The complexity of computing a Nash equilibrium. Siam J. Comput. 2009, 39, 195–259. [Google Scholar] [CrossRef]
  6. Hazan, E.; Krauthgamer, R. How hard is it to approximate the best Nash equilibrium? Siam J. Comput. 2011, 40, 79–91. [Google Scholar] [CrossRef]
  7. Etessami, K.; Yannakakis, M. On the complexity of Nash equilibria and other fixed points. Siam J. Comput. 2010, 39, 2531–2597. [Google Scholar] [CrossRef] [Green Version]
  8. Yanovskaya, E.B. Equilibrium points in polymatrix games. Latv. Math. Collect. 1968, 8, 381–384. [Google Scholar]
  9. Howson, J.T. Equilibria of polymatrix games. Manag. Sci. 1972, 18, 312–318. [Google Scholar] [CrossRef]
  10. Eaves, C.B. Polymatrix games with joint constraints. Siam J. Appl. Math. 1973, 24, 418–423. [Google Scholar] [CrossRef]
  11. Howson, J.T.; Rosenthal, R.W. Bayesian equilibria of finite two-person games with incomplete information. Manag. Sci. 1974, 21, 313–315. [Google Scholar] [CrossRef]
  12. Quintas, L. A Note on Polymatrix Games. Int. J. Game Theory 1989, 18, 261–272. [Google Scholar] [CrossRef]
  13. Miller, D.A.; Zucker, S.W. Copositive-plus Lemke Algorithm Solves Polymatrix Games. Oper. Res. Lett. 1991, 10, 285–290. [Google Scholar] [CrossRef]
  14. Lemke, C.E. Bimatrix equilibrium points and mathematical programming. Manag. Sci. 1965, 11, 681–689. [Google Scholar] [CrossRef] [Green Version]
  15. Wilson, R. Computing Equilibria of N-Person Games. Siam J. Appl. Math. 1971, 21, 80–87. [Google Scholar] [CrossRef] [Green Version]
  16. Lemke, C.E.; Howson, J.T. Equilibrium points of bimatrix games. Siam J. Appl. Math. 1964, 12, 413–423. [Google Scholar] [CrossRef]
  17. Govindan, S.; Wilson, R. Computing Nash equilibria by iterated polymatrix approximation. J. Econ. Dyn. Control 2004, 28, 1229–1241. [Google Scholar] [CrossRef]
  18. Strekalovskii, A.S.; Enkhbat, R. Polymatrix games and Optimization Problems. Autom. Remote Control 2014, 75, 632–645. [Google Scholar] [CrossRef]
  19. Papadimitriou, C.H.; Roughgarden, T. Computing Correlated Equilibria in Multi-Player Games. J. ACM 2008, 55, 1–29. [Google Scholar] [CrossRef]
  20. Belhaiza, S. Computing Perfect Nash Equilibria for Polymatrix Games. Game Theory 2014, 2014, 937070. [Google Scholar] [CrossRef] [Green Version]
  21. Myerson, R.B. Refinements of the Nash equilibrium concept. Int. J. Game Theory 1978, 7, 73–80. [Google Scholar] [CrossRef] [Green Version]
  22. Belhaiza, S.; Audet, C.; Hansen, P. On Proper Refinement of Bimatrix Games Nash Equilibria. Automatica 2012, 48, 297–303. [Google Scholar] [CrossRef]
  23. Belhaiza, S.; Charrad, S.; M’Hallah, R. On the Performance of Managers and Controllers: A Polymatrix Game Approach for the Manager-Controller-Board of Directors’ Conflict. J. Optim. Theory Appl. 2018, 177, 584–602. [Google Scholar] [CrossRef]
  24. Borm, P.E.M.; Jansen, M.J.M.; Potters, J.A.M.; Tijs, S.H. On the structure of the set of perfect equilibria in bimatrix games. O-R Spektrum 1993, 15, 17–20. [Google Scholar] [CrossRef] [Green Version]
  25. Audet, C.; Belhaiza, S.; Hansen, P. A new sequence form approach for the enumeration and refinement of all extreme Nash equilibria for extensive form games. Int. Game Theory Rev. 2009, 11, 437–451. [Google Scholar] [CrossRef]
  26. Avis, D.; Rosenberg, G.D.; Savani, R.; von Stengel, B. Enumeration of Nash equilibria for two-player games. Econ. Theory 2009, 42, 9–37. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Eq3; Plot of σ ( ϵ ) .
Figure 1. Eq3; Plot of σ ( ϵ ) .
Algorithms 14 00365 g001
Table 1. Extreme Nash equilibria.
Table 1. Extreme Nash equilibria.
Eq α 1 α 2 α 3 X 1 X 2 X 3
1465 ( 0 , 0 , 1 , 0 ) ( 0 , 0 , 1 ) ( 1 , 0 )
2484 ( 0 , 1 , 0 , 0 ) ( 0 , 0 , 1 ) ( 1 , 0 )
3 35 9 31 6 89 18 ( 1 6 , 0 , 5 6 , 0 ) ( 1 18 , 0 , 17 18 ) ( 7 9 , 2 9 )
Table 2. Optimization results.
Table 2. Optimization results.
Eq.Perfect ϵ -Proper ϵ σ ( ϵ )
1yesyes 1 1000 1 100 , 100 , 100
2yesno 1 1000 0
3yesyes 1 1000 1 5 , 998 , 996
Table 3. Optimization results for equilibrium 3.
Table 3. Optimization results for equilibrium 3.
ϵ σ ( ϵ ) X 1 ( ϵ ) X 2 ( ϵ ) X 3 ( ϵ )
1 2 1 36 ( 11 36 , 1 18 , 11 18 , 1 36 ) ( 1 18 , 1 36 , 11 12 ) ( 49 54 , 5 54 )
1 4 1 90 ( 17 90 , 2 45 , 34 45 , 1 90 ) ( 2 45 , 1 90 , 17 18 ) ( 211 270 , 59 270 )
1 8 1 333 ( 56 219 , 7 219 , 17 24 , 7 1752 ) ( 8 333 , 1 333 , 36 37 ) ( 8 9 , 1 9 )
1 9 1 473 ( 81 473 , 9 473 , 382 473 , 1 473 ) ( 76 1419 , 1 473 , 1340 1419 ) ( 100 129 , 29 129 )
1 10 1 586 ( 50 293 , 5 293 , 475 586 , 1 586 ) ( 5 1758 , 1 586 , 830 879 ) ( 1363 1758 , 395 1758 )
1 100 1 59 , 896 ( 1250 7487 , 25 14 , 974 , 49 , 795 59 , 896 , 1 59 , 896 )( 9995 179 , 688 , 1 59 , 896 , 84845 89 , 844 ) ( 139 , 693 179 , 688 , 39 , 995 179 , 688 )
1 1000 1 5 , 998 , 996 ( 250 , 000 1 , 499 , 749 , 250 1 , 499 , 749 , 4 , 997 , 995 5 , 998 , 996 , 1 5 , 998 , 996 ) ( 999 , 995 17 , 996 , 988 , 1 5 , 998 , 996 , 274 , 145 290 , 274 ) ( 13 , 996 , 993 17 , 996 , 988 , 3 , 999 , 995 17 , 996 , 988 )
1 10 , 000 1 599 , 989 , 996 ( 25 , 000 , 000 149 , 997 , 499 , 2500 149 , 997 , 499 , 499 , 979 , 995 599 , 989 , 996 , 1 599 , 989 , 996 ) ( 99 , 999 , 995 1 , 799 , 969 , 988 , 1 599 , 989 , 996 , 849 , 984 , 995 899 , 984 , 994 ) ( 1 , 399 , 969 , 993 1 , 799 , 969 , 988 , 399 , 999 , 995 1 , 799 , 969 , 988 )
1 100 , 000 1 59 , 999 , 899 , 996 ( 2 , 500 , 000 , 000 14 , 999 , 974 , 999 , 25 , 000 14 , 999 , 974 , 999 , 49 , 999 , 799 , 995 59 , 999 , 899 , 996 , 1 59 , 999 , 899 , 996 ) ( 9 , 999 , 999 , 995 179 , 999 , 699 , 988 , 1 59 , 999 , 899 , 996 , 84 , 999 , 849 , 995 89 , 999 , 849 , 994 ) ( 139 , 999 , 699 , 993 179 , 999 , 699 , 988 , 39 , 999 , 999 , 995 179 , 999 , 699 , 988 )
1 1 , 000 , 000 1 5 , 999 , 998 , 999 , 996 ( 250 , 000 , 000 , 000 1 , 499 , 999 , 749 , 999 , 250 , 000 1 , 499 , 999 , 749 , 999 , 4 , 999 , 997 , 999 , 995 5 , 999 , 998 , 999 , 996 , 1 5 , 999 , 998 , 999 , 996 ) ( 999 , 999 , 999 , 995 17 , 999 , 996 , 999 , 988 , 1 5 , 999 , 998 , 999 , 996 , 8 , 499 , 998 , 499 , 995 8 , 999 , 998 , 499 , 994 ) ( 13 , 999 , 996 , 999 , 993 17 , 999 , 996 , 999 , 988 , 3 , 999 , 999 , 999 , 995 17 , 999 , 996 , 999 , 988 )
Table 4. ϵ -proper experimental results.
Table 4. ϵ -proper experimental results.
ϵ = 10 4 d = 0.125 d = 0.25 d = 0.5 d = 0.75 d = 1.0
SizeneTimeneTimeneTimeneTimeneTime
2 × 2 × 25.111.72.59.31.87.91.57.81.06.7
3 × 2 × 27.322.34.920.33.518.51.68.71.28.1
3 × 3 × 28.635.25.324.32.521.21.820.81.519.5
3 × 3 × 311.047.87.635.62.927.42.027.01.626.7
4 × 4 × 413.772.78.561.83.956.43.745.22.641.5
5 × 4 × 315.287.79.486.54.782.44.279.92.874.6
5 × 4 × 416.8109.410.097.16.293.65.381.43.277.1
5 × 5 × 518.2123.610.3115.46.7106.95.991.14.784.5
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Belhaiza, S. A Branch-and-Bound Algorithm for Polymatrix Games ϵ-Proper Nash Equilibria Computation. Algorithms 2021, 14, 365. https://doi.org/10.3390/a14120365

AMA Style

Belhaiza S. A Branch-and-Bound Algorithm for Polymatrix Games ϵ-Proper Nash Equilibria Computation. Algorithms. 2021; 14(12):365. https://doi.org/10.3390/a14120365

Chicago/Turabian Style

Belhaiza, Slim. 2021. "A Branch-and-Bound Algorithm for Polymatrix Games ϵ-Proper Nash Equilibria Computation" Algorithms 14, no. 12: 365. https://doi.org/10.3390/a14120365

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop