An Enhanced Gradient Algorithm for Computing Generalized Nash Equilibrium Applied to Electricity Market Games

Adriano C. Lisboa; Fellipe F. G. Santos; Douglas A. G. Vieira; Rodney R. Saldanha; Felipe A. C. Pereira

doi:10.3390/en18030727

,

and

¹

Gaia, Belo Horizonte 31310-260, MG, Brazil

²

ENACOM, Belo Horizonte 31275-100, MG, Brazil

³

Electrical Engineering Department, Federal University of Minas Gerais, Belo Horizonte 31270-901, MG, Brazil

⁴

Gratuated Program on Computational and Mathematical Modelling, Federal Center of Technological Education of Minas Gerais, Belo Horizonte 30421-169, MG, Brazil

Energies2025, 18(3), 727;https://doi.org/10.3390/en18030727

This article belongs to the Section C: Energy Economics and Policy

Version Notes

Order Reprints

Abstract

This paper introduces an enhanced algorithm for computing generalized Nash equilibria for multiple player nonlinear games, which degenerates in a gradient algorithm for single player games (i.e., optimization problems) or potential games (i.e., equivalent to minimizing the respective potential function), based on the Rosen gradient algorithm. Analytical examples show that it has similar theoretical guarantees of finding a generalized Nash equilibrium when compared to the relaxation algorithm, while numerical examples show that it is faster. Furthermore, the proposed algorithm is as fast as, but more stable than, the Rosen gradient algorithm, especially when dealing with constraints and non-convex games. The algorithm is applied to an electricity market game representing the current electricity market model in Brazil.

Keywords:

Brazilian electricity market; Rosen algorithm; generalized Nash equilibrium

1. Introduction

The energy sector in many countries around the world has been reformed since the end of the 1980s [1,2]. It changed from a centralized, monopolistic sector to one that stimulates competition, with an open market that gencos (and often consumers) can make bids for energy prices and negotiate future energy contracts. The energy spot price is then defined by the market operator (and/or system operator) using the marginal unit dispatched price, considering the bids and transmission constraints [3].

The change in the energy markets also created challenges for healthy market operation. While the move from regulated energy amounts and prices gave gencos the freedom to define better business strategies, it also gave larger companies the possibility to manipulate prices and exert market power [1,4,5,6]. Furthermore, a more open market tends to require shorter time bids [7].

The Brazilian electric sector reforms started in 1996 [8], and now it works in a mix of regulated and open markets. The open market allows gencos and consumers to negotiate contracts directly, and has spot prices calculated from audited costs and future prices of water (as hydroelectric power plants are the main source of energy), using a complex chain of models. There are proposals for moving the market to a bid-based model that would allow generators to make bids, declaring the price and quantity curves they are offering for each time slot [8,9].

The use of game theory as a tool to study competitive energy markets is well established [1,10]. As the electricity markets usually derived from an integrated, monopolistic and state-owned company, the analyses usually focus on oligopolist competitive game models [10], particularly looking for Nash equilibrium strategies and their relationships to marginal cost curves. In this sense, Nash equilibrium analysis can be used by market participants to define best bid strategies and by regulatory agencies to evaluate market power.

A Nash equilibrium is defined by

\begin{matrix} x_{1}^{★} & = arg min_{x_{1}} f_{1} (x_{1}, x_{- 1}^{★}) \end{matrix}

(1)

\begin{matrix} x_{2}^{★} & = arg min_{x_{2}} f_{2} (x_{2}, x_{- 2}^{★}) \\ \dots \end{matrix}

(2)

\begin{matrix} x_{p}^{★} & = arg min_{x_{p}} f_{p} (x_{p}, x_{- p}^{★}) \end{matrix}

(3)

where

x = (x_{1}, x_{2}, \dots, x_{p}) \in R^{n}

are the player variables (i.e., strategy),

f_{v} : R^{n} \to R

is the objective function of player v,

x_{v} \in R^{n_{v}}

are the variables of player v and

x_{- v}

denotes all variables except the ones of player v. The Nash equilibrium concept can be extended to concurrent constrained optimization problems, as defined further in this paper.

The first known use of the Nash equilibrium concept dates back to 1838. However, it was named after John Nash in 1950 who proved its existence for continuous convex objectives relative to player own variables and nonempty convex compact feasible sets [11]. Rosen has proved extra conditions on continuously differentiable convex games in order to guarantee uniqueness of a Nash equilibrium [12]. Facchinei et al., have analyzed the existence of a Nash equilibrium for pseudo-convex objective functions relatively to player own variables in terms of variational inequalities on compact convex feasible sets [13].

The Best Response Dynamics (BRD) is a straightforward algorithm to find a Nash equilibrium where at each iteration a different player optimizes its own strategy looking only to its own objective, while fixing other player strategies. It enjoys theoretical guarantees of convergence to a Nash equilibrium only in special cases, including potential games [14], aggregative games [15], acyclic games [16] and quasi-acyclic games [17,18].

Rosen has proposed a gradient algorithm with guaranteed convergence and optimality for games satisfying certain convex conditions [12], which introduces the concept of normalized Nash equilibrium for constrained games and also the fundamental idea for the search direction of this paper.

The relaxation algorithm considers the optimization of Nikaido–Isoda [19] function at each iteration, which is built from player objective functions, so that the next player strategies lies in the segment connecting the previous iteration point and the Nikaido–Isoda function solution. It enjoys theoretical guarantees of convergence to a Nash equilibrium for games with weakly convex–concave Nikaido–Isoda functions [20,21] respecting a few extra conditions, which is a broader class of games than BRD and Rosen gradient algorithm can cope with with theoretical guarantees.

This paper proposes and analyzes an enhanced algorithm based on a line search along a composition of objective function gradients of players as search direction at each iteration, without the need of an optimization on player variables at each iteration as in the relaxation method. The result is a faster and more stable algorithm, especially in games of many variables. This algorithm has already been used, but not analyzed in depth [22].

2. The Gradient Algorithm

2.1. Fundamental Idea

The fundamental idea of the proposed algorithm is to follow the vector field composed of a negatively weighted gradient of each player objective

F (x) = - (λ_{1} \nabla_{x_{1}} f_{1} (x), λ_{2} \nabla_{x_{2}} f_{2} (x), \dots, λ_{p} \nabla_{x_{p}} f_{p} (x)), λ > 0

(4)

until it eventually stops at a Nash equilibrium. Let

\hat{F} (x)

denote the unit vector towards a non-null

F (x)

. Then, the algorithm can be defined by the iterative update

x_{k + 1} = x_{k} + α^{★} \hat{F} (x_{k})

(5)

where the step length is

α^{★} = max_{α} α : \hat{F} {(x_{k})}^{T} \hat{F} (x_{k + 1}) \geq 1 - η, η \in (0, 2)

(6)

where

η

controls the change direction threshold to follow the vector field (

η = 0

for no direction change).

The search direction (4) is the same as proposed by Rosen [12]. However, the step length (6) is different (i.e., it is not the one that minimizes

∥F (x + α F (x))∥

) and behaves better in practice).

Notice that for single player games (i.e., optimization problems) and

η = 1

, this algorithm degenerates into the classical gradient descent algorithm [23]. Furthermore, if

η = 1

,

λ = 1

and the vector field (4) is conservative, i.e., the game is potential [14], then the algorithm also degenerates into the classical gradient descent algorithm for minimizing the associated potential function.

2.2. Algorithm Analysis

The convergence of the algorithm depends on the following concept of attractor and basin of attraction.

Definition 1

(Attractor and basin of attraction). Let

0 \leq t \in R

represent time and

s (t, x)

be a function which specifies the dynamics of the system, so that

s (0, x) = x \in R^{n}

is the initial state of the system and

s (t, x)

is the result of the evolution of this state after t units of time. An attractor

A \subset R^{n}

of a dynamic system

s (t, x)

is characterized by the following three conditions:

1.: If $s (t_{a}, x) \in A$ then $s (t, x) \in A$ , $\forall t > t_{a}$ ;
2.: There exists a neighborhood of $A$ , called the basin of attraction $B$ of $A$ , such that for any open neighborhood $V$ of $A$ there is a positive constant T such that $f (t, b) \in V$ , $\forall t > T$ and $\forall b \in B$ ;
3.: There is no non-empty subset of $A$ having the first two properties.

Indeed, the iterative update (5) with a step length (6) can be seen as a dynamic system

s (t, x)

, where t is the iteration k and x define the player variables. When

η \to 0

, the dynamic system can be visualized by the vector field given by (4). Convergence and optimality proofs for the proposed algorithm are given next.

Theorem 1

(Convergence). The iterative update (5) with step length (6) converges to an attractor of vector field (4) for

η \to 0

and a enough large k if at least one attractor exists and the starting point

x_{0}

belongs to its basin of attraction.

Proof.

The iterative update (5) with given

η \to 0

for step length (6) defines a dynamic system given by the vector field (4) and the convergence to an attractor follows by construction. □

Theorem 2

(Optimality). Every fixed point attractor of vector field (4) has necessary conditions to be also a Nash equilibrium of game (1)–(3)

\forall λ > 0

for functions

f_{v}

continuously differentiable in x.

Proof.

If the attractor is a single fixed point, then

F (x) = 0

because otherwise the respective dynamic system would continue moving. This

F (x) = 0

defines a necessary optimality condition for each player, i.e., a Nash equilibrium. □

For pseudo-convex (i.e., objective functions are pseudo-convex to respective player own variables) unconstrained games, the condition

F (x) = 0

also becomes sufficient for Nash equilibria. Indeed, if pseudo-convexity is proven for a game and the proposed algorithm converges to a single point, then it is a Nash equilibrium. Furthermore, for finite pseudo-convex potential games, the proposed algorithm converges to a Nash equilibrium just like a classical gradient algorithm converges to the minimum of finite pseudo-convex optimization problems. For convex games, the proposed enhanced gradient algorithm converges to the Nash equilibrium just like Rosen gradient algorithm because

∥F (x_{k})∥

is reduced after each iteration k for a small enough

η

.

2.3. Examples for Convergence Analysis

Consider a 2-player game where one player controls the price that it will sell a product and the other player controls the amount of the product it will buy. Both players are penalized for rising their control variables squared by a factor

c / 2

. The game of maximum gain of player 1 and minimum expenses of player 2 can be written as

\begin{matrix} x_{1}^{★} & = arg min_{x_{1}} f_{1} (x_{1}, x_{2}^{★}) = \frac{1}{2} c x_{1}^{2} - x_{1} x_{2}^{★} \end{matrix}

(7)

\begin{matrix} x_{2}^{★} & = arg min_{x_{2}} f_{2} (x_{1}^{★}, x_{2}) = \frac{1}{2} c x_{2}^{2} + x_{1}^{★} x_{2} \end{matrix}

(8)

for

c > 0

, so that each objective function is convex relative to respective player variable.

The best response curves are given by

\begin{matrix} X_{1}^{★} (x_{2}) & = \frac{x_{2}}{c} \end{matrix}

(9)

\begin{matrix} X_{2}^{★} (x_{1}) & = - \frac{x_{1}}{c} \end{matrix}

(10)

so that it has a unique Nash equilibrium at

x^{★} = (0, 0)

(11)

The iterative update (5) with step length (6) converges to the Nash equilibrium of the game (7) and (8) whenever

η < \bar{η} = \frac{2 c^{2}}{1 + c^{2}}, \forall λ

(12)

given by

η = 1 - \hat{F} {(x_{k})}^{T} \hat{F} (x_{k + 1})

where

∥x_{k}∥ > ∥x_{k + 1}∥

and

x_{k + 1} = x_{k} + α F (x_{k})

. Furthermore, it converges at maximum rate for

η^{★} = 1 - \frac{1}{\sqrt{1 + c^{2}}}, \forall λ

(13)

given by

η = 1 - \hat{F} {(x_{k})}^{T} \hat{F} (x_{k + 1})

, where

x_{k + 1}^{T} F (x_{k}) = 0

and

x_{k + 1} = x_{k} + α F (x_{k})

. Notice that

η^{★} < \bar{η}

,

\forall c, λ

.

Hence, even for convex games with a unique Nash equilibrium, the iterative update (5) may not converge if

η \in (0, 2)

is large enough. Furthermore, a large convergent step length may lead to a slower convergence, just like a small step length. Indeed, even the relaxation algorithm [20] only converges to the Nash equilibrium of the game (7) and (8) for specific fixed step lengths. For

c = 0

, the game becomes linear (i.e., objective functions are linear to respective player own variables), and there is still a Nash equilibrium at

x = (0, 0)

, but both algorithms cannot find it starting from somewhere else.

An interesting example of a non-convex game is

\begin{matrix} x_{1}^{★} & = arg min_{x_{1}} f_{1} (x_{1}, x_{2}^{★}) = - x_{1} x_{2}^{★} + c x_{1}^{2} (\frac{1}{4} x_{1}^{2} + \frac{1}{2} x_{2}^{★ 2} - r^{2}) \end{matrix}

(14)

\begin{matrix} x_{2}^{★} & = arg min_{x_{2}} f_{2} (x_{1}^{★}, x_{2}) = x_{1}^{★} x_{2} + c x_{2}^{2} (\frac{1}{4} x_{2}^{2} + \frac{1}{2} x_{1}^{★ 2} - r^{2}) \end{matrix}

(15)

whose attractor is the circle

x_{1}^{2} + x_{2}^{2} = r^{2}

(16)

for all

R^{2}

, except at the origin

x = (0, 0)

, where

F (x) = 0

, but it is not a Nash equilibrium. Indeed, this game has no Nash equilibrium. The proposed algorithm will never stop for this game, being trapped in the circular attractor, unless the starting point is the origin, where it stops immediately because of a necessary optimality condition that is not sufficient.

3. Generalized Nash Equilibrium

A generalized Nash equilibrium is defined by

\begin{matrix} x_{1}^{★} & = arg min_{x_{1}} f_{1} (x_{1}, x_{- 1}^{★}) : x_{1} \in X_{1} (x_{- 1}^{★}) \end{matrix}

(17)

\begin{matrix} x_{2}^{★} & = arg min_{x_{2}} f_{2} (x_{2}, x_{- 2}^{★}) : x_{2} \in X_{2} (x_{- 2}^{★}) \\ \dots \end{matrix}

(18)

\begin{matrix} x_{p}^{★} & = arg min_{x_{p}} f_{p} (x_{p}, x_{- p}^{★}) : x_{p} \in X_{p} (x_{- p}^{★}) \end{matrix}

(19)

where

X_{v} (x_{- v}) \subseteq R^{n_{v}}

is the feasible set of player v as a function of other player variables

x_{- v}

.

Consider the joint constraints

X_{v} (x_{- v}) = {x_{v} | (x_{v}, x_{- v}) \in X}

(20)

where

X = {x | g (x) \leq 0} \subseteq R^{n}

is the feasible set for joint constraint functions

g : R^{n} \to R^{m}

.

3.1. Generalized Gradient Algorithm

The generalized algorithm can be defined by the iterative update

x_{k + 1} = x_{k} + α^{★} {\hat{d}}_{k}

(21)

where the step length is

α^{★} = max_{α} α : {\hat{d}}_{k}^{T} \hat{F} (x_{k + 1}) \geq 1 - η, g (x_{k + 1}) \leq 0, η \in (0, 2)

(22)

starting from a feasible point

x_{0} \in X

.

Define the directions

D = [F (x) - \nabla g_{J} (x)]

(23)

where

J = {j | g_{j} (x) = 0}

(24)

is the active set of constraints. Then, the unit search direction

\hat{d}

must lie inside the cone

D^{T} d > 0

in order to walk at least locally in the direction of the vector field while reducing the constraint function values. A good choice is a direction d so that not only

D^{T} d > 0

, but also

d = D w

,

w \geq 0

and

w \neq 0

[24]. A direction in this intersection can be found by the linear optimization problem,

\begin{matrix} minimize & - s \end{matrix}

(25)

\begin{matrix} subject to & - D^{T} D w + σ s \leq 0 \end{matrix}

(26)

\begin{matrix} \sum_{j = 1}^{n_{d}} w_{j} = 1 \end{matrix}

(27)

\begin{matrix} w \in {[0, 1]}^{n_{d}} \end{matrix}

(28)

\begin{matrix} s \in R \end{matrix}

(29)

where

σ \in R^{n_{d}}

defines a weight for each director. These weights

σ

allow us to obtain directions

\hat{d}

closer to

F (x_{k})

, which improves the convergence rate and also reduces divergent behaviors of the algorithm (as seen for Problems (7) and (8)). A weight w relative to

F (x)

four times the ones for activating nonlinear constraints and 4000 times the ones for activating linear constraints is typical for a good performance in practice. The solution of this linear problem

(w^{★}, s^{★})

defines the search direction,

d_{k} = D w^{★}

(30)

if

s^{★} > 0

, otherwise the algorithm stops.

3.2. Generalized Gradient Algorithm Analysis

The primal cone of directions

d = D w

,

w \geq 0

,

w \neq 0

, is introduced in the formulation (25)–(29) only to lead to a better (i.e., more centralized) search direction. The following lemma proves that its intersection with a dual nonempty cone

D^{d} > 0

is also always nonempty.

Lemma 1

(cone intersection). If

D^{T} d > 0

then

\exists w \geq 0, w \neq 0

so that

D w = d

.

Proof.

Suppose the direction d can be written as

D w

. If it belongs to the cone

D^{T} d > 0

, then must exist a

w \geq 0

,

w \neq 0

such that

\begin{matrix} D^{T} d = D^{T} D w & > 0 \end{matrix}

(31)

\begin{matrix} \Rightarrow c^{T} D^{T} D w & > 0, \forall c \geq 0, c \neq 0 \end{matrix}

(32)

which can be interpreted as if exists a strictly supporting hyperplane at the origin to the cone

C = {d | d = D c, c \geq 0 c \neq 0}

whose normal

D w

belongs to

C

. Because

C

is a convex set contained in an open halfspace intersection (i.e.,

D^{T} d > 0

), the origin lies in the boundary of the closure of

C

, the existence of w is guaranteed by the supporting hyperplane theorem for convex sets. □

The proofs of convergence and optimality of the proposed algorithm to find a generalized Nash equilibrium are given next.

Theorem 3

(generalized convergence). The iterative update (21) with step length (22) converges to an attractor of vector field (4) constrained to

x \in X

for

η \to 0

and a large enough k if at least one attractor exists and the starting point

x_{0}

belongs to its basin of attraction.

Proof.

The linear programming provides a direction

d = D w

,

w \geq 0

,

w \neq 0

, such that

D^{T} d > 0

if

s^{★} > 0

. Because of Lemma 1, it is sufficient to consider a direction d such that

D^{T} d > 0

, since in this case d can always be written as

D w

,

w \geq 0

,

w \neq 0

. When there is a direction d that satisfies

D^{T} d > 0

, then there exists an infinitesimal step towards d that does not violate any constraint and still has a component on vector field

F (x)

. Hence, the algorithm converges to an attractor of vector field (4) constrained to

x \in X

for

η \to 0

by construction. □

Theorem 4

(Generalized optimality). Every fixed point attractor

x^{★}

of the vector field (4) constrained to

x \in X

satisfies the variational constraint

F {(x^{★})}^{T} (y - x^{★}) \leq 0

,

\forall y \in X \cap B

, where

B

is a ball of infinitesimal radius around

x^{★}

. Furthermore, every solution

x^{★}

to the variational inequality

F {(x^{★})}^{T} (y - x^{★}) \leq 0

,

\forall y \in X

, for a closed convex set

X

and the vector field

F (x)

in (4) is also a generalized Nash equilibrium for all

λ > 0

of the game (17)–(19), with constraints respecting (20) and objective functions

f_{v}

continuously differentiable in x and pseudo-convex in

x_{v}

.

Proof.

Every fixed point attractor

x^{★}

of

F (x)

constrained to

x \in X

is characterized when there is no feasible direction to step towards

F (x)

by definition, so that mathematically leads to

F {(x^{★})}^{T} d \leq 0

,

\forall d : \nabla g {(x^{★})}^{T} d \leq 0

, which is equivalent to

F {(x^{★})}^{T} (y - x^{★}) \leq 0

,

\forall y \in X \cap B

.

Let

x_{v}

be any point in

X_{v} (x_{- v}^{★})

. From (20), it is clear that

y = (x_{v}, x_{- v}^{★}) \in X

. Since

x^{★}

is a solution to the variational inequality and

y \in X

, the definition of vector field (4) based on diffentiability of

f_{v}

leads to

0 \leq - F {(x)}^{T} (y - x^{★}) = λ_{v} \nabla_{x_{v}} f_{v} {(x_{y}, x_{- v}^{★})}^{T} (x_{v} - x_{v}^{★}), \forall λ_{v} > 0, \forall x_{v} \in X_{v} (x_{- v}^{★}),

(33)

so that

x^{★}

is a solution of game (17)–(19) by the minimum principle under the assumed pseudo-convexity of

f_{v}

. □

Every point that satisfies the variational inequality

F {(x^{★})}^{T} (y - x^{★}) \leq 0

is called normalized Nash equilibrium [12] or variational equilibrium [25], which are the type of equilibrium that the proposed gradient algorithm is able to find. Every normalized Nash equilibrium is a generalized Nash equilibrium, but the converse is not true in general [13]. The following example illustrates this fact.

3.3. An Example with Infinite Generalized Nash Equilibria

Consider the two-player game [13],

\begin{matrix} x_{1}^{★} & = arg min_{x_{1}} f_{1} (x_{1}, x_{2}^{★}) = {(x_{1} - 1)}^{2} : x_{1} + x_{2}^{★} \leq 1 \end{matrix}

(34)

\begin{matrix} x_{2}^{★} & = arg min_{x_{2}} f_{2} (x_{1}^{★}, x_{2}) = {(x_{2} - \frac{1}{2})}^{2} : x_{1}^{★} + x_{2} \leq 1 \end{matrix}

(35)

whose best response curves are given by

\begin{matrix} X_{1}^{★} (x_{2}) & = \{\begin{matrix} 1, & x_{2} \leq 0 \\ 1 - x_{2}, & x_{2} > 0 \end{matrix} \end{matrix}

(36)

\begin{matrix} X_{2}^{★} (x_{1}) & = \{\begin{matrix} \frac{1}{2}, & x_{1} \leq \frac{1}{2} \\ 1 - x_{1}, & x_{1} > \frac{1}{2} \end{matrix} \end{matrix}

(37)

so that the generalized Nash equilibria are

x^{★} = (\frac{1}{2} (1 + a), \frac{1}{2} (1 - a)), a \in [0, 1]

(38)

Considering that

x = (1, 1 / 2)

is infeasible so that

F (x) \neq 0

for all feasible points, the variational inequality

F {(x)}^{T} (y - x) \leq 0

,

\forall y \in X

, holds true for

X = {x = (x_{1}, x_{2}) | x_{1} + x_{2} \leq 1}

only when

F (x) = (b, b)

,

b \geq 0

for

x_{1} + x_{2} = 1

, so that

\begin{matrix} F (x) & = - (λ_{1} (2 x_{1} - 2), (1 - λ_{1}) (2 x_{2} - 1)), λ_{1} \in (0, 1) \\ = - (λ_{1} (2 x_{1} - 2), (1 - λ_{1}) (1 - 2 x_{1})), λ_{1} \in (0, 1) \\ = (b, b) \end{matrix}

(39)

so that, because

λ \neq 0

and

λ \neq 1

by definition of the vector field (4), only the Nash equilibria

x^{★} = (\frac{1}{2} (1 + λ_{1}), \frac{1}{2} (1 - λ_{1})), λ_{1} \in (0, 1)

(40)

can be found by the proposed algorithm, leaving two generalized Nash equilibria out of reach:

x^{★} = (1, 0)

and

x^{★} = (1 / 2, 1 / 2)

. However, if no positive weights

λ

were associated with

F (x)

, only a single generalized Nash equilibria could be found:

x^{★} = (3 / 4, 1 / 4)

, the one where

λ_{1} = λ_{2} = 1 / 2

.

3.4. A Numerical Example: Internet Game

Consider the extended internet switching p-player symmetrical game [26]

x_{v}^{★} = arg min_{x_{v}} \frac{\sum_{i = 1}^{n} x_{v, i}}{B} - \frac{\sum_{i = 1}^{n} x_{v, i}}{\sum_{u = 1}^{p} \sum_{i = 1}^{n} x_{u, i}} : \sum_{u = 1}^{p} \sum_{i = 1}^{n} x_{u, i} \leq B, x_{v} \geq ϵ, v = 1, \dots, p

(41)

which has Nash equilibria at points satisfying

\sum_{i = 1}^{n} x_{v, i}^{★} = \frac{B (p - 1)}{p^{2}}, v = 1, \dots, p

(42)

which is unique when

n = 1

.

Figure 1 shows the typical convergence of the proposed enhanced gradient algorithm (EGNE) for

η = 1

compared to the relaxation algorithm (RNE) for a fixed step of

1 / 2

, Rosen gradient algorithm (GNE), and BRD algorithm (BRD), measured by number of function evaluations. Notice that EGNE, GNE, and BRD are typically one order of magnitude faster than RNE for

p = 5

. As the number of players increases, or the number of variables increases, this difference becomes even greater as shown in Figure 2 and Figure 3, respectively. This is because relaxation algorithm solves an optimization problem every iteration, while the proposed algorithm and GNE only performs a line search, and BRD solves an optimization problem with less variables. Notice also that the relaxation algorithm (RNE) becomes unstable for

n \geq 4

.

Figure 1. Convergence of proposed Enhanced Gradient Algorithm (EGNE) for

η = 1

and

λ_{v} = 1 / 5,

v = 1, \dots, 5

(EGNE), relaxation algorithm for a fixed step of

1 / 2

(RNE), Rosen gradient algorithm for

λ_{v} = 1 / 5, v = 1, \dots, 5

(GNE), and the BRD algorithm (BRD), for game (41) with

p = 5

,

n = 1

,

B = 1

and

ϵ = 10^{- 2}

.

Figure 2. Average computational cost of 30 solutions of game (41) with proposed enhanced gradient algorithm for

η = 1

and

λ_{v} = 1 / p, v = 1, \dots, p

(EGNE), relaxation algorithm for a fixed step of

1 / 2

(RNE), Rosen gradient algorithm for

λ_{v} = 1 / p, v = 1, \dots, p

(GNE), and BRD algorithm (BRD), for an increasing number of players p starting from a random feasible point

x_{0, v} \in [ϵ, B / p]

,

v = 1, \dots, p

, with

n = 1

,

B = 1

and

ϵ = 10^{- 2}

. The stop criterion is to move closer than a Euclidean distance of

10^{- 6}

to the know Nash equilibrium.

Figure 3. Average computational cost of 30 solutions of game (41) with proposed enhanced gradient algorithm for

η = 1

and

λ_{v} = 1 / p, v = 1, \dots, p

(EGNE), relaxation algorithm for a fixed step of

1 / 2

(RNE), Rosen gradient algorithm for

λ_{v} = 1 / p, v = 1, \dots, p

(GNE), and BRD algorithm (BRD), for an increasing number of variables n starting from a random feasible point

x_{0, v} \in [ϵ, B / (n p)]

,

v = 1, \dots, p

, with

p = 4

,

B = 1

and

ϵ = 10^{- 6}

. The stop criterion is to move closer than a Euclidean distance of

10^{- 6}

to the known Nash equilibrium.

4. Energy Market Application

The current Brazilian electric sector is mainly powered by hydroelectric plants, that are dispatched centrally by the system operator, that calculates the water future cost of each to estimate the energy spot prices. As the generation of hydroelectric power plants are strongly affected by water inflow, which depends on the rain regime and upstream hydroelectric plant generation, there is a compulsory financial hedge mechanism to share the hydrological risk of all hydroelectric power plants. In this mechanism, called energy reallocation mechanism, the generation of all hydroelectric power plants is bundled together and each plant payment is proportional to its physical guarantee (PG) that is prefixed for commercialization and does not depend on its actual generation [22].

In this context, each genco v can seasonalize their hydroelectric plants PG

x_{v, t}

along months

t = 1, \dots, T

, where T is the number of periods (

T = 12

in this paper for real seasonalization game), as long as its yearly average PG

X_{v}

is unchanged. This opens possibility to each genco to seasonalize strategically in order to maximize its own total income

R_{v, t}

along periods t, which will be modulated by a generation scaling factor, i.e., the ratio between the actual generation

H_{t}

and the amount of PG allocated by all players in each period t. The Nash equilibrium of the seasonalization non-cooperative game with p-player can be formulated by [22]:

x_{v}^{★} = arg max_{x_{v}} \sum_{t = 1}^{T} R_{v, t} (x) : \sum_{t = 1}^{T} x_{v, t} \leq T X_{v}, \underset{̲}{ρ} X_{v} \leq x_{v} \leq \bar{ρ} X_{v}, v = 1, \dots, p

(43)

where the income of each player v in each period t is given by

R_{v, t} (x) = h_{t} P_{m_{v}, t} [\frac{x_{v} H_{t}}{\sum_{u = 1}^{4} x_{u, t}} - (1 - s) X_{v} η_{t}]

(44)

where

h_{t}

is the duration of period t,

P_{m, t}

is the spot price of period t in sub-market m,

m_{v}

is the sub-market where player v is located, s is the reduction ratio, and

η_{t}

is the generation scaling factor for a flat seasonalization so that a negative income means a worse income than a reference flat seasonalization. Considering

\bar{ρ} > \underset{̲}{ρ} > 0

and

X_{v} > 0

,

\forall v

, the objective function of each player is smooth and pseudo-convex over the feasible domain. The linear constraint functions tend to be activated as the players maximize their income, which is especially difficult to handle, and makes them a challenging test for the constraint handling proposed in this paper.

Figure 4 shows a typical convergence for the seasonalization 4-player game considering one player per sub-market (i.e.,

m_{v} = v

) and year 2020 input data given by

X_{1} = 32,926

,

X_{2} = 7186.4

,

X_{3} = 5962.1

,

X_{4} = 9948.7

,

s = 0.1

,

\underset{̲}{ρ} = 0.5

,

\bar{ρ} = 1.6

, and the parameters given in Table 1. Table 2 shows the respective Nash equilibrium obtained by the proposed algorithm. The maximum variable variation for the proposed enhanced gradient algorithm in 5 runs starting from random points was about

10^{- 6}

, while for the relaxation algorithm was about

10^{- 1}

and the Rosen gradient algorithm was

10^{1}

. The proposed enhanced gradient algorithm convergence is about six times faster than the relaxation algorithm, and a bit faster than the Rosen gradient algorithm.

Figure 4. Typical convergence for the seasonalization game using the proposed gradient algorithm for

η = 1

and

λ_{v} = 1 / 4, i = 1, \dots, 4

(EGNE) the relaxation method for a fixed step of

1 / 2

(RNE), and Rosen gradient algorithm for

λ_{v} = 1 / 4, i = 1, \dots, 4

(GNE), compared to EGNE solution.

Table 1. Seasonalization game parameters.

Table 2. Nash equilibrium for seasonalization game.

Figure 5 shows how the EGNE algorithm scales with an increasing number of players for

\underset{̲}{ρ} = 1 / 2

,

\bar{ρ} = 3 / 2

,

T = 12

,

M = 4

,

P_{m, t} \in [10, 600]

,

H_{t} \in [0, 4 \times 10^{3}]

,

X_{v} \in [10^{3}, 3 \times 10^{3}]

and

m_{v} \in {1, \dots, M}

.

Figure 5. Average computational cost of 30 solutions of seasonalization game with proposed enhanced gradient algorithm for

η = 1

and

λ_{v} = 1 / p, v = 1, \dots, p

(EGNE), for an increasing number of players p starting from a random feasible point

x_{0, v} \in [\underset{̲}{ρ} X_{v}, X_{v}]

,

v = 1, \dots, p

.

5. Conclusions

This paper proposes an enhanced gradient algorithm to find a generalized Nash equilibrium with joint constraints that seems to have similar convergence and optimality behavior when compared to the state-of-the-art relaxation method, but it is typically about one order of magnitude faster in practice. A promising future application is to open or semi-open electricity market game formulations, where the players make price offers and the consumers or system-market operators define which player will actually generate energy. Furthermore, further details can be added to the electricity market game formulation, e.g., control mechanisms to assert proper operation or prevent spurious behavior under any scenario.

The convergence and optimality guarantees of the proposed algorithm are based on the concept of attractor. Future developments in vector field conditions for existence and uniqueness of fixed point attractors can improve the ability to give theoretical guarantees to find a Nash equilibrium for specific game instances.

Author Contributions

Conceptualization, A.C.L.; Methodology, A.C.L.; Software, A.C.L.; Validation, F.F.G.S.; Formal analysis, A.C.L.; Investigation, A.C.L.; Resources, F.F.G.S.; Data curation, F.F.G.S.; Writing—original draft, A.C.L.; Writing—review & editing, F.A.C.P.; Supervision, F.F.G.S., D.A.G.V. and R.R.S.; Project administration, F.F.G.S.; Funding acquisition, D.A.G.V. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by an R&D ANEEL project by CEMIG, CAPES, CNPq and FAPEMIG, Brazil.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Adriano C. Lisboa was employed by the company Gaia. Authors Adriano C. Lisboa, Douglas A. G. Vieira and Felipe A. C. Pereira were employed by the company ENACOM. Author Fellipe F. G. Santos was employed by the company Companhia Energética de Minas Gerais S.A. Authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Green, R.J.; Newbery, D.M. Competition in the British Electricity Spot Market. J. Political Econ. 1992, 100, 929–953. [Google Scholar] [CrossRef]
Carvalho, M.; Pedroso, J.P.; Saraiva, J. Electricity day-ahead markets: Computation of Nash equilibria. J. Ind. Manag. Optim. 2015, 11, 985–998. [Google Scholar] [CrossRef]
PJM. PJM Manual 11: Energy & Ancillary Services Market Operations; PJM: Norristown, PA, USA, 2024. [Google Scholar]
Marshall, L.; Bruce, A.; MacGill, I. Assessing wholesale competition in the Australian National Electricity Market. Energy Policy 2021, 149, 112066. [Google Scholar] [CrossRef]
Hao, Y.; Vand, B.; Delgado, B.M.; Baldi, S. Market Manipulation in Stock and Power Markets: A Study of Indicator-Based Monitoring and Regulatory Challenges. Energies 2023, 16, 1894. [Google Scholar] [CrossRef]
Yu, L.; Wang, P.; Chen, Z.; Li, D.; Li, N.; Cherkaoui, R. Finding Nash equilibrium based on reinforcement learning for bidding strategy and distributed algorithm for ISO in imperfect electricity market. Appl. Energy 2023, 350, 121704. [Google Scholar] [CrossRef]
Wu, C.; Gu, W.; Yi, Z.; Lin, C.; Long, H. Non-cooperative differential game and feedback Nash equilibrium analysis for real-time electricity markets. Int. J. Electr. Power Energy Syst. 2023, 144, 108561. [Google Scholar] [CrossRef]
Ribeiro, L.; Street, A.; Valladão, D.; Freire, A.C.; Barroso, L. Technical and economical aspects of wholesale electricity markets: An international comparison and main contributions for improvements in Brazil. Electr. Power Syst. Res. 2023, 220, 109364. [Google Scholar] [CrossRef]
PSR. Propostas de Metodologias Para a Formação de Preços por Oferta No Brasil, Entregável 3: Proposta de Desenho de Mecanismo Conceitual; Technical Report; PSR: Rio de Janeiro, Braizl, 2021. [Google Scholar]
Lin, X.; Wang, B.; Xiang, Z.; Zheng, Y. A review of market power-mitigation mechanisms in electricity markets. Energy Convers. Econ. 2022, 3, 304–318. [Google Scholar] [CrossRef]
Nash, J.F. Equilibrium points in n-person games. Proc. Natl. Acad. Sci. USA 1950, 36, 48–49. [Google Scholar] [CrossRef] [PubMed]
Rosen, J. Existence and uniqueness of equilibrium points for concave n-person games. Econometrica 1965, 33, 520–534. [Google Scholar] [CrossRef]
Facchinei, F.; Fischer, A.; Piccialli, V. On generalized Nash games and variational inequalities. Oper. Res. Lett. 2007, 35, 159–164. [Google Scholar] [CrossRef]
Monderer, D.; Shapley, L.S. Potential games. Games Econ. Behav. 1996, 14, 124–143. [Google Scholar] [CrossRef]
Dindos, M.; Mezzetti, C. Better-reply dynamics and global convergence to Nash equilibrium in aggregative games. Games Econ. Behav. 2006, 54, 261–292. [Google Scholar] [CrossRef]
Fabrikant, A.; Jaggard, A.D.; Schapira, M. On the structure of weakly acyclic games. Theory Comput. Syst. 2013, 53, 107–122. [Google Scholar] [CrossRef]
Friedman, J.W.; Mezzetti, C. Learning in games by random sampling. J. Econ. Theory 2001, 98, 55–84. [Google Scholar] [CrossRef]
Takahashi, S.; Yamamori, T. The pure Nash equilibrium property and the quasi-acyclic condition. Econ. Bull. 2002, 3, 1–6. [Google Scholar]
Hukuhane, N.; Kazuo, I. Note on noncooperative convex games. Pac. J. Math. 1955, 5, 807–815. [Google Scholar]
Uryas’ev, S.; Rubinstein, R.Y. On relaxation algorithm in computation of noncooperative equilibria. IEEE Trans. Autom. Control 1994, 39, 1263–1267. [Google Scholar] [CrossRef]
Krawczyk, J.B.; Uryas’ev, S. Relaxation algorithms to find Nash equilibria with economic applications. Environ. Model. Assess. 2000, 5, 63–73. [Google Scholar] [CrossRef]
dos Santos, F.F.G.; de Castro Lobato, M.V.; Vieira, D.A.G.; Lisboa, A.C.; Saldanha, R.R. A Nash equilibrium approach to the Brazilian seasonalization of energy certificates. Energies 2022, 15, 2156. [Google Scholar] [CrossRef]
Lemaréchal, C. Cauchy and the gradient method. Doc. Math. Extra 2012, 251, 10. [Google Scholar]
Vieira, D.A.G.; Takashi, R.H.C.; Saldanha, R.R. Multicriteria optimization with a multiobjective golden section line search. Math. Program. 2012, 131, 131–161. [Google Scholar] [CrossRef]
Facchinei, F.; Kanzow, C. Generalized Nash equilibrium problems. Q. J. Oper. Res. 2007, 5, 173–210. [Google Scholar] [CrossRef]
Kesselman, A.; Leonardi, S.; Bonifaci, V. Game-theoretic analysis of internet switching with selfish users. Notes Comput. Sci. 2005, 3828, 236–245. [Google Scholar]

Figure 1. Convergence of proposed Enhanced Gradient Algorithm (EGNE) for

η = 1

and

λ_{v} = 1 / 5,

v = 1, \dots, 5

(EGNE), relaxation algorithm for a fixed step of

1 / 2

(RNE), Rosen gradient algorithm for

λ_{v} = 1 / 5, v = 1, \dots, 5

(GNE), and the BRD algorithm (BRD), for game (41) with

p = 5

,

n = 1

,

B = 1

and

ϵ = 10^{- 2}

.

Figure 2. Average computational cost of 30 solutions of game (41) with proposed enhanced gradient algorithm for

η = 1

and

λ_{v} = 1 / p, v = 1, \dots, p

(EGNE), relaxation algorithm for a fixed step of

1 / 2

(RNE), Rosen gradient algorithm for

λ_{v} = 1 / p, v = 1, \dots, p

(GNE), and BRD algorithm (BRD), for an increasing number of players p starting from a random feasible point

x_{0, v} \in [ϵ, B / p]

,

v = 1, \dots, p

, with

n = 1

,

B = 1

and

ϵ = 10^{- 2}

. The stop criterion is to move closer than a Euclidean distance of

10^{- 6}

to the know Nash equilibrium.

Figure 3. Average computational cost of 30 solutions of game (41) with proposed enhanced gradient algorithm for

η = 1

and

λ_{v} = 1 / p, v = 1, \dots, p

(EGNE), relaxation algorithm for a fixed step of

1 / 2

(RNE), Rosen gradient algorithm for

λ_{v} = 1 / p, v = 1, \dots, p

(GNE), and BRD algorithm (BRD), for an increasing number of variables n starting from a random feasible point

x_{0, v} \in [ϵ, B / (n p)]

,

v = 1, \dots, p

, with

p = 4

,

B = 1

and

ϵ = 10^{- 6}

. The stop criterion is to move closer than a Euclidean distance of

10^{- 6}

to the known Nash equilibrium.

Figure 4. Typical convergence for the seasonalization game using the proposed gradient algorithm for

η = 1

and

λ_{v} = 1 / 4, i = 1, \dots, 4

(EGNE) the relaxation method for a fixed step of

1 / 2

(RNE), and Rosen gradient algorithm for

λ_{v} = 1 / 4, i = 1, \dots, 4

(GNE), compared to EGNE solution.

Figure 5. Average computational cost of 30 solutions of seasonalization game with proposed enhanced gradient algorithm for

η = 1

and

λ_{v} = 1 / p, v = 1, \dots, p

(EGNE), for an increasing number of players p starting from a random feasible point

x_{0, v} \in [\underset{̲}{ρ} X_{v}, X_{v}]

,

v = 1, \dots, p

.

Table 1. Seasonalization game parameters.

t	$h_{t}$	$H_{t}$	$η_{t}$	$P_{1, t}$	$P_{2, t}$	$P_{3, t}$	$P_{4, t}$
1	744	54,169	0.99670380	13,767	13,757	13,735	13,701
2	672	56,765	1.03818298	12,148	12,148	11,182	11,117
3	744	58,476	1.00380676	11,781	11,782	10,617	9953
4	720	52,581	0.83536170	10,342	10,334	9652	8529
5	744	48,936	0.86609332	10,067	10,042	9505	8848
6	720	46,301	0.91454965	10,048	9937	9742	9772
7	744	45,573	0.98870119	10,234	10,080	9672	9782
8	744	45,880	1.01770868	10,161	9991	9736	9879
9	720	46,472	1.06667054	10,369	10,167	9984	10,094
10	744	48,262	1.09996905	10,293	10,115	10,021	10,086
11	720	48,630	1.09499348	10,126	10,021	9802	9830
12	744	51,927	1.07686986	8659	8645	8419	8401

Table 2. Nash equilibrium for seasonalization game.

t	$x_{1}$	$x_{2}$	$x_{3}$	$x_{4}$
1	46,805	8982.91	9539	15,918
2	38,630	9569	5685	11,247
3	42,278	11,498	6779	9215
4	32,363	9014	6590	5655
5	30,513	7621	5851	7303
6	28,156	5434	5240	9628
7	29,064	5962	4,816	9687
8	29,121	5574	5102	10,117
9	29,136	5415	5285	10,141
10	31,089	5635	5916	10,903
11	29,806	5811	5500	10,138
12	28,151	5719	5240	9433

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

An Enhanced Gradient Algorithm for Computing Generalized Nash Equilibrium Applied to Electricity Market Games

Abstract

1. Introduction

2. The Gradient Algorithm

2.1. Fundamental Idea

2.2. Algorithm Analysis

2.3. Examples for Convergence Analysis

3. Generalized Nash Equilibrium

3.1. Generalized Gradient Algorithm

3.2. Generalized Gradient Algorithm Analysis

3.3. An Example with Infinite Generalized Nash Equilibria

3.4. A Numerical Example: Internet Game

4. Energy Market Application

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics