Hierarchical Structures and Leadership Design in Mean-Field-Type Games with Polynomial Cost

El Oula Frihi, Zahrate; Barreiro-Gomez, Julian; Eddine Choutri, Salah; Tembine, Hamidou

doi:10.3390/g11030030

Open AccessArticle

Hierarchical Structures and Leadership Design in Mean-Field-Type Games with Polynomial Cost

¹

Lab. of Probability and Statistics (LaPS), Department of Mathematics, Badji-Mokhtar University, B.P.12, Annaba 23000, Algeria

²

Learning & Game Theory Laboratory, Engineering Division, New York University Abu Dhabi, Saadiyat Campus, PO Box 129188, Abu Dhabi 44966, UAE

³

Research Center on Stability, Instability and Turbulence, New York University Abu Dhabi, Abu Dhabi 44966, UAE

^*

Author to whom correspondence should be addressed.

Games 2020, 11(3), 30; https://doi.org/10.3390/g11030030

Submission received: 8 June 2020 / Revised: 28 July 2020 / Accepted: 30 July 2020 / Published: 6 August 2020

(This article belongs to the Special Issue Mean-Field-Type Game Theory)

Download

Browse Figures

Versions Notes

Abstract

:

This article presents a class of hierarchical mean-field-type games with multiple layers and non-quadratic polynomial costs. The decision-makers act in sequential order with informational differences. We first examine the single-layer case where each decision-maker does not have the information about the other control strategies. We derive the Nash mean-field-type equilibrium and cost in a linear state-and-mean-field feedback form by using a partial integro-differential system. Then, we examine the Stackelberg two-layer problem with multiple leaders and multiple followers. Numerical illustrations show that, in the symmetric case, having only one leader is not necessarily optimal for the total sum cost. Having too many leaders may also be suboptimal for the total sum cost. The methodology is extended to multi-level hierarchical systems. It is shown that the order of the play plays a key role in the total performance of the system. We also identify a specific range of parameters for which the Nash equilibrium coincides with the hierarchical solution independently of the number of layers and the order of play. In the heterogeneous case, it is shown that the total cost is significantly affected by the design of the hierarchical structure of the problem.

Keywords:

mean-field-type hierarchical control; mean-field-type games; design of hierarchical structure

1. Introduction

The idea of hierarchy dates back at least to 1934, when Stackelberg [1] introduced a game that models markets where some firms have a stronger influence on others. Stackelberg games consist of two players, a leader and a follower. The leader who moves first decides an optimal strategy after anticipating the best response of the follower. Then, the follower eventually chooses the anticipated best response to optimize their cost or payoff. Therefore, this game is a game with two-level hierarchy. A dynamic Linear-Quadratic (LQ) Stackelberg differential game was studied by Samaan and Cruz [2]. A stochastic LQ Stackelberg differential game was investigated by Bagchi and Başar [3]. Bensoussan et al. [4] derived a maximum principle for the leader’s Stackelberg solution under the adapted closed-loop memoryless information structure.

Having two or more players, the Stackelberg game is called a hierarchical game, and it becomes more interesting and involved due to its multi-layer structure, including various forms of information. The players act in sequential order, such that each one of them is a leader for the previous and a follower of the next player in the hierarchy. For hierarchical mean-field-free differential games, see, for example, [5,6,7,8,9].

Only a few papers have considered hierarchical structures in mean-field-related games. Open-loop Stackelberg solutions are addressed in a linear-quadratic setting in [10,11]; and in the context of large populations, mean-field Stackelberg games are investigated in [12,13,14,15,16]. Besides, the leader-follower configuration has been used in several problems and fields to illustrate and model a variety of hierarchical behaviors. For instance, in [17], a leader-follower stochastic differential game with asymmetric information is studied, motivated by applications in finance, economics, and management engineering. In [18], a large-population leader-follower stochastic multi-agent system is analyzed with coupled cost functions and by using a mean-field Linear-Quadratic-Gaussian (LQG) approach. Regarding control applications, [19] presents a tracking control design in a distributed manner in a multi-agent system configured in a leader-follower fashion, and it is shown that the setup can be used to model the power sharing problem in microgrids. In [20], a security problem in networked control systems is studied by means of a Stackelberg approach, and in [21] a hierarchical control structure or sequential predictive control is designed for a large-scale water system. In [22], leadership is studied in the context of public goods games by means of the reward and punishment effects. The works mentioned above do not consider a hierarchical mean-field-type game setting where the payoff functionals are non-linear with respect to the probability measure of the state.

mean-field-type control was first introduced by Anderson and Djehiche [23], as well as Buckdahn, Djehiche, and Li [24]. The authors solved a one-player mean-field-type game in which the state dynamics and the payoff function depend on the first moment of the state (the mean-field coupling). The stochastic mean-field-type control problem is generalized to the stochastic mean-field-type game with several players—see, for example, [25,26,27,28,29,30,31].

The hierarchical mean-field-type game theory studies a class of hierarchical games in which the payoffs and/or state dynamics depend not only on the state-action pairs, but also the distribution of them [30]. This class of games offers several features:

A single decision-maker can have a strong impact on the mean-field terms;
The expected payoffs are not necessarily linear with respect to the state distribution;
The number of decision-makers is not necessarily infinite.

Games with non-linear distribution-dependent quantity-of-interest are very attractive in terms of applications, since the non-linear dependence of the payoff functions in terms of state distribution allow us to capture risk measures, which are functionals of variance, inverse quantile, and/or higher moments. In portfolio optimization, for instance, payoff functions may include the third and the fourth moments, known as the kurtosis and skewness (e.g., [32,33]). Generally, equilibrium solutions to mean-field-type games are presented as either open-loop or closed-loop solutions. The open-loop solutions are controls that do not explicitly depend on the state process at time t, that is, they are rather adapted processes that depend only on time and the initial data. The stochastic maximum principle can be used as a methodology for finding such optimal control strategies. Closed-loop solutions (i.e., feedback solutions) are deterministic functions that depend on the state of the process at time t, as well as its marginal distribution. The dual adjoint functions which are obtained from the Hamilton-Jacobi-Bellman (HJB) equations can be used for finding feedback optimal controls. We will use this approach throughout this paper. For linear quadratic stochastic differential games, Sun and Yong [34] established that the existence of open-loop optimal control strategies is equivalent to the solvability of the corresponding optimality system, which is a forward-backward Stochastic Differential Equation (SDE), and the existence of closed-loop optimal strategies is equivalent to the existence of a regular solution to the corresponding Riccati equation.

Our contribution can be summarized as follows. This work examines a class of hierarchical mean-field-type games with multiple layers, leaders, and followers. Based on infinite-dimensional partial integro-differential equations (PIDEs) on the space of measures, we provide semi-explicit solutions in closed-loop form of a class of master systems with hierarchical structure and non-quadratic cost, which are not covered in the earlier works. Recall that the non-quadratic costs allow for analysis other classes of higher risk terms, such as kurtosis [32,33]. The novelty of this paper mainly lies in the analysis of the effect of hierarchy and leadership on the solutions.

The rest of this article is structured as follows. We present the model setup in Section 2. Section 3 investigates the Nash equilibrium (no leader). Section 4 presents the Stackelberg solution. The multi-layer case is presented in Section 5. Numerical examples are presented in Section 6. Finally, concluding remarks are drawn in Section 7.

2. The Setup

There are

I \geq 2

decision-makers interacting within the time horizon

[t_{0}, t_{1}], t_{0} < t_{1} .

The set of decision-makers is denoted by

I = {1, 2, \dots, I} .

Decision-maker

i \in I

has a control action

u_{i} \in U_{i} = R .

The state x is driven by a Drift-Jump-Diffusion process of mean-field-type, given by

d x = b d t + σ d B + \int_{Θ} μ (., θ) \tilde{N} (d t, d θ), x (t_{0}) \sim m (t_{0}, .),

where

$Drift : b : [t_{0}, t_{1}] \times R \times \prod_{j = 1}^{I} U_{j} \times P (R) \to R,$
$Diffusion coefficient : σ : [t_{0}, t_{1}] \times R \times \prod_{j = 1}^{I} U_{j} \times P (R) \to R,$
$Brownian motion B,$
$Set of jump size : Θ = R_{+} \ {0},$
$Jump : N (d t, d θ),$
$Compensated jump : \tilde{N} (d t, d θ) = N (d t, d θ) - ν (d θ) d t,$
$Jump rate : μ : [t_{0}, t_{1}] \times R \times \prod_{j = 1}^{I} U_{j} \times P (R) \times Θ \to R,$

where

P (R)

denotes the set of probability measures on

R .

We assume that

x (t_{0}), B

and N are mutually independent. The performance functional of decision-maker i is

L_{i} (u, m_{0}) = h_{i} (x (t_{1}), m (t_{1})) + \int_{t_{0}}^{t_{1}} l_{i} (t, x, u, m) d t,

where

m (t, d y) = P_{x (t)} (d y)

is the probability measure of the state

x (t)

at time

t,

and

\begin{matrix} l_{i} : [t_{0}, t_{1}] \times R \times \prod_{j = 1}^{I} U_{j} \times P (R) \to R, \\ h_{i} : R \times P (R) \to R . \end{matrix}

In addition, each decision-maker is assumed to have a computational capability, such as being able to compute an aggregative term of m from the model. Let

U_{i}

be the set of control strategies of decision-maker i that are progressively measurable with respect to the filtration generated by the unions of events in

{B, N} .

2.1. Games with Polynomial Cost

We investigate the mean-field-type game with the following data:

\begin{matrix} t_{0} & = 0, t_{1} = T > 0, \\ l_{i} (t, x, u, m) & = q_{i} \frac{{(x - \bar{x})}^{2 k_{i}}}{2 k_{i}} + r_{i} \frac{{(u_{i} - {\bar{u}}_{i})}^{2 k_{i}}}{2 k_{i}} + c_{i} {(x - \bar{x})}^{2 k_{i} - 1} (u_{i} - {\bar{u}}_{i}) \\ + \sum_{j \in I \ {i}} ϵ_{i j} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{i} - {\bar{u}}_{i}) (u_{j} - {\bar{u}}_{j}) \end{matrix}

(1a)

\begin{matrix} + {\bar{q}}_{i} \frac{{\bar{x}}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} + {\bar{r}}_{i} \frac{{\bar{u}}_{i}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} + {\bar{c}}_{i} {\bar{x}}^{2 {\bar{k}}_{i} - 1} {\bar{u}}_{i} + \sum_{j \neq i} {\bar{ϵ}}_{i j} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{i} {\bar{u}}_{j}, \end{matrix}

(1b)

\begin{matrix} h_{i} (x, m) & = q_{i T} \frac{{(x_{T} - {\bar{x}}_{T})}^{2 k_{i}}}{2 k_{i}} + {\bar{q}}_{i T} \frac{{\bar{x}}_{T}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}}, \end{matrix}

(1c)

\begin{matrix} b (t, x, u, m) & = b_{1} (x - \bar{x}) + {\bar{b}}_{1} \bar{x} + \sum_{j \in I} [b_{2 j} (u_{j} - {\bar{u}}_{j}) + {\bar{b}}_{2 j} {\bar{u}}_{j}], \end{matrix}

(1d)

\begin{matrix} σ (t, x, u, m) & = (x - \bar{x}) \tilde{σ}, \end{matrix}

(1e)

\begin{matrix} μ (t, x, u, m, θ) & = (x - \bar{x}) \tilde{μ} (., θ), \end{matrix}

(1f)

\begin{matrix} \bar{x} (t) & = \int y m (t, d y), \end{matrix}

(1g)

\begin{matrix} {\bar{u}}_{i} (t) & = \int u_{i} (t, y, m) m (t, d y), i \in I, \end{matrix}

(1h)

where

k_{i} \geq 1, {\bar{k}}_{i} \geq 1,

are natural numbers, and the coefficients are time-dependent. The coefficient functions

q_{i}, r_{i},

{\bar{q}}_{i}

and

{\bar{r}}_{i}

are nonnegative functions, and

\begin{matrix} \int_{t_{0}}^{t_{1}} [{\tilde{σ}}^{2} (t) + \int_{Θ} ({(1 + \tilde{μ} (t, θ))}^{2 k_{i}} - 1 - 2 k_{i} \tilde{μ} (t, θ)) ν (d θ)] d t < + \infty . \end{matrix}

2.2. Hierarchical Leader Design and Algorithmic Approach

The hierarchical leadership design consists of finding the optimal number of hierarchical layers h and the non-empty subsets of players

I_{1}, \dots, I_{h}

, partitioning the set of all players as

I = ⋃_{k = 1}^{h} I_{k}, and if k \neq k^{'}, I_{k} \cap I_{k^{'}} = \emptyset .

The performance functional for the hierarchical design is the sum cost at the chosen hierarchical solution, that is,

inf_{h} inf_{(I_{1}, \dots, I_{h}) : \cup_{k = 1}^{h} I_{k} = I} S (h, I_{1}, \dots, I_{h}) .

Here, we take into consideration three main game scenarios, described as follows. First, the game has a unique layer, that is, a situation in which all the players select their strategies simultaneously. Second, the game is played in two layers (bi-level hierarchy). The players are grouped into two sets (

h = 2

): leaders, which are those who decide first, as well as simultaneously; and followers, which are those who react against the decision of the leaders. Third, the game is structured to take into account as many layers as the number of players (fully hierarchical configuration with

h = I

), that is, players select strategically in sequence one-by-one in I layers. For all configurations, let

L_{i}^{*}

denote the optimal cost of the player

i \in I

in the hierarchical mean-field-type game problem, and

S (h, I_{1}, \dots, I_{h}) = \sum_{i \in I} L_{i}^{*}

denotes the total (social) cost at the hierarchical solution. The hierarchical leadership design consists in determining the optimal leaders, followers, and/or number of layers, such that the total cost is minimized.

Notice that, for both the bi-level and fully hierarchical cases, there are multiple combinations for the players. In the bi-level scenario, the set of all possible sets of leaders is given by the power set

2^{I},

and any set of leaders is denoted by

I_{L} \subseteq 2^{I}

with the corresponding set of followers,

I_{F} = I \ I_{L}

. Regarding the fully hierarchical game, there are as many possibilities in the strategic ordering as permutations of the set of players

I

. All possible permutations of the players are considered.

For the bi-level case, the optimal set for leaders and followers is

\begin{matrix} I_{L}^{*} & \in arg min_{2^{I}} S (2, I_{1}, I_{2}), \\ I_{F}^{*} & = I \ I_{L}^{*} . \end{matrix}

On the other hand, for the fully hierarchical case, we have that the optimal permutation is

\begin{matrix} (I_{1}^{*}, \dots, I_{I}^{*}) & \in arg min_{I_{1}, \dots, I_{I}} S (I, I_{1}, \dots, I_{I}) . \end{matrix}

In this paper, we study the three aforementioned scenarios involving one, two, and I layers, as presented in Figure 1. We also present under which conditions all the three configurations have the same solution, that is, when the Nash solution coincides with the hierarchical solutions at different layers. Furthermore, we present numerical examples considering different levels of hierarchy. The problem addressed in this paper can be interpreted as a mechanism design that, instead of determining the appropriate cost functionals or utility functions to induce a desired output, we design the best hierarchical structure in order to reduce the overall social cost.

Remark 1

(Feasibility and Existence). The set of possible combinations for the layers/levels and players per level is non-empty and finite. Then, the optimal hierarchical leader design is feasible, and there exists an optimal solution (combination) such that the social cost is minimized.

Since the feasible set of possible combinations for the hierarchical configurations is non-empty and finite, then it is possible to find the best hierarchical structure by means of Algorithm 1. The main results evoked in the Algorithm 1, given by Propositions 1–3, are presented throughout the paper.

Algorithm 1: Finding the best hierarchical structure

According to the procedure in Algorithm 1, one of the main concerns in the leadership design problem is related to the dimensionality of the feasible set for the hierarchical structures (NP-hard problem). The total number of combinations is, given by the total number of ordered partitions from a set, where such total combinations are computed by means of the ordered Bell number

B : N \to N

—that is, for I players we have:

B (I) = \sum_{k = 0}^{I} \sum_{j = 0}^{k} {(- 1)}^{k - j} (\begin{matrix} k \\ j \end{matrix}) j^{n} .

For instance, if

I = 2

, then there are

B (2) = 3

possible leadership configurations, as shown in Figure 2; i

I = 3

, and then there are

B (3) = 13

possible leadership structures presented in Figure 3, and

B (4) = 75

,

B (5) = 541

, and

B (6) = 4683

. Figure 4 illustrates the rapid increment of the number of combinations as the decision-makers increase. Notice that it is not possible to have more levels than players in the hierarchical game (

h \leq I

). The following sections are devoted to the presentation of semi-explicit solutions for hierarchical mean-field-type games with different levels from one (Nash scenario) up to the number of players I (fully hierarchical scenario).

3. Nash Mean-Field-Type Equilibrium

The risk-neutral mean-field-type game is, given by

{(I, U_{i}, U_{i}, E [L_{i}])}_{i \in I} .

A risk-neutral Nash mean-field-type Equilibrium is a solution of the following fixed-point problem:

\begin{matrix} i \in I, \\ E [L_{i} (u^{*})] = inf_{u_{i} \in U_{i}} E [L_{i} (u_{1}^{*}, \dots, u_{i - 1}^{*}, u_{i}, u_{i + 1}^{*}, \dots, u_{I}^{*})] . \end{matrix}

Let

{\hat{V}}_{i} (t, m)

be the optimal cost-to-go from m at time

t \in (t_{0}, t_{1})

given the strategies of the others, that is,

\begin{matrix} {\hat{V}}_{i} (t, m) & = inf_{u_{i}} E [h_{i} (x (t_{1}), m (t_{1})) + \int_{t}^{t_{1}} l_{i} (t, x, u, m) d t^{'} | m (t) = m] . \end{matrix}

We say that

{\hat{V}}_{i, m} (t, x, m) : = {\hat{V}}_{i, m} (t, m) (x)

is a Gâteaux derivative of

{\hat{V}}_{i} (t, m)

, with respect to the measure m, if

lim_{τ \to 0} \frac{d}{d τ} {\hat{V}}_{i} (t, m + τ \tilde{m}) = \int {\hat{V}}_{i, m} (t, m) (x) \tilde{m} (d x) .

(2)

If

\int \tilde{m} (d x) = 0

, then adding a constant to

{\hat{V}}_{i, m} (t, x, m)

does not change the value of the integral in (2). For any scalar

λ

and

m \in P (R)

one has,

λ = λ \int m (d x) .

Thus,

λ

is also a Gâteaux-derivative of the constant function

λ .

However, in our problem, the term

{\hat{V}}_{i, x m}

, which is the gradient of

x \mapsto {\hat{V}}_{i, m} (t, x, m)

, will be used in the Hamiltonian, and

{\hat{V}}_{i, x m}

does not have the constant ambiguity. Let us denote the jump operator J as

J [ϕ_{i}] : = \int_{Θ} [ϕ_{i, m} (t^{-}, x + μ) - ϕ_{i, m} - μ ϕ_{i, x m}] ν (d θ) .

Let us introduce the integrand Hamiltonian as

\begin{matrix} H_{i} (t, x, m, {\hat{V}}_{m}, & {\hat{V}}_{x m}, {\hat{V}}_{x x m}) \\ = inf_{u_{i} \in U_{i}} \{l_{i} + b {\hat{V}}_{i, x m} + \frac{σ^{2}}{2} {\hat{V}}_{i, x x m} + \int_{Θ} [{\hat{V}}_{i, m} (t^{-}, x + μ) - {\hat{V}}_{i, m} - μ {\hat{V}}_{i, x m}] ν (d θ)\} . \end{matrix}

A sufficiency condition for a risk-neutral Nash equilibrium system is, given by the following PIDE system:

\begin{matrix} 0 & = {\hat{V}}_{i, t} (t, m) + \int H_{i} (t, x, m, {\hat{V}}_{m}, {\hat{V}}_{x m}, {\hat{V}}_{x x m}) m (d x), \end{matrix}

(3a)

\begin{matrix} {\hat{V}}_{i} (t_{1}, m) & = \int m (d y) h_{i} (y, m), i \in I . \end{matrix}

(3b)

We refer the reader to [35] for a derivation of this equilibrium system. The system (3) is an infinite-dimensional PIDE system in m and it provides the Nash equilibrium values of the mean-field-type game. Notice that from (3), the equilibrium strategies have the best response to the integrand Hamiltonian and can be expressed as functions of

t, x, m, {\hat{V}}_{i, m}, {\hat{V}}_{i, x m}, {\hat{V}}_{i, x x m} .

Next, we semi-explicitly provide the Nash mean-field-type equilibrium in linear state-and-mean -field feedback strategies. To do so, we use (3).

Proposition 1.

A risk-neutral Nash mean-field-type equilibrium is given in a semi-explicit way, as follows:

\begin{matrix} u_{i}^{n e} & = - η_{i} (x - \int y m (d y)) - {\bar{η}}_{i} \int y m (d y), \end{matrix}

(4a)

\begin{matrix} 0 & = - r_{i} η_{i}^{2 k_{i} - 1} - \sum_{j \neq i} ϵ_{i j} η_{j} + b_{2 i} α_{i} + c_{i}, \end{matrix}

(4b)

\begin{matrix} 0 & = - {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i} - 1} - \sum_{j \neq i} {\bar{ϵ}}_{i j} {\bar{η}}_{j} + {\bar{b}}_{2 i} {\bar{α}}_{i} + {\bar{c}}_{i}, \end{matrix}

(4c)

\begin{matrix} {\hat{V}}_{i} (t, m) & = α_{i} \int_{x} \frac{{(x - \int y m (d y))}^{2 k_{i}}}{2 k_{i}} m (d x) + {\bar{α}}_{i} \frac{{(\int y m (d y))}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}}, \\ 0 & = {\dot{α}}_{i} + q_{i} + r_{i} η_{i}^{2 k_{i}} - 2 k_{i} c_{i} η_{i} + 2 k_{i} \sum_{j \neq i} ϵ_{i j} η_{i} η_{j} + 2 k_{i} α_{i} [b_{1} - \sum_{j \in I} b_{2 j} η_{j}] + 2 k_{i} (2 k_{i} - 1) α_{i} \frac{1}{2} {\tilde{σ}}^{2} \end{matrix}

(4d)

\begin{matrix} + α_{i} \int_{Θ} [{(1 + \tilde{μ})}^{2 k_{i}} - 1 - 2 k_{i} \tilde{μ}] ν (d θ), \end{matrix}

(4e)

\begin{matrix} α_{i} (T) & = q_{i T}, \end{matrix}

(4f)

\begin{matrix} 0 & = {\dot{\bar{α}}}_{i} + {\bar{q}}_{i} + {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i}} - 2 {\bar{k}}_{i} {\bar{c}}_{i} {\bar{η}}_{i} + 2 {\bar{k}}_{i} \sum_{j \neq i} {\bar{ϵ}}_{i j} {\bar{η}}_{i} {\bar{η}}_{j} + 2 {\bar{k}}_{i} {\bar{α}}_{i} [{\bar{b}}_{1} - \sum_{j} {\bar{b}}_{2 j} {\bar{η}}_{j}], \end{matrix}

(4g)

\begin{matrix} {\bar{α}}_{i} (T) & = {\bar{q}}_{i T}, \end{matrix}

(4h)

for all

i \in I

with

\begin{matrix} \int y m (t, d y) & = [\int y m (0, d y)] e^{\int_{0}^{t} [{\bar{b}}_{1} - \sum_{j} {\bar{b}}_{2 j} {\bar{η}}_{j}] d t^{'}}, \end{matrix}

(4i)

whenever the above coefficient system admits a solution which does not escape within

[t_{0}, t_{1}]

.

Proof.

The proof is presented in Appendix A. □

The following Remark discusses the existence and uniqueness of the

η

terms in Proposition 1.

Remark 2.

The uniqueness of the coefficient system (4) in η requires a strong condition, that is,

0 = - r_{i} η_{i}^{2 k_{i} - 1} - \sum_{j \neq i} ϵ_{i j} η_{j} + b_{2 i} α_{i} + c_{i} .

Let I be an arbitrary integer and $k_{i} = k = 1$ , the system in η becomes linear and has a unique solution if, and only if the determinant of the matrix M is non-zero, with $M_{i i} = r_{i}$ and $M_{i j} = ϵ_{i j}, i \neq j .$ When the determinant is zero, the resulting control strategies become non-admissible and the costs become infinite.
For $k_{i} = k = 2$ , and $I = 2$ the system in η becomes a binary cubic polynomial, given by

$\begin{matrix} r_{1} η_{1}^{3} + ϵ_{12} η_{2} - b_{21} α_{1} - c_{1} & = 0, \\ r_{2} η_{2}^{3} + ϵ_{21} η_{1} - b_{22} α_{2} - c_{2} & = 0 . \end{matrix}$

For $ϵ_{12} = 0$ , there is a unique solution, given by

$η_{1} = {(\frac{b_{21} α_{1} + c_{1}}{r_{1}})}^{\frac{1}{3}}, η_{2} = {(\frac{- ϵ_{21} η_{1} + b_{22} α_{2} + c_{2}}{r_{2}})}^{\frac{1}{3}} .$

For $ϵ_{12} \neq 0$ , we derive from the first equation that

$η_{2} = \frac{- r_{1} η_{1}^{3} + b_{21} α_{1} + c_{1}}{ϵ_{12}} .$

By substituting it to the second equation, we arrive at

$r_{2} {(\frac{- r_{1} η_{1}^{3} + b_{21} α_{1} + c_{1}}{ϵ_{12}})}^{3} + ϵ_{21} η_{1} - b_{21} α_{1} - c_{1} = 0 .$

The latter equation is a polynomial of odd degree “9”. It has a unique real root in $η_{1}$ if its derivative has a constant sign. Its derivative is

$ϵ_{21} - 9 \frac{r_{1} r_{2}}{ϵ_{12}} η_{1}^{2} {(\frac{- r_{1} η_{1}^{3} + b_{21} α_{1} + c_{1}}{ϵ_{12}})}^{2} .$

It has a constant sign if $ϵ_{21}$ and $\frac{r_{1} r_{2}}{ϵ_{12}}$ have opposite signs. If $r_{1}$ and $r_{2}$ are positive, then the condition is reduced to

$ϵ_{21} ϵ_{12} \leq 0 .$
$I = 2$ and arbitrary $k_{i} \geq 1$ . Thus, a sufficiency condition is that $ϵ_{j i}$ and $(2 k_{i} - 1) (2 k_{j} - 1) \frac{r_{i} r_{j}}{ϵ_{i j}}$ have opposite signs. In particular if $k_{i} \geq 1, k_{j} \geq 1, r_{i} > 0, r_{j} > 0$ , then the condition reduces to

$ϵ_{i j} ϵ_{j i} \leq 0 .$
The same reasoning applies to the system in $\bar{η}$ , and has a unique real solution if

${\bar{ϵ}}_{i j} {\bar{ϵ}}_{j i} \leq 0 .$
For $I \geq 3$ decision-makers and arbitrary $k_{i} \geq 1$ , the system can be rewritten as a fixed-point equation which fulfils a contraction mapping condition if the norms of r and ϵ are sufficiently small. In this case, there is a unique solution.

In the next section, we investigate the bi-level case with multiple leaders and multiple followers.

4. Multiple Leaders and Multiple Followers

We consider the description in (1) in a bi-level hierarchical game with two and more leaders, that is,

| I_{L} | \geq 2,

and two and more followers, that is,

| I_{F} | \geq 2 .

We restrict our attention to the admissible strategies, which are Lipschitz, in the state

x .

Given the strategies of the leaders

{(u_{i})}_{i \in I_{L}} \in \prod_{i \in I_{L}} U_{i},

a risk-neutral best-response strategy of follower j is a strategy that solves

{inf}_{U_{j}} E L_{j} .

The set of risk-neutral best responses of j is denoted by

{rnBR}_{j} ({(u_{i})}_{i \in I_{L}}, {(u_{j^{'}})}_{j^{'} \in I_{F} \ {j}}) .

A mean-field-type risk-neutral Nash equilibrium among the followers given the first movers’ strategies

{(u_{i})}_{i \in I_{L}} \in \prod_{i \in I_{L}} U_{i},

is a strategy profile

(u_{j}, j \in I_{F})

of all followers, such that for every decision-maker

j \in I_{F},

u_{j} \in {rnBR}_{j} ({(u_{i})}_{i \in I_{L}}; {(u_{j^{'}}^{rn})}_{j^{'} \in I_{F} \ {j}}) .

The followers solve the following Nash game given the strategy of the leaders

{(u_{i})}_{i \in I_{L}}

, that is,

\begin{matrix} j \in I_{F} : \end{matrix}

\begin{matrix} 0 = {\hat{V}}_{j, t} (t, m) + \int H_{j}^{r} (x, m, {({\hat{V}}_{j^{'}, m}, {\hat{V}}_{j^{'}, x m}, {\hat{V}}_{j^{'}, x x m})}_{j^{'} \in I_{F}} | {(u_{i})}_{i \in I_{L}}) m (d x), \end{matrix}

(5a)

\begin{matrix} {\hat{V}}_{j} (t_{1}, m) = \int m (d y) h_{j} (y, m), \end{matrix}

(5b)

\begin{matrix} H_{j}^{r} = inf_{u_{j} \in U_{j}} \{l_{j} + b {\hat{V}}_{j, x m} + \frac{σ^{2}}{2} {\hat{V}}_{j, x x m} + J [{\hat{V}}_{j, m}] | {(u_{i})}_{i \in I_{L}}\} . \end{matrix}

(5c)

Then, the leaders solve the following PIDE system:

\begin{matrix} i & \in I_{L} : \end{matrix}

\begin{matrix} 0 & = {\hat{V}}_{i, t} (t, m) + \int H_{i}^{r} (x, m, {({\hat{V}}_{i^{'}, m}, {\hat{V}}_{i^{'}, x m}, {\hat{V}}_{i, x x m})}_{i^{'} \in I_{L} \cup I_{F}}) m (d x), \end{matrix}

(6a)

\begin{matrix} {\hat{V}}_{i} (t_{1}, m) & = \int m (d y) h_{i} (y, m), \end{matrix}

(6b)

\begin{matrix} H_{i}^{r} & = inf_{u_{i} \in U_{i}} \{l_{i} + b {\hat{V}}_{i, x m} + \frac{σ^{2}}{2} {\hat{V}}_{i, x x m} + J [{\hat{V}}_{i, m}] | {u_{j}^{*} (., {(u_{i})}_{i \in I_{L}})}_{j \in I_{F}}\} . \end{matrix}

(6c)

A minimizer of the integrand Hamiltonian

H_{i}^{r}

, denoted by

u_{i}^{s s} = u_{i}^{s s} (t, x, m, {({\hat{V}}_{i^{'}, m}, {\hat{V}}_{i^{'}, x m}, {\hat{V}}_{i^{'}, x x m})}_{i^{'} \in I_{L} \cup I_{F}}),

provides a candidate Stackelberg strategy of the leader i. A mean-field-type risk-neutral Stackelberg solution between multiple leaders and multiple followers is a strategy

({(u_{i}^{s s})}_{i \in I_{L}}, {(u_{j}^{s s})}_{j \in I_{F}})

of all decision-makers, such that

\begin{matrix} i & \in I_{L}, \\ u_{i}^{s s} & \in arg min_{u_{i} \in U_{i}} \{E L_{i} (x, u_{i}, {(u_{i^{'}}^{s s})}_{i \in I_{L} \ {i}}, {(u_{j}^{s s})}_{j \in I_{F}}) : u_{j}^{s s} \in {rnBR}_{j} ({(u_{i}^{s s})}_{i \in I_{L}}; {(u_{j^{'}}^{s s})}_{j^{'} \in I_{F} \ {j}}\}, \end{matrix}

and for every follower,

j \in I_{F}, u_{j}^{s s} \in {rnBR}_{j} ({(u_{i}^{s s})}_{i \in I_{L}}; {(u_{j^{'}}^{s s})}_{j^{'} \in I_{F} \ {j}}) .

The next result presents the Stackelberg mean-field-type solution involving several leaders and followers in a semi-explicit manner.

Proposition 2.

The risk-neutral Stackelberg mean-field-type solution with multiple leaders and multiple followers is given in a semi-explicit way, as follows:

\begin{matrix} u_{j}^{s s} & = - η_{j} (x - \int y m (d y)) - {\bar{η}}_{j} \int y m (d y), j \in I_{F}, \\ j & \in I_{F} : \\ 0 & = - r_{j} η_{j}^{2 k_{j} - 1} - \sum_{j^{'} \in I_{F} \ {j}} ϵ_{j j^{'}} η_{j^{'}} - \sum_{i \in I_{L}} ϵ_{j i} η_{i} + b_{2 j} α_{j} + c_{j}, \\ 0 & = - {\bar{r}}_{j} {\bar{η}}_{j}^{2 {\bar{k}}_{j} - 1} - \sum_{j^{'} \in I_{F} \ {j}} {\bar{ϵ}}_{j j^{'}} {\bar{η}}_{j^{'}} - \sum_{i \in I_{L}} {\bar{ϵ}}_{j i} {\bar{η}}_{i} + {\bar{b}}_{2 j} {\bar{α}}_{j} + {\bar{c}}_{j}, \\ i & \in I_{L} : \\ 0 & = - r_{i} η_{i}^{2 k_{i} - 1} - \sum_{i^{'} \in I_{L} \ {i}} ϵ_{i i^{'}} η_{i^{'}} - \sum_{j \in I_{F}} ϵ_{i j} η_{j} + b_{2 i} α_{i} + \sum_{j \in I_{F}} ϵ_{i j} η_{i} \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j} η_{j}^{2 k_{j} - 2}} \\ - \sum_{j \in I_{F}} b_{2 j} \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j} η_{j}^{2 k_{j} - 2}} α_{i} + c_{i}, \\ 0 & = - {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i} - 1} - \sum_{i^{'} \in I_{L} \ {i}} {\bar{ϵ}}_{i i^{'}} {\bar{η}}_{i^{'}} - \sum_{j \in I_{F}} {\bar{ϵ}}_{i j} {\bar{η}}_{j} + {\bar{b}}_{2 i} {\bar{α}}_{i} + \sum_{j \in I_{F}} {\bar{ϵ}}_{i j} {\bar{η}}_{i} \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j} {\bar{η}}_{j}^{2 {\bar{k}}_{j} - 2}} \\ - \sum_{j \in I_{F}} {\bar{b}}_{2 j} \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j} {\bar{η}}_{j}^{2 {\bar{k}}_{j} - 2}} {\bar{α}}_{i} + {\bar{c}}_{i}, \end{matrix}

(7a)

and

\begin{matrix} {\hat{V}}_{i} & (0, m) = α_{i} (0) \int_{x} \frac{{(x - \int y m_{0} (d y))}^{2 k_{i}}}{2 k_{i}} m_{0} (d x) + {\bar{α}}_{i} (0) \frac{{(\int y m_{0} (d y))}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}}, \\ 0 & = {\dot{α}}_{i} + q_{i} + r_{i} η_{i}^{2 k_{i}} - 2 k_{i} c_{i} η_{i} + 2 k_{i} \sum_{i^{'} \in I_{L} \ {i}} ϵ_{i i^{'}} η_{i} η_{i^{'}} + 2 k_{i} \sum_{j \in I_{F}} ϵ_{i j} η_{i} η_{j} \\ + 2 k_{i} [b_{1} - \sum_{i^{'} \in I_{L}} b_{2 i^{'}} η_{i^{'}} - \sum_{j \in I_{F}} b_{2 j} η_{j}] α_{i} + 2 k_{i} (2 k_{i} - 1) α_{i} \frac{1}{2} {\tilde{σ}}^{2} \end{matrix}

(7b)

\begin{matrix} + α_{i} \int_{Θ} [{(1 + \tilde{μ})}^{2 k_{i}} - 1 - 2 k_{i} \tilde{μ}] ν (d θ), \end{matrix}

(7c)

\begin{matrix} α_{i} & (T) = q_{i T}, \\ 0 & = {\dot{\bar{α}}}_{i} + {\bar{q}}_{i} + {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i}} - 2 {\bar{k}}_{i} {\bar{c}}_{i} {\bar{η}}_{i} + 2 {\bar{k}}_{i} \sum_{i^{'} \in I_{L} \ {i}} {\bar{ϵ}}_{i i^{'}} {\bar{η}}_{i} {\bar{η}}_{i^{'}} + 2 {\bar{k}}_{i} \sum_{j \in I_{F}} {\bar{ϵ}}_{i j} {\bar{η}}_{i} {\bar{η}}_{j} \end{matrix}

(7d)

\begin{matrix} + 2 {\bar{k}}_{i} [{\bar{b}}_{1} - \sum_{i^{'} \in I_{L}} {\bar{b}}_{2 i^{'}} {\bar{η}}_{i^{'}} - \sum_{j \in I_{F}} {\bar{b}}_{2 j} {\bar{η}}_{j}] {\bar{α}}_{i}, \end{matrix}

(7e)

\begin{matrix} {\bar{α}}_{i} & (T) = {\bar{q}}_{i T}, \end{matrix}

(7f)

with

\begin{matrix} \int y m (t, d y) & = [\int y m (0, d y)] e^{\int_{0}^{t} [{\bar{b}}_{1} - \sum_{j} {\bar{b}}_{2 j} {\bar{η}}_{j}] d t^{'}}, \end{matrix}

(7g)

whenever the above coefficient system admits a unique solution.

Proof.

The proof is presented in Appendix A. □

Remark 3.

Clearly, the mean-field-type Nash equilibrium in (4) differs from the Stackelberg solution in (7) when the

ϵ_{i j}

are non-zero.

4.1. No Control-Coupling within Classes

It follows from (7) that, for

ϵ_{j j^{'}} = 0 = {\bar{ϵ}}_{j j^{'}}

for

(j, j^{'}) \in I_{F}^{2},

the term

η_{j}

is explicitly, given by

η_{j} = {\{\frac{- \sum_{i \in I_{L}} ϵ_{j i} η_{i} + b_{2 j} α_{j} + c_{j}}{r_{j}}\}}^{\frac{1}{2 k_{j} - 1}},

and

{\bar{η}}_{j} = {\{\frac{- \sum_{i \in I_{L}} {\bar{ϵ}}_{j i} {\bar{η}}_{i} + {\bar{b}}_{2 j} {\bar{α}}_{j} + {\bar{c}}_{j}}{{\bar{r}}_{j}}\}}^{\frac{1}{2 {\bar{k}}_{j} - 1}} .

4.1.1. No Leader and All Followers

In this case, there is no leader. All decision-makers are followers. This case is similar to the model proposed in the Nash game above. The solution is given by (4).

4.1.2. One Leader and Multiple Followers

There is a unique leader in

I_{L}

, and the remaining decision-makers in

I_{F}

are followers.

I = I_{L} \cup I_{F} .

We assume that the leader (decision-maker

1 \in I_{L}

) uses a state- and mean-field-type feedback strategy

u_{1} (t, x, m)

and each of the followers (decision-maker

j \in I_{F}

) finds a state- and mean-field-type feedback strategy

u_{j} (t, x, m, u_{1})

given

u_{1} .

The followers solve a Nash game given the strategy of the leader

u_{1} .

4.1.3. Multiple Leaders and One Follower

Since there is only one follower, the reaction set of the follower will be computed given the strategies of the leaders.

4.1.4. All Leaders and No Follower

In this case, there is no follower. All decision-makers are leaders. In terms of the information structure, this case is similar to the model proposed in the Nash game above. The solution is given by (4).

5. Fully Hierarchical Game

In the previous sections, we had only bi-level game problems. In this section, we make as many levels as the number of decision-makers. There are

| I |

hierarchical levels. At each layer

i,

decision-maker i chooses a control strategy

u_{i}

knowing the control strategy of the preceding decision-makers, that is,

{i - 1, \dots, 1} .

This becomes a sequential decision-making problem. We use a backward induction method to solve the hierarchical game problem. This means that the decision-making problem at the last layer I, which is the reaction of decision-maker I, can be seen as a mean-field-type control problem. This is because at the

i -

th level, the strategies

{(u_{i^{'}})}_{i^{'} \in {1, \dots, i - 1}}

are already known by decision-maker

i .

The Proposition 1 next presents the multi-level hierarchical-structure solution in the context of mean-field-type games in a semi-explicit manner.

Proposition 1.

The risk-neutral

I -

level hierarchical mean-field-type solution is given in a semi-explicit way, as follows:

\begin{matrix} u_{i}^{h s} & = - η_{i} (x - \int y m (d y)) - {\bar{η}}_{i} \int y m (d y), i \in I, \end{matrix}

(8a)

\begin{matrix} {\hat{V}}_{i} & (0, m) = α_{i} (0) \int_{x} \frac{{(x - \int y m_{0} (d y))}^{2 k_{i}}}{2 k_{i}} m_{0} (d x) + {\bar{α}}_{i} (0) \frac{{(\int y m_{0} (d y))}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}}, \end{matrix}

(8b)

with

\begin{matrix} \int y m (t, d y) & = [\int y m (0, d y)] e^{\int_{0}^{t} [{\bar{b}}_{1} - \sum_{j} {\bar{b}}_{2 j} {\bar{η}}_{j}] d t}, \end{matrix}

(8c)

where the coefficient functions are, given by

\begin{matrix} L e v e l & 1 : \\ 0 & = - r_{1} η_{1}^{2 k_{1} - 1} + c_{1} - \sum_{j = 2}^{I} ϵ_{1, j} η_{j} + \sum_{j = 2}^{I} ϵ_{1, j} η_{i} \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j}} η_{j}^{- 2 (k_{j} - 1)} \\ + [b_{2, 1} - \sum_{j = 2}^{I} b_{2 j} \frac{ϵ_{j 1}}{(2 k_{j} - 1) r_{j}} η_{j}^{- 2 (k_{j} - 1)}] α_{1}, \\ 0 & = {\dot{α}}_{1} + q_{1} + r_{1} η_{1}^{2 k_{1}} - 2 k_{1} c_{1} η_{1} + 2 k_{1} \sum_{j = 2}^{I} ϵ_{1 j} η_{1} η_{j} + 2 k_{1} {b_{1} - b_{21} η_{1} - \sum_{j = 2}^{I} b_{2 j} η_{j}} α_{1} \\ + 2 k_{1} (2 k_{1} - 1) α_{1} \frac{1}{2} {\tilde{σ}}^{2} + α_{1} \int_{Θ} [{(1 + \tilde{μ})}^{2 k_{1}} - 1 - 2 k_{1} \tilde{μ}] ν (d θ), \\ α_{1} (T) & = q_{1 T}, \\ 0 & = - {\bar{r}}_{1} {\bar{η}}_{1}^{2 {\bar{k}}_{1} - 1} + {\bar{c}}_{1} - \sum_{j = 2}^{I} {\bar{ϵ}}_{1, j} {\bar{η}}_{j} + \sum_{j = 2}^{I} {\bar{ϵ}}_{1, j} {\bar{η}}_{1} \frac{{\bar{ϵ}}_{j 1}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j}} {\bar{η}}_{j}^{- 2 ({\bar{k}}_{j} - 1)} \\ + [{\bar{b}}_{21} - \sum_{j = 2}^{I} {\bar{b}}_{2 j} \frac{{\bar{ϵ}}_{j 1}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j}} {\bar{η}}_{j}^{- 2 ({\bar{k}}_{j} - 1)}] {\bar{α}}_{1}, \\ 0 & = {\dot{\bar{α}}}_{1} + {\bar{q}}_{1} + {\bar{r}}_{1} {\bar{η}}_{1}^{2 {\bar{k}}_{1}} - 2 {\bar{k}}_{1} {\bar{c}}_{1} {\bar{η}}_{1} + 2 {\bar{k}}_{1} \sum_{j = 2}^{I} {\bar{ϵ}}_{1 j} {\bar{η}}_{i} {\bar{η}}_{j} + 2 {\bar{k}}_{1} {{\bar{b}}_{1} - {\bar{b}}_{21} {\bar{η}}_{1} - \sum_{j = 2}^{I} {\bar{b}}_{2 j} {\bar{η}}_{j}} {\bar{α}}_{1}, \\ {\bar{α}}_{1} (T) & = {\bar{q}}_{1 T} . \\ L e v e l & i : \\ 0 & = - r_{i} η_{i}^{2 k_{i} - 1} + c_{i} - \sum_{i^{'} = 1}^{i - 1} ϵ_{I - 1, i^{'}} η_{i^{'}} - \sum_{j = i + 1}^{I} ϵ_{i, j} η_{j} + \sum_{j = i + 1}^{I} ϵ_{i, j} η_{i} \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j}} η_{j}^{- 2 (k_{j} - 1)} \\ + [b_{2 i} - \sum_{j = i + 1}^{I} b_{2 j} \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j}} η_{j}^{- 2 (k_{j} - 1)}] α_{i}, \\ 0 & = {\dot{α}}_{i} + q_{i} + r_{i} η_{i}^{2 k_{i}} - 2 k_{i} c_{i} η_{i} + 2 k_{i} \sum_{i^{'} = 1}^{i - 1} ϵ_{i i^{'}} η_{i} η_{i^{'}} + 2 k_{i} \sum_{j = i + 1}^{I} ϵ_{i j} η_{i} η_{j} \\ + 2 k_{i} {b_{1} - \sum_{i^{'} = 1}^{i - 1} b_{2 i^{'}} η_{i^{'}} - b_{2 i} η_{i} - \sum_{j = i + 1}^{I} b_{2 j} η_{j}} α_{i} + 2 k_{i} (2 k_{i} - 1) α_{i} \frac{1}{2} {\tilde{σ}}^{2} \\ + α_{i} \int_{Θ} [{(1 + \tilde{μ})}^{2 k_{i}} - 1 - 2 k_{i} \tilde{μ}] ν (d θ), \\ α_{i} (T) & = q_{i T}, \\ 0 & = - {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i} - 1} + {\bar{c}}_{i} - \sum_{i^{'} = 1}^{i - 1} {\bar{ϵ}}_{I - 1, i^{'}} {\bar{η}}_{i^{'}} - \sum_{j = i + 1}^{I} {\bar{ϵ}}_{i, j} {\bar{η}}_{j} + \sum_{j = i + 1}^{I} {\bar{ϵ}}_{i, j} {\bar{η}}_{i} \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j}} {\bar{η}}_{j}^{- 2 ({\bar{k}}_{j} - 1)} \\ + [{\bar{b}}_{2 i} - \sum_{j = i + 1}^{I} {\bar{b}}_{2 j} \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j}} {\bar{η}}_{j}^{- 2 ({\bar{k}}_{j} - 1)}] {\bar{α}}_{i}, \\ 0 & = {\dot{\bar{α}}}_{i} + {\bar{q}}_{i} + {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i}} - 2 {\bar{k}}_{i} {\bar{c}}_{i} {\bar{η}}_{i} + 2 {\bar{k}}_{i} \sum_{i^{'} = 1}^{i - 1} {\bar{ϵ}}_{i i^{'}} {\bar{η}}_{i} {\bar{η}}_{i^{'}} + 2 {\bar{k}}_{i} \sum_{j = i + 1}^{I} {\bar{ϵ}}_{i j} {\bar{η}}_{i} {\bar{η}}_{j} \\ + 2 {\bar{k}}_{i} [{\bar{b}}_{1} - \sum_{i^{'} = 1}^{i - 1} {\bar{b}}_{2 i^{'}} {\bar{η}}_{i^{'}} - {\bar{b}}_{2 i} {\bar{η}}_{i} - \sum_{j = i + 1}^{I} {\bar{b}}_{2 j} {\bar{η}}_{j}] {\bar{α}}_{i}, \\ {\bar{α}}_{i} (T) & = {\bar{q}}_{i T} . \\ L e v e l & I : \\ η_{I} & = {(\frac{- \sum_{j = 1}^{I - 1} ϵ_{I, j} η_{j} + b_{2 I} α_{I} + c_{I}}{r_{I}})}^{\frac{1}{2 k_{I} - 1}}, \\ 0 & = {\dot{α}}_{I} + q_{I} + r_{I} η_{I}^{2 k_{I}} - 2 k_{I} c_{I} η_{I} + 2 k_{I} \sum_{i^{'} = 1}^{I - 1} ϵ_{I i^{'}} η_{I} η_{i^{'}} + 2 k_{I} {b_{1} - \sum_{i^{'} = 1}^{I - 1} b_{2 i^{'}} η_{i^{'}} - b_{2 I} η_{I}} α_{I} \\ + 2 k_{I} (2 k_{I} - 1) α_{I} \frac{1}{2} {\tilde{σ}}^{2} + α_{I} \int_{Θ} [{(1 + \tilde{μ})}^{2 k_{I}} - 1 - 2 k_{I} \tilde{μ}] ν (d θ), \\ α_{I} (T) & = q_{I T}, \\ {\bar{η}}_{I} & = {(\frac{- \sum_{j = 1}^{I - 1} {\bar{ϵ}}_{I, j} {\bar{η}}_{j} + {\bar{b}}_{2 I} {\bar{α}}_{I} + {\bar{c}}_{I}}{{\bar{r}}_{I}})}^{\frac{1}{2 {\bar{k}}_{I} - 1}}, \\ 0 & = {\dot{\bar{α}}}_{I} + {\bar{q}}_{I} + {\bar{r}}_{I} {\bar{η}}_{I}^{2 {\bar{k}}_{I}} - 2 {\bar{k}}_{I} {\bar{c}}_{I} {\bar{η}}_{I} + 2 {\bar{k}}_{I} \sum_{i^{'} = 1}^{I - 1} {\bar{ϵ}}_{I i^{'}} {\bar{η}}_{I} {\bar{η}}_{i^{'}} + 2 {\bar{k}}_{I} {{\bar{b}}_{1} - \sum_{i^{'} = 1}^{i - 1} {\bar{b}}_{2 i^{'}} {\bar{η}}_{i^{'}} - {\bar{b}}_{2 I} {\bar{η}}_{I}} {\bar{α}}_{I}, \\ {\bar{α}}_{I} (T) & = {\bar{q}}_{I T}, \end{matrix}

whenever these equations admit a solution.

Proof.

This proof is presented in Appendix A. □

From the analysis above, the following remarks are in order:

For $ϵ_{i j} \neq 0, {\bar{ϵ}}_{i j} \neq 0,$ the order of the play matters because of the informational difference between the decision-makers at different levels of hierarchy in (8). One open question that we leave for future investigation is: How to determine the optimal ordering among all permutations of heterogenous decision-makers?
When all the $ϵ_{i j}$ and ${\bar{ϵ}}_{i j}$ are zero, the Nash equilibrium coincides with the bi-level solution, which coincides with any level of hierarchical solution. The order of the play and the informational difference do not generate an extra advantage for the first mover in this particular case. Consequently, the hierarchical leader design is only performed when the parameters $ϵ_{i j} \neq 0, {\bar{ϵ}}_{i j} \neq 0$ .

6. Numerical Investigation

In this section, we perform some numerical examples in order to analyze two main scenarios. We study the effect of the number of leaders on the total cost for both homogeneous and heterogeneous scenarios, and we investigate the effect of the hierarchical structure considering a heterogeneous scenario.

6.1. Effect of the Number of Leaders on the Total Cost

We investigate the effect of the number of leaders on the total performance of the system. The total cost at the Stackelberg solution is

S (I_{L}, m_{0}) = \sum_{i \in I_{L}} {\hat{V}}_{i} (0, m_{0}) + \sum_{j \in I_{F}} {\hat{V}}_{j} (0, m_{0}) .

For

m_{0} = δ_{x_{0}},

and

{\bar{k}}_{i} = \bar{k} \geq 1,

the total cost is

S (I_{L}, m_{0}) = (\sum_{i \in I_{L}} {\bar{α}}_{i} (0) + \sum_{j \in I_{F}} {\bar{α}}_{j} (0)) \frac{x_{0}^{2 \bar{k}}}{2 \bar{k}} .

6.1.1. Uniform Coupling and Homogeneous Players

When all other parameters are identical across the players except their role,

S (I_{L}, m_{0})

can be expressed as a function

| I_{L} | .

It follows from (7) that

\begin{matrix} χ & : = | I_{L} |, \\ 0 & = - \bar{r} {({\bar{η}}^{f o})}^{2 \bar{k} - 1} - (| I | - χ - 1) \bar{ϵ} {\bar{η}}^{f o} - χ \bar{ϵ} {\bar{η}}^{l e a d} + {\bar{b}}_{2} {\bar{α}}^{f o} + c, \\ 0 & = - \bar{r} {({\bar{η}}^{l e a d})}^{2 \bar{k} - 1} - (χ - 1) \bar{ϵ} {\bar{η}}^{l e a d} - (| I | - χ) \bar{ϵ} {\bar{η}}^{f o} + {\bar{b}}_{2} {\bar{α}}^{l e a d} + \bar{c} + \frac{\bar{ϵ} (| I | - χ) (\bar{ϵ} {\bar{η}}^{l e a d} - {\bar{α}}^{l e a d} {\bar{b}}_{2})}{(2 \bar{k} - 1) \bar{r} {({\bar{η}}^{f o})}^{2 \bar{k} - 2}}, \\ {\bar{α}}^{l e a d} (t_{0}) & = {\bar{q}}_{t_{1}} + \int_{t_{0}}^{t_{1}} {\bar{q} + \bar{r} {({\bar{η}}^{l e a d})}^{2 \bar{k}} - 2 \bar{k} \bar{c} {\bar{η}}^{l e a d} + 2 \bar{k} \bar{ϵ} {\bar{η}}^{l e a d} [(χ - 1) {\bar{η}}^{l e a d} + (| I | - χ) {\bar{η}}^{f o}] \\ + 2 \bar{k} {\bar{α}}^{l e a d} [{\bar{b}}_{1} - {\bar{b}}_{2} {\bar{η}}^{l e a d} χ - {\bar{b}}_{2} {\bar{η}}^{f o} (| I | - χ)]} d t \\ {\bar{α}}^{f o} (t_{0}) & = {\bar{q}}_{t_{1}} + \int_{t_{0}}^{t_{1}} {\bar{q} + \bar{r} {({\bar{η}}^{f o})}^{2 \bar{k}} - 2 \bar{k} \bar{c} {\bar{η}}^{f o} + 2 \bar{k} \bar{ϵ} {\bar{η}}^{f o} [(| I | - χ - 1) {\bar{η}}^{f o} + χ {\bar{η}}^{l e a d}] \\ + 2 \bar{k} {\bar{α}}^{f o} [{\bar{b}}_{1} - {\bar{b}}_{2} {\bar{η}}^{l e a d} χ - {\bar{b}}_{2} {\bar{η}}^{f o} (| I | - χ)]} d t . \end{matrix}

The optimal number of leaders is, given by

| I_{L} | \in arg min_{χ} [χ {\bar{α}}^{l e a d} (0) + (| I | - χ) {\bar{α}}^{f o} (0)],

where

\bar{α}

depends on

χ

as well. We observe that the latter function is not necessarily monotone in

χ = | I_{L} | .

This means that increasing the number of leaders in the interaction does not necessarily improve the total performance of the system.

We numerically investigate

S (| I_{L} |, δ_{x_{0}})

as a function of

χ = | I_{L} |

for

| I | = 6 .

Let us consider a symmetric six-player game problem involving the parameters presented here:

\begin{matrix} {\bar{c}}_{i} & = \bar{c} = 0, \forall i \in I, & {\bar{k}}_{i} & = \bar{k} = 1, \forall i \in I, \\ {\bar{ϵ}}_{i} & = \bar{ϵ} = 1, \forall i \in I, & b_{2 i} & = b_{2} = 0.1, \forall i \in I, \\ {\bar{b}}_{2 i} & = {\bar{b}}_{2} = 0.5, \forall i \in I, & {\bar{r}}_{i} & = \bar{r} = 2, \forall i \in I, \\ {\bar{q}}_{i} & = \bar{q} = 1, \forall i \in I, & {\bar{q}}_{i T} & = {\bar{q}}_{T} = 2, \forall i \in I, \\ T & = 0.1 . \end{matrix}

Figure 5 presents the evolution of both

{\dot{\bar{α}}}^{leader}

and

{\dot{\bar{α}}}^{follower}

for a different number of leaders

| I_{L} |

. Notice that the initial values

{\bar{α}}_{0}^{leader}

and

{\bar{α}}_{0}^{follower}

determine the optimal cost considering that

\begin{matrix} {\bar{x}}_{0} = \bar{x} (0) = \int y m (0, d y) = \int y δ_{x_{0}} (d y) = x_{0} . \end{matrix}

(9)

Figure 5 and Table 1 also show that, under the considered parameters, the lowest total cost is obtained when

| I_{L} | = 2,

corresponding to a cost

S (| I_{L} |, δ_{x_{0}}) = 7.911

. These results offer an insight into the game’s structural design for the sake of either individual or total costs. We observe that having only one leader is suboptimal for the total cost. Having too many leaders (where the majority of the decision-makers are leaders) is not suboptimal for the total cost. In this setting, there is a tradeoff between leaders and followers, so that the system’s cost gets balanced.

6.1.2. Uniform Coupling and Heterogeneous Players

Now we investigate the two-layer case with uniform coupling, that is,

{\bar{ϵ}}_{i j} = 0.1

, for all combinations

i, j \in I

and for the heterogeneous case with

| I | = 3

. We consider the following parameters:

\begin{matrix} b_{21} & = 0.1, & b_{22} & = 0.2, & b_{23} & = 0.3, \\ {\bar{b}}_{21} & = 0.5, & {\bar{b}}_{22} & = 0.6, & {\bar{b}}_{23} & = 0.7, \\ {\bar{r}}_{1} & = 2, & {\bar{r}}_{2} & = 2.1, & {\bar{r}}_{3} & = 2.2, \\ {\bar{q}}_{1} & = 1, & {\bar{q}}_{2} & = 2, & {\bar{q}}_{3} & = 3, \\ {\bar{q}}_{1 T} & = 4, & {\bar{q}}_{2 T} & = 6, & {\bar{q}}_{3 T} & = 8, \\ {\bar{b}}_{1} & = 2, & T & = 1, & {\bar{k}}_{i} & = \bar{k} = 1, \forall i \in I, \end{matrix}

Figure 6 shows the evolution of

{\bar{α}}_{1}, {\bar{α}}_{2},

and

{\bar{α}}_{3}

for the different topologies presented in Table 2. It can be seen in Figure 7 that all the structures return a close value for the total cost. However, Table 2 shows that the best topology is the last one, where the third player acts as the unique leader assuming an initial condition, such that (10) holds.

6.2. Impact of the Hierarchical Structures

Here, we analyze the impact on the order of the strategic selection, that is, the hierarchical order on the heterogeneous case with

| I | = 3

. We consider the following heterogeneous parameters:

\begin{matrix} b_{21} & = 0.1, & b_{22} & = 0.2, & b_{23} & = 0.3, \\ {\bar{b}}_{21} & = 0.4, & {\bar{b}}_{22} & = 0.5, & {\bar{b}}_{23} & = 0.6, \\ {\bar{r}}_{1} & = 1, & {\bar{r}}_{2} & = 2, & {\bar{r}}_{3} & = 3, \\ {\bar{q}}_{1} & = 1.1, & {\bar{q}}_{2} & = 1.2, & {\bar{q}}_{3} & = 1.3, \\ {\bar{q}}_{1 T} & = 2.1, & {\bar{q}}_{2 T} & = 2.2, & {\bar{q}}_{3 T} & = 2.3, \\ {\bar{b}}_{1} & = 2, & T & = 0.1, & {\bar{k}}_{i} & = \bar{k} = 1, \forall i \in I, \end{matrix}

and

\begin{matrix} \bar{ϵ} = (\begin{matrix} 1 & 1.2 & 1.1 \\ 1.5 & 1 & 1.6 \\ 1.3 & 1.4 & 1 \end{matrix}) . \end{matrix}

Table 3 shows the summary of the total costs for the six different possible hierarchical orders assuming an initial condition, such that (10) holds. It can be seen that the third configuration is the best to minimize the total cost. Moreover, Figure 8 presents the evolution of the equations

\sum_{j \in I} {\dot{\bar{α}}}_{j} (t)

for all the possible structures.

7. Conclusions

In this paper, we have examined multi-layer hierarchical mean-field-type games with non-quadratic polynomial costs. We derived hierarchical mean-field-type solutions in linear state- and mean-field feedback form by using a partial integro-differential system, and also established the relationship between the Nash and the hierarchical solutions. Furthermore, we studied the impact of the number of leaders on a bi-level Stackelberg problem for both symmetric and non-symmetric scenarios. In addition, we have shown that the number of layers, permutations of the decision-makers per layer, and their identity significantly affect the total cost of the system. We have also numerically shown that the ordering among all permutations of heterogenous decision-makers may reduce the cost by a significant proportion, depending on the horizon. One open question that we leave for future investigation is to find, theoretically, the optimal ordering among all permutations of heterogenous decision-makers, and to examine the benefits/costs of structure design and leadership.

Author Contributions

All authors have equally contributed. All authors have read and agreed to the published version of the manuscript.

Funding

U.S. Air Force Office of Scientific Research under grant number FA9550-17-1-0259.

Acknowledgments

We gratefully acknowledge support from U.S. Air Force Office of Scientific Research under grant number FA9550-17-1-0259.

Conflicts of Interest

There is no conflict of interest.

Appendix A

Proof of Proposition 1.

Under the assumption of perfect state observation and perfect knowledge of the model, a sufficiency condition for equilibrium is, given by the PIDE system (3). We aim to solve (3). To do so, we start with the following guess functional of decision-maker i as

\begin{matrix} {\hat{V}}_{i} (t, m) & = α_{i} (t) \int_{x} \frac{{(x - \int y m (d y))}^{2 k_{i}}}{2 k_{i}} m (d x) + {\bar{α}}_{i} (t) \frac{{(\int y m (d y))}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}}, \end{matrix}

where the coefficient functions

α_{i}

and

{\bar{α}}_{i}

need to be determined. Notice that, for

k_{i} = 1

, the functional

{\hat{V}}_{i} (t, m)

becomes a mean-variance-dependent functional, and for an arbitrary parameter

k_{i}

, the functional may support higher order moments. We compute the key terms

{\hat{V}}_{i, m} (t, m)

,

{\hat{V}}_{i, x m} (t, m)

,

{\hat{V}}_{i, x x m} (t, m) .

\begin{matrix} {\hat{V}}_{i, m} (t, m) & = - α_{i} x \int {(y - \int z m (d z))}^{2 k_{i} - 1} m (d y) + α_{i} \frac{{(x - \int y m (d y))}^{2 k_{i}}}{2 k_{i}} \end{matrix}

\begin{matrix} + {\bar{α}}_{i} x {(\int y m (d y))}^{2 k_{i} - 1}, \\ {\hat{V}}_{i, x m} (t, m) & = - α_{i} \int {(y - \int z m (d z))}^{2 k_{i} - 1} m (d y) + α_{i} {(x - \int y m (d y))}^{2 k_{i} - 1} \end{matrix}

(A1a)

\begin{matrix} + {\bar{α}}_{i} {(\int y m (d y))}^{2 k_{i} - 1}, \end{matrix}

(A1b)

\begin{matrix} {\hat{V}}_{i, x x m} (t, m) & = (2 k_{i} - 1) α_{i} {(x - \int y m (d y))}^{2 (k_{i} - 1)}, \end{matrix}

(A1c)

\begin{matrix} {\hat{V}}_{i, m} (t, m) (x + μ) & - {\hat{V}}_{i, m} (t, m) (x) - μ {\hat{V}}_{i, x m} (t, m) (x) = α_{i} \frac{{(x - \int y m (d y))}^{2 k_{i}}}{2 k_{i}} [{(1 + \tilde{μ})}^{2 k_{i}} - 1 - 2 k_{i} \tilde{μ}] + \tilde{ϵ}, \end{matrix}

(A1d)

with

\int \tilde{ϵ} m (d y) = 0 .

The Integrand Hamiltonian is strictly convex in

(u_{i} - {\bar{u}}_{i}, {\bar{u}}_{i})

. The optimal control strategy is the unique minimizer of

\begin{matrix} r_{i} \frac{{(u_{i} - {\bar{u}}_{i})}^{2 k_{i}}}{2 k_{i}} + c_{i} {(x - \bar{x})}^{2 k_{i} - 1} (u_{i} - {\bar{u}}_{i}) + \sum_{j \neq i} ϵ_{i j} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{i} - {\bar{u}}_{i}) (u_{j} - {\bar{u}}_{j}) \\ + [{\hat{V}}_{i, x m} (t, m) - \int {\hat{V}}_{i, x m} (t, m) (x) m (d x)] \sum_{j \in I} b_{2 j} (u_{j} - {\bar{u}}_{j}) + {\bar{r}}_{i} \frac{{\bar{u}}_{i}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} + {\bar{c}}_{i} {\bar{x}}^{2 {\bar{k}}_{i} - 1} {\bar{u}}_{i} + \sum_{j \neq i} {\bar{ϵ}}_{i j} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{i} {\bar{u}}_{j} \\ + [\int {\hat{V}}_{i, x m} (t, m) (x) m (d x)] \sum_{j} {\bar{b}}_{2 j} {\bar{u}}_{j} . \end{matrix}

(A2)

By strictly convexity and by orthogonality between

(u_{i} - {\bar{u}}_{i})

and

{\bar{u}}_{i}

the following condition system holds:

\begin{matrix} i & \in I, \\ 0 & = r_{i} {(u_{i} - {\bar{u}}_{i})}^{2 k_{i} - 1} + c_{i} {(x - \bar{x})}^{2 k_{i} - 1} + \sum_{j \neq i} ϵ_{i j} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{j} - {\bar{u}}_{j}) \end{matrix}

\begin{matrix} + [{\hat{V}}_{i, x m} (t, m) - \int {\hat{V}}_{i, x m} (t, m) (x) m (d x)] b_{2 i}, \end{matrix}

(A3a)

\begin{matrix} 0 & = {\bar{r}}_{i} {\bar{u}}_{i}^{2 {\bar{k}}_{i} - 1} + {\bar{c}}_{i} {\bar{x}}^{2 {\bar{k}}_{i} - 1} + \sum_{j \neq i} {\bar{ϵ}}_{i j} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{j} + [\int {\hat{V}}_{i, x m} (t, m) (x) m (d x)] {\bar{b}}_{2 i} . \end{matrix}

(A3b)

By solving the previously mentioned conditions, one obtains the optimal control input in a closed-loop form. The linear state- and mean-field-type feedback strategy

u_{i} = - η_{i} (x - \int y m (d y)) - {\bar{η}}_{i} \int y m (d y), i \in I

solves the system if the coefficients satisfy

\begin{matrix} i & \in I, \end{matrix}

\begin{matrix} 0 & = - r_{i} η_{i}^{2 k_{i} - 1} - \sum_{j \neq i} ϵ_{i j} η_{j} + b_{2 i} α_{i} + c_{i}, \end{matrix}

(A4a)

\begin{matrix} 0 & = - {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i} - 1} - \sum_{j \neq i} {\bar{ϵ}}_{i j} {\bar{η}}_{j} + {\bar{b}}_{2 i} {\bar{α}}_{i} + {\bar{c}}_{i}, \end{matrix}

(A4b)

The integrand Hamiltonian of i becomes

\begin{matrix} H_{i} & = [q_{i} + r_{i} η_{i}^{2 k_{i}} - 2 k_{i} c_{i} η_{i} + 2 k_{i} \sum_{j \neq i} ϵ_{i j} η_{i} η_{j}] \frac{{(x - \int y m (d y))}^{2 k_{i}}}{2 k_{i}} + 2 k_{i} α_{i} [b_{1} - \sum_{j \in I} b_{2 j} η_{j}] \frac{{(x - \int y m (d y))}^{2 k_{i}}}{2 k_{i}} \\ + 2 k_{i} (2 k_{i} - 1) α_{i} \frac{1}{2} {\tilde{σ}}^{2} \frac{{(x - \int y m (d y))}^{2 k_{i}}}{2 k_{i}} + α_{i} \int_{Θ} [{(1 + \tilde{μ})}^{2 k_{i}} - 1 - 2 k_{i} \tilde{μ}] ν (d θ) \frac{{(x - \int y m (d y))}^{2 k_{i}}}{2 k_{i}} \\ + [{\bar{q}}_{i} + {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i}} - 2 {\bar{k}}_{i} {\bar{c}}_{i} {\bar{η}}_{i}] \frac{{(\int y m (d y))}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} + [2 {\bar{k}}_{i} \sum_{j \neq i} {\bar{ϵ}}_{i j} {\bar{η}}_{i} {\bar{η}}_{j}] \frac{{(\int y m (d y))}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} \\ + 2 {\bar{k}}_{i} {\bar{α}}_{i} [{\bar{b}}_{1} - \sum_{j} {\bar{b}}_{2 j} {\bar{η}}_{j}] \frac{{(\int y m (d y))}^{2 {\bar{k}}_{i} - 1}}{2 {\bar{k}}_{i}} + {\tilde{ϵ}}_{2} . \end{matrix}

(A5)

By identification the coefficients

α_{i}

solve the following ordinary differential equation:

\begin{matrix} 0 & = {\dot{α}}_{i} + q_{i} + r_{i} η_{i}^{2 k_{i}} - 2 k_{i} c_{i} η_{i} + 2 k_{i} \sum_{j \neq i} ϵ_{i j} η_{i} η_{j} + 2 k_{i} α_{i} [b_{1} - \sum_{j \in I} b_{2 j} η_{j}] + 2 k_{i} (2 k_{i} - 1) α_{i} \frac{1}{2} {\tilde{σ}}^{2} \end{matrix}

\begin{matrix} + α_{i} \int_{Θ} [{(1 + \tilde{μ})}^{2 k_{i}} - 1 - 2 k_{i} \tilde{μ}] ν (d θ), \end{matrix}

(A6a)

\begin{matrix} α_{i} (T) & = q_{i T}, \end{matrix}

(A6b)

\begin{matrix} 0 & = {\dot{\bar{α}}}_{i} + {\bar{q}}_{i} + {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i}} - 2 {\bar{k}}_{i} {\bar{c}}_{i} {\bar{η}}_{i} + 2 {\bar{k}}_{i} \sum_{j \neq i} {\bar{ϵ}}_{i j} {\bar{η}}_{i} {\bar{η}}_{j} + 2 {\bar{k}}_{i} {\bar{α}}_{i} [{\bar{b}}_{1} - \sum_{j} {\bar{b}}_{2 j} {\bar{η}}_{j}], \end{matrix}

(A6c)

\begin{matrix} {\bar{α}}_{i} (T) & = {\bar{q}}_{i T} . \end{matrix}

(A6d)

The aggregate mean-field term

\int y m (t, d y)

can be derived in a semi-explicit way by taking the expected value of the state dynamics. It follows that

\int y m (t, d y) = [\int y m (0, d y)] e^{\int_{0}^{t} [{\bar{b}}_{1} - \sum_{j} {\bar{b}}_{2 j} {\bar{η}}_{j}] d t} .

This completes the proof. □

Proof of Proposition 2.

For the data in (1), the integrand Hamiltonian

H_{j}^{r}

has a unique minimizer, denoted by

u_{j}^{*} = u_{j}^{*} (t, x, m, {({\hat{V}}_{j^{'}, m}, {\hat{V}}_{j^{'}, x m}, {\hat{V}}_{j^{'}, x x m})}_{j^{'} \in I_{F}}, {(u_{i})}_{i \in I_{L}}),

which provides the reaction strategies of the follower decision-makers. Following (1) with leaders in

I_{L}

and followers in

I_{F},

the first order optimality condition yields

\begin{matrix} j & \in I_{F}, \\ 0 & = r_{j} {(u_{j} - {\bar{u}}_{j})}^{2 k_{j} - 1} + c_{i} {(x - \bar{x})}^{2 k_{i} - 1} + \sum_{j^{'} \in I_{F} \ {j}} ϵ_{j j^{'}} {(x - \bar{x})}^{2 (k_{j} - 1)} (u_{j^{'}} - {\bar{u}}_{j^{'}}) \end{matrix}

\begin{matrix} + \sum_{i \in I_{L}} ϵ_{j i} {(x - \bar{x})}^{2 (k_{j} - 1)} (u_{i} - {\bar{u}}_{i}) + [{\hat{V}}_{j, x m} (t, m) - \int {\hat{V}}_{j, x m} (t, m) (x) m (d x)]] b_{2 j}, \end{matrix}

(A7a)

\begin{matrix} 0 & = {\bar{r}}_{j} {\bar{u}}_{j}^{2 {\bar{k}}_{j} - 1} + {\bar{c}}_{i} {\bar{x}}^{2 {\bar{k}}_{i} - 1} + \sum_{j^{'} \in I_{F} \ {j}} {\bar{ϵ}}_{j j^{'}} {\bar{x}}^{2 ({\bar{k}}_{j} - 1)} {\bar{u}}_{j^{'}} + \sum_{i \in I_{L}} {\bar{ϵ}}_{j i} {\bar{x}}^{2 ({\bar{k}}_{j} - 1)} {\bar{u}}_{i} + [\int {\hat{V}}_{j, x m} (t, m) (x) m (d x)] {\bar{b}}_{2 j}, \end{matrix}

(A7b)

and

\begin{matrix} j & \in I_{F}, \end{matrix}

\begin{matrix} \sum_{i \in I_{L}} ϵ_{j i} η_{i} & = - r_{j} η_{j}^{2 k_{j} - 1} - \sum_{j^{'} \in I_{F} \ {j}} ϵ_{j j^{'}} η_{j^{'}} + b_{2 j} α_{j} + c_{j}, \end{matrix}

(A8a)

\begin{matrix} \sum_{i \in I_{L}} {\bar{ϵ}}_{j i} {\bar{η}}_{i} & = - {\bar{r}}_{j} {\bar{η}}_{j}^{2 {\bar{k}}_{j} - 1} - \sum_{j^{'} \in I_{F} \ {j}} {\bar{ϵ}}_{j j^{'}} {\bar{η}}_{j^{'}} + {\bar{b}}_{2 j} {\bar{α}}_{j} + {\bar{c}}_{j}, \end{matrix}

(A8b)

which provides

{η_{j}, {\bar{η}}_{j}}_{j \in I_{F}}

as function of

{η_{i}, {\bar{η}}_{i}}_{i \in I_{L}}

and

α, \bar{α}

. Following (1) with leaders in

I_{L}

and followers in

I_{F},

the leaders’ integrand Hamiltonian can be rewritten as follows

\begin{matrix} H_{i}^{r} & = inf_{u_{i} \in U_{i}} {l_{i} + b {\hat{V}}_{i, x m}} + \frac{σ^{2}}{2} {\hat{V}}_{i, x x m} + J [{\hat{V}}_{i, m}], \\ = inf_{u_{i} \in U_{i}} q_{i} \frac{{(x - \bar{x})}^{2 k_{i}}}{2 k_{i}} + r_{i} \frac{{(u_{i} - {\bar{u}}_{i})}^{2 k_{i}}}{2 k_{i}} + c_{i} {(x - \bar{x})}^{2 k_{i} - 1} (u_{i} - {\bar{u}}_{i}) \\ + \sum_{i^{'} \in I_{L} \ {i}} ϵ_{i i^{'}} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{i} - {\bar{u}}_{i}) (u_{i^{'}} - {\bar{u}}_{i^{'}}) + \sum_{j \in I_{F}} ϵ_{i j} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{i} - {\bar{u}}_{i}) (u_{j}^{*} - {\bar{u}}_{j}^{*}) \\ + {\bar{q}}_{i} \frac{{\bar{x}}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} + {\bar{r}}_{i} \frac{{\bar{u}}_{i}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} + {\bar{c}}_{i} {\bar{x}}^{2 {\bar{k}}_{i} - 1} {\bar{u}}_{i} + \sum_{i^{'} \in I_{L} \ {i}} {\bar{ϵ}}_{i i^{'}} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{i} {\bar{u}}_{i^{'}} + \sum_{j \in I_{F}} {\bar{ϵ}}_{i j} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{i} {\bar{u}}_{j}^{*} \\ + \{b_{1} (x - \bar{x}) + \sum_{i^{'} \in I_{L}} b_{2 i^{'}} (u_{i^{'}} - {\bar{u}}_{i^{'}}) + \sum_{j \in I_{F}} b_{2 j} (u_{j}^{*} - {\bar{u}}_{j}^{*})\} {\hat{V}}_{i, x m} \\ + {{\bar{b}}_{1} \bar{x} + \sum_{i^{'} \in I_{L}} {\bar{b}}_{2 i^{'}} {\bar{u}}_{i^{'}} + \sum_{j \in I_{F}} {\bar{b}}_{2 j} {\bar{u}}_{j}^{*}} {\hat{V}}_{i, x m} + \frac{σ^{2}}{2} {\hat{V}}_{i, x x m} + J [{\hat{V}}_{i, m}] \end{matrix}

In view of (A7),

\{\begin{matrix} \frac{\partial (u_{j}^{*} - {\bar{u}}_{j}^{*})}{\partial (u_{i} - {\bar{u}}_{i})} = - \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j} η_{j}^{2 k_{j} - 2}}, \\ \frac{\partial {\bar{u}}_{j}^{*}}{\partial {\bar{u}}_{i}} = - \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j} {\bar{η}}_{j}^{2 {\bar{k}}_{j} - 2}}, \end{matrix}

The optimal Stackelberg strategies of the leaders satisfy the following system:

\begin{matrix} 0 & = r_{i} {(u_{i} - {\bar{u}}_{i})}^{2 k_{i} - 1} + c_{i} {(x - \bar{x})}^{2 k_{i} - 1} + \sum_{i^{'} \in I_{L} \ {i}} ϵ_{i i^{'}} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{i^{'}} - {\bar{u}}_{i^{'}}) \\ + \sum_{j \in I_{F}} ϵ_{i j} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{j}^{*} - {\bar{u}}_{j}^{*}) - \sum_{j \in I_{F}} ϵ_{i j} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{i} - {\bar{u}}_{i}) \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j} η_{j}^{2 k_{j} - 2}} \\ + [b_{2 i} - \sum_{j \in I_{F}} b_{2 j} \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j} η_{j}^{2 k_{j} - 2}}] α_{i} {(x - \bar{x})}^{2 k_{i} - 1}, \\ 0 & = {\bar{r}}_{i} {\bar{u}}_{i}^{2 {\bar{k}}_{i} - 1} + {\bar{c}}_{i} {\bar{x}}^{2 {\bar{k}}_{i} - 1} + \sum_{i^{'} \in I_{L} \ {i}} {\bar{ϵ}}_{i i^{'}} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{i^{'}} + \sum_{j \in I_{F}} {\bar{ϵ}}_{i j} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{j}^{*} \\ - \sum_{j \in I_{F}} {\bar{ϵ}}_{i j} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{i} \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j} {\bar{η}}_{j}^{2 {\bar{k}}_{j} - 2}} + [{\bar{b}}_{2 i} - \sum_{j \in I_{F}} {\bar{b}}_{2 j} \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j} {\bar{η}}_{j}^{2 {\bar{k}}_{j} - 2}}] {\bar{α}}_{i} {\bar{x}}^{2 {\bar{k}}_{i} - 1}, \end{matrix}

whose solution provides the coefficients

{(η_{i}^{s s}, {\bar{η}}_{i}^{s s})}_{i \in L}

. □

Appendix A.1. I-th Hierarchical Level

Proof of Proposition 3.

We use a backward induction procedure to prove the statement.

When decision-maker I optimizes the preceding decision-makers have already chosen their strategy and that is known by

I .

Hence, integrand Hamiltonian of I is

\begin{matrix} H_{I} & = inf_{u_{I} \in U_{I}} {l_{I} + b {\hat{V}}_{I, x m}} + \frac{σ^{2}}{2} {\hat{V}}_{I, x x m} + J [{\hat{V}}_{I, m}] \\ = inf_{u_{I} \in U_{I}} q_{I} \frac{{(x - \bar{x})}^{2 k_{I}}}{2 k_{I}} + r_{I} \frac{{(u_{I} - {\bar{u}}_{I})}^{2 k_{I}}}{2 k_{I}} + c_{I} {(x - \bar{x})}^{2 k_{I} - 1} (u_{I} - {\bar{u}}_{I}) \\ + \sum_{i^{'} = 1}^{I - 1} ϵ_{I, i^{'}} {(x - \bar{x})}^{2 (k_{I} - 1)} (u_{I} - {\bar{u}}_{I}) (u_{i^{'}} - {\bar{u}}_{i^{'}}) + [b_{1} (x - \bar{x}) + \sum_{i^{'} = 1}^{I - 1} b_{2 i^{'}} (u_{i^{'}} - {\bar{u}}_{i^{'}})] {\hat{V}}_{I, x m} \\ + b_{2, I} (u_{I} - {\bar{u}}_{I}) {\hat{V}}_{I, x m} + {\bar{q}}_{I} \frac{{\bar{x}}^{2 {\bar{k}}_{I}}}{2 {\bar{k}}_{I}} + {\bar{r}}_{I} \frac{{\bar{u}}_{I}^{2 {\bar{k}}_{I}}}{2 {\bar{k}}_{I}} + {\bar{c}}_{I} {\bar{x}}^{2 {\bar{k}}_{I} - 1} {\bar{u}}_{I} + \sum_{i^{'} = 1}^{I - 1} {\bar{ϵ}}_{I, i^{'}} {\bar{x}}^{2 ({\bar{k}}_{I} - 1)} {\bar{u}}_{I} {\bar{u}}_{i^{'}} \\ + [{\bar{b}}_{1} \bar{x} + \sum_{i^{'} = 1}^{I - 1} {\bar{b}}_{2 i^{'}} {\bar{u}}_{i^{'}} + {\bar{b}}_{2, I} {\bar{u}}_{I}] {\hat{V}}_{I - 1, x m} + \frac{σ^{2}}{2} {\hat{V}}_{I, x x m} + J [{\hat{V}}_{I, m}] . \end{matrix}

It follows from strictly convex optimization above that the best response strategy can be expressed as:

\begin{matrix} u_{I}^{*} - {\bar{u}}_{I}^{*} & = - ξ_{1}^{\frac{1}{2 k_{I} - 1}}, \\ {\bar{u}}_{I}^{*} & = - ξ_{2}^{\frac{1}{2 {\bar{k}}_{I} - 1}}, \end{matrix}

where

\begin{matrix} ξ_{1} & = \frac{1}{r_{I}} (\sum_{i^{'} = 1}^{I - 1} ϵ_{I, i^{'}} {(x - \bar{x})}^{2 (k_{I} - 1)} (u_{i^{'}} - {\bar{u}}_{i^{'}}) + b_{2, I} {\hat{V}}_{I, x m} + c_{I} {(x - \bar{x})}^{2 k_{I} - 1}), \\ ξ_{2} & = \frac{1}{{\bar{r}}_{I}} (\sum_{i^{'} = 1}^{I - 1} {\bar{ϵ}}_{I, i^{'}} {\bar{x}}^{2 ({\bar{k}}_{I} - 1)} {\bar{u}}_{i^{'}} + {\bar{b}}_{2, I} {\hat{V}}_{I, x m} + {\bar{c}}_{I} {\bar{x}}^{2 {\bar{k}}_{I} - 1}) . \end{matrix}

In particular,

\begin{matrix} i \leq & I - 1 : \end{matrix}

\begin{matrix} \frac{\partial (u_{I}^{*} - {\bar{u}}_{I}^{*})}{\partial (u_{i} - {\bar{u}}_{i})} = & \frac{ϵ_{I, i}}{(2 k_{I} - 1) r_{I}} {(x - \bar{x})}^{2 (k_{I} - 1)} {(u_{I}^{*} - {\bar{u}}_{I}^{*})}^{- 2 (k_{I} - 1)}, \end{matrix}

(A9a)

\begin{matrix} \frac{\partial {\bar{u}}_{I}^{*}}{\partial {\bar{u}}_{i}} & = \frac{{\bar{ϵ}}_{I, i}}{(2 {\bar{k}}_{I} - 1) {\bar{r}}_{I}} {\bar{x}}^{2 ({\bar{k}}_{I} - 1)} {({\bar{u}}_{I}^{*})}^{- 2 ({\bar{k}}_{I} - 1)} . \end{matrix}

(A9b)

If the preceding decision-makers

{1, 2, \dots, I - 1}

have all used linear state-and-mean-field feedback strategies then the reaction of the I-th decision-maker who is at I-th level of hierarchy can be rewritten as

\begin{matrix} u_{I}^{s e q} & = - η_{I} (x - \int y m (d y)) - {\bar{η}}_{I} \int y m (d y), \\ η_{I} & = {(\frac{- \sum_{j = 1}^{I - 1} ϵ_{I, j} η_{j} + b_{2 I} α_{I} + c_{I}}{r_{I}})}^{\frac{1}{2 k_{I} - 1}}, \\ {\bar{η}}_{I} & = {(\frac{- \sum_{j = 1}^{I - 1} {\bar{ϵ}}_{I, j} {\bar{η}}_{j} + {\bar{b}}_{2 I} {\bar{α}}_{I} + {\bar{c}}_{I}}{{\bar{r}}_{I}})}^{\frac{1}{2 {\bar{k}}_{I} - 1}} . \end{matrix}

□

Appendix A.2. (I − 1)-th Hierarchical Level

At the hierarchical level

I - 1,

the preceding levels are

{1, 2, \dots, I - 2}

and the succeeding level is

I .

Having the expression of the optimal control strategies of the last layer I we can move to the preceding layer, that is,

I - 1 .

Decision-maker

I - 1

has

u_{1}, \dots, u_{I - 2}

and the reaction

u_{I}^{*}

of decision-maker

I .

Therefore, the integrand Hamiltonian of

I - 1

is, given by

\begin{matrix} H_{I - 1}^{r} = inf_{u_{I - 1} \in U_{I - 1}} {l_{i} + b {\hat{V}}_{I - 1, x m}} + \frac{σ^{2}}{2} {\hat{V}}_{I - 1, x x m} + J [{\hat{V}}_{I - 1, m}] \\ = inf_{u_{I - 1} \in U_{I - 1}} r_{I - 1} \frac{{(u_{I - 1} - {\bar{u}}_{I - 1})}^{2 k_{I - 1}}}{2 k_{I - 1}} + q_{I - 1} \frac{{(x - \bar{x})}^{2 k_{I - 1}}}{2 k_{I - 1}} + c_{I - 1} {(x - \bar{x})}^{2 k_{I - 1} - 1} (u_{I - 1} - {\bar{u}}_{I - 1}) \\ + \sum_{i^{'} = 1}^{I - 2} ϵ_{I - 1, i^{'}} {(x - \bar{x})}^{2 (k_{I - 1} - 1)} (u_{I - 1} - {\bar{u}}_{I - 1}) (u_{i^{'}} - {\bar{u}}_{i^{'}}) + ϵ_{I - 1, I} {(x - \bar{x})}^{2 (k_{I - 1} - 1)} (u_{I - 1} - {\bar{u}}_{I - 1}) (u_{I}^{*} - {\bar{u}}_{I}^{*}) \\ + [b_{1} (x - \bar{x}) + \sum_{i^{'} = 1}^{I - 2} b_{2 i^{'}} (u_{i^{'}} - {\bar{u}}_{i^{'}}) + b_{2, I - 1} (u_{I - 1} - {\bar{u}}_{I - 1}) + b_{2 I} (u_{I}^{*} - {\bar{u}}_{I}^{*})] {\hat{V}}_{I - 1, x m} \\ + {\bar{q}}_{I - 1} \frac{{\bar{x}}^{2 {\bar{k}}_{I - 1}}}{2 {\bar{k}}_{I - 1}} + {\bar{r}}_{I - 1} \frac{{\bar{u}}_{I - 1}^{2 {\bar{k}}_{I - 1}}}{2 {\bar{k}}_{I - 1}} + {\bar{c}}_{I - 1} {\bar{x}}^{2 {\bar{k}}_{I - 1} - 1} {\bar{u}}_{I - 1} + \sum_{i^{'} = 1}^{I - 2} {\bar{ϵ}}_{I - 1, i^{'}} {\bar{x}}^{2 ({\bar{k}}_{I - 1} - 1)} {\bar{u}}_{I - 1} {\bar{u}}_{i^{'}} \\ + {\bar{ϵ}}_{I - 1, I} {\bar{x}}^{2 ({\bar{k}}_{I - 1} - 1)} {\bar{u}}_{I - 1} {\bar{u}}_{I}^{*} + {{\bar{b}}_{1} \bar{x} + \sum_{i^{'} = 1}^{I - 2} {\bar{b}}_{2 i^{'}} {\bar{u}}_{i^{'}} + {\bar{b}}_{2, I - 1} {\bar{u}}_{I - 1} + {\bar{b}}_{2 I} {\bar{u}}_{I}^{*}} {\hat{V}}_{I - 1, x m} + \frac{σ^{2}}{2} {\hat{V}}_{I - 1, x x m} + J [{\hat{V}}_{I - 1, m}] \end{matrix}

In view of (A9), the terms with

{\bar{u}}_{I}^{*}

depend on

{\bar{u}}_{I - 1}

,

{\bar{u}}_{I - 2}

, …,

{\bar{u}}_{1} .

The first-order optimality condition for

u_{I - 1}^{*}

yields

\begin{matrix} 0 & = - r_{I - 1} η_{I - 1}^{2 k_{I - 1} - 1} + c_{I - 1} - \sum_{i^{'} = 1}^{I - 2} ϵ_{I - 1, i^{'}} η_{i^{'}} - ϵ_{I - 1, I} η_{I} + ϵ_{I - 1, I} η_{I - 1} \frac{ϵ_{I, I - 1}}{(2 k_{I} - 1) r_{I}} η_{I}^{- 2 (k_{I} - 1)} \\ + \{b_{2, I - 1} - b_{2 I} \frac{ϵ_{I, I - 1}}{(2 k_{I} - 1) r_{I}} η_{I}^{- 2 (k_{I} - 1)}\} α_{I - 1}, \\ 0 & = - {\bar{r}}_{I - 1} {\bar{η}}_{I - 1}^{2 {\bar{k}}_{I - 1} - 1} + {\bar{c}}_{I - 1} - \sum_{i^{'} = 1}^{I - 2} {\bar{ϵ}}_{I - 1, i^{'}} {\bar{η}}_{i^{'}} - {\bar{ϵ}}_{I - 1, I} {\bar{η}}_{I} + {\bar{ϵ}}_{I - 1, I} {\bar{η}}_{I - 1} \frac{{\bar{ϵ}}_{I, I - 1}}{(2 {\bar{k}}_{I} - 1) {\bar{r}}_{I}} {\bar{η}}_{I}^{- 2 ({\bar{k}}_{I} - 1)} \\ + \{{\bar{b}}_{2, I - 1} - {\bar{b}}_{2, I} \frac{{\bar{ϵ}}_{I, I - 1}}{(2 {\bar{k}}_{I} - 1) {\bar{r}}_{I}} {\bar{η}}_{I}^{- 2 ({\bar{k}}_{I} - 1)}\} {\bar{α}}_{I - 1}, \end{matrix}

where we have used (A9) for

i = I - 1

.

u_{I - 1}^{s e q} = - η_{I - 1} (x - \int y m (d y)) - {\bar{η}}_{I - 1} \int y m (d y) .

(A10)

Appendix A.3. i-th Hierarchical Level

For

i \in {2, \dots, I - 2},

\begin{matrix} H_{i}^{r} & = inf_{u_{i} \in U_{i}} q_{i} \frac{{(x - \bar{x})}^{2 k_{i}}}{2 k_{i}} + r_{i} \frac{{(u_{i} - {\bar{u}}_{i})}^{2 k_{i}}}{2 k_{i}} + c_{i} {(x - \bar{x})}^{2 k_{i} - 1} (u_{i} - {\bar{u}}_{i}) \\ + \sum_{i^{'} = 1}^{i - 1} ϵ_{i i^{'}} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{i} - {\bar{u}}_{i}) (u_{i^{'}} - {\bar{u}}_{i^{'}}) + \sum_{j = i + 1}^{I} ϵ_{i j} {(x - \bar{x})}^{2 (k_{i} - 1)} (u_{i} - {\bar{u}}_{i}) (u_{j}^{*} - {\bar{u}}_{j}^{*}) \\ + [b_{1} (x - \bar{x}) + \sum_{i^{'} = 1}^{i - 1} b_{2 i^{'}} (u_{i^{'}} - {\bar{u}}_{i^{'}}) + b_{2 i} (u_{i} - {\bar{u}}_{i}) + \sum_{j = i + 1}^{I} b_{2 j} (u_{j}^{*} - {\bar{u}}_{j}^{*})] {\hat{V}}_{i, x m} \\ + {\bar{q}}_{i} \frac{{\bar{x}}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} + {\bar{r}}_{i} \frac{{\bar{u}}_{i}^{2 {\bar{k}}_{i}}}{2 {\bar{k}}_{i}} + {\bar{c}}_{i} {\bar{x}}^{2 {\bar{k}}_{i} - 1} {\bar{u}}_{i} + \sum_{i^{'} = 1}^{i - 1} {\bar{ϵ}}_{i i^{'}} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{i} {\bar{u}}_{i^{'}} + \sum_{j = i + 1}^{I} {\bar{ϵ}}_{i j} {\bar{x}}^{2 ({\bar{k}}_{i} - 1)} {\bar{u}}_{i} {\bar{u}}_{j}^{*} \\ + \{{\bar{b}}_{1} \bar{x} + \sum_{i^{'} = 1}^{i - 1} {\bar{b}}_{2 i^{'}} {\bar{u}}_{i^{'}} + {\bar{b}}_{2 i} {\bar{u}}_{i} + \sum_{j = i + 1}^{I} {\bar{b}}_{2 j} {\bar{u}}_{j}^{*}\} {\hat{V}}_{i, x m} + \frac{σ^{2}}{2} {\hat{V}}_{i, x x m} + J [{\hat{V}}_{i, m}] . \end{matrix}

By identification from the first-order optimality condition the coefficient functions

η_{i}, {\bar{η}}_{i}

satisfy the following equations

\begin{matrix} 0 & = - r_{i} η_{i}^{2 k_{i} - 1} + c_{i} - \sum_{i^{'} = 1}^{i - 1} ϵ_{I - 1, i^{'}} η_{i^{'}} - \sum_{j = i + 1}^{I} ϵ_{i, j} η_{j} + \sum_{j = i + 1}^{I} ϵ_{i, j} η_{i} \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j}} η_{j}^{- 2 (k_{j} - 1)} \\ + \{b_{2 i} - \sum_{j = i + 1}^{I} b_{2 j} \frac{ϵ_{j i}}{(2 k_{j} - 1) r_{j}} η_{j}^{- 2 (k_{j} - 1)}\} α_{i}, \\ 0 & = - {\bar{r}}_{i} {\bar{η}}_{i}^{2 {\bar{k}}_{i} - 1} + {\bar{c}}_{i} - \sum_{i^{'} = 1}^{i - 1} {\bar{ϵ}}_{I - 1, i^{'}} {\bar{η}}_{i^{'}} - \sum_{j = i + 1}^{I} {\bar{ϵ}}_{i, j} {\bar{η}}_{j} + \sum_{j = i + 1}^{I} {\bar{ϵ}}_{i, j} η_{i} \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j}} {\bar{η}}_{j}^{- 2 ({\bar{k}}_{j} - 1)} \\ + \{{\bar{b}}_{2 i} - \sum_{j = i + 1}^{I} {\bar{b}}_{2 j} \frac{{\bar{ϵ}}_{j i}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j}} {\bar{η}}_{j}^{- 2 ({\bar{k}}_{j} - 1)}\} {\bar{α}}_{i}, \end{matrix}

Appendix A.4. 1-st Hierarchical Level

We now examine first level of the hierarchy. The integrand Hamiltonian of decision-maker 1 is

\begin{matrix} H_{1}^{r} & = inf_{u_{1} \in U_{1}} q_{1} \frac{{(x - \bar{x})}^{2 k_{1}}}{2 k_{1}} + r_{1} \frac{{(u_{1} - {\bar{u}}_{1})}^{2 k_{1}}}{2 k_{1}} + c_{1} {(x - \bar{x})}^{2 k_{1} - 1} (u_{1} - {\bar{u}}_{1}) \\ + \sum_{j = 2}^{I} ϵ_{1 j} {(x - \bar{x})}^{2 (k_{1} - 1)} (u_{1} - {\bar{u}}_{1}) (u_{j}^{*} - {\bar{u}}_{j}^{*}) + \{b_{1} (x - \bar{x}) + b_{21} (u_{1} - {\bar{u}}_{1})\} {\hat{V}}_{1, x m} \\ + \{\sum_{j = 2}^{I} b_{2 j} (u_{j}^{*} - {\bar{u}}_{j}^{*})\} {\hat{V}}_{1, x m} + {\bar{q}}_{1} \frac{{\bar{x}}^{2 {\bar{k}}_{1}}}{2 {\bar{k}}_{1}} + {\bar{r}}_{1} \frac{{\bar{u}}_{1}^{2 {\bar{k}}_{1}}}{2 {\bar{k}}_{1}} + {\bar{c}}_{1} {\bar{x}}^{2 {\bar{k}}_{1} - 1} {\bar{u}}_{1} + \sum_{j = 2}^{I} {\bar{ϵ}}_{1 j} {\bar{x}}^{2 ({\bar{k}}_{1} - 1)} {\bar{u}}_{1} {\bar{u}}_{j}^{*} \\ + \{{\bar{b}}_{1} \bar{x} + {\bar{b}}_{21} {\bar{u}}_{1} + \sum_{j = 2}^{I} {\bar{b}}_{2 j} {\bar{u}}_{j}^{*}\} {\hat{V}}_{1, x m} + \frac{σ^{2}}{2} {\hat{V}}_{1, x x m} + J [{\hat{V}}_{1, m}] . \end{matrix}

By identification from the first-order optimality condition the coefficient functions

η_{1}, {\bar{η}}_{1}

satisfy the following equations

\begin{matrix} 0 & = - r_{1} η_{1}^{2 k_{1} - 1} + c_{1} - \sum_{j = 2}^{I} ϵ_{1 j} η_{j} + \sum_{j = 2}^{I} ϵ_{1 j} η_{1} \frac{ϵ_{j 1}}{(2 k_{j} - 1) r_{j}} η_{j}^{- 2 (k_{j} - 1)} + \{b_{21} - \sum_{j = 2}^{I} b_{2 j} \frac{ϵ_{j 1}}{(2 k_{j} - 1) r_{j}} η_{j}^{- 2 (k_{j} - 1)}\} α_{1}, \\ 0 & = - {\bar{r}}_{1} {\bar{η}}_{1}^{2 {\bar{k}}_{1} - 1} + {\bar{c}}_{1} - \sum_{j = 2}^{I} {\bar{ϵ}}_{1 j} {\bar{η}}_{j} + \sum_{j = 2}^{I} {\bar{ϵ}}_{1 j} {\bar{η}}_{1} \frac{{\bar{ϵ}}_{j 1}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j}} {\bar{η}}_{j}^{- 2 ({\bar{k}}_{j} - 1)} + \{{\bar{b}}_{21} - \sum_{j = 2}^{I} {\bar{b}}_{2 j} \frac{{\bar{ϵ}}_{j 1}}{(2 {\bar{k}}_{j} - 1) {\bar{r}}_{j}} {\bar{η}}_{j}^{- 2 ({\bar{k}}_{j} - 1)}\} {\bar{α}}_{1} . \end{matrix}

Putting all together we arrive at the announced statement. This completes the proof.

References

Stackelberg, H.V. The Theory of the Market Economy; Peacock, A.J.; Hodge, W., Translators; Originally Published as Grundlagen der Theoretischen Volkswirtschaftlehre; Oxford University Press: Oxford, UK, 1948. [Google Scholar]
Simaan, M.; Cruz, J.B. On the Stackelberg strategy in nonzero-sum games. J. Optim. Theory Appl. 1973, 11, 533–555. [Google Scholar] [CrossRef]
Bagchi, A.; Basar, T. Stackelberg strategies in linear-quadratic stochastic differential games. J. Optim. Theory Appl. 1981, 35, 443–464. [Google Scholar] [CrossRef]
Bensoussan, A.; Chen, S.; Sethi, S.P. The maximum principle for global solutions of stochastic Stackelberg differential games. SIAM J. Control Optim. 2015, 53, 1956–1981. [Google Scholar] [CrossRef]
Pan, L.; Yong, J. A differential game with multi-level of hierarchy. J. Math. Anal. Appl. 1991, 161, 522–544. [Google Scholar] [CrossRef] [Green Version]
Simaan, M.; Cruz, J. A Stackelberg solution for games with many players. IEEE Trans. Autom. Control 1973, 18, 322–324. [Google Scholar] [CrossRef]
Cruz, J. Leader-follower strategies for multilevel systems. IEEE Trans. Autom. Control 1978, 23, 244–255. [Google Scholar] [CrossRef] [Green Version]
Gardner, B.; Cruz, J. Feedback Stackelberg strategy for M-level hierarchical games. IEEE Trans. Autom. Control 1978, 23, 489–491. [Google Scholar] [CrossRef]
Basar, T.; Selbuz, H. Closed-loop Stackelberg strategies with applications in the optimal control of multilevel systems. IEEE Trans. Autom. Control 1979, 24, 166–179. [Google Scholar] [CrossRef]
Lin, Y.; Jiang, X.; Zhang, W. An Open-Loop Stackelberg Strategy for the Linear Quadratic Mean-Field Stochastic Differential Game. IEEE Trans. Autom. Control 2019, 64, 97–110. [Google Scholar] [CrossRef]
Du, K.; Wu, Z. Linear-Quadratic Stackelberg Game for Mean-Field Backward Stochastic Differential System and Application. Math. Probl. Eng. 2019, 2019, 1798585. [Google Scholar] [CrossRef] [Green Version]
Moon, J.; Basar, T. Linear-quadratic stochastic differential Stackelberg games with a high population of followers. In Proceedings of the 54th IEEE Conference on Decision Control, Osaka, Japan, 15–18 December 2015; pp. 2270–2275. [Google Scholar]
Bensoussan, A.; Chau, M.H.M.; Yam, S.C.P. Mean-field Stackelberg games: Aggregation of delayed instructions. SIAM J. Control Optim. 2015, 53, 2237–2266. [Google Scholar] [CrossRef] [Green Version]
Bensoussan, A.; Chau, M.; Lai, Y.; Yam, S. Linear-quadratic mean field Stackelberg games with state and control delays. SIAM J. Control Optim. 2017, 55, 2748–2781. [Google Scholar] [CrossRef]
Averboukh, A.Y. Stackelberg solution for first-order mean-field game with a major player. Izv. Inst. Mat. Inform. Udmurt. 2018, 52. [Google Scholar] [CrossRef]
Moon, J.; Basar, T. Linear quadratic mean field Stackelberg differential games. Automatica 2018, 97, 200–213. [Google Scholar] [CrossRef]
Shi, J.; Wang, G.; Xiong, J. Leader-follower stochastic differential game with asymmetric information and applications. Automatica 2016, 63, 60–73. [Google Scholar] [CrossRef] [Green Version]
Nourian, M.; Caines, P.; Malhamé, R.P.; Huang, M. Mean Field LQG Control in Leader-Follower Stochastic Multi-Agent Systems: Likelihood Ratio Based Adaptation. IEEE Trans. Autom. Control 2012, 57, 2801–2816. [Google Scholar] [CrossRef] [Green Version]
Cai, H.; Hu, G. Distributed Tracking Control of an Interconnected Leader-Follower multi-agent System. IEEE Trans. Autom. Control 2017, 62, 3494–3501. [Google Scholar] [CrossRef]
Li, Y.; Shi, D.; Chen, T. False Data Injection Attacks on Networked Control Systems: A Stackelberg Game Analysis. IEEE Trans. Autom. Control 2018, 63, 3503–3509. [Google Scholar] [CrossRef]
Barreiro-Gomez, J.; Ocampo-Martinez, C.; Quijano, N. Partitioning for large-scale systems: A sequential distributed MPC design. In Proceedings of the 20th IFAC World Congress, Toulouse, France, 9–14 July 2017; pp. 8838–8843. [Google Scholar]
Sutter, M.; Rivas, M.F. Leadership, Reward and Punishment in Sequential Public Goods Experiments. In Reward and Punishment in Social Dilemmas; Lange, P.A.V., Rockenbach, B., Yamagishi, T., Eds.; Oxford University Press: Oxford, UK, 2014; pp. 1–39. [Google Scholar]
Andersson, D.; Djehiche, B. A Maximum Principle for SDEs of mean-field-type. Appl. Math. Optim. 2011, 63, 341–356. [Google Scholar] [CrossRef]
Buckdahn, R.; Djehiche, B.; Li, J. A General Stochastic Maximum Principle for SDEs of mean-field-type. Appl. Math. Optim. 2011, 64, 197–216. [Google Scholar] [CrossRef]
Tembine, H. Risk-sensitive mean-field-type games with L^p-norm drifts. Automatica 2015, 59, 224–237. [Google Scholar] [CrossRef]
Tcheukam, A.; Tembine, H. mean-field-type Games for Distributed Power Networks in Presence of Prosumers. In Proceedings of the 2016 28th Chinese Control and Decision Conference (CCDC), Yinchuan, China, 28–30 May 2016; pp. 446–451. [Google Scholar]
Djehiche, B.; Tcheukam, A.; Tembine, H. mean-field-type Games in Engineering. AIMS Electron. Electr. Eng. 2017, 1, 18. [Google Scholar] [CrossRef]
Tembine, H. mean-field-type games. AIMS Math. 2017, 2, 706–735. [Google Scholar] [CrossRef]
Duncan, T.; Tembine, H. Linear-Quadratic mean-field-type Games: A Direct Method. Games 2018, 9, 7. [Google Scholar] [CrossRef] [Green Version]
Barreiro-Gomez, J.; Duncan, T.E.; Tembine, H. Linear-Quadratic mean-field-type Games: Jump-Diffusion Process with Regime Switching. IEEE Trans. Autom. Control 2019, 64, 4329–4336. [Google Scholar] [CrossRef]
Barreiro-Gomez, J.; Duncan, T.E.; Tembine, H. Linear-Quadratic mean-field-type Games with Multiple Input Constraints. IEEE Control Syst. Lett. 2019, 3, 511–516. [Google Scholar] [CrossRef]
Beardsley, X.W.; Field, B.; Xiao, M. Mean-variance-skewness-kurtosis portfolio optimization with return and liquidity. Commun. Math. Financ. 2012, 1, 13–49. [Google Scholar]
Theodossiou, P.; Savva, C.S. Skewness and the Relation Between Risk and Return. Manag. Sci. 2016, 62, 1598–1609. [Google Scholar] [CrossRef] [Green Version]
Sun, J.; Yong, J. Linear Quadratic Stochastic Differential Games: Open-Loop and Closed-Loop Saddle Points. SIAM J. Control. Optim. 2014, 52, 4082–4121. [Google Scholar] [CrossRef] [Green Version]
Bensoussan, A.; Djehiche, B.; Tembine, H.; Yam, S.C.P. mean-field-type Games with Jump and Regime Switching. Dyn. Games Appl. 2020. [Google Scholar] [CrossRef]

Figure 1. Different hierarchical designs and their solution concepts considered in this paper.

Figure 2. Possible combinations in the hierarchical leadership design for two decision-makers. Ordered Bell number

B (2) = 3

.

Figure 2. Possible combinations in the hierarchical leadership design for two decision-makers. Ordered Bell number

B (2) = 3

.

Figure 3. Possible combinations in the hierarchical leadership design for three decision-makers. Ordered Bell number

B (3) = 13

.

Figure 3. Possible combinations in the hierarchical leadership design for three decision-makers. Ordered Bell number

B (3) = 13

.

Figure 4. Number of possible hierarchical structures for given set of decision-makers described by the ordered Bell number

B (I)

.

Figure 4. Number of possible hierarchical structures for given set of decision-makers described by the ordered Bell number

B (I)

.

Figure 5. Evolution of the differential equations

{\dot{\bar{α}}}^{leader / follower}

, and the corresponding initial values for the different number of leaders in the homogeneous scenario.

Figure 5. Evolution of the differential equations

{\dot{\bar{α}}}^{leader / follower}

, and the corresponding initial values for the different number of leaders in the homogeneous scenario.

Figure 6. Evolution of the differential equations

{\dot{\bar{α}}}^{leader / follower}

, and the corresponding initial values for the different number of leaders in the heterogeneous scenario.

Figure 6. Evolution of the differential equations

{\dot{\bar{α}}}^{leader / follower}

, and the corresponding initial values for the different number of leaders in the heterogeneous scenario.

Figure 7. Evolution of the sum of differential equations and the corresponding total cost for the heterogeneous scenario.

Figure 8. Evolution of the differential equations

\sum_{j \in I} {\dot{\bar{α}}}_{j} (t)

, and the corresponding initial values for different hierarchical structures in the heterogeneous scenario.

Figure 8. Evolution of the differential equations

\sum_{j \in I} {\dot{\bar{α}}}_{j} (t)

, and the corresponding initial values for different hierarchical structures in the heterogeneous scenario.

Table 1. Summary of

{\bar{α}}_{0}^{leader}

,

{\bar{α}}_{0}^{follower}

, and

S (| I_{L} |, δ_{x_{0}})

for the different number of leaders in the homogeneous scenario. bold—significant difference.

Table 1. Summary of

{\bar{α}}_{0}^{leader}

,

{\bar{α}}_{0}^{follower}

, and

S (| I_{L} |, δ_{x_{0}})

for the different number of leaders in the homogeneous scenario. bold—significant difference.

Leader(s)-Follower(s) Structure
Individual leader cost	3.132	3.37	9.772	3.107	2.968
Individual follower cost	1.217	0.2931	0.3481	2.933	3.562
Total cost	9.219	7.911	30.36	18.29	18.4

Table 2. Summary of

{\bar{α}}_{0}^{leader}

,

{\bar{α}}_{0}^{follower}

, and

S (| I_{L} |, δ_{x_{0}})

for the different number of leaders in the heterogeneous scenario. bold—significant difference.

Table 2. Summary of

{\bar{α}}_{0}^{leader}

,

{\bar{α}}_{0}^{follower}

, and

S (| I_{L} |, δ_{x_{0}})

for the different number of leaders in the heterogeneous scenario. bold—significant difference.

Leader(s)-Follower(s) Structure
Leaders	${1} {2}$	${1} {3}$	${2} {3}$	${1}$	${2}$	${3}$
Followers	${3}$	${2}$	${1}$	${2} {3}$	${1} {3}$	${1} {2}$
Total cost	17.14	16.96	16.99	17.04	17.13	16.92

Table 3. Total cost for the different hierarchical orders in a three-player case in the heterogeneous scenario. bold—significant difference.

Hierarchical Structure
Combination label	1	2	3	4	5	6
Hierarchical order	${1} {2} {3}$	${1} {3} {2}$	${2} {1} {3}$	${2} {3} {1}$	${3} {1} {2}$	${3} {2} {1}$
Total cost	6.124	7.464	5.864	8.757	6.894	8.433

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

El Oula Frihi, Z.; Barreiro-Gomez, J.; Eddine Choutri, S.; Tembine, H. Hierarchical Structures and Leadership Design in Mean-Field-Type Games with Polynomial Cost. Games 2020, 11, 30. https://doi.org/10.3390/g11030030

AMA Style

El Oula Frihi Z, Barreiro-Gomez J, Eddine Choutri S, Tembine H. Hierarchical Structures and Leadership Design in Mean-Field-Type Games with Polynomial Cost. Games. 2020; 11(3):30. https://doi.org/10.3390/g11030030

Chicago/Turabian Style

El Oula Frihi, Zahrate, Julian Barreiro-Gomez, Salah Eddine Choutri, and Hamidou Tembine. 2020. "Hierarchical Structures and Leadership Design in Mean-Field-Type Games with Polynomial Cost" Games 11, no. 3: 30. https://doi.org/10.3390/g11030030

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hierarchical Structures and Leadership Design in Mean-Field-Type Games with Polynomial Cost

Abstract

1. Introduction

2. The Setup

2.1. Games with Polynomial Cost

2.2. Hierarchical Leader Design and Algorithmic Approach

3. Nash Mean-Field-Type Equilibrium

4. Multiple Leaders and Multiple Followers

4.1. No Control-Coupling within Classes

4.1.1. No Leader and All Followers

4.1.2. One Leader and Multiple Followers

4.1.3. Multiple Leaders and One Follower

4.1.4. All Leaders and No Follower

5. Fully Hierarchical Game

6. Numerical Investigation

6.1. Effect of the Number of Leaders on the Total Cost

6.1.1. Uniform Coupling and Homogeneous Players

6.1.2. Uniform Coupling and Heterogeneous Players

6.2. Impact of the Hierarchical Structures

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. I-th Hierarchical Level

Appendix A.2. (I − 1)-th Hierarchical Level

Appendix A.3. i-th Hierarchical Level

Appendix A.4. 1-st Hierarchical Level

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI