Ultra-Dense Uplink UAV Lossy Communications: Trajectory Optimization Based on Mean Field Game

Yibo Ma; Shen Qian

doi:10.3390/electronics14112219

and

¹

Electronic Information Science and Technology, Northwest University, Xi’an 710127, China

²

Department of Information Systems, Faculty of Informatics, Tokyo City University, Kanagawa 224-8551, Japan

^*

Author to whom correspondence should be addressed.

Electronics2025, 14(11), 2219;https://doi.org/10.3390/electronics14112219

This article belongs to the Special Issue Innovations in Radio Frequency Technologies, Wireless Communication, and Signal Processing

Version Notes

Order Reprints

Abstract

This paper investigates a multiple unmanned aerial vehicle (UAV) enabled network for supporting emergency communication services, where each drone acts as a base station (also called the drone small cell (DSC)). The novelty of this paper is that a mean field game (MFG)-based strategy is conceived for jointly controlling the three-dimensional (3D) locations of these drones to guarantee the distortion requirement of lossy communications, while considering the inter-cell interference and the flight energy consumption of drones. More explicitly, we derive the Hamilton–Jacobi–Bellman (HJB) and Fokker–Planck–Kolmogorov (FPK) equations, and propose an algorithm where both the Lax–Friedrichs scheme and the Lagrange relaxation are invoked for solving the HJB and FPK equations with 3D control vectors and state vectors. The numerical results show that the proposed algorithm can achieve a higher access rate with a similar flight energy consumption.

Keywords:

unmanned aerial vehicle; ultra-dense communications; trajectory optimization; mean field game; energy efficiency

1. Introduction

With the rapid development of Internet-of-Things (IoT), it has brought severe challenges to the design of mobile wireless networks, mainly to provide high data rates and extremely low latency [1]. However, terrestrial base stations may not be available when destroyed by disasters, and cannot support the communication demands in emergency scenarios.

Due to the flexible maneuverability, easy deployment, low cost and miniaturization of unmanned aerial vehicle (UAV), it has been used in many emergency communication scenarios caused by disasters [2,3]. As one of the supporting technologies of sixth-generation (6G) wireless communication systems, the deployment of aerial base stations is also an efficient way to enhance wireless communication services. With the reduction in UAV cost, the scale of the UAV communication network can be significantly extended by introducing multiple UAVs to provide service [4]. UAVs can construct an aerial UAV swarm network through flexible networking, supplementing to the existing network architecture for wireless information transmission, which can realize rapid movement of the wireless network coverage area.

When the number of users increases in 6G networks, it is very difficult to provide satisfactory service for users [5]. Therefore, UAV communication networks will undoubtedly have to face the challenges of dense deployment scenarios [6]. A typical scenario of ultra-dense UAV communications is illustrated in Figure 1, where ground base stations are not deployed or unavailable in this area. People may encounter this situation when disasters destroy the base stations [7]. To satisfy the users’ communication demands, many UAVs carry communication devices to provide connection services to the ultra-dense users.

Figure 1. A basic scenario of ultra-dense UAV communications.

In the scenario where dense communication links coexist, mutual interference is a problem that network operators must face. How to achieve interference control in ultra-dense networks in a communication environment with restricted or more complicated channel conditions is still a research direction for academia and industry. By deploying UAVs at suitable positions, it can reduce the interference from the users in other cells, while maintaining a good coverage for the users in its cell.

In extreme cases, if the interference is too severe for the link to transmit information losslessly, the received information inevitably contains distortions. The conventional communication systems will discard the error-corrupted information since the lossy recovery cannot be further utilized. Nonetheless, artificial intelligence (AI)-enabled 6G networks have the capability to exploit the useful information from lossy recoveries. In task-oriented communication scenarios such as IoT [8], a certain degree of distortion is acceptable provided that the final decision is still correct. Furthermore, 6G networks may intentionally perform lossy communications, i.e., semantic communications [9,10,11,12,13] emphasize the reconstructed information having the same “meaning” rather than “bit sequences” as the original information. Thus, lossy communications have a bright future in the 6G era.

There is already some research focusing on UAV lossy communications [14,15,16]. The work in [14] proposes an optimization method for minimizing the age of information in UAV communications. The authors in [15] analyze the lossy communication performance of cooperative UAV networks. The work in [16] focuses on the adaptive communication protocol for transmitting critical video data by lossy compression. However, the scenario of these studies contains a limited number of UAVs and users. Although [17] investigates data sharing of a large-scale UAV swarm in lossy communication environments, the system objective is to reliably exchange information based on consensus algorithm instead of exploiting the lossy information. Therefore, an ultra-dense UAV network adopting lossy communications remains to be investigated. The key problem for optimizing the ultra-dense UAV network is the computing complexity of the algorithm. Especially in dynamic environment, UAVs face complex optimization constraints [18]. In ultra-dense networks, conventional optimization approaches face a huge amount of parameters to be optimized, which require incredible computing complexity. It is extremely hard to solve an optimization problem with a huge amount of parameters within a certain time. To solve this problem, mean field game (MFG) is a state-of-the-art tool which can significantly alleviate the curse of dimensionality. The so-called mean field theory is simply to average the effect of the environment on the object, by collectively processing the influence of the surrounding objects on the target, using the global average effect result to replace the effect caused by a large number of monomers. In the MFG, for a typical individual, the game with all other individuals is simplified to a game with a mean field. MFG has brought the possibility of modeling and solving such a dense network distributed by game strategies.

In recent years, MFG has gradually been used in communication scenarios [19,20]. The authors in [21] discussed the downlink interference management in dense UAVs networks using MFG theory, which modeled the interference control problem as an altitude control problem. The MFG was used to obtain the optimal altitude control strategy. In [22], a joint channel access and power control optimization problem was solved by formulating a multiple MFG for large-scale UAV networks. In [23], the authors studied a prediction-based charging policy and interference mitigation approach in wireless powered IoT networks. In these networks, it modeled the interference mitigation problem as an MFG system, where the drone powered the sensors through the appropriate path. The authors in [24] combined MFG with multi-agent deep reinforcement learning for resource allocation of UAV-assisted multi-access edge computing networks.

In summary, the numbers of UAVs and users were quite limited in many scenarios, which hardly satisfied the needs of ultra-dense scenes. Although there were some works adopting the MFG in drone communication to deal with the interference management and deployment problem, the system is designed for lossless communications, which do not fit the AI-enabled scenario in 6G.

Motivated by the aforementioned facts, this paper considers a large number of UAVs that serve multiple users in lossy communication networks, and proposes a 3D distributed dynamic flight strategy. In order to solve the problem of mutual interference minimization in the dense network of drones, we will deeply study the system optimization problem of the ultra-dense network composed of drone base stations and a large number of users. The MFG framework is used to find the best 3D position solution for drones. The main idea of the proposed algorithm is to average the interference from the mass of all users as a mean field, which significantly reduces the computing complexity for solving the optimization problem.

The main contributions of this paper are summarized as follows:

An MFG framework for dynamic emergency communication networks: We propose an MFG framework for dynamic communication networks. The framework contains a small number of base stations and a large number of UAVs and users. Among them, the UAVs assist the base stations to provide services for the users.
Energy consumption and location problem formulation: With the help of the proposed MFG framework, we optimize the trajectory of the UAV, and design the corresponding cost function. We formulate this problem as the problem of cost minimization, but the constraints of energy consumption and penalty must be considered.
Constructing a robust MFG: Considering the time-varying problem of the channel, a robust mean field framework is designed to solve the trajectory optimization problem.
Equilibrium solution of MFG: We obtain the equilibrium solution for ultra-dense uplink UAV lossy communications by alternately solving the Hamilton-Jacobi-Bellman (HJB) and Fokker-Planck-Kolmogorov (FPK) functions of MFG.

The rest of the paper is organized as follows. In Section 2, we introduce the system model and the related assumptions including the application scenarios, the channel model, the flight energy model as well as the cost function. In Section 3, the stochastic differential game and the MFG system framework are presented. In Section 4, the presented framework is derived and the iteration equations of UAV control and state are obtained. The simulation results and analyses are shown in Section 5. The conclusions are drawn in Section 6.

2. System Model

In this section, we introduce the system model of ultra-dense uplink UAV communications. Section 2.1 mathematically describes the basic scenario considered in this paper. Section 2.2 characterizes the channel model for UAV communications. Section 2.3 presents the cost function of each UAV based on the user interference and the flight energy consumption.

2.1. Basic Scenario

We consider the basic scenario illustrated in Figure 1, where the users stay in a large square region with a side length of R. To formulate the optimization problem, we make the following assumptions.

(1) The locations of the users are assumed to be independently and randomly distributed according to the Poisson point process (PPP) [25].

(2) We assume that the UAVs follow a uniform distribution in the horizontal plane, while all of the UAVs are at the same altitude. The density of users is denoted as

μ

, and hence, the number of users can be expressed as

Q = μ A

, where A is the area of the whole region. Moreover, the set of users is denoted as

Q

.

(3) We further assume that the positions of users are constant or changing slowly during the service time

[0, T]

. Thus, the number of users in the responsible area of each UAV remains constant, and each UAV only serves the users within its responsible area.

In the considered scenario, N UAVs form a set

N

and share the same time-frequency channel for receiving uplink data from the users assigned to them. The transmit power is denoted as

P_{t}

, and the users request access continuously during the time interval. Under this circumstance, we have to consider the interference from other users when optimizing the UAV positions.

Figure 2 depicts the cell served by a UAV

i \in N

. If we can guarantee the access quality of the farthest user, the access quality can also be satisfied for the other users. Without loss of generality, we assume that the user j is the farthest user in the cell. Therefore, the user j is located at the boundary of the optimal coverage area of the UAV i. The UAV i is located at the attitude

h_{i}

, and the radius of the coverage area is

r_{i}

. For simplicity, we neglect the altitude of the user and the antenna heights. Then, the distance between the UAV i and the boundary of the coverage area is

d_{i} = \sqrt{r_{i}^{2} + h_{i}^{2}}

, and the corresponding elevation angle is

θ_{i} = arctan (h_{i} / r_{i})

. Furthermore, we denote the range of the UAV flight altitude and the maximum UAV speed as

\tilde{H} = [h_{min}, h_{max}]

and

v_{max}

, respectively.

Figure 2. The coverage model of the UAV i.

2.2. Channel Model

In this paper, the air-to-ground (A2G) channel is assumed to follow the probabilistic line-of-sight (LoS) model, where the channels between the UAVs and the users are of either the LoS or of the non-line-of-sight (NLoS) nature and their occurrence probabilities are determined by the elevation angle of the transmission link [26]. Moreover, the multiple reflected signals which cause the multi-path fading [27] are also taken into consideration.

The path-loss between the UAV i and a user k for LoS and NLoS links is denoted as

P L_{LoS, i k}

and

P L_{NLoS, i k}

, respectively. The path-loss in dB can be expressed by [26]

\begin{matrix} P L_{LoS, i k}^{dB} & = 20 log (\frac{4 π f_{c} d_{i k}}{c}) + ζ_{LoS}, \end{matrix}

(1)

\begin{matrix} P L_{NLoS, i k}^{dB} & = 20 log (\frac{4 π f_{c} d_{i k}}{c}) + ζ_{NLoS}, \end{matrix}

(2)

where

d_{i k}

is the distance between the UAV i and the user k.

ζ_{LoS}

and

ζ_{NLoS}

represent the free space propagation loss of LoS and NLoS links, respectively, which depends on the environmental conditions.

f_{c}

represents the carrier frequency, and c stands for the speed of light. Moreover, the probability of LoS links is given by [27]

P_{LoS, i k} = \frac{1}{1 + α exp [- β (θ_{i k} - α)]},

(3)

where

α

and

β

are the constants which depend on the environment, the density and height of buildings, and the elevation angle.

θ_{i k}

is the elevation angle from the user k to the UAV i. Hence, the probability of NLoS links is

P_{NLoS, i k} = 1 - P_{LoS, i k}

. Then, the path-loss function can be represented as

P L_{i k}^{dB} = P_{LoS, i k} \cdot P L_{LoS, i k}^{dB} + P_{NLoS, i k} \cdot P L_{NLoS, i k}^{dB} .

(4)

By combining (1)–(4), the path-loss function can be rewritten as

\begin{matrix} P L_{i k}^{dB} & = \frac{ζ_{LoS} - ζ_{NLoS}}{1 + α \exp [- β (θ_{i k} - α)]} + 20 log (\frac{4 π f_{c} d_{i k}}{c}) + ζ_{NLoS}, \end{matrix}

(5)

which is a function of

θ_{i k}

and

d_{i k}

. Specifically, for the user located at the boundary of the coverage area, the optimal angle

θ_{opt}

equals

20 . 34^{\circ}

,

42 . 44^{\circ}

,

54 . 62^{\circ}

, and

75 . 52^{\circ}

for the suburban, urban, dense urban and high-rise urban environments, respectively [28]. Therefore, if the value of the path-loss is given, the optimal UAV attitude and coverage radius pair

(h, r)

can be obtained, and vice versa. Since MFG averages the mass of users’ interference, including the small-scale fading, this paper simplifies the small-scale fading by utilizing MFG.

2.3. Cost Function

For achieving higher energy efficiency while guaranteeing successful access for each user, the UAV should adjust its position jointly based on the user interference and its flight energy consumption.

The quality of user access is characterized by the distortion of the recovered information. According to the rate-distortion theory [29], the minimum rate for a Bern(0.5) source to satisfy the distortion requirement

D_{i}

for the i-th UAV is given by

\begin{matrix} R_{i} (D_{i}) = 1 - H_{b} (D_{i}), \end{matrix}

(6)

where

H_{b} (D_{i}) = - D_{i} {log}_{2} D_{i} - (1 - D_{i}) {log}_{2} (1 - D_{i})

denotes the binary entropy function.

Based on Shannon’s lossy source-channel separation theorem, the rate is constrained by the signal-to-interference-plus-noise ratio (SINR) of the received signals as

\begin{matrix} R_{i} (D_{i}) \leq \frac{C (γ_{i})}{R_{c, i}}, \end{matrix}

(7)

where

R_{c, i}

is the end-to-end coding rate,

γ_{i}

represents the SINR of the received signals, and

C (γ_{i}) = {log}_{2} (1 + γ_{i})

denotes the channel capacity with two-dimensional signalling. Therefore, the required SINR threshold

γ_{t h, i}

can be obtained as

\begin{matrix} γ_{t h, i} = C^{- 1} [R_{c, i} R_{i} (D_{i})], \end{matrix}

(8)

with

C^{- 1} (\cdot)

denoting the inverse function of

C (\cdot)

. For simplicity, this paper considers the case that all distortion requirements are the same, and hence, the SINR thresholds are also the same for all links, i.e.,

γ_{t h, i} = γ_{t h}

.

At the moment t, the external interference

I_{out, i} (t)

for the UAV i is caused by those users assigned to other UAVs, which can be formulated as

I_{out, i} (t) = \sum_{k \in Q ∖ Q_{in, i}} P_{t} G_{i k} (t),

(9)

where

Q_{in, i}

stands for the set of internal users served by the UAV i.

Q ∖ Q_{in, i}

denotes the complement set of

Q_{in, i}

in

Q

.

G_{i k} (t)

represents the geometric gain, which is given by

\begin{matrix} G_{i k} (t) = 10^{- P L_{i k}^{dB} (t) / 10} . \end{matrix}

(10)

For the internal user j located at the boundary of the coverage area, the internal interference

I_{in, i} (t)

from the other users in the responsible area of the UAV i is

I_{in, i} (t) = \sum_{k \in Q_{in, i} ∖ j} P_{t} G_{i k} (t) .

(11)

Consequently, the SINR from the farthest internal user to the UAV i is readily given by

γ_{i} (t) = \frac{P_{t} G_{i} (t)}{a_{in} I_{in, i} (t) + a_{out} I_{out, i} (t) + δ^{2}},

(12)

where

δ^{2}

is the power of the Gaussian white noise.

G_{i} (t)

is the geometric gain from the boundary of the coverage area to the UAV i.

a_{in}

and

a_{out}

are the weights distinguishing the internal and external interference, respectively.

By minimizing the gap between the required SINR threshold

γ_{t h}

and the SINR

γ_{i} (t)

of the signal received from the farthest internal user, the energy efficiency is optimized while guaranteeing the quality of user access.

Meanwhile, it is necessary to consider the flight energy consumption when planning the optimal UAV trajectory. The accumulative flight energy consumption of the UAV i at time t is given by [30]

\begin{matrix} E_{f, i} (t) & = D_{i} (t) [P_{I} {(\sqrt{1 + \frac{V_{f}^{4}}{4 V_{0}^{4}}} - \frac{V_{f}^{2}}{2 V_{0}^{4}})}^{\frac{1}{2}} + P_{B} (1 + \frac{3 V_{f}^{2}}{V_{tip}^{2}}) + \frac{1}{2} d_{f} ρ η A_{f} V_{f}^{3}], \end{matrix}

(13)

where

D_{i} (t)

stands for the flight distances.

V_{0}

,

V_{tip}

, and

V_{f}

represent the rotor induced speed when hovering, the speed of the blade tip, and the flight speed of the UAV, respectively.

η

and

ρ

are the rotor solidity and the air density.

d_{f}

and

A_{f}

represent the fuselage drag ratio and the total area of rotary wing, respectively. The constant parameters

P_{I}

and

P_{B}

denote the induced power and the blade profile power, respectively. For simplicity, we assume that each time unit is short enough, and hence, the acceleration of UAV is neglected, which means

V_{f}

is a constant.

For enhancing the energy efficiency, the optimization problem can be formulated as

\begin{matrix} P 1 : & min E_{f, i} (t) \end{matrix}

(14)

\begin{matrix} s . t . γ_{i} (t) \geq γ_{t h} . \end{matrix}

(15)

The challenge for solving

P 1

is that the received power in the target and the interference to others are coupled. Increasing the received power and SINR of one UAV results in the SINR reduction of other UAVs. To solve the problem,

P 1

is transformed to a problem of minimizing the cost of the system, i.e.,

\begin{matrix} P 2 : & min w_{1} {[γ_{i} (t) - γ_{t h}]}^{2} + w_{2} E_{f, i} (t), \end{matrix}

(16)

where

w_{1}

and

w_{2}

are the weight coefficients for the quality of user access and the energy consumption, respectively.

Moreover, two punishment functions are formulated to balance the locations between the UAV and the mass consisting of all users. One of the punishment functions is to restrain the overlap and the energy consumption caused by the height variation, which is expressed as

P_{i (1)} (t) = {(A_{C, i} (t) - \frac{A}{N})}^{2},

(17)

where

A_{C, i} (t)

is the coverage area of UAV i at time t. Clearly,

A_{C, i} (t) = π \cdot r_{i}^{2} (t)

. Another punishment function is caused by the horizontal movement of the UAV. In order to satisfy the SINR requirement

γ_{i} (t) \geq γ_{t h}

, the UAV i will try to increase the distance from the mass. Therefore, this penalty term

P_{i (2)} (t) = D_{i} (t)

is utilized to restrain the drop-out of the mass, where

D_{i} (t)

represents the distance between the UAV i and the mass at time t. Finally, the cost function of the UAV i is given by

\begin{matrix} C_{i} (t) & = w_{1} {[γ_{i} (t) - γ_{t h}]}^{2} + w_{2} E_{f, i} (t) + κ_{1} P_{i (1)} (t) + κ_{2} P_{i (2)} (t), \end{matrix}

(18)

where

κ_{1}

and

κ_{2}

are the coefficients of two punishment terms. The optimization problem is rewritten as

\begin{matrix} P 3 : & min C_{i} (t) . \end{matrix}

(19)

3. MFG for Trajectory Optimization

In this section, we establish the MFG framework of trajectory optimization for ultra-dense uplink UAV communications. To begin with, the control set and the state set of the UAVs are described in detail in Section 3.1. Then, to address a large number of distributed control of the UAVs, the MFG framework is proposed in Section 3.2.

3.1. Basic Elements in Differential Game

The problem of trajectory optimization for multiple UAV communications can be modeled as a differential game. In this game, we define the control set of the UAV i as

U_{i}

, which is the set of possible 3D motion velocity vectors. The velocity vector of the UAV i at time t is given by

u_{i} (t) = \{x_{i} (t), y_{i} (t), z_{i} (t)\} \in U_{i}

, where

x_{i} (t)

,

y_{i} (t)

and

z_{i} (t)

represent the velocity vector in transverse, longitudinal and vertical directions, respectively. The state set of the UAV i is formulated as

S_{i}

, which is the set of 3D positions. Then, the state of the UAV i at time t is expressed as

s_{i} (t) = {X_{i} (t), Y_{i} (t), h_{i} (t)}

. Based on the control and state mentioned above, the position evolution of the UAV i can be written as

\begin{matrix} d s_{i} (t) = u_{i} (t) d t + σ d W (t), \end{matrix}

(20)

where

σ

and

W (t)

are the volatility and the Brownian motion, respectively. The Brownian motion

W (t)

is a random 3D motion, which can be expressed as

W (t) = {W_{x} (t), W_{y} (t), W_{z} (t)}

. Clearly, from (20), we have

\begin{matrix} d X_{i} (t) & = x_{i} (t) d t + σ d W_{x} (t), \end{matrix}

(21)

\begin{matrix} d Y_{i} (t) & = y_{i} (t) d t + σ d W_{y} (t), \end{matrix}

(22)

\begin{matrix} d h_{i} (t) & = z_{i} (t) d t + σ d W_{z} (t) . \end{matrix}

(23)

In order to optimize the UAV trajectory, the problem can be equivalently solved by minimizing the cost function. Specifically, during the time interval

[0, T]

, the UAV i obtains the optimal control strategy

u_{i}^{*} (t)

by minimizing the cost function (18), i.e.,

u_{i}^{*} (t) = \arg \min_{u_{i} (t) \in U_{i}} E [\int_{0}^{T} C_{i} (t) d t + C_{i} (T)] .

(24)

Then, the value function

v_{i} (t)

indicating the minimum cost of the given dynamic system is defined as

\begin{matrix} v_{i} (t, s_{i} (t)) & = \min_{u_{i} (t) \in U_{i}} E [\int_{t}^{T} C_{i} (t) d t + v_{i} (T, s_{i} (T))], t \in [0, T], \end{matrix}

(25)

where

v_{i} (T, s_{i} (T))

represents the value function at time

t = T

. This value function should satisfy a partial differential equation, called the HJB equation [31], which is defined as

\begin{matrix} min_{u_{i} (t) \in U_{i}} [C_{i} (t, u_{i} (t), s_{i} (t)) - u_{i} (t) \cdot \frac{\partial v_{i} (t, s (t))}{\partial s}] + \frac{\partial v_{i} (t, s (t))}{\partial t} = 0, \end{matrix}

(26)

where the first term is known as the Hamiltonian, i.e.,

\begin{matrix} H (C, \frac{\partial v_{i} (t, s_{i} (t))}{\partial s}) & = min_{u_{i} (t) \in U_{i}} [C_{i} (t, u_{i} (t), s_{i} (t)) - u_{i} (t) \cdot \frac{\partial v_{i} (t, s_{i} (t))}{\partial s}] \\ = min_{u_{i} (t) \in U_{i}} [w_{1} {(γ_{i} (t) - γ_{t h})}^{2} + w_{2} E_{f, i} (t) + κ_{1} P_{i (1)} (t) \\ + κ_{2} P_{i (2)} (t) - u_{i} (t) \cdot \frac{\partial v_{i} (t, s_{i} (t))}{\partial s}] . \end{matrix}

(27)

Proof.

Refer to Appendix A. □

All order derivatives of the Hamiltonian

H (C, \frac{\partial v_{i} (t, s_{i} (t))}{\partial s})

exist due to the continuity of the cost function

C_{i} (t)

. Therefore, the Hamiltonian

H (C, \frac{\partial v_{i} (t, s_{i} (t))}{\partial s})

is smooth, which means that a solution to the HJB Equation (26) exists. By solving the HJB equation, we can obtain the equilibrium solution of the differential game. Consequently, the existence of Nash equilibrium for the differential game is proved.

Due to a large number of UAVs, N partial differential equations need to be solved to obtain equilibrium for N players’ game, which is unrealistic. Therefore, the MFG framework is proposed to significantly simplify the system based on the mean field term and two sets of equations.

3.2. MFG Framework

Hereafter, we introduce the MFG framework for the movement of N UAVs, which is composed of two coupled HJB and FPK [32] equations. We can obtain the optimal control of each UAV by solving the backward equation (HJB equation), while the forward equation (FPK equation) describes the evolution of the mass, which represents the evolution of the mean field.

In the MFG framework, each UAV would take rational control individually to minimize the cost function. Given a sufficiently large N, the continuity of the mean field can be guaranteed. Furthermore, the players just need to interact with the mean field, and the permutation of the states among the players would not affect the outcome of the game. The mean field term is defined as

m (t, s)

, which is the state probability distribution of N UAVs as follows.

Definition 1.

With the given state

s_{i} (t) = \{X_{i} (t), Y_{i} (t), h_{i} (t)\}

, the mean field term, called the normalized density function, is given by

m (t, s) = lim_{N \to \infty} \frac{1}{N} \sum_{i = 1}^{N} 1_{{s_{i} (t) = s}},

(28)

where

1

means that it will return 1 when

s_{i} (t) = s

, and otherwise, it returns 0.

To implement the MFG framework, it is necessary to transform the cost function of the general differential game to that of the MFG. Now, we derive the cost function under the MFG framework corresponding to (18). The first term in (18) represents the satisfaction of user access, which is constituted of the difference between the SINR of received signals and the SINR threshold. Thus, the SINR in (12) can be rewritten for MFG as:

{\bar{γ}}_{i} (t) = \frac{P_{t} G_{i} (t)}{a_{in} {\bar{I}}_{in, i} (t) + a_{out} {\bar{I}}_{out, i} (t) + δ^{2}},

(29)

where the

{\bar{I}}_{in, i} (t)

and

{\bar{I}}_{out, i} (t)

are the mean field interferences, which can be expressed as:

\begin{matrix} {\bar{I}}_{in, i} (t) & = P_{t} \int_{S} m (t, s) {\bar{G}}_{in, i} (t) d s, \end{matrix}

(30)

\begin{matrix} {\bar{I}}_{out, i} (t) & = P_{t} \int_{S} m (t, s) {\bar{G}}_{out, i} (t) d s . \end{matrix}

(31)

{\bar{G}}_{in, i} (t)

and

{\bar{G}}_{out, i} (t)

are the mean field geometric gains of the internal users and the external users, respectively.

Then, by substituting (29) into (18), the mean field cost function can be defined as

\begin{matrix} {\bar{C}}_{i} (t, m (t, s)) & = w_{1} {[{\bar{γ}}_{i} (t) - γ_{t h}]}^{2} + w_{2} E_{f, i} (t) + κ_{1} P_{i (1)} (t) + κ_{2} P_{i (2)} (t) . \end{matrix}

(32)

Based on the mean field cost function mentioned above, we can define the HJB equation of MFG as follows.

Definition 2.

The HJB equation of MFG can be expressed as

\frac{\partial v (t, s)}{\partial t} + min_{u (t, s) \in U} [\bar{C} (t, m (t, s)) - u (t) \cdot \frac{\partial v (t, s)}{\partial s}] = 0 .

(33)

Moreover, the Hamiltonian can be expressed as

\begin{matrix} H (\bar{C}, \frac{\partial v (t, s)}{\partial s}) = min_{u (t, s) \in U} [\bar{C} (t, m (t, s)) - u (t) \cdot \frac{\partial v (t, s)}{\partial s}] . \end{matrix}

(34)

The HJB equation is solved inversely in the time domain from

t = T

to

t = 0

, which can calculate the optimal control of each UAV. Then, we derive the FPK equation of the mean field framework defined above as:

\frac{\partial m (t, s)}{\partial t} - \frac{\partial}{\partial s} [m (t, s) \cdot u (t)] = 0,

(35)

which describes the evolution of the mass and evolves forward with time. In this MFG framework, the HJB and FPK equations evolve interactively and finally reach the mean field equilibrium (MFE).

4. Energy-Efficient Flight Strategy

In this section, we obtain the energy-efficient flight strategy by solving the HJB and FPK Equations (33) and (35). These two equations are coupled mutually and interact with each other, which can reach the MFE by resorting to the finite difference method [33].

In this finite difference framework, the time space

[0, T]

and the 3D vector space representing the location space, including 2D vector space in the horizontal direction

[0, X_{\max}]

,

[0, Y_{\max}]

and vector space in the vertical direction

[0, h_{\max}]

, respectively, are discretized into

W \times X \times Y \times Z

spaces. Then, we aim to find the optimal control policy in this four-dimensional discrete vector space including the time space and the location space. Hence, we define

\begin{matrix} Δ t : = \frac{T}{W}, Δ X : = \frac{X_{max}}{X}, Δ Y : = \frac{Y_{max}}{Y}, Δ h : = \frac{h_{max}}{Z}, \end{matrix}

(36)

which represent the iteration steps of the time, the transverse vector, the longitudinal vector, and the vertical vector, respectively.

Then, we use the Lax–Friedrichs schemes to solve the FPK equation in (35). Let n denote time index, j, k and l denote location coordinate indices in the discretized grid. Therefore, we have the iterative equation of mean field term as

\begin{matrix} m_{j, k, l}^{n + 1} & = \frac{1}{2} [(m_{j + 1, k, l}^{n} + m_{j - 1, k, l}^{n}) + (m_{j, k + 1, l}^{n} + m_{j, k - 1, l}^{n}) + (m_{j, k, l + 1}^{n} + m_{j, k, l - 1}^{n})] \\ - (Ξ + Ψ + H), \end{matrix}

(37)

where

Ξ

,

Ψ

, and

H

are given by

\begin{matrix} Ξ & = \frac{Δ t}{2 Δ X} (m_{j + 1, k, l}^{n} \cdot x_{j + 1, k, l}^{n} - m_{j - 1, k, l}^{n} \cdot x_{j - 1, k, l}^{n}), \end{matrix}

(38)

\begin{matrix} Ψ & = \frac{Δ t}{2 Δ Y} (m_{j, k + 1, l}^{n} \cdot y_{j, k + 1, l}^{n} - m_{j, k - 1, l}^{n} \cdot y_{j, k - 1, l}^{n}), \end{matrix}

(39)

\begin{matrix} H & = \frac{Δ t}{2 Δ h} (m_{j, k, l + 1}^{n} \cdot z_{j, k, l + 1}^{n} - m_{j, k, l - 1}^{n} \cdot z_{j, k, l - 1}^{n}) . \end{matrix}

(40)

m_{j, k, l}^{n}

,

x_{j, k, l}^{n}

,

y_{j, k, l}^{n}

, and

z_{j, k, l}^{n}

denote the value of the mean field, the transverse control, the longitudinal control and the vertical control, respectively.

In order to solve the HJB equation, we have to consider the constraints of the forward equation and mean field. Therefore, we use the Lagrange relaxation to solve the HJB equation. The Lagrangian

L (m (t, s), u (t, s), λ (t, s))

is defined as

\begin{matrix} L (m (t, s), u (t, s), λ (t, s)) \\ = \int_{t = 0}^{T} \int_{X = 0}^{X_{max}} \int_{Y = 0}^{Y_{max}} \int_{h = 0}^{h_{max}} [\partial_{t} m (t, s) - \partial_{S} (m (t, s) \cdot u (t, s))] \cdot λ (t, s) d t d X d Y d h \\ + E [\int_{0}^{T} {\bar{C}}_{n} (t) d t + {\bar{C}}_{n} (T)] \\ = \int_{t = 0}^{T} \int_{X = 0}^{X_{max}} \int_{Y = 0}^{Y_{max}} \int_{h = 0}^{h_{max}} [[\partial_{t} m (t, s) - \partial_{S} (m (t, s) \cdot u (t, s))] \cdot λ (t, s) + \bar{C} (t, s) m (t, s)] d t d X d Y d h, \end{matrix}

(41)

where

λ (t, s)

is the Lagrange multiplier. Here,

\partial_{S} (m (t, s) \cdot

u (t, s))

is defined as

\begin{matrix} \partial_{S} (m (t, s) \cdot u (t, s)) & = \partial_{X} (m (t, s) \cdot x (t, s)) + \partial_{Y} (m (t, s) \cdot y (t, s)) + \partial_{h} (m (t, s) \cdot z (t, s)) . \end{matrix}

(42)

We solve (41) by using the finite difference method. Similar to the previous method of solving the FPK equation, we discretize the Lagrangian as

\begin{matrix} L_{Δ s, Δ t} & = \sum_{n = 1}^{W + 1} \sum_{j = 1}^{X + 1} \sum_{k = 1}^{Y + 1} \sum_{l = 1}^{Z + 1} [{\bar{C}}_{j, k, l}^{n} \cdot m_{j, k, l}^{n} + λ_{j, k, l}^{n} \cdot Λ] \\ \cdot Δ X Δ Y Δ h Δ t, \end{matrix}

(43)

where

{\bar{C}}_{j, k, l}^{n}

and

λ_{j, k, l}^{n}

represent the value of the cost function and the Lagrange multiplier at time n location

(j, k, l)

on the discretized grid, respectively. Here,

Λ

is given by

\begin{matrix} Λ & = \frac{1}{Δ t} [m_{j, k, l}^{n + 1} - \frac{1}{2} (m_{j + 1, k, l}^{n} + m_{j - 1, k, l}^{n} + m_{j, k + 1, l}^{n} + m_{j, k - 1, l}^{n} + m_{j, k, l + 1}^{n} + m_{j, k, l - 1}^{n})] \\ - (\frac{Ξ + Ψ + H}{Δ t}) . \end{matrix}

(44)

In this model, the optimal decision variables include

u^{*} = \{x^{*}, y^{*}, z^{*}\}

,

m^{*}

and

λ^{*}

. To begin with, we update the value of the Lagrange multiplier by calculating

\frac{\partial L_{Δ s, Δ t}}{\partial m_{j, k, l}^{n}} = 0

. Therefore, we can obtain the iterative equation of variables

λ

as

\begin{matrix} λ_{j, k, l}^{n - 1} & = \frac{1}{2} (λ_{j + 1, k, l}^{n} + λ_{j - 1, k, l}^{n} + λ_{j, k + 1, l}^{n} + λ_{j, k - 1, l}^{n} + λ_{j, k, l + 1}^{n} + λ_{j, k, l - 1}^{n}) \\ + \frac{Δ t \cdot x_{j, k, l}^{n}}{2 Δ X} (λ_{j - 1, k, l}^{n} - λ_{j + 1, k, l}^{n}) + \frac{Δ t \cdot y_{j, k, l}^{n}}{2 Δ Y} (λ_{j, k - 1, l}^{n} - λ_{j, k + 1, l}^{n}) \\ + \frac{Δ t \cdot z_{j, k, l}^{n}}{2 Δ h} (λ_{j, k, l - 1}^{n} - λ_{j, k, l + 1}^{n}) - Δ t {\bar{C}}_{j, k, l}^{n}, \end{matrix}

(45)

where

n, j, k, l

are arbitrary on this discretized grid. Then, we update the value of the control by calculating

\frac{\partial L_{Δ s, Δ t}}{\partial x_{j, k, l}^{n}} = 0

,

\frac{\partial L_{Δ s, Δ t}}{\partial y_{j, k, l}^{n}} = 0

and

\frac{\partial L_{Δ s, Δ t}}{\partial z_{j, k, l}^{n}} = 0

, respectively. Therefore, the iterative equation of control

x (t)

can be expressed as follows:

\sum_{j = 1}^{X + 1} \sum_{k = 1}^{Y + 1} \sum_{l = 1}^{Z + 1} m_{j, k, l}^{n} \frac{\partial {\bar{C}}_{j, k, l}^{n}}{\partial x_{j, k, l}^{n}} - \frac{m_{j, k, l}^{n}}{2 Δ X} (λ_{j - 1, k, l}^{n} - λ_{j + 1, k, l}^{n}) = 0 .

(46)

Similarly, we can obtain the iterative equations of y and z as

\sum_{j = 1}^{X + 1} \sum_{k = 1}^{Y + 1} \sum_{l = 1}^{Z + 1} m_{j, k, l}^{n} \frac{\partial {\bar{C}}_{j, k, l}^{n}}{\partial y_{j, k, l}^{n}} - \frac{m_{j, k, l}^{n}}{2 Δ Y} (λ_{j, k - 1, l}^{n} - λ_{j, k + 1, l}^{n}) = 0,

(47)

\sum_{j = 1}^{X + 1} \sum_{k = 1}^{Y + 1} \sum_{l = 1}^{Z + 1} m_{j, k, l}^{n} \frac{\partial {\bar{C}}_{j, k, l}^{n}}{\partial z_{j, k, l}^{n}} - \frac{m_{j, k, l}^{n}}{2 Δ h} (λ_{j, k, l - 1}^{n} - λ_{j, k, l + 1}^{n}) = 0 .

(48)

Finally, the MFE is solved by (37) and (46)–(48) iteratively until they converge. The specific iteration step is displayed in Algorithm 1.

Algorithm 1 Obtaining the MFE

1:: Initialization:
2:: $m_{:}^{0}$ : initialize mean-field distribution;
3:: $λ_{:}^{W + 1}$ : initialize Lagrangian parameters;
4:: $u_{:}^{W + 1} : = x_{:}^{W + 1}, y_{:}^{W + 1}, z_{:}^{W + 1}$ : initial control.
5:: Repeat: Until the system obtains the MFE
6:: Compute update for the mean-field m:
7:: for $n = 1 : W$ , $j \in 1, . . ., X$ , $k \in 1, . . ., Y$ , and $l \in 1, . . ., h$ do
8:: Update $m_{n, j, k}^{n + 1}$ using (37).
9:: end for
10:: Compute update for the Lagrangian parameters $λ$ :
11:: for $n = W : 1$ , $j \in 1, . . ., X$ , $k \in 1, . . ., Y$ , and $l \in 1, . . ., h$ do
12:: Update $λ_{n, j, k}^{n - 1}$ using (45).
13:: end for
14:: Compute update for the control u:
15:: for $n = W : 1$ , $j \in 1, . . ., X$ , $k \in 1, . . ., Y$ , and $l \in 1, . . ., h$ do
16:: Update $x_{n, j, k}^{n - 1}$ using (46).
17:: end for
18:: for $n = W : 1$ , $j \in 1, . . ., X$ , $k \in 1, . . ., Y$ , and $l \in 1, . . ., h$ do
19:: Update $y_{n, j, k}^{n - 1}$ using (47).
20:: end for
21:: for $n = W : 1$ , $j \in 1, . . ., X$ , $k \in 1, . . ., Y$ , and $l \in 1, . . ., h$ do
22:: Update $z_{n, j, k}^{n - 1}$ using (48).
23:: end for

In Algorithm 1, we solve the FPK equation by iterating the mean field term m. During the iteration, if j equals 1 or X, we assume

\frac{1}{2} (m_{j + 1, :}^{n} + m_{j - 1, :}^{n}) = m_{j, :}^{n}

, and the term

m_{j - 1, k, l}^{n} \cdot x_{j - 1, k, l}^{n}

in (38) can be expressed as

m_{j, k, l}^{n} \cdot x_{j, k, l}^{n}

. k, l are similar to j. On the other hand, the HJB equation can be solved by reversely iterating the Lagrange multiplier

λ

and the control u. The end condition of iterations is the convergence point appearing or exceeding the number of iteration steps. We assume that the coordinates of the UAV in the area are positive. The values of control and the mean field term are positive for any time n and any state

(j, k, l)

. Therefore, the reformulated problem with the constraints mentioned above is a convex optimization problem. Meanwhile, the conditions of this algorithm (iterative equations) satisfy the necessary and sufficient conditions of the convex optimization problem. In other words, this convergence point is the MFE with the cost function

\bar{C} (t)

.

5. Numerical Results

In this section, we evaluate the system performance with the main simulation parameter settings listed in Table 1. We assume the coordinate origin of the UAV location is located in the lower left corner of the desired area, so that the state of the UAV is positive. The side length of the large square geographical region (desired area) is set as

R = 10

km, which means the maximum value of the horizontal area

X_{max} = Y_{max} = 10

km. In this model, the minimum and the maximum altitudes of UAVs are

h_{\min} = 1

km and

h_{max} = 2

km, respectively, which ensures the complete coverage of users in each small area.

Table 1. Parameters in simulations.

At the initial time, 100 UAVs are hovering at initial positions. The initial distributions of the UAVs and users are shown in Figure 3. The red circles represent the UAVs and the blue stars represent the users. In the horizontal direction, these 100 UAVs populate the desired area and each UAV has its responsible area, which is the small rectangle in Figure 3. The users are of random positions and numbers in each small rectangle. Meanwhile, each UAV has the same altitude

h_{0}

. We also illustrate the user and UAV distributions in Figure 4, where we assume that the initial altitude of users ranges from 0 to 5 m.

Figure 3. The 2D distribution of 100 and 1000 users at initial time.

Figure 4. The 3D distribution of users at initial time

t = 0

.

To illustrate the evolution of the mean field state under the control obtained by Algorithm 1 over a predefined period of time

t \in [0, T]

, we provide the distribution of the mass UAVs at different times, as shown in Figure 5. Here, T is set as 15 s. Figure 5 shows the changes in the distribution of the UAV at three times, namely, t = 5 s, 10 s, and 15 s. In Figure 5a–c, we show the 2D distribution of the UAVs (red circles) and users (blue stars) at those three moments. Meanwhile, we show the position changes of 100 UAVs in 3D space, which can be seen from Figure 5d–f. Compared with the initial time

t = 0

, the 100 UAVs find the optimal locations to ensure users’ access. The users’ access status is presented in Figure 6.

Figure 5. The changes in 100 UAVs’ 3D position at different time

t = 5, 10, 15

.

Figure 6. The ratio of successful access of 1000 users during time

t \in [0, T]

.

To verify the users’ access situation under these flight conditions, we show the access situation of all users in the predefined period of time

t \in [0, T]

in Figure 6. The black curve shows the rate of successful access of all users by adopting our proposed algorithm. At the initial time, the ratio of successful access of all users is at a lower level because the altitude of UAVs is lower and the initial position is certain, which corresponds to a smaller coverage area and the incorrect position. Then, all UAVs adjust their positions with the 3D optimal control. The ratio of successful access of all users rises until they all can access. Clearly, it is seen that this ratio reaches 1 at t = 14, which means that when the system reaches equilibrium, the user’s access target will be satisfied. For comparison, we assume that all UAVs increase access ratio in the same flight mode (rising at a constant speed), as shown in Figure 6. It can be observed that the ratio of user’s access is still at

0.8

and the growth is slow until the last time t = 15. Clearly, there is an obvious gap between the proposed algorithm and the benchmark scheme for the ratio of successful access. This is because the solution derived from the HJB equation follows the Bellman principle of optimality, which selects the optimal actions for system control.

To further illustrate the user’s access rate, we show the average SINR from time 0 to T, as shown in Figure 7. The red straight line represents the threshold of SINR. At the initial time, the average SINR is low enough because there are only a few users satisfying the access requirement. It can be seen that the average SINR reaches the threshold we set above after t = 6. Corresponding to Figure 6, at t = 6, the user’s access rate reaches

0.8

, which means 20% of the users still have lower SINR, which leads to the average SINR still rising. Moreover, MFE can be achieved until the optimal position is reached because the average SINR is basically unchanged. In addition, as another important term of the cost function, the flight energy consumption will be shown in Figure 8.

Figure 7. The average SINR of all users during time

t \in [0, T]

.

Figure 8. The average energy consumption of 100 UAVs during time

t \in [0, T]

.

In Figure 8, we present the average flight energy consumption of 100 UAVs. At the initial time, all UAVs hover at the same altitude (1000 m), and the initial energy consumption (hover energy consumption) is 60 J/s. Then, each UAV flies by adopting the optimal control, which leads to a higher average flight energy consumption at the beginning for achieving the SINR requirements. Subsequently, the average energy consumption decreases because the closer target area implies that more users are capable of reaching the threshold of SINR. The equilibrium emerges when all users arrive at the SINR threshold. At that time, each UAV keeps hovering, and the flight energy is restored to 60 J/s. The red straight line represents the average energy consumption of 100 UAVs by rising at a constant speed. The energy consumption is a constant at each time. Combining with Figure 6, when the proposed algorithm arrives at equilibrium, the total energy consumption by all UAVs is close to the energy consumption of all UAVs rising at the same speed by using the same time, but the users’ access rate is higher.

6. Conclusions

This paper has proposed a multi-UAV framework for emergency communication networks with ultra-dense ground users. In this framework, each UAV decides its own 3D flight trajectory for maximizing users’ communication qualities and the ratio of successful access under the flight energy consumption constraint. The system is modelled as an MFG, where N UAVs perform distributed controls based on their own local information. After designing the cost function and the value function of this system, we derive the HJB and FPK equations of the system. To obtain the solution, we adopt the Lax–Friedrichs scheme and the Lagrange relaxation, and demonstrate the existence of the MFE using strict proof. The numerical results show that the average SINR of all users arrives at the SINR threshold and the ratio of successful access of all users achieves 100% with the controls solved from MFG. Moreover, the comparison with the benchmark schemes verifies that the proposed algorithm has a higher access rate with a similar flight energy consumption. In the future, this work can be extended from the asymptotic convergence to a faster convergence approach, such as fixed-time convergence [34].

Author Contributions

Conceptualization, S.Q.; Methodology, S.Q.; Validation, Y.M.; Investigation, Y.M.; Writing—original draft, Y.M.; Supervision, S.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Derivation of HJB Equation

We first derive the value function of velocity vector

u_{i} (t)

and the state

s_{i} (t)

by Richard Bellman’s principle of optimality as

\begin{matrix} v_{i} (t) = min_{u_{i} (t) \in U_{i}} E [ & \int_{t}^{t + d t} c_{i} (t, s_{i} (t), u_{i} (t)) d t + v_{i} (t + d t, s_{i} (t + d t))] . \end{matrix}

(A1)

Then, we calculate the last term in (A1) by Taylor expansion as

\begin{matrix} v_{t} (t + d t, s_{t} (t + d t)) & = v_{t} (t, s_{i} (t)) + \frac{\partial v_{t} (t, s_{i} (t))}{\partial t} + \frac{\partial s_{t} (t)}{\partial t} \cdot \nabla v_{i} (t, s_{i} (t)) d t + o (d t), \end{matrix}

(A2)

where

\nabla v_{i} (t, s_{i} (t))

is the gradient of value function v and

o (d t)

is the term of higher-order Taylor expansion. By combining (A1) and (A2), we can obtain the HJB Equation (26).

References

Xiao, Y.; Du, Q.; Cheng, W.; Karagiannidis, G.K.; Zhao, Z. Model-ML Integrated Intelligence in URLLC Towards End-to-End Delay Fulfillment Over Vehicular Networks. IEEE Internet Things Mag. 2023, 6, 62–68. [Google Scholar] [CrossRef]
Li, C.; Zhu, S.; Sun, H.; Zhao, K.; Sun, L.; Zhang, S.; Wang, J.; Fang, L. Design and Implementation of an Emergency Environmental Monitoring System. Electronics 2025, 14, 287. [Google Scholar] [CrossRef]
Chen, S.; Li, W.; Zheng, W.; Liu, F.; Zhou, S.; Wang, S.; Yuan, Y.; Zhang, T. Application of Optical Communication Technology for UAV Swarm. Electronics 2025, 14, 994. [Google Scholar] [CrossRef]
Chen, R.; Cheng, W.; Ding, Y.; Wang, B. QoS-Guaranteed Multi-UAV Coverage Scheme for IoT Communications with Interference Management. IEEE Internet Things J. 2024, 11, 4116–4126. [Google Scholar] [CrossRef]
Zheng, K.; Fu, J.; Liu, X. Relay Selection and Deployment for NOMA-Enabled Multi-AAV-Assisted WSN. IEEE Sens. J. 2025, 25, 16235–16249. [Google Scholar] [CrossRef]
Wang, Y.; Yang, C.; Li, T.; Mi, X.; Li, L.; Han, Z. A Survey on Mean-Field Game for Dynamic Management and Control in Space-Air-Ground Network. IEEE Commun. Surv. Tutor. 2024, 26, 2798–2835. [Google Scholar] [CrossRef]
Yao, Z.; Cheng, W.; Zhang, W.; Zhang, H. Resource Allocation for 5G-UAV-Based Emergency Wireless Communications. IEEE J. Sel. Areas Commun. 2021, 39, 3395–3410. [Google Scholar] [CrossRef]
Zhao, Z.; Du, Q.; Song, H. Traffic Load Learning Towards Early Detection of Intrusion in Industrial mMTC Networks. IEEE Trans. Ind. Inform. 2023, 19, 8441–8451. [Google Scholar] [CrossRef]
Lin, W.; Yan, Y.; Li, L.; Han, Z.; Matsumoto, T. SemantIC: Semantic Interference Cancellation Toward 6G Wireless Communications. IEEE Commun. Lett. 2024, 28, 1810–1814. [Google Scholar] [CrossRef]
Fu, Y.; Cheng, W.; Zhang, W.; Wang, J. Scalable Extraction Based Semantic Communication for 6G Wireless Networks. IEEE Commun. Mag. 2024, 62, 96–102. [Google Scholar] [CrossRef]
Gimenez-Guzman, J.M.; Leyva-Mayorga, I.; Popovski, P. Semantic V2X Communications for Image Transmission in 6G Systems. IEEE Netw. 2024, 38, 48–54. [Google Scholar] [CrossRef]
Lin, W.; Yan, Y.; Li, L.; Han, Z.; Matsumoto, T. Semantic-Forward Relaying: A Novel Framework Toward 6G Cooperative Communications. IEEE Commun. Lett. 2024, 28, 518–522. [Google Scholar] [CrossRef]
Kim, K.; Lee, W.Y.; Ko, Y.J. Semantic Communications and Lossy Compression with Side Information. In Proceedings of the 15th International Conference on Information and Communication Technology Convergence (ICTC), Jeju Island, Republic of Korea, 16–18 October 2024; pp. 1638–1639. [Google Scholar] [CrossRef]
Lin, W.; Li, L.; Liu, Y.; He, Y.; Liu, Y. Timeliness optimization of unmanned aerial vehicle lossy communications for internet-of-things. Chin. J. Aeronaut. 2023, 36, 249–255. [Google Scholar] [CrossRef]
Lin, W.; Li, L.; Yuan, J.; Han, Z.; Juntti, M.; Matsumoto, T. Cooperative Lossy Communications in Unmanned Aerial Vehicle Networks: Age-of-Information with Outage Probability. IEEE Trans. Veh. Technol. 2021, 70, 10105–10120. [Google Scholar] [CrossRef]
Mehrabi, N.; Boroujeni, S.P.H.; Hofseth, J.; Razi, A.; Cheng, L.; Kaur, M.; Martin, J.; Amin, R. Adaptive Data Transport Mechanism for UAV Surveillance Missions in Lossy Environments. In Proceedings of the IEEE 22nd Consumer Communications & Networking Conference (CCNC), Las Vegas, NV, USA, 10–13 January 2025; pp. 1–4. [Google Scholar] [CrossRef]
Davis, D.T.; Chung, T.H.; Clement, M.R.; Day, M.A. Consensus-based data sharing for large-scale aerial swarm coordination in lossy communications environments. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, Republic of Korea, 9–14 October 2016; pp. 3801–3808. [Google Scholar] [CrossRef]
Deng, W.; Feng, J.; Zhao, H. Autonomous Path Planning via Sand Cat Swarm Optimization with Multi-Strategy Mechanism for Unmanned Aerial Vehicles in Dynamic Environment. IEEE Internet Things J. 2025, 1. [Google Scholar] [CrossRef]
Mkiramweni, M.E.; Yang, C.; Li, J.; Zhang, W. A Survey of Game Theory in Unmanned Aerial Vehicles Communications. IEEE Commun. Surv. Tutor. 2019, 21, 3386–3416. [Google Scholar] [CrossRef]
Banez, R.A.; Li, L.; Yang, C.; Han, Z. Mean Field Game and Its Applications in Wireless Networks; Springer: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
Li, L.; Zhang, Z.; Xue, K.; Wang, M.; Pan, M.; Han, Z. AI-Aided Downlink Interference Control in Dense Interference-Aware Drone Small Cells Networks. IEEE Access 2020, 8, 15110–15122. [Google Scholar] [CrossRef]
Chen, R.; Chen, J.; Wang, H.; Tong, X.; Xu, Y.; Qi, N.; Xu, Y. Joint Channel Access and Power Control Optimization in Large-Scale UAV Networks: A Hierarchical Mean Field Game Approach. IEEE Trans. Veh. Technol. 2023, 72, 1982–1996. [Google Scholar] [CrossRef]
Li, L.; Xu, Y.; Zhang, Z.; Yin, J.; Chen, W.; Han, Z. A Prediction-Based Charging Policy and Interference Mitigation Approach in the Wireless Powered Internet of Things. IEEE J. Sel. Areas Commun. 2019, 37, 439–451. [Google Scholar] [CrossRef]
Lin, W.; Ma, H.; Li, L.; Han, Z. Computing Assistance From the Sky: Decentralized Computation Efficiency Optimization for Air-Ground Integrated MEC Networks. IEEE Wirel. Commun. Lett. 2022, 11, 2420–2424. [Google Scholar] [CrossRef]
Srinivasa, S.; Haenggi, M. Distance Distributions in Finite Uniformly Random Networks: Theory and Applications. IEEE Trans. Veh. Technol. 2010, 59, 940–949. [Google Scholar] [CrossRef]
Al-Hourani, A.; Kandeepan, S.; Jamalipour, A. Modeling air-to-ground path loss for low altitude platforms in urban environments. In Proceedings of the IEEE Global Communications Conference, Austin, TX, USA, 8–12 December 2014; pp. 2898–2904. [Google Scholar] [CrossRef]
Mozaffari, M.; Saad, W.; Bennis, M.; Debbah, M. Efficient Deployment of Multiple Unmanned Aerial Vehicles for Optimal Wireless Coverage. IEEE Commun. Lett. 2016, 20, 1647–1650. [Google Scholar] [CrossRef]
Alzenad, M.; El-Keyi, A.; Lagum, F.; Yanikomeroglu, H. 3-D Placement of an Unmanned Aerial Vehicle Base Station (UAV-BS) for Energy-Efficient Maximal Coverage. IEEE Wirel. Commun. Lett. 2017, 6, 434–437. [Google Scholar] [CrossRef]
El Gamal, A.; Kim, Y.H. Network Information Theory; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Zeng, Y.; Xu, J.; Zhang, R. Energy Minimization for Wireless Communication with Rotary-Wing UAV. IEEE Trans. Wirel. Commun. 2019, 18, 2329–2345. [Google Scholar] [CrossRef]
Bardi, M.; Dolcetta, I.C. Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations; Springer: Berlin/Heidelberg, Germany, 1997. [Google Scholar]
Bogachev, V.I.; Krylov, N.V.; Röckner, M.; Shaposhnikov, S.V. Fokker-Planck-Kolmogorov Equations; American Mathematical Society: Providence, RI, USA, 2015; Volume 207. [Google Scholar]
Burger, M.; Schulte, J.M. Adjoint Methods for Hamilton-Jacobibellman Equations; Westfälische Wilhelms-Universität Münster: Münster, Germany, 2010. [Google Scholar]
Ning, B.; Han, Q.L.; Zuo, Z.; Jin, J.; Zheng, J. Collective Behaviors of Mobile Robots Beyond the Nearest Neighbor Rules with Switching Topology. IEEE Trans. Cybern. 2018, 48, 1577–1590. [Google Scholar] [CrossRef]

Figure 1. A basic scenario of ultra-dense UAV communications.

Figure 2. The coverage model of the UAV i.

Figure 3. The 2D distribution of 100 and 1000 users at initial time.

Figure 4. The 3D distribution of users at initial time

t = 0

.

Figure 5. The changes in 100 UAVs’ 3D position at different time

t = 5, 10, 15

.

Figure 6. The ratio of successful access of 1000 users during time

t \in [0, T]

.

Figure 7. The average SINR of all users during time

t \in [0, T]

.

Figure 8. The average energy consumption of 100 UAVs during time

t \in [0, T]

.

Table 1. Parameters in simulations.

Parameters	Value	Parameters	Value	Parameters	Value
$μ$	$0.1$ Users/m²	$α$	$9 . 6$	$η$	$0.05$
R	10 km	$β$	$0.28$	$A_{f}$	$0.5$ m²
$P_{t}$	30 dBm	$θ$	$80^{\circ}$	$ζ_{LoS}$	1 dB
$γ_{t h}$	10 dB	$V_{f}$	30 m/s	$ζ_{NLoS}$	20 dB
$f_{c}$	2 GHz	$V_{tip}$	120 m/s	$a_{in}$	$0.6$
c	$3 \times 10^{8}$ m/s	$V_{0}$	4 m/s	$a_{out}$	$0.4$
$ρ$	$1.225$ kg/m³	$d_{f}$	$0.6$	$[h_{min}, h_{max}]$	$[1, 2]$ km

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Ultra-Dense Uplink UAV Lossy Communications: Trajectory Optimization Based on Mean Field Game

Abstract

1. Introduction

2. System Model

2.1. Basic Scenario

2.2. Channel Model

2.3. Cost Function

3. MFG for Trajectory Optimization

3.1. Basic Elements in Differential Game

3.2. MFG Framework

4. Energy-Efficient Flight Strategy

5. Numerical Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Derivation of HJB Equation

References

Article Metrics

Citations

Article Access Statistics