Joint Resource Allocation Optimization in Space–Air–Ground Integrated Networks

: A UAV-assisted space–air–ground integrated network (SAGIN) can provide communication services for remote areas and disaster-stricken regions. However, the increasing types and numbers of ground terminals (GTs) have led to the explosive growth of communication data volume, which is far from meeting the communication needs of ground users. We propose a mobile edge network model that consists of three tiers: satellites, UAVs, and GTs. In this model, UAVs and satellites deploy edge servers to deliver services to GTs. GTs with limited computing capabilities can upload computation tasks to UAVs or satellites for processing. Specifically, we optimize association control, bandwidth allocation, computation task allocation, caching decisions, and the UAV’s position to minimize task latency. However, the proposed joint optimization problem is complex, and it is difficult to solve. Hence, we utilize Block Coordinate Descent (BCD) and introduce auxiliary variables to decompose the original problem into different subproblems. These subproblems are then solved using the McCormick envelope theory, the Successive Convex Approximation (SCA) method, and convex optimization techniques. The simulation results extensively illustrate that the proposed solution dramatically decreases the overall latency when compared with alternative benchmark schemes.


Introduction
The rapid advancement of mobile communication has led to various emerging applications that have increased convenience in social life and production [1] and resulted in substantial changes [2].However, these novel services impose more stringent requirements on communication networks regarding latency, reliability, and speed [3].Simultaneously, the astronomical growth in data communication volume and energy consumption for computation and transmission put a heavy load on terminal devices with limited computing capabilities [4].Therefore, the explosive growth of data traffic and network applications presented significant challenges for the future development of mobile communication technology [5].Simultaneously, ground-based networks suffer challenges in providing coverage to remote areas [6]; in many prospective wireless communication situations, UAVs play a crucial role in meeting various communication needs.Moreover, UAVs can be integrated with low-earth orbit satellites for diverse domains [7], which can provide information services for different network applications in different spatial domains [8].Therefore, building a SAGIN to enable information from different spatial domains to serve various industries will be a future trend [9].
Recently, UAV-assisted MEC has been a focal point of research and is widely applied in various scenarios [10].In [11], the authors optimized key variables jointly between multiple drones and ground-based stations to decrease the cost of energy consumption and latency, which improves communication quality.Equipping UAVs with MEC servers enables the migration of ground-based computing tasks to drones.The work in [12] formulated a joint optimization problem with the limit of energy and latency constraints to minimize the energy consumption of ground user terminals.The authors in [13] utilized drones as relays, presented a dynamic NOMA/OMA strategy, and presented optimization problems to improve the minimum speed between vehicles and the total transmission rate.The security of the network is affected by various factors; the authors in [14] optimized the flight trajectory and power of the UAV and proposed an inequality iteration algorithm to improve the average secrecy rate of the network.Within the UAV-based MEC system network, UAVs can function as small-cell base stations to achieve scalable and secure video transmission through caching, allowing for the provision of services to mobile users [15].To balance the crucial metrics and the electrical energy consumption of the MEC system, the authors in [16] proposed a problem with the objective of maximizing weighted computational effectiveness.To address the heavy computational workload of UAVs, in [17], the authors employed game theory principles to enhance the computational offloading efficiency of UAVs by balancing energy consumption, latency, and cost.In [18], a data offloading framework is proposed wherein users can offload partial computational tasks to the MEC servers on UAVs, and the distinctiveness of the Pure Nash Equilibrium (PNE) is demonstrated through simulations.The authors in [19] studied a collaborative system with multiple UAVs, and it can utilize a multi-agent deep learning framework to tackle decreasing system latency and energy consumption.
UAV-assisted MEC systems can be integrated with satellites to become SAGIN, providing edge computing services to GTs in remote regions [20].The work in [21] presented a SAG hybrid cloud-edge computing framework.By jointly optimizing task assignment, transmission power, bandwidth allocation, and UAV computing resources, it achieves the minimization of the maximum computation latency among GTs.MEC services are frequently deployed in SAGIN networks; the authors in [22] presented an iterative optimization approach and utilized a greedy algorithm and the SCA method to minimize the total computing cost of all GTs.Due to the limited battery capacity, the authors in [23] decreased energy consumption by enhancing the offloading ratios and allocation of processing resources for terrestrial users, UAVs, and satellites.While ensuring energy constraints, Ref. [24] optimized offloading decisions to minimize network latency and proposed the BSUM algorithm for simulation verification.The network topology exhibits dynamics, and the authors in [25] presented an algorithm based on ADMM to get the optimal solution for network slicing problems.In [26], the authors proposed an I-SAT network (integrated satellite-aerial-terrestrial network), aiming to enhance the average throughput among users by addressing the joint optimization problem of association variables, power allocation, and UAV trajectories.In [27], the authors presented an integrated SAG network scheme incorporating terahertz; the scheme leveraged the abundant bandwidth of the terahertz spectrum and optimized variables such as terahertz frequency allocation to minimize the energy consumption of the network system.The work in [28] proposed a resource management scheme integrating Software-Defined Networking, which enhanced economic efficiency by analyzing variables, including priority, latency, energy consumption, and service level agreements.Additionally, due to the limitations of spectrum resources and atmospheric effects on free-space optical communication, in [29], the authors investigated a hybrid radio frequency and FSO scheme in a SAGIN and verified its performance through simulations.
Although there have been many studies on resource allocation in MEC systems, optimizing caching decisions is usually merely considered in the two-tier networks of MEC, as it is difficult to optimize caching decisions and other variables in a three-tier network.So, there is a limited amount of work considering the optimization of caching decision variables within the framework of a SAGIN.The interaction between computation and communication in mobile edge networks has a substantial influence.The problem of resource allocation for improving network performance continues to be a subject that merits extensive investigation.Therefore, this paper proposes a three-tier mobile edge network model and, through the proposed algorithm, jointly optimizes association control, bandwidth allocation, computation task allocation, caching decisions, and the UAV's position to minimize total task latency.The major contributions of this paper are as follows: (1) Considering the limited computing capabilities of ground users and aiming to improve the rationality of network resource allocation, we deploy edge servers on UAVs to provide caching content services.We formulate an original problem with the purpose of minimizing the total computation task delay.
(2) We propose an iterative optimization algorithm based on BCD, decomposing the joint optimization problem into different subproblems.We consider optimizing cache decision variables in SAGIN and use the McCormick envelope theory to address the coupling of association variables and cache decision variables.Additionally, we optimize UAV positions using the SCA method and convex optimization techniques.
(3) The experimental results demonstrate that the proposed method outperforms others, effectively enhancing the total computation task delay of the network system.

System Model
Figure 1 illustrates the SAGIN system, which comprises N GTs, M UAVs, and one low-Earth-orbit (LEO) satellite, where UAVs and satellite are equipped with edge servers.The UAVs can provide services to ground user terminals and satellite while also handling computation tasks from both ground and satellite sources.The sets of GTs and UAVs can be denoted by N = {1, 2, . . ., N} and M = {1, 2, . . ., M}, respectively.The computation task of the i-th GT in the current time slot is denoted by I i .When users generate demands, computation tasks will be generated, such as weather data processing, decoding data packets, handling network protocols, etc.In general, computation tasks typically involve data transmission.The computation tasks generated by ground terminals and satellite will be transmitted, and the total computation task delay consists of transmission and computation delays.The triplet is marked by <F i , D i , T i >, where (1) F i is defined as the computational complexity, representing the CPU resources required to compute one bit of data.(2) D i denotes the size of the computation task.Note that the computation task can be partitioned.(3) T i is the maximum tolerable delay of the task.The overall data volume of computational task D i can be separated into two distinct sections, denoted by D l i and D s i ; i.e., where D l i denotes the data produced by ground user terminals, and D s i represents the data generated by satellites, which can be transmitted to UAVs and GTs for processing.
Additionally, the data generated by ground user terminals can be divided into three parts, which can be processed on the GTs, UAVs, and satellites; i.e., Each UAV is outfitted with a server, allowing each UAV to offer services for GTs within its service area.The ground terminals can either process tasks locally or offload computation tasks to UAV-based MEC servers or satellite cloud servers.The essential notations are shown in Table 1.
Uplink transmission rate from i-th device to j-th UAV, and from j-th UAV to satellite Downlink transmission rate from j-th UAV to i-th device, and from satellite to j-th UAV

Channel Model
Consider representing the coordinates of ground terminals and UAVs in a threedimensional Cartesian coordinate system, denoted by u GT i = x GT i , y GT i , 0 and u U AV j = x U AV j , y U AV j , H , respectively.Here, ∥∥ 2 represents the square distance between the position vectors of ground user terminals and UAVs' positions.Thus, the distance between the two can be expressed in the form that follows.
According to [18], the channel coefficient can be represented as follows.
where ĥi,j denotes the small-scale fading, h 0 is the reference channel gain at the distance d 0 = 1, and α represents the path-loss exponent.According to [30], assuming that α = 2, the small-scale fading can be represented as follows: Note that the Rician fading factor is denoted by λ, and it is represented by h i,j in the Line of Sight (LoS) scenario, where | hi,j | = 1.In the Non-Line of Sight (NLoS) situation, it is denoted by hi,j , where hi,j ∼ CN(0, 1).
The ground terminals utilize Frequency Division Multiple Access (FDMA) to offload computation tasks to UAVs to prevent co-frequency interference.When the i-th GT offloads its tasks to the j-th UAV, the achievable uplink data rate for the GT will be where B U is the available bandwidth in the uplink, b i,j represents the allocated system bandwidth, P i is the transmission power for the i-th GT in the uplink, and N 0 is the noise power.Where P j is the transmission power of the j-th UAV, the downlink data rate for GTs will be where B D is the available bandwidth in the downlink.

Computation Model
We define a ij as the association variable between the i-th GT and the j-th UAV.In particular, each GT can be associated with, at most, one UAV; i.e., where c u j represents the cache variable, and c u j = 1 indicates that the data has been cached on the UAV, and c u j = 0 otherwise.
Considering the cache variable in the latency, in some scenarios, there are more computation tasks for the satellites and fewer tasks for ground user terminals.So, the paper temporarily considers caching variables only in the downlink.In addition, the satellite request latency can be ignored in certain situations, such as autonomously executed tasks or periodic data collection.The computation capability of a GT is denoted by f l i .When the GT processes task I i locally, the delay consists of two parts: transmission delay and computation delay.It is essential to consider whether the UAV caches computation tasks from the satellite.The local computation delay can be defined as When the GT processes computation task I i , the energy consumption can modeled as where k is a constant depending on the processor chip architecture.When ground user terminals offload part of the computation task to a UAV, we can assume that the latency can be divided into three stages: (1) the transmission time from the GTs to the UAV, (2) the time it takes to process the task on the UAV server, and (3) the time it takes for the UAV to transmit the computation results back to the ground user terminals.According to [31], the processed computation results in a data volume much smaller than before processing; therefore, this part can be omitted.Therefore, the latency when processing computing tasks at the UAV can be expressed as where f u i represents the computing capability of the UAV.Therefore, the consumption of energy for transmission used during the process of offloading computation tasks to the j-th UAV can be expressed as The consumption of energy for carrying out computation tasks I i by the UAV is given by We assumed that multiple UAVs hovered above the task area to provide relevant services for the GTs and neglected the propulsion energy of the UAVs for now [12,32].However, except for the energy use of computing and transmission, the flying energy of the UAVs also needs to be considered [33].
The thrust of the UAV's mass is represented by ζ, the power efficiency is denoted by η j , the number of rotors is expressed by q, the rotor diameter is r, and the air density is ρ.Therefore, the energy consumption of the UAV hovering can be represented as So, the sum of the energy consumption of the UAV for computation data, transmitting data, and hovering is represented as Similarly to [34], we consider fixed transmission rates for communication with the satellite.In addition, the delay in processing computing tasks in the satellite can be represented as where f s i represents the computing capacity of the satellite, and P s u denotes the transmission power between the UAV and the satellite.Therefore, the energy consumption during the process of transmitting tasks from the UAV to the satellite is given by In summary, the total delay of computation tasks I i during the processing can be expressed as where the total energy consumption will be

Problem Formulation and Optimization Solution
We focus on minimizing the total computation task delay while satisfying constraints on maximum energy consumption and latency.We formulated the association variable a = a i,j , the computation task D = {D l i , D s i }, the bandwidth allocation b = b i,j , the caching decision variables c = c u j , and the UAV coordinates . The formulated joint optimization problem is as follows: where T i represents the maximum completion time for computation tasks, E max i is the maximum available energy for the i-th GT, and C u represents the cache capacity of each UAV not being able to exceed its maximum capacity.The constraint conditions in Equation ( 22) can be interpreted as follows: C1 represents the processing time of computation tasks not being able to exceed the maximum delay; C2 represents the energy consumption constraint for each GT; C3 implies the constraints on the allocation of computation tasks for each ground user terminal; C4 is the ability of each GT to communicate with no more than one UAV; C5 represents the constraints on bandwidth allocation ratios; C7 implies the limited cache capacity of the UAV; C8 represents the the smallest distance between UAVs to avoid collisions; C9 denotes the constraints on the flight altitude.
We can observe that Equation ( 22) is non-convex, leading to difficulty obtaining a globally optimal solution.Based on the BCD method, we demonstrate an alternating algorithm that transforms Equation ( 22) into subproblems, including association control, task allocation, bandwidth allocation, and UAV position optimization, to obtain suboptimal solutions to problems with Equation (22).

Association Control and Caching Variables
Under a given {b, D, u U }, the association variable and the caching variable are tightly coupled in a multiplicative form, which makes the problem challenging to solve.We relax the integer variable a i,j ∈ {0, 1},c u j ∈ {0, 1} to a continuous variable 0 ≤ a i,j ≤ 1, 0 ≤ c u j ≤ 1 and introduce a new variable z = z i,j , ∀i ∈ N , ∀j ∈ M. Therefore, the original problem can be reformulated as follows: Then, according to the McCormick envelope theory, the non-convex constraint z i,j = 1 − c u j a i,j , ∀i ∈ N , ∀j ∈ M can be replaced by the McCormick convex relaxation condition, which can be expressed explicitly as z i,j ≥ 0, ∀i ∈ N , ∀j ∈ M (25) Therefore, Equation ( 23) can be re-expressed as min a,c,z By utilizing the McCormick envelope theory and convex relaxation techniques, the original problem can be transformed into a convex optimization problem, and we can find the optimal solution using standard optimization algorithms.

Computation Task Assignment
Given {a, b, c, u U }, the original problem can be reformulated as follows: It can be concluded that Equation (29), the above expression, is non-convex and can be solved using various classical algorithms, such as interior-point methods.

Bandwidth Allocation
Under a given {a, c, D, u U }, the original problem can be rewritten as follows: According to T u i ≤ T i and E i ≤ E max i , C1 3 and C2 3 can be inferred.According to [35], the perspective operation preserves convex (concave) shapes.It can be seen that b i,j B U log 1 . Hence, it can be proven that b i,j B U log 1 is a concave function of b i,j ; therefore, both C1 3 and C2 3 are convex.Combining other constraint conditions, we can conclude that Equation ( 30) is convex.

UAV Position
When {a, b, c, D} is given, the problem can be converted into the problem of the positions of UAVs.
In order to address the non-convex constraints, the auxiliary variables ε, λ, η and x i are introduced.We define that ≥ λ, and R D u,l ≥ η.Furthermore, C1 4 can be reformulated as follows: Based on the SCA approach, we perform a first-order Taylor expansion at the local point xi .The Equation (32) can be inferred as follows: Next, we need to address the non-convex constraint C8.By giving the local points Ĥj , ûUAV j and Ĥm , ûUAV m , it can be inferred that the lower bound is on the left-hand side of C9.The formula can be expressed as Equation ( 34): The lower bound at the given local point is replaced by the left-hand side of C1 4 , C2 4 , and C9, so we can obtain the following convex problem: The problem mentioned above can be resolved by utilizing the MATLAB CVX toolbox.At the m-th iteration, As the subset of (31) is the feasible set of Equation ( 35), the upper bound solution for Equation ( 31) can be obtained by solving an approximate problem of Equation (35).The solution process for problem (31) is depicted in Algorithm 1.
Algorithm 1 Optimization algorithm for the problem of Equation ( 31)  In conclusion, we propose a comprehensive alternating optimization algorithm using the BCD method to solve Equation (22).Specifically, we divide the variables in Equation ( 22) into different blocks, which are a,c,b, D, and u U .Subsequently, we alternately optimize association control, cache decision, computing task allocation, bandwidth allocation, and UAV position optimization by solving Equations ( 23) and ( 29)-( 31) while keeping the other variable blocks fixed.Algorithm 2 summarizes the detailed process of this method.
In the first step of Algorithm 2, initializing {a } is necessary.To guarantee the feasibility of the initial point, the initialization strategy should satisfy the constraints of Equation (23).Algorithm 3 provides the initialization method.
Algorithm 2 Optimization algorithm for Equation ( 22) The computation task data volume originated from both GTs and satellites.The data produced by GTs can be processed locally on the ground, UAVs, and satellites, respectively; i.e.,.
0] Initialize: Set the coordinates of the UAVs, and the height of the UAV is

Results and Analysis
In this section, we conducted extensive simulations to validate the proposed scheme's and algorithm's effectiveness.We consider a system with K = 20 that are located at the horizontal coordinates [0; 20], [20; 5], [20; 0], [5; 20], [20; 20] [21,36,37], the simulation parameters are shown in Table 2.We have set up the following three basic schemes for ease of comparison.

1.
Local Computation: The data volume of computing tasks is entirely processed by local GTs, and neither the UAVs nor the satellite (LEO) provide computing services for ground GTs.

2.
Satellite Computation: The data volume of the computation task is entirely processed by the satellite, and ground GTs and UAVs do not provide computation services for the satellite.

3.
Fixed UAV Positions: The data volume of computing tasks can be handled at ground GTs, UAVs, and the LEO satellite.In contrast with the proposed method, this method concurrently optimizes association control, bandwidth allocation, cache decision, and the distribution of computing tasks while maintaining fixed UAV positions.

4.
Fixed Bandwidth Allocation: All ground user terminals are allocated equal bandwidth, while the GTs, UAVs, and satellites provide services normally [20]. 5.
Ground-Air Cooperation: The computation tasks of ground user terminals can be offloaded to UAVs for processing, whereas the satellite does not provide services for GTs and UAVs [21].
Figure 2 illustrates the convergence of the algorithm.It can be observed that the proposed algorithm converges rapidly and obtains the optimal solution.Based on the convergence analysis conducted using the Block Coordinate Descent method [39], it can be observed that when the number of UAVs (M) remains constant and the number of ground user terminals (N) rises, there will be a corresponding increase in the computation delay of each ground user terminal.This is due to the inevitable decrease in the allocation of edge computing resources to each ground user terminal, which leads to an increase in the total computation task delay.When N remains constant and M increases, the total computation task delay decreases.The reason is that the increase in the number of UAVs leads to a corresponding increase in the computational capacity on the UAV side, which allows UAVs to process tasks quickly and reduces the latency for task processing.In addition, the ground user terminals have the capability to allocate tasks to more UAVs, thereby reducing the processing workload on each UAV and consequently reducing the total computation task delay.
As shown in Figure 3, we plot the relationship between the total computational latency and the local computation capacity.It can be observed that there is a negative correlation between the computing capability of ground user terminals and the total computation task delay.The reason is that the enhancement in computation capability reduces the local processing delay.However, the enhancement of local computation capability does not affect the situation in which all computation tasks are executed on the satellite.In addition, it is worth noting that, compared with local computation, satellite computation, fixed UAV positions, fixed bandwidth allocation, and ground-air cooperation schemes, the optimization algorithm proposed in this paper effectively reduces the delay.Local computation capability (Hz) As shown in Figure 4, we plot the relationship between the total computational latency and the satellite computation capacity.From Figure 4, it can be observed that the total computation task delay decreases with the increase in satellite computation capability.This is due to the increase in satellite computation capacity reducing the processing latency of the tasks handled by the satellite.The enhanced computation capacity of the satellites facilitates collaboration between space, air, and ground, thereby reducing the total computation task delay.In addition, we can conclude that the proposed optimization approach reduces the latency effectiveness compared with local computation, satellite computation, fixed UAV position, fixed bandwidth, and ground-air cooperation schemes.As depicted in Figure 5, we plot the relationship between the total computation task delay and the amount of the computation task data.It can be seen that the total computation task delay increases with the increase in the amount of computation task data in Figure 5.The increase in computation tasks leads to an increase in the transmission latency from ground user terminals to UAVs and from UAVs to satellites.In addition, it increases the number of computation tasks handled by ground user terminals, UAVs, and satellites, leading to an increase in the processing latency of these tasks.Consequently, the total computation task latency increases.The reason for this is that an increase in the amount of computation tasks leads to an increase in the transmission latency from ground user terminals to UAVs and from UAVs to satellites, which, at the same time, increases the computation tasks handled by ground user terminals, UAVs, and satellites, increasing the processing latency of these tasks, thus leading to an increase in the total computation task delay.Besides, it is evident that the optimization scheme proposed in this paper surpasses other baseline approaches.We investigate the relationship between the total computation latency and the transmission power under the condition of a Rician fading factor of λ = 3, as depicted in Figure 6.It is evident that when the transmission power increases, there is a corresponding decrease in the total computation task delay.The reason for this occurrence is that the augmentation in transmission power results in a corresponding augmentation in the transmission rate between ground user terminals and UAVs, hence diminishing the transmission latency both on the ground and in the air.Moreover, the paper's optimization algorithm effectively reduces the total computation task delay when compared with other baseline solutions.
Figure 7 depicts the correlation between the total computation task delay and the bandwidth of the system when the Rician fading factor λ = 3.From Figure 7, it can be inferred that there is a negative correlation between the total computation task delay and the system bandwidth.The reason for this phenomenon is that an increase in system capacity leads to a rise in the transmission rate between ground user terminals and UAVs, which reduces transmission delay.This has a similar effect on increasing transmission power, which fosters cooperation between terrestrial user terminals and aerial UAVs, further reducing the total computation task delay.

Discussion
In contrast with conventional research, the novel UAV-assisted SAGIN model proposed in this paper is quite interesting.In order to improve the precision of our findings, we partitioned the computation tasks generated by ground user terminals, enabling separate processing on the ground, UAVs, and satellites.In general, three-layer networks do not typically comprehensively consider optimizing cache variables [23,25].Moreover, to the best of our knowledge, the existing research mainly focuses on optimizing caching variables in two-tier mobile edge networks [15].We are the first to optimize caching variables in the three-tier network model of UAV-assisted SAGIN.The optimization of resource distribution to improve network performance has been an important topic of research for scholars globally in the field of MEC.Correspondingly, the proposed framework model in the paper effectively makes use of the flexibility of UAVs, which improves the performance of the MEC system.
In order to minimize the total computation task delay, we present an iterative algorithm that improves association control, bandwidth allocation, computation task assignment, caching decisions, and the UAV position.However, the present problem is mixed-integer nonlinear programming (MINLP), which is difficult to solve directly.The BCD is frequently utilized to resolve optimization problems with many variables.The core idea is to decompose the optimization problem into several subproblems, wherein all variables are partitioned into distinct blocks, each containing one or more variables.After that, these blocks are updated alternately.During each iteration, a specific block is selected, and the variables in the selected block are considered independent variables, forming a subproblem.The variables in the chosen block are updated, and this process is repeated iteratively until a convergence criterion is satisfied.Specifically, we decompose the proposed problem into four blocks using the BCD method.In the first block, we solve for the association control and caching decision variables while maintaining other variables constantly.The McCormick envelope theory is utilized to decouple the variables, followed by applying convex optimization techniques to convert them into convex problems to solve them.In the above manner, the process is repeated iteratively until the final block, where the only need is to solve for the positions of the UAVs.The problem is converted into a convex problem by introducing new variables for relaxation and conducting a Taylor expansion around local points to get the lower bounds, which can be solved using MATLAB.Finally, the efficacy of the proposed optimization algorithm is validated through simulations.
The results show that the proposed UAV-assisted SAGIN model presented in this paper is significant.Firstly, this model can be applied in various scenarios, including but not limited to emergency communication, post-disaster recovery, communication coverage in remote areas, and military operations.It has the capability to offer flexible, efficient, and dependable communication.The parameters that were studied in the model, including cache variables and UAV positions, are also instructive and can help designers make better MEC-related systems in the future.Comparing the proposed optimization method with different baseline schemes has shown that it works well, demonstrating the significance of the optimized variables in this paper.This indicates their applicability in various realworld scenarios.Finally, the efficacy of the proposed optimization algorithm is validated through simulations.
Future work still has many directions worthy of our exploration, such as (1) multi-agent systems, (2) reconfigurable intelligent surfaces, and (3) artificial intelligence.
(1) Multi-Agent Systems: Multi-agent systems can be deployed on MEC servers on UAVs or at base stations, where each ground user reports its parameters to the agents via UAVs.The agents can regularly update the state of the associated users, determine bandwidth allocation and transmission strategies, and perform other functions based on the current user status.Based on the Markov Decision Process, the actor learns the policy function and the critic learns the value function through interaction with the environment, aiming to maximize resource utilization.(2) Reconfigurable Intelligent Surfaces (RIS): Obstacles like trees and buildings may inevitably obstruct communication between ground users and UAVs.We can achieve the control and optimization of signals by appropriately configuring the phase and magnitude of the controllable units of RIS.An RIS can expand coverage and enhance signal transmission effectiveness, and compensate for signal coverage blind spots and shadow areas, which improve the coverage range and quality of UAV-assisted SAGIN networks.(3) Artificial Intelligence: In the UAV-assisted SAGIN model proposed by the paper, we can utilize artificial intelligence algorithms to design resource allocation schemes, which include deep neural networks and deep reinforcement learning, including but not limited to planning UAV paths, making decisions about navigation, searching for targets, sensing the surroundings, and avoiding obstacles.This enables the UAVassisted SAGIN to allocate communication resources dynamically based on real-time communication demands and environmental changes, thereby improving the performance and efficiency of the model in various application scenarios.

Conclusions
The paper presents a three-tier network model comprising satellites, UAVs, and ground user terminals.In the three-tier network model, the computation tasks of ground user terminals can be divided, and the tasks can be processed by ground stations, UAVs, and satellites.To be more specific, to reduce the total computation task delay, we proposed an iterative approach that jointly optimizes association control, bandwidth allocation, cache decision-making, computation task assignment, and UAV positioning.However, the proposed problem is an MINLP, which makes it difficult to solve directly.We utilize BCD to decompose it into four different blocks, each containing different variables.We keep the other variables constant to solve the variables we need in each block.The McCormick envelope theory separates the associated control and caching decision variables, thereby transforming the non-convex problem into a convex one.A Taylor expansion is conducted on local points to determine the lower bounds and introduce new variables for relaxation to solve the positions of the UAVs.Once all the different blocks have been transformed into convex problems, MATLAB is employed to solve them.Finally, it is shown through simulations that the approach suggested in this paper surpasses other baseline solutions and ensures convergence, effectively lowering the total computation task delay.

Figure 3 .
Figure 3.Total delay versus local computation capability.

Table 1 .
Description of important symbols.
Total delay versus maximum transmission power of GTs.