Service Cache-Based Offloading and Resource Optimization Algorithm for UAV-Assisted Computing

Zihao Li; Qi Zhu

doi:10.3390/electronics14081578

and

The Department of Communication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210009, China

^*

Author to whom correspondence should be addressed.

Electronics2025, 14(8), 1578;https://doi.org/10.3390/electronics14081578

Version Notes

Order Reprints

Abstract

Edge computing minimizes data transmission delay and fulfills the requirements of real-time applications for the wireless network coverage problem in edge computing. This paper proposes a resource optimization algorithm based on service caching, and designs a hybrid computing model of UAV-assisted computation offloading and service programs, in which the UAV (Unmanned Aerial Vehicle) not only carries a server for computation offloading but also serves as a relay to assist in the downloading of service programs. The objective is to minimize the system’s total energy consumption while adhering to constraints such as delay, cache capacity, and other relevant factors. Because of the non-convex nature of the objective problem, it is divided into three sub-problems for more effective handling. Through analysis, the relationship between the resource allocation ratio and total system energy consumption is established, and the simulated annealing algorithm, greedy algorithm, and genetic algorithm are used to solve these sub-problems, respectively. Finally, the global optimal solutions for location deployment, service caching, resource allocation, and offloading decisions are obtained through iteration. The simulation results show that the proposed algorithm successfully reduces the total energy consumption of the system while maintaining task completion delay constraints.

Keywords:

UAV; service caching; computational offloading; simulated annealing; genetic algorithm

1. Introduction

Over the past few years, the swift advancement of mobile computing has been propelled by the growing use of different mobile devices, including smartphones and wearable devices, which enable computing and communication to be carried out anytime and anywhere. However, due to resource limitations, mobile devices often struggle to meet users’ quality of service expectations. Although cloud computing has significant advantages in resource management and computational performance compared to local computing, it usually relies on remote data centers with long data transmission latency [1,2], which can severely impact applications that demand high real-time performance such as multimedia or medical monitoring. In recent years, fog computing has emerged as a promising paradigm to address the limitations of traditional cloud-based architectures in smart city environments, such as high latency and limited context awareness. A recent systematic literature review by Rahman et al. [3] provides a comprehensive classification of fog computing research into service-based, resource-based, and application-based approaches, highlighting its crucial role in latency-sensitive urban applications. Additionally, by offloading computing tasks to servers closer to users or task sources, mobile edge computing (MEC) provides an efficient solution to these challenges, alleviating the computational burden on user devices, and effectively reducing the data transmission delay compared with cloud computing.

For the multi-user scenario, researchers have given solution ideas to reduce the energy consumption of the system. In [4], the authors introduced an energy-efficient beamforming scheme for downlink multi-user MISO systems, where a base station equipped with dynamic metasurface antennas (DMAs) simultaneously serves multiple users, and they used an efficient alternating optimization (AO) algorithm to solve it. Additionally, researchers have investigated computational offloading strategies and suggested various methods to optimize both energy and delay consumption. In [5], the authors combined energy and delay consumption into a single framework, reformulating the problem as the search for an optimal solution within a finite policy space, and applied a heuristic approach to make offloading decisions for mobile devices. In [6], the authors utilized an orthogonal frequency division multiplexing (OFDM) transmission mechanism to reduce inter-user interference while optimizing offloading decisions and bandwidth resources to lower the overall energy consumption of the MEC system. In [7], the authors considered the uncertainty of resource demand and the delay constraints of heterogeneous computing tasks for dynamic time-varying systems, and co-optimized the offloading decision, computational resources, and bandwidth resources in the MEC system to minimize the total system energy consumption. Yousefpour et al. [8] proposed a comprehensive IoT–fog–cloud architecture aimed at minimizing IoT service delay through fog offloading. Their framework allows IoT tasks to be adaptively offloaded among fog nodes based on real-time queue states and estimated processing times, effectively balancing load and improving response times. By developing both a Markovian queueing model and an event-driven simulator, they demonstrate that their fog-collaboration policy significantly lowers average service delay across different traffic patterns (e.g., light vs. heavy tasks). This work highlights the potential of collaborative fog offloading to meet stringent quality of service (QoS) requirements in large-scale IoT deployments, and it provides valuable insights for subsequent research on robust, low-latency fog-based architectures. The authors of [9] introduced a computational offloading algorithm that jointly considers task prioritization and partial offloading. This algorithm makes offloading decisions based on the task’s tolerable delay, the minimum computation delay, and the lowest energy consumption. The offloading task priorities and MEC server rankings are determined independently. Tasks with higher priority are offloaded firstly to MEC servers with greater computational power, alleviating the processing load on local devices. In [10], the authors proposed a mobile edge computing model consisting of multiple users and MEC servers, where each user has multiple independent tasks. Through optimizing task offloading, computation, and communication resource allocation, the goal is to achieve the overall best decision that minimizes the weighted total energy cost and latency for all users. However, the cache capacity of the MEC servers is not considered in the above studies, and all assumed that all service programs are cached in the servers.

In practice, the cache capacity of the MEC server’s storage is constrained, preventing it from caching all service programs. To address this problem, the authors of [11] proposed a cache-assisted computation offloading scheme and optimized the caching policy, offloading policy, and resource allocation to decrease task processing time and conserve energy on user devices. In [12], an integrated optimization approach for multi-user computation offloading and service caching was explored by the authors, modeling it as a mixed-integer nonlinear programming problem to reduce the system’s task cost. In [13], the authors designed a collaborative caching algorithm to assist computation offloading, which determined different cache contents for different tasks and designed corresponding update policies to optimize the usage of cache and computational resources in the system. The authors of [14] used game theory to develop a suboptimal offloading strategy that incorporates service caching and D2D communication in multi-access networks, aiming to reduce computational offloading overhead by optimizing the offloading decision. In [15], a cache-enhanced computational offloading system model was proposed, extending the local caching of a single region in a MEC network to collaborative caching across multiple regions. This approach improves the overall cache hit rate, with offloading decisions ultimately determined using deep reinforcement learning. The above studies assumed that users are within the mobile network coverage and did not consider the communication situation of users in complex environments such as when terrestrial infrastructure is scarce or unavailable.

Mobile networks cannot cover all areas due to complex terrain, infrastructure limitations, or natural disasters. The use of UAVs has become increasingly prevalent in wireless communication systems due to their flexibility to enable dynamic deployment [16]. Equipping MEC servers on UAVs to provide service to users who cannot be covered by wireless networks is an approach that can significantly improve the user experience compared to traditional terrestrial MEC networks. In [17], researchers have investigated the dominant barriers and key techniques of THz-ISAC-UAV from the transceiver design perspective. To enhance both energy and hardware efficiency while addressing challenges related to distance and mobility, the authors focused on three critical technologies: UM-MIMO-ISAC hybrid beamforming, THz-ISAC waveform design, and communication and sensing channel state information acquisition. Finally, they provided a comprehensive discussion of their underlying principles and the key challenges associated with each.

In [18], focusing on multi-UAV MEC systems, researchers examined service caching-based cooperative computation and resource allocation, proposing an optimization problem that minimizes the worst-case task completion delay under device and UAV energy constraints. In [19], the authors utilized UAV-assisted service caching to obtain an optimal offloading and resource allocation policy that minimizes energy consumption under delay constraints by determining the 3D location of the UAV and the deployment of its services in the edge servers. The authors of [20] proposed a model to minimize the total UAV endurance under delay constraints, which jointly optimizes the offloading and caching decisions, the UAV hovering trajectory, and its computational resource allocation to improve the communication and computational resource utilization.

Research on UAV-assisted service cache-based computational offloading has just begun, and the related literature and research results are relatively limited. Refs. [18,19] studied UAV-assisted service cache-based computation offloading, but all of them assumed that users cached the required services, but the service requirements of users are usually highly dynamic and diverse, it is difficult to match and cache suitable services for all users in advance, and some computational tasks not only rely on static code or data, but also require real-time back-end processing. In that circumstance, all users holding caches is inconsistent with reality and not feasible. In addition, the authors of [18] proposed a scheme to offload user tasks to the base station for computation when the UAV computational capability or cache resources do not meet the user’s needs, but it does not take into account the fact that the user is distant from the base station, which makes it difficult for the UAV to cover both users and the base station at the same time, and when the UAV does not cache the relevant service program, it only considers the download of service programs from the base station, which results in a waste of the user’s device service resources. In [19], the authors assumed that only the services cached by the UAV are possible to uninstall, and this scenario ignores the situation where the UAV can download programs from either the user or the base station, which underutilizes the computational resources of the UAV. In [20], the authors assumed that the user’s computing power and cache capacity are much smaller than those of the UAV. In addition, they did not study local computing and caching, and concentrated on offloading computing workloads and the processing of UAV-cached tasks at the current and previous moments, without considering service caching. To tackle the aforementioned problems, this paper presents a UAV-assisted resource optimization algorithm based on service caching and downloading, which introduces a relay UAV and a computing UAV in order to assist users in completing computation tasks. To reduce total system energy consumption, this paper jointly optimizes UAV positions, caching decisions, computational resource allocation for the computing UAV, and user offloading strategies. The key contributions are summarized as follows:

(1): A UAV-assisted cooperative computing model based on service caching and downloading is constructed, which consists of users, a relay UAV, a computing UAV, and a base station, and users are outside the coverage range of the base station. Among them, the computing UAV is able to cache part of the service programs and can provide computing and downloading services to users. The relay UAV has a wider communication coverage and can act as a relay to download programs from the base station and forward them to a user or the computing UAV when neither the user nor the computing UAV has cached the relevant service programs.
(2): Under the constraints of time delay and the computing UAV cache capacity, an optimization problem is designed to optimize and lower system-wide energy consumption. The problem is a mixed-integer nonlinear programming (MINLP) problem which is divided into three separate sub-problems. The UAV location deployment and service caching sub-problem involves the optimization of discrete and continuous variables, and is solved using a simulated annealing algorithm; the computational resource allocation sub-problem is examined to establish the association between the resource allocation ratio and the system overall energy consumption, and it is addressed by using a greedy algorithm; the optimization variables of the offloading strategy sub-problem are binary, and the offloading strategy is solved using a genetic algorithm as a genetic code for each individual.
(3): The objective problem’s optimal solution is determined through global iteration. Simulation results demonstrate that the proposed algorithm achieves substantial energy savings compared to benchmark algorithms. In comparison with the algorithm introduced in [18], the overall energy consumption is reduced by approximately 40%.

This paper proceeds as follows: The system model is described in Section 2, and the objective problem based on this system model is presented in Section 3. In Section 4, the optimization method for each sub-problem is given and the analysis of the simulation results is conducted. Finally, the full paper is summarized in Section 5.

2. System Model

As shown in Figure 1, the system model is presented, which consists of a relay UAV, a computing UAV,

M

ground users, and a base station. Considering the UAV’s constrained capacity, the computation function and the longer distance communication function are completed by two UAVs, respectively, the computing UAV carries a MEC server to provide computation and service program caching services for users, and the relay UAV is used to assist the communication between users, the computing UAV, and the base station. The set of user devices is denoted by

M = {1, 2, \dots, m}

; each device is assigned a computational task to complete, and execution of the task can take place on the local device or on the computing UAV.

Figure 1. System model.

Let

U_{m} = (S_{m}, L_{m}, T_{m}^{max})

signify the task assigned to device m, where

S_{m}

represents the size of the task data,

L_{m}

represents the amount of CPU cycles needed to process a single bit of task data, and

T_{m}^{max}

denotes the upper limit of permissible latency for processing the task. Let

α_{m} = {0, 1}

be the user device offloading strategy, where

α_{m} = 0

implies that the task is executed directly on the user device, and 1 means that the task is executed on the computing UAV, which can serve multiple user devices at the same time [21]. Each task requires a corresponding program to execute, the set of all executed programs is denoted as

P = {1, 2, . . . p}

, and the size of each program is

W_{p}

. Here,

ϕ (m) = p_{m}

is used to denote that the computational task of the device m requires a program p, and the tasks of multiple devices may require the same program to execute. The base station can cache the service programs required by all user devices because it is equipped with a high-performance server, and user devices and UAVs can only cache some of the programs due to their limited cache capacity.

A three-dimensional Cartesian coordinate system is used to specify the coordinates of the UAV and the user equipment, assuming that the positions of the UAV and the user equipment remain unchanged during the execution of the computational task, that the UAV hovers at a fixed altitude of

H_{ξ}, ξ \in {u_{r}, u_{c}}

, whose horizontal coordinates are represented by

Q_{ξ} = (x_{ξ}, y_{ξ}), ξ \in {u_{r}, u_{c}}

, that the user equipment and the base station coordinates are represented by

Q_{m} = (x_{m}, y_{m})

and

Q_{b} = (x_{b}, y_{b})

, respectively, and that the horizontal altitude of both can be neglected with respect to the UAV.

2.1. Communication Model

This model involves wireless communication between the user device, UAV, and base station. Assuming that the communication spectrum between the three is independent and there is no interference with each other, the bandwidth of the link between the computing UAV and the relay UAV is

B_{u_{r}, u_{c}}

, the bandwidth of the link between the base station and the relay UAV is

B_{b, u_{r}}

, the link bandwidths between user devices and the two UAVs are both

B_{m, u}

.

2.1.1. Communication Between Ground Equipment (User Equipment, Base Stations) and UAVs

Considering the presence of multiple scatterers or obstructions in the real-world environment, which results in signals not being able to propagate according to the free-space model and generate additional path loss, it is difficult to reflect the actual situation by using the free path loss model (FSPL) [22]. In this paper, a probability-based path loss model [23] is used, which integrates the occurrence probability of line-of-sight (LoS) and non-line-of-sight (NLoS) communications and their corresponding path loss characteristics. The LoS and NLoS communication probabilities between the base station and the relay UAV are, respectively,

P_{b, u_{r}}^{LoS} = \frac{1}{1 + a exp [- b ((\frac{180}{π})arcsin (\frac{H_{u_{r}}}{d_{b, u_{r}}}) - α)]} .

(1)

P_{b, u_{r}}^{NLoS} = 1 - P_{b, u_{r}}^{LoS} .

(2)

where

d_{b, u_{r}}

signifies the Euclidean distance between the base station and the relay UAV, and a and b are environment-dependent constants. The path loss between the base station and the relay UAV is

L_{b, u_{r}}^{v} = 20 lg (\frac{4 π f_{c} d_{b, u_{r}}}{c}) + λ_{v}, v \in {LoS, NLoS} .

(3)

where

f_{c}

is the signal’s carrier frequency, c is the speed of light, and

λ_{v}

is the additional path loss of line-of-sight and non-line-of-sight links. Therefore, the average path loss between the base station and the relay UAV is

{\bar{L}}_{b, u_{r}} = P_{b, u_{r}}^{LoS} L_{b, u_{r}}^{LoS} + P_{b, u_{r}}^{NLoS} L_{b, u_{r}}^{NLoS}

(4)

The channel gain between the two is

g_{b, u_{r}} = \frac{1}{{\bar{L}}_{b, u_{r}}}

(5)

Similarly, the channel gains for the computing UAV–user and relay UAV–user connections can be obtained, respectively,

g_{u_{r}, m} = \frac{1}{{\bar{L}}_{u_{r}, m}}

(6)

g_{m, u_{c}} = \frac{1}{{\bar{L}}_{m, u_{c}}}

(7)

where

{\bar{L}}_{u_{r}, m}

is the average path loss between the computing UAV and user devices, while

{\bar{L}}_{m, u_{c}}

denotes the average path loss between the relay UAV and user devices.

Accordingly, the communication rate between the base station and the relay UAV is

r_{b 2 u_{r}} = B_{b, u_{r}} {log}_{2} (1 + \frac{p_{b} g_{b, u_{r}}}{N})

(8)

where

p_{b}

is the transmission power of the base station, and N is the noise power.

The data transmission rate between the relay UAV and the user device is

r_{u_{r} 2 m} = B_{m, u} {log}_{2} (1 + \frac{p_{u} g_{u_{r}, m}}{N})

(9)

where

p_{u_{r}}

is the transmit power of the relay UAV.

If a user device offloads the computational task to the computing UAV, the data transfer rate is

r_{m 2 u_{c}} = B_{m, u} {log}_{2} (1 + \frac{p_{m} g_{m, u_{c}}}{N})

(10)

where

p_{m}

is the device’s transmission power.

If a user device downloads the relevant service program from the computing UAV, the data transfer rate is

r_{u_{c} 2 m} = B_{m, u} {log}_{2} (1 + \frac{p_{u_{c}} g_{m, u_{c}}}{N})

(11)

where

p_{u_{c}}

is the transmit power of the computing UAV.

2.1.2. Communication Between the Relay UAV and the Computing UAV

Considering that the UAV has sufficient hovering height, the communication environment can be assumed to be free space, so the path loss is

L_{u_{r}, u_{c}} = 20 lg (\frac{4 π f_{c} d_{u_{r}, u_{c}}}{c})

(12)

where

d_{u_{r}, u_{c}}

is the distance between the relay UAV and the computing UAV. The channel gain between the two is

g_{u_{r}, u_{c}} = 10^{- \frac{L_{u_{r}, u_{c}}}{10}}

(13)

Therefore, the data transfer rate between the relay UAV and the computing UAV is

r_{u_{r} 2 u_{c}} = B_{u_{r}, u_{c}} {log}_{2} (1 + \frac{p_{u_{r}} g_{u_{r}, u_{c}}}{N})

(14)

2.2. Cache Model

The computing UAV can cache some programs for use by the user devices. The caching decision of the computing UAV for a program p is denoted as

C_{p} \in {0, 1}

;

C_{p} = 1

denotes that it has cached the service program p, and 0 indicates that it has not cached the program. Since the computing UAV has a constrained cache capacity, the cumulative size of the cached programs cannot surpass its maximum storage limit; then,

\sum_{p \in P} C_{p} W_{p} \leq K

(15)

where K is the cache capacity of the computing UAV.

Each user device also has a cache tolerance, so there is

\sum_{p \in P} J_{p_{m}} W_{p} \leq K_{m}

(16)

where

K_{m}

indicates the cache capacity of the device m,

J_{p_{m}}

is a binary constant, and

J_{p_{m}} = 1

indicates that the device m has cached the program p.

2.3. Computational Model

The flowchart of the computation offloading model designed in this paper is shown in Figure 2, where the computation tasks for each device can be computed locally or executed remotely on the computing UAV. When a user chooses local computation, there will be the following three scenarios:

Figure 2. System model flowchart.

1.: If the user device m caches the relevant service program, the delay and power consumption during task execution are as follows:

$t_{1}^{local} = \frac{S_{m} L_{m}}{f_{m}}$

(17)

$E_{1}^{local} = η f_{m}^{2} S_{m} L_{m}$

(18)

where $η$ is an active capacitor switch, and $f_{m}$ denotes the local computing resources of the device.
2.: In the case where the user device does not cache the service program, but the computing UAV stores the relevant service program, then the device will download the service program from the computing UAV, and the task completion delay and energy consumption are, respectively,

$t_{2}^{local} = t_{1}^{local} + \frac{W_{ϕ (m)}}{r_{u_{c} 2 m}}$

(19)

$E_{2}^{local} = E_{1}^{local} + p_{u_{c}} \frac{W_{ϕ (m)}}{r_{u_{c} 2 m}}$

(20)
3.: If neither the user device m nor the computing UAV has cached the relevant service program, the device will download the service program from the base station through the relay UAV, and the task completion delay and energy consumption are, respectively,

$t_{3}^{local} = t_{1}^{local} + \frac{W_{ϕ (m)}}{r_{b 2 u_{r}}} + \frac{W_{ϕ (m)}}{r_{u_{r} 2 m}}$

(21)

$E_{3}^{local} = E_{1}^{local} + p_{b} \frac{W_{ϕ (m)}}{r_{b 2 u_{r}}} + p_{u_{r}} \frac{W_{ϕ (m)}}{r_{u_{r} 2 m}}$

(22)

When a task is offloaded by a user to the computing UAV for computation, the following three scenarios occur:

1.: If the corresponding service program for UAV caching is calculated, the task completion delay and energy consumption are, respectively,

$t_{1}^{edge} = \frac{S_{m}}{r_{m 2 u_{c}}} + \frac{S_{m} L_{m}}{β_{m} f_{u_{c}}}$

(23)

$E_{1}^{edge} = p_{m} \frac{S_{m}}{r_{m 2 u_{c}}} + η {(β_{m} f_{u_{c}})}^{2} S_{m} L_{m}$

(24)

where $β (m) \in {0, 1}$ is the computing resource allocation ratio for the user device by the computing UAV.
2.: If the computing UAV does not cache but the user device m caches the service program, then the computing UAV will download the service program from the device, and then the task completion delay and energy consumption are

$t_{2}^{edge} = \frac{S_{m} + W_{ϕ (m)}}{r_{m 2 u_{c}}} + \frac{S_{m} L_{m}}{β_{m} f_{u_{c}}}$

(25)

$E_{2}^{edge} = p_{m} \frac{S_{m} + W_{ϕ (m)}}{r_{m 2 u_{c}}} + η {(β_{m} f_{u_{c}})}^{2} S_{m} L_{m}$

(26)
3.: If neither the computing UAV nor the user device m has cached the service program, then the computing UAV will download the service program from the base station through the relay UAV, and the task completion delay and energy consumption are, respectively,

$t_{3}^{edge} = \frac{W_{ϕ (m)}}{r_{b 2 u_{c}}} + max (\frac{W_{ϕ (m)}}{r_{u_{r} 2 u_{c}}}, \frac{S_{m}}{r_{m 2 u_{c}}}) + \frac{S_{m} L_{m}}{β_{m} f_{u_{c}}}$

(27)

$E_{3}^{edge} = p_{b} \frac{W_{ϕ (m)}}{r_{b 2 u_{r}}} + p_{u_{r}} \frac{W_{ϕ (m)}}{r_{u_{r} 2 u_{c}}} + p_{m} \frac{S_{m}}{r_{m 2 u_{c}}} + η {(β_{m} f_{u_{c}})}^{2} S_{m} L_{m}$

(28)

Therefore, for the user device, the delay and system energy consumption required to accomplish its task are, respectively,

\begin{matrix} T_{m} & = (1 - J_{ϕ (m)}) (1 - C_{ϕ (m)}) \{α_{m} t_{3}^{edge} + (1 - α_{m}) t_{3}^{local}\} \\ + (1 - J_{ϕ (m)}) C_{ϕ (m)} \{α_{m} t_{1}^{edge} + (1 - α_{m}) t_{2}^{local}\} \\ + J_{ϕ (m)} (1 - C_{ϕ (m)}) \{α_{m} t_{2}^{edge} + (1 - α_{m}) t_{1}^{local}\} \\ + J_{ϕ (m)} C_{ϕ (m)} \{α_{m} t_{1}^{edge} + (1 - α_{m}) t_{1}^{local}\} \end{matrix}

(29)

\begin{matrix} E_{m} & = (1 - J_{ϕ (m)}) (1 - C_{ϕ (m)}) \{α_{m} E_{3}^{edge} + (1 - α_{m}) E_{3}^{local}\} \\ + (1 - J_{ϕ (m)}) C_{ϕ (m)} \{α_{m} E_{1}^{edge} + (1 - α_{m}) E_{2}^{local}\} \\ + J_{ϕ (m)} (1 - C_{ϕ (m)}) \{α_{m}^{edge} E_{2}^{edge} + (1 - α_{m}) E_{1}^{local}\} \\ + J_{ϕ (m)} C_{ϕ (m)} \{α_{m} E_{1}^{edge} + (1 - α_{m}) E_{1}^{local}\} . \end{matrix}

(30)

2.4. Description of the Problem

This research aims to optimize system-wide energy consumption, and jointly optimize the location deployment of the computing UAV

Q_{u_{c}}

, the relay UAV

Q_{u_{r}}

, the service caching of the computing UAV C, the resource allocation for computation

β

, and the offloading decision of user devices

α

in order to minimize the total system energy consumption. Since the mechanical energy consumption is basically constant in this model, this paper takes the mechanical energy consumption as a pre-condition and focuses on the optimization problem of communication energy consumption and computational energy consumption. The problem can be expressed as

\begin{matrix} P 1 : & min_{\begin{matrix} Q_{u_{c}}, Q_{u_{r}}, C, β, α \end{matrix}} \sum_{m \in M} E_{m} \\ s . t . & C 1 : α_{m} \in {0, 1} \\ C 2 : C_{ϕ (m)} \in {0, 1} \\ C 3 : β_{m} \in [0, 1] \\ C 4 : \sum_{\begin{matrix} m \in M \end{matrix}} α_{m} β_{m} \leq 1 \\ C 5 : \sum_{m \in M} C_{ϕ (m)} W_{ϕ (m)} \leq K \\ C 6 : T_{m} \leq T_{m}^{max} \\ C 7 : D_{u_{r}, u_{c}} \leq H_{u_{r}} \frac{1}{cos (\frac{θ}{2})} \\ C 8 : D_{b, u_{r}} \leq R_{b} \\ C 9 : α_{m} L_{m, u_{c}} + (1 - α_{m}) C_{ϕ (m)} (1 - J_{ϕ (m)}) L_{m, u_{c}} \leq H_{u_{c}} tan (\frac{θ}{2}) \\ C 10 : (1 - α_{m}) (1 - C_{ϕ (m)}) (1 - J_{ϕ (m)}) L_{m, u_{r}} \leq H_{u_{r}} tan (\frac{θ}{2}) . \end{matrix}

Constraint

C 1

enforces that the user device’s offloading decision must be a binary variable. Constraint

C 2

indicates that a binary variable is used to represent the computing UAV’s caching decision, and Constraint

C 3

indicates that the UAV’s computational resource allocation ratio is a continuous variable. Constraint

C 4

indicates that the computing UAV will only allocate resources to users who perform offloading, and the sum of the resource allocation ratios will not exceed 1. Constraint

C 5

indicates that the sum of the program sizes cached in the computing UAV must not exceed the maximum storage capacity of the UAV. Constraint

C 6

means that the maximum tolerable delay sets an upper limit on each task’s completion time. Constraint

C 7

limits the location of the two UAVs to ensure that they can communicate, where

D_{u_{r}, u_{c}} = \sqrt{{(x_{u_{r}} - x_{u_{c}})}^{2} + {(y_{u_{r}} - y_{u_{c}})}^{2} + {(H_{u_{r}} - H_{u_{c}})}^{2}}

,

θ

indicates the UAV antenna’s half-power beamwidth [24]. Constraint

C 8

guarantees that the relay UAV maintains communication with the base station, where

D_{b, u_{r}} = \sqrt{{(x_{b} - x_{u_{r}})}^{2} + {(y_{b} - y_{u_{r}})}^{2} + H_{u_{r}}^{2}}

,

R_{b}

denotes the base station’s communication range in 3D space. For user devices that perform computational offloading or need to download a service program from the computing UAV, the constraint

C 9

restricts the said devices to be within the coverage area of the computing UAV, where

L_{m, u c} = \sqrt{{(x_{m} - x_{u_{c}})}^{2} + {(y_{m} - y_{u_{c}})}^{2}}

. If user devices need to download service programs from the base station via the relay UAV, the constraint

C 10

restricts these devices to be within the coverage area of the relay UAV, where

L_{m, u_{r}} = \sqrt{{(x_{m} - x_{u_{r}})}^{2} + {(y_{m} - y_{u_{r}})}^{2}}

.

3. Joint Optimization Algorithm

Due to its non-convexity, the original problem is broken into three sub-problems that are interrelated but can be solved separately, and then the three decompositions are updated iteratively, so as to obtain an approximate or global optimal solution on the whole. The sub-problem of UAV location deployment and service caching decision contains both discrete variables (caching decision) and continuous variables (UAV location), and the whole problem is non-convex, which is difficult to solve by conventional analytic or gradient class methods. The simulated annealing algorithm, as a typical global heuristic algorithm, is able to search in a larger solution space and avoid falling into local optimums to a certain extent, which is suitable for these kinds of complex optimization problems with mixed discrete and continuous variables. In the sub-problem of computational resource allocation, when other variables are fixed, there is a monotonic relationship between the task delay and the computational resources allocated to the user, so the optimal allocation policy can be obtained by directly adopting the greedy idea of “allocating as few resources as possible” under the premise of satisfying the delay constraint. This sub-problem is relatively simple, and the greedy algorithm can efficiently obtain a closed-form or approximate closed-form solution. Finally, the offloading decision sub-problem is a purely discrete binary variable optimization problem; the dimension increases with the number of users and also has non-convex characteristics. Genetic algorithms can efficiently find an approximate global optimal solution in a large discrete search space through the mechanism of “selection-crossover-mutation”, which is very suitable for this kind of discrete combinatorial optimization. These three algorithms are chosen to solve the corresponding sub-problems separately, which can take advantage of their respective advantages on the problem characteristics, but also spread out the complexity of different sub-problems, and ultimately obtain the optimal solution of the overall problem through iterative solving.

3.1. UAV Position and Cache Decision Solving

When the variables

β

,

α

are determined, the problem

P 1

can be rewritten as

\begin{matrix} P 2 : & min_{\begin{matrix} Q_{u_{c}}, Q_{u_{r}}, C \end{matrix}} \sum_{m \in M} E_{m} \\ s . t . & C 2, C 5 - C 10 . \end{matrix}

(31)

The problem remains non-convex, and this paper employs the simulated annealing algorithm to solve it. The simulated annealing algorithm is a heuristic-based optimization approach that draws on the principle of solid matter annealing, in which the solid is able to make its internal particles gradually reach a more stable arrangement by slowly decreasing the temperature during the annealing process. The simulated annealing algorithm makes use of the above idea by gradually reducing the temperature parameter T when searching the solution space and accepting the worse solution with a certain degree of randomness, so that it can overcome the local optimum and eventually discover a superior solution in the solution space [25]. When the simulated annealing algorithm is used to solve the problem

P 2

, the solution space, the fitness function, and the Metropolis criterion are defined as follows:

1.: Solution space: In this paper, the solution space can be expressed as $π_{best n}^{N} = (Q_{u_{c}, n}^{N}, Q_{u_{r}, n}^{N}, C_{n}^{N}) .$ The solution space $π_{b e s t 0}^{1}$ can be generated by random values in the initial stage, and the optimal solution $π_{b e s t n}^{1}$ is obtained after n iterations of the simulated annealing algorithm. In the global iteration stage, the initial solution space $π_{b e s t 0}^{N}$ of the Nth simulated annealing algorithm is the best solution $π_{b e s t n}^{N - 1}$ of the $N - 1$ th algorithm.
2.: Fitness function: In order to improve the search efficiency of the algorithm and the quality of the solution, the fitness function consists of the objective function augmented by a weighted penalty value, i.e.,

$fitness = \sum_{m \in M} E_{m} + λ {log}_{2} (μ_{1} p_{1} + μ_{2} p_{2} + \dots + μ_{n} p_{n})$

(32)

$p_{n} = max (0, X_{n} - Y)$

(33)

where $p_{n}$ represents the penalty value accumulated during the iteration process when the solution does not satisfy the constraints in $P 2$ , $X_{n}$ is the value obtained in the nth iteration using the current solution, Y is the constraint value, and $μ_{n}$ corresponds to the different weights of the penalty value. The logarithmic form of the penalty value is used to control the growth of the penalty value, to avoid the explosion of the value.
3.: Metropolis criterion: This criterion is used to determine whether or not to accept a new solution at each iteration of the algorithm. By introducing a certain amount of randomness, it can help the algorithm prevent convergence to a local optimum, and thus explore the global optimum solution more efficiently. The chance of adopting a new solution is

$P_{accept} = \{\begin{matrix} 1, & Δ E \leq 0, \\ e^{- \frac{Δ E}{T}}, & Δ E > 0 \end{matrix}$

(34)

where $Δ E$ represents the difference between the new adaptation value and the current adaptation value, T denotes the current temperature, and the update equation is

$T = T_{0} ε^{n}, n \in N^{+}$

(35)

where $T_{0}$ is the initial temperature, $ε$ is the cooling rate, and n is the number of iterations in the algorithm.

Firstly, the relevant parameters are initialized, the total energy consumption and the penalty term under the current scheme are calculated, and the penalty term is added to the total energy consumption to obtain the fitness value. Then, the variables are optimized by the simulated annealing algorithm; specifically, a neighborhood search method is used to generate the new solution to obtain the new fitness value, and a decision is made on whether to accept the new solution according to the acceptance criterion. Finally, the temperature is lowered by continuous iteration until the temperature reaches a certain threshold value. The specific solution process is shown in Algorithm 1.

The algorithm’s computational complexity is structured into two primary layers. For the outer annealing loop, it takes

{log}_{\frac{1}{ε}} (\frac{T}{T_{min}})

iterations for the temperature to decrease from the initial T to

T_{m i n}

; the loop also exits when

c o u n t e r

reaches the specified

t i m e s

, so the outer loop time complexity is

O (min {{log}_{\frac{1}{ε}} (\frac{T}{T_{min}}), times})

. For the inner loop at each temperature, the main operations of generating the neighborhood solution and calculating the total energy consumption and adaptation value are performed. The generation of the neighborhood solution is achieved by randomly deciding the number of this neighborhood perturbation, which in the worst case is of the same order of magnitude as the number of service program types, so the time complexity is

O (N_{p})

and

N_{p}

is the number of service program types. Since the overall energy consumption and the adaptation value are calculated for each user, the time complexity of both operations is

O (M)

, i.e., the time complexity of the inner loop at each temperature is

O (M + N_{p})

. In summary, the time complexity of Algorithm 1 is

O (L \cdot min {{log}_{\frac{1}{ε}} (\frac{T}{T_{min}}), times} \cdot (M + N_{p}))

.

Algorithm 1 Cache and UAV position optimization algorithm based on simulated annealing algorithm

Require: initialization constants, offloading decisions

α

, calculating resource allocation ratios

β

, caching decisions C, UAV positions

Q_{u_{r}}

and

Q_{u_{c}}

Ensure: C,

Q_{u_{r}}

and

Q_{u_{c}}

1:: $c o u n t e r = 0$ , $b e s t_{s o l u t i o n} = (C, Q_{u_{r}}, Q_{u_{c}})$ , initial temperature $T_{0}$ , cooling rate $ε$ , minimum temperature threshold $T_{m} i n$ , number of iterations per temperature L, max maximum number of consecutive non-improvements allowed $t i m e s$ , penalty factor $λ$
2:: Calculate of the overall energy consumption of the system based on the current solution $c u r r e n t_{e n e r g y}$ with the adaptation value $c u r r e n t_{f i t n e s s}$
3:: Let $b e s t_{f i t n e s s} \leftarrow c u r r e n t_{f i t n e s s}$
4:: while $T > T_{m i n}$ and $c o u n t e r < t i m e s$ do
5:: $i m p \leftarrow 0$
6:: for $i t e r = 1$ to L do
7:: Calculate the overall energy consumption of the system $n e w_{e n e r g y}$ based on the neighborhood solution and the adaptation value $n e w_{f i t n e s s}$ , respectively.
8:: if $n e w_{f i t n e s s}$ or $rand < e^{\frac{c u r r e n t_{f i t n e s s} - n e w_{f i t n e s s}}{T}}$ then
9:: $c u r r e n t_{s o l u t i o n} \leftarrow n e w_{s o l u t i o n}$
10:: $c u r r e n t_{f i t n e s s} \leftarrow n e w_{f i t n e s s}$
11:: end if
12:: if $c u r r e n t_{f i t n e s s} < b e s t_{f i t n e s s}$ then
13:: $b e s t_{s o l u t i o n} \leftarrow c u r r e n t_{s o l u t i o n}$
14:: $b e s t_{f i t n e s s} \leftarrow c u r r e n t_{f i t n e s s}$
15:: $i m p \leftarrow i m p + 1$
16:: end if
17:: end for
18:: if $i m p = 0$ then
19:: $c o u n t e r \leftarrow c o u n t e r + 1$
20:: else
21:: $c o u n t e r \leftarrow 0$
22:: end if
23:: $T = T^{0} ε^{n}$
24:: end while
25:: return $r e s$

3.2. Optimization of Computing Resource Allocation

The problem

P 1

can be rewritten when the variables Q, C,

α

are determined:

\begin{matrix} P 3 : & min_{β} \sum_{m \in M} E_{m} \\ s . t . C 3, C 6 \end{matrix}

(36)

By observing Equation (29), it suggests that the total system energy usage required to accomplish the computational task is proportional to the computational resources allocated by the computing UAV when other variables are known, so the computational resources can be allocated as little as possible while satisfying the delay constraints. For the user device m, it can be obtained from Equation (28) and Constraint

C 6

that

β_{m} \geq \frac{G_{m}}{(T_{m}^{max} - H_{m}) f_{u_{c}}}

(37)

Among them,

G_{m} = S_{m} L_{m}

(38)

\begin{matrix} H_{m} = & (1 - J_{ϕ (m)}) (1 - C_{ϕ (m)}) \{(1 - α_{m}) t_{3}^{local} + α_{m} (\frac{W_{ϕ (m)}}{r_{b 2 u_{r}}} + max \{\frac{W_{ϕ (m)}}{r_{u_{r} 2 u_{c}}}, \frac{S_{m}}{r_{m 2 u_{c}}}\})\} \\ + (1 - J_{ϕ (m)}) C_{ϕ (m)} \{α_{m} \frac{S_{m}}{r_{m 2 u_{c}}} + (1 - α_{m}) t_{2}^{local}\} \\ + J_{ϕ (m)} C_{ϕ (m)} \{α_{m} \frac{S_{m}}{r_{m 2 u_{c}}} + (1 - α_{m}) t_{1}^{local}\} \\ + J_{ϕ (m)} (1 - C_{ϕ (m)}) \{α_{m} \frac{S_{m} + W_{ϕ (m)}}{r_{m 2 u_{c}}} + (1 - α_{m}) t_{1}^{local}\} \end{matrix}

(39)

The best resource allocation strategy is obtained to attain the lowest value of the objective function by converting Equation (34) into another equation. The specific implementation is shown in Algorithm 2.

Algorithm 2 Greedy-based UAV computational resource allocation algorithm

Require: initialization constants, offloading decisions

α

, caching decision C, UAV positions

Q_{u_{r}}

and

Q_{u_{c}}

Ensure:

β

1:: resource allocations $β \leftarrow []$
2:: for $m = 1$ to $M$ do
3:: $β_{m} = \frac{G_{m}}{(T_{m}^{max} - H_{m}) f_{u_{c}}}$
4:: $β \leftarrow β \cup β_{m}$
5:: end for
6:: return $β$

Since the number of iterations of the above algorithm is only related to the number of user devices, it has a linear time complexity and that complexity is

O (M)

.

3.3. Offloading Decision Optimization

The optimization problem

P 1

can can be rewritten when the variables Q, C, and

β

are determined:

\begin{matrix} P 4 : & min_{α} \sum_{m \in M} E_{m} \\ s . t . C 1, C 4, C 6, C 10 \end{matrix}

(40)

The problem involves the solution of discrete variables, which is suitable to be solved by the genetic algorithm. As a probabilistic approach, the genetic algorithm is used for search and optimization driven by concepts from natural selection and genetic evolution, which simulates the evolutionary process of natural selection in biological evolution, continuously optimizes the quality of the solution through the mechanisms of “selection”, “crossover” and “mutation”, and finally finds a near-optimal solution to the problem. Through “selection”, “crossover”, “mutation”, and other mechanisms, the quality of the solution is continuously optimized, ultimately yielding a near-optimal solution for the problem [26].

1.: Population formation and optimal individual selection: In this paper, it is assumed that $p o p$ populations are needed, and each individual in the population is the offloading strategy, so the size of the initialized population matrix is $p o p \times M$ . To prevent the inefficiency in optimization caused by a completely random population generation, the best solution of the previous optimization is used as a part of the initial solution of the current optimization. The fitness values of individuals are calculated according to Equations (32) and (33), and the tournament selection method is used to randomly select multiple individuals from the population each time and select the individual with the lowest fitness as the parent.
2.: Individual crossover and mutation: Two parents produce offspring by single-point crossover, and then mutation operation is performed on the offspring, i.e., randomly flipping the gene locus according to the probability $p_{m u t a t i o n}$ . Crossover and mutation processes are illustrated in Figure 3 and Figure 4.

Figure 3. Crossover operation.

Figure 4. Mutation operation.
3.: Population update: Replacing the current population with the offspring generated in Algorithm 2 ensures that the new population inherits superior genes and is also diverse.

Firstly, the relevant parameters are initialized, and the optimal individuals obtained previously are mutated to obtain the initial population. Secondly, the fitness value is obtained by calculating the energy consumption and task completion time of each individual in the current population, and the better individuals are retained by using the tournament selection method, which is shown in Algorithm 3. Then, the intersection is randomly selected for gene exchange, and the genes are flipped with a certain probability. Finally, the iteration is stopped when the number of times reaches the upper limit value to obtain the optimal offloading decision. Finally, upon reaching the maximum iteration count, the iteration is stopped to obtain the optimal offloading decision. Algorithm 4 illustrates the overall execution of the proposed algorithm.

Algorithm 3 Tournament selection method

Require: population matrix

p o p u l a t i o n

, fitness value for each individual in the population

f i t n e s s

, number of individuals randomly selected from the population

t_{s i z e}

Ensure:

b e s t_{i d x}

1:: Counting the number of individuals in the current population
2:: $p o p_{s i z e} = s i z e (p o p u l a t i o n, 1)$
3:: Within $1 \sim p o p_{size}$ randomly select $t_{s i z e}$ individual indices $i d x s$ that represent the individuals competing in the current round of the tournament.
4:: Find the individual with the best fitness among the selected individuals $b e s t_{i d x} = m i n (f i t n e s s (i d x s))$
5:: return $b e s t_{i d x}$

Algorithm 4 Genetic algorithm-based optimization algorithm for offloading decision

Require: initialization constants, compute resource allocation ratio

β

, caching decision C, UAV positions

Q_{u_{r}}

and

Q_{u_{c}}

, penalty factor

λ

, last offloading optimization strategy

α_{p r e v}

Ensure:

α

1:: Initialize population size $p o p$ , maximum number of iterations $m a x_{g e n}$ , crossover rate $p_{c r o s s o v e r}$ , mutation rate $p_{m u t a t i o n}$
2:: Generate initial populations:
3:: $p o p u l a t i o n \leftarrow [α_{prev}, rand ([0, 1]), pop - 1, M]$
4:: $b e s t_f i t n e s s \leftarrow \infty$
5:: $b e s t_s o l u t i o n \leftarrow []$
6:: for $g e n = 1$ to $m a x_{g e n}$ do
7:: for $p = 1$ to $p o p$ do
8:: $f i t n e s s (m) \leftarrow$ calculate the fitness value for each individual
9:: end for
10:: $[{current}_{fitness}, index] = min (fitness)$
11:: if $c u r r e n t_{f i t n e s s} < b e s t_{f i t n e s s}$ then
12:: $b e s t_{f i t n e s s} \leftarrow c u r r e n t_{f i t n e s s}$
13:: $b e s t_{s o l u t i o n} \leftarrow p o p u l a t i o n (i n d e x, 1 : M)$
14:: end if
15:: for $p = 1$ to $p o p$ do
16:: Two parents $P_{1}$ , $P_{2}$ were selected by Algorithm 3, and $c h i l d = P_{1}$
17:: if $r a n d < p_{c r o s s o v e r}$ then
18:: Randomly select the intersection point and perform a crossover operation on $P_{1}$ and $P_{2}$
19:: end if
20:: for $m = 1$ to $M$ do
21:: if $r a n d < p_{m u t a t i o n}$ then
22:: $c h i l d (m) = 1 - c h i l d (m)$
23:: end if
24:: end for
25:: $n e w_{p o p u l a t i o n} (p, 1 : M) = c h i l d$
26:: end for
27:: $p o p u l a t i o n = n e w_{p o p u l a t i o n}$
28:: end for
29:: $α = b e s t_{s o l u t i o n}$
30:: return $α$

Since the fitness of each individual in the population must be evaluated in every generation, the time complexity of this operation is

O (M \cdot pop)

; for each individual, crossover and mutation are constant operations, so the time complexity of both population crossover and mutation is

O (p o p)

. In summary, the time complexity of the algorithm is

O ({max}_{gen} \cdot M \cdot pop)

. The optimal solution for the aforementioned three algorithms is achieved via cyclic iteration. For any user device unable to obtain a feasible solution, the required energy consumption for task completion is assigned as

E_{p e n a l t y}

.

The global iteration is shown in Algorithm 5, and the overall time complexity of the algorithm is

O (n \cdot m a x {O_{1}, O_{2}, O_{3}})

, where

O_{1} = O (L \cdot min {{log}_{\frac{1}{ε}} (\frac{T}{T_{min}}), times} \cdot (M + N_{p}))

,

O_{2} = O (M)

, and

O_{3} = O ({max}_{gen} \cdot M \cdot pop)

.

Algorithm 5 Overall iterative algorithm

1:: Randomly initialize { $Q_{u_{r}}^{0}$ , $Q_{u_{c}}^{0}$ , $C^{0}$ , $β^{0}$ , $α^{0}$ }
2:: Initialize iteration number $n = 0$ and convergence threshold $ϵ$
3:: repeat
4:: Given ( $β^{n}$ , $α^{n}$ ), obtain the optimal ( $Q_{u_{r}}^{(n + 1)}$ , $Q_{u_{c}}^{(n + 1)}$ , $C^{(n + 1)}$ ) by Algorithm 1
5:: Given ( $Q_{u_{r}}^{(n + 1)}$ , $Q_{u_{c}}^{(n + 1)}$ , $C^{(n + 1)}$ , $α^{n}$ ), obtain the optimal $β^{(n + 1)}$ by Algorithm 2
6:: Given ( $Q_{u_{r}}^{(n + 1)}$ , $Q_{u_{c}}^{(n + 1)}$ , $C^{(n + 1)}$ , $β^{(n + 1)}$ ), obtain the optimal $α^{(n + 1)}$ by Algorithm 4
7:: $n = n + 1$
8:: until The objective value decreases by less than $ϵ$ or the maximum number of iterations is reached.

4. Simulation Results and Analysis

In this work, the performance of the proposed algorithm is assessed through simulations in MATLAB 2023a, and the corresponding simulation scenario is depicted in Figure 1, where user devices are presumed to be dispersed within the area of

100 \times 100

, the UAV computational resource is 7 GHz, and the environment-related parameters such as

a = 9.61

,

b = 0.16

, and the detailed simulation configurations are presented in Table 1 [20].

Table 1. Simulation parameters.

To assess and contrast the effectiveness of this algorithm, this paper uses three comparison algorithms:

1.: Algorithm 1 for computing the UAV cache decision without optimization, i.e., the random cache at initialization is used as the final decision, and other variables are optimized according to the algorithm presented in this paper.
2.: Algorithm 2 for computing UAV and relay UAV positions without optimization, i.e., fixing the UAV position, and other variables are optimized according to the algorithm presented in this paper.
3.: Algorithm 3 is the algorithm of [18], which adopts a single-UAV architecture, with limited UAV coverage; the UAV has both computing and relay functions, and it only considers downloading the service program from the base station when the UAV does not cache the related service program.

The convergence behavior of the proposed algorithm is demonstrated in Figure 5. As shown in the figure, the total energy consumption gradually decreases with an increasing number of iterations. Specifically, the proposed algorithm converges to the final solution within a maximum of 5 iterations across various conditions, demonstrating its fast convergence rate. Furthermore, when the UAV’s computing capacity is 7 GHz, the total energy consumption decreases by 36.1% and 33.6% for 25 and 30 users, respectively, which highlights the effectiveness of our proposed algorithm.

Figure 5. Convergence state of this paper’s algorithm.

Figure 6 illustrates the influence of UAV cache capacity on overall energy consumption. It can be observed that with the cache capacity increasing, the energy consumption decreases. This is because the increase in the UAV cache capacity indicates that the UAV can cache more services and saves the transmission energy for downloading the services from the base station, so the curve is a decreasing trend. The figure also demonstrates that the algorithm proposed in this paper outperforms the other three algorithms. The caching policy of Algorithm 1 is not determined based on the user device task type and offloading decision, resulting in the type of services cached on the UAV possibly not being required by the device, and the device and the computing UAV will download the service program from the base station through the relay UAV, which increases the transmission energy consumption. Algorithm 2 fixes the position of the two UAVs, and some devices may be outside the UAV coverage range, which has an impact on devices that are outside the range and need to download service programs for local computation, i.e., leading to an infeasible solution. In this case, the user device whose energy consumption is high is assigned a larger value as a penalty. In addition, the overall energy consumption of the algorithm is larger because the UAV is not on the optimal location, the task upload or service download delay is larger, and thus the transmission energy consumption will be larger as well. The UAV in Algorithm 3 has the functions of both computing and relaying, and subject to the coverage, it needs to balance the communication distance with the base station and the user, there will be more devices outside the UAV’s coverage, and the number of devices that appear to have an infeasible solution is increased compared to the algorithm in this paper.

Figure 6. Relationship between cache capacity and total energy consumption.

Figure 7 depicts the overall energy consumption trends of various algorithms as the maximum tolerable delay grows. For small tolerable delays, the task processing has less flexibility, and the device tends to complete the computation on the local device or UAV with larger computational resources to finish the acceptable delay limit. Thus, the computational energy consumption is larger at this time. In the process of gradually increasing the maximum tolerance delay, the task processing has more selectivity. When the overall delay of the task in the local computation or offloading to the UAV computation is within the tolerance range, the device opts for the option that consumes less energy to complete the task, resulting in a decrease in the device’s total energy consumption. The algorithm in this paper dynamically adjusts the offloading decision to prioritize local computing with lower energy consumption or low-power resource allocation (e.g., the lower bound of resource allocation in Equation (37)) when the delay margin increases. In contrast, Algorithms 1–3 are unable to flexibly utilize the delay margin due to rigid caching strategies or fixed UAV position.

Figure 7. Relationship between the maximum tolerable delay versus total energy consumption.

Figure 8 illustrates the variation in system overall energy consumption as the number of user devices changes. It is evident that with the number of user devices growing, a corresponding increase in total energy consumption is observed for all four algorithms. Since the algorithm in this paper optimizes the computing UAV cache decision, the cache hit rate is higher than that of Algorithm 1, which saves some of the transmission energy of the service programs. An increase in the number of computing tasks means an increase in the likelihood of needing a greater number of service types. But the computing UAV cache capacity is limited, so when the service programs corresponding to the computing tasks are not cached, they also need to be downloaded from the base station via the relay UAV, which increases the transmission energy. Therefore, the overall energy consumption increases. In Algorithms 2 and 3, the fixed UAV location or single-function UAV architecture is prone to the problem that some users are forced to use energy-intensive local computation due to being out of coverage when the number of users increases, which leads to the result that the total energy consumption is greater than the energy consumption of the algorithm in this paper.

Figure 8. Relationship between the number of user devices and total energy consumption.

Figure 9 presents the relationship between average task size and system overall energy consumption, demonstrating that larger task sizes result in higher energy consumption. Given a fixed CPU workload per bit, an increase in task size directly results in a higher total CPU cycle demand for task execution, consequently leading to an increase in computational latency. Under the condition of constant maximum tolerable latency, the device tends to complete the computation on a local device or UAV with larger computational resources, which leads to an increase in computational energy consumption. In addition, an increase in task size also leads to an increased likelihood that the task completion delay will exceed the tolerated delay, ultimately affecting the overall energy consumption. Additionally, a low cache hit rate for Algorithm 1 requires frequent downloads of service programs from the base station, which adds additional transmission energy. For Algorithms 2 and 3, some users may be at the edge or in low-coverage areas, thus adding additional delay and transmission energy consumption.

Figure 9. Relationship between the average task size and total energy consumption.

Figure 10 shows that the overall energy consumption of the system decreases as the UAV computational resources increase. For the algorithms in this paper and Algorithm 1, when the computational resources are small, the number of users who can receive the UAV service is small, and more devices take local computation, which leads to larger total energy consumption. As the computational resources increase, the UAV can provide sufficient computational services to the users in the coverage area, resulting in smaller computational energy consumption. For Algorithms 2 and 3, when the computational resources exceed 7 GHz, the computational resources are more sufficient, and when the resources continue to increase, no more user devices can be served. It is difficult to effectively extend the service coverage, and when the resources are increased, the users beyond the coverage cannot utilize the additional resources, resulting in a limited reduction in the overall energy consumption.

Figure 10. Relationship between computing resources and total energy consumption.

Figure 11 gives the mean value as well as the standard deviation of the energy consumption obtained from multiple tests (with different user locations each time) for a given computational resource. From the figure, it can be seen that the average energy consumption shows an overall decreasing trend as the UAV computational resources increase, while the length of the error bars showing the magnitude of fluctuations in the effect of different user locations on energy consumption under each computational resource.

Figure 11. Energy error bar plot.

5. Conclusions

This paper investigates a UAV-assisted computing offloading model based on service caching with the objective of lowering system-wide energy consumption. The simulated annealing algorithm is applied to optimize UAV location deployment and computing UAV caching decisions, while a greedy-based approach is used for UAV computational resource allocation. Additionally, a genetic algorithm is adopted for selecting the optimal offloading strategy for user devices. Finally, the obtained simulation results reinforce the proposed algorithm’s effectiveness and outstanding performance. When the number of users is small, the gap between each algorithm is relatively small; when the number of users increases to 30, the gap becomes significant. When the number of users reaches 30, the energy consumption of the algorithm in this paper is about 12 J, while the energy consumption of the worst algorithm is as high as 24 J, indicating that the algorithm has better scalability and energy-saving effects for large-scale user scenarios. When the delay is short (0.7 s), the energy consumption of each algorithm is relatively high, but with an increase in delay, the energy consumption of the algorithm in this paper decreases the most, only about 5 J at 1.1 s, while the energy consumption of the comparison algorithm remains above 7.5 J or even higher, indicating that the algorithm can better use the delay redundancy for energy consumption optimization. When the task size is small, the difference between each algorithm is relatively limited; when the task increases to

2.15 \times 10^{5}

bits, the gap increases. The energy consumption of the algorithm in this paper is kept at about 10 J, while the energy consumption of the worst algorithm is as high as 20 J, showing that the algorithm has better energy-saving advantages under large-task-load scenarios. Regarding the increase in cache capacity, the algorithm in this paper has the most obvious decrease, from about 8.5 J to 6.0 J, which saves about 29% of energy consumption, while the algorithm with the highest energy consumption remains above 13 J under the same caching condition, which indicates that the present algorithm more fully utilizes the cache resources to reduce the transmission overhead. In this paper, the energy consumption of the algorithm is about 9.0 J when the UAV’s computational resources are

1 \times 10^{9}

Hz, and decreases to 6.0 J when the UAV’s computational resources are

9 \times 10^{9}

Hz, which is a reduction of 33%. The energy consumption of the comparative algorithms is still higher than this paper’s algorithm by 20% to 50%, which indicates that this paper’s algorithm can better utilize the additional computational resources and further reduce the energy consumption. Future work will concentrate on improving latency performance, UAV trajectory optimization, and other related content in UAV-assisted computing offloading scenarios.

Author Contributions

Conceptualization, Z.L. and Q.Z.; methodology, Z.L. and Q.Z.; software, Z.L.; formal analysis, Z.L.; writing—original draft preparation, Z.L.; writing—review and editing, Q.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Jiangsu Provincial Key Research and Development Program (BE2022068-2) and the National Natural Science Foundation of China (92367302).

Data Availability Statement

Access to the experimental data presented in this article can be obtained by contacting the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Fu, Y.; Shan, Y.; Zhu, Q.; Hung, K.; Wu, Y.; Quek, T.Q. A distributed microservice-aware paradigm for 6G: Challenges, principles, and research opportunities. IEEE Netw. Mag. 2023, 38, 163–170. [Google Scholar] [CrossRef]
Gupta, D.; Moudgil, A.; Wadhwa, S.; Solanki, V. Efficient data caching and computation offloading strategy for edge network. In Proceedings of the 2022 International Conference on Emerging Smart Computing and Informatics (ESCI), Pune, India, 9–11 March 2022; pp. 1–5. [Google Scholar]
Maryam, S.; Morteza, R.; Zhang, H.; AmirMehdi, M.; Mostafa, H.K. Fog computing approaches in IoT-enabled smart cities. J. Netw. Comput. Appl. 2023, 211, 103557. [Google Scholar]
Chen, G.; Zhang, R.; Zhang, H.; Miao, C.; Ma, Y.; Wu, W. Energy-Efficient Beamforming for Downlink Multi-User Systems with Dynamic Metasurface Antennas. IEEE Commun. Lett. 2025, 29, 1517–1530. [Google Scholar] [CrossRef]
Wu, J.; Cao, Z.; Zhang, Y.; Zhang, X. Edge-cloud collaborative computation offloading model based on improved partical swarm optimization in MEC. In Proceedings of the 2019 IEEE 25th International Conference on Parallel and Distributed Systems (ICPADS), Tianjin, China, 4–6 December 2019; pp. 959–962. [Google Scholar]
Cheng, K.; Teng, Y.; Sun, W.; Liu, A.; Wang, X. Energy-efficient joint offloading and wireless resource allocation strategy in multi-MEC server systems. In Proceedings of the 2018 IEEE International Conference on Communications (ICC), Kansas City, MO, USA, 20–24 May 2018; pp. 1–6. [Google Scholar]
Zhou, H.; Jiang, K.; Liu, X.; Li, X.; Leung, V.C.M. Deep Reinforcement Learning for Energy-Efficient Computation Offloading in Mobile-Edge Computing. IEEE Internet Things J. 2022, 9, 1517–1530. [Google Scholar] [CrossRef]
Yousefpour, A.; Ishigaki, G.; Gour, R.; Jue, J.P. On Reducing IoT Service Delay via Fog Offloading. IEEE Internet Things J. 2018, 5, 998–1010. [Google Scholar] [CrossRef]
Pan, M.; Li, Z. Multi-user Computation Offloading Algorithm for Mobile Edge Computing. In Proceedings of the 2021 2nd International Conference on Electronics, Communications and Information Technology (CECIT), Sanya, China, 27–29 December 2021; pp. 771–776. [Google Scholar]
Zhang, K.; Gui, X.; Ren, D. Joint Optimization on Computation Offloading and Resource Allocation in Mobile Edge Computing. In Proceedings of the 2019 IEEE Wireless Communications and Networking Conference (WCNC), Marrakesh, Morocco, 15–18 April 2019; pp. 1–6. [Google Scholar]
Chen, Z.; Zhou, Z.; Chen, C. Code Caching-Assisted Computation Offloading and Resource Allocation for Multi-User Mobile Edge Computing. IEEE Trans. Netw. Serv. Manag. 2021, 18, 4517–4530. [Google Scholar] [CrossRef]
Zhang, Z.; Zhou, H.; Li, D. Joint optimization of multi-user computing offloading and service caching in mobile edge computing. In Proceedings of the 2021 IEEE/ACM 29th International Symposium on Quality of Service (IWQOS), Tokyo, Japan, 25–28 June 2021; pp. 1–2. [Google Scholar]
Song, Q.; Wang, J.; Liu, J. A cache-assisted computing offloading strategy based on deep Q network. In Proceedings of the 2023 7th International Conference on Management Engineering, Software Engineering and Service Sciences (ICMSS), Wuhan, China, 6–8 January 2023; pp. 80–85. [Google Scholar]
Li, J.; Zhang, H.; Ji, H.; Li, X. Joint computation offloading and service caching for MEC in multi-access networks. In Proceedings of the 2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Istanbul, Turkey, 8–11 September 2019; pp. 1–6. [Google Scholar]
Yang, S.; Liu, J.; Zhang, F.; Li, F.; Chen, X.; Fu, X. Caching-enabled computation offloading in multi-region MEC network via deep reinforcement learning. IEEE Internet Things J. 2022, 9, 21086–21098. [Google Scholar] [CrossRef]
Zhang, S.; Zhang, H.; He, Q.; Bian, K.; Song, L. Service caching based aerial cooperative computing and resource allocation in multi-UAV enabled MEC systems. IEEE Commun. Lett. 2017, 22, 161–164. [Google Scholar] [CrossRef]
Zhang, R.; Wu, W.; Chen, X.; Gao, Z.; Cai, Y. Terahertz Integrated Sensing and Communication-Empowered UAVs in 6G: A Transceiver Design Perspective. IEEE Veh. Technol. Mag. 2025, 2–11. Available online: https://ieeexplore.ieee.org/document/10891254 (accessed on 28 March 2025.).
Zheng, G.; Xu, C.; Wen, M.; Zhao, X. Joint trajectory and power optimization for UAV relay networks. IEEE Trans. Veh. Technol. 2022, 71, 10934–10947. [Google Scholar] [CrossRef]
Tian, X.; Min, X.; Zhou, L. UAV-enabled Service Caching Edge Computing Optimal Computation Floading and Resource Allocation Strategy. J. Chin. Comput. Syst. 2023, 44, 1557–1562. [Google Scholar]
Bao, L.; Luo, J.; Bao, H.; Hao, Y.; Zhao, M. Cooperative computation and cache scheduling for UAV-enabled MEC networks. IEEE Trans. Green Commun. Netw. 2021, 6, 965–978. [Google Scholar] [CrossRef]
Hao, H.; Xu, C.; Zhang, W.; Yang, S.; Muntean, G.-M. Joint task offloading, resource allocation, and trajectory design for multi-uav cooperative edge computing with task priority. IEEE Trans. Mobile Comput. 2024, 23, 8649–8663. [Google Scholar] [CrossRef]
Zhang, X.; Zhang, J.; Xiong, J.; Zhou, L.; Wei, J. Energy-efficient multi-UAV-enabled multiaccess edge computing incorporating NOMA. IEEE Internet Things J. 2020, 7, 5613–5627. [Google Scholar] [CrossRef]
Alzenad, M.; El-Keyi, A.; Yanikomeroglu, H. 3-D placement of an unmanned aerial vehicle base station for maximum coverage of users with different QoS requirements. IEEE Wirel. Commun. Lett. 2017, 7, 38–41. [Google Scholar] [CrossRef]
Shakoor, S.; Kaleem, Z.; Do, D.T.; Dobre, O.A.; Jamalipour, A. Joint optimization of UAV 3-D placement and path-loss factor for energy-efficient maximal coverage. IEEE Internet Things J. 2020, 8, 9776–9786. [Google Scholar] [CrossRef]
Yao, X.; Chen, G. Simulated Annealing Algorithm and Its Applications. J. Comput. Res. Dev. 1990, 7, 1–6. [Google Scholar]
Li, Y.; Yuan, H.; Yu, J.; Zhang, G.; Liu, K. A Review of Genetic Algorithms in Optimization Problems. J. Shandong Ind. Technol. 2019, 12, 242–243. [Google Scholar]

Figure 1. System model.

Figure 2. System model flowchart.

Figure 3. Crossover operation.

Figure 4. Mutation operation.

Figure 5. Convergence state of this paper’s algorithm.

Figure 6. Relationship between cache capacity and total energy consumption.

Figure 7. Relationship between the maximum tolerable delay versus total energy consumption.

Figure 8. Relationship between the number of user devices and total energy consumption.

Figure 9. Relationship between the average task size and total energy consumption.

Figure 10. Relationship between computing resources and total energy consumption.

Figure 11. Energy error bar plot.

Table 1. Simulation parameters.

Parameter	Value
The Computing UAV’s Height $H_{u_{c}}$	60 m
The Relay UAV’s Height $H_{u_{r}}$	100 m
User Device Computing Resource	$0.5$ GHz
Types of Program	8
Noise Power	$- 100$ dBm
Tolerable Delay	$0.9$ s
The Capacitor Switch $η$	$10^{- 27}$
Antenna’s Half-power Beamwidth $θ$	$\frac{π}{2}$
Penalty $E_{p e n a l t y}$	1 J
Transmitter Power $p_{m}$ , $p_{u_{r}}$ , $p_{u_{c}}$ , $p_{b}$	1 W, 3 W, 3 W, 10 W
Bandwidth $B_{m, u}$ , $B_{u_{r}, u_{c}}$ , $B_{b, u_{r}}$	2 MHz, 3 MHz, 4 MHz

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Service Cache-Based Offloading and Resource Optimization Algorithm for UAV-Assisted Computing

Abstract

1. Introduction

2. System Model

2.1. Communication Model

2.1.1. Communication Between Ground Equipment (User Equipment, Base Stations) and UAVs

2.1.2. Communication Between the Relay UAV and the Computing UAV

2.2. Cache Model

2.3. Computational Model

2.4. Description of the Problem

3. Joint Optimization Algorithm

3.1. UAV Position and Cache Decision Solving

3.2. Optimization of Computing Resource Allocation

3.3. Offloading Decision Optimization

4. Simulation Results and Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics