1. Introduction
With the advent of sixth-generation (6G) communication technology, the Internet of Things (IoT) is evolving toward a more intelligent and efficient direction, with significant advancements in seamless connectivity, ultra-low latency, and ultra-high reliability [
1]. These developments have accelerated the progress of unmanned aerial vehicle (UAV) technology, particularly with the support of IoT and wireless sensor networks (WSN). In these domains, UAVs have gradually become one of the core technologies supporting emerging application scenarios [
2]. They are not only capable of serving dynamic data relays and mobile sensing platforms but also play a crucial role in distributed networks. UAVs provide essential data collection and transmission services in areas such as environmental monitoring and smart cities [
3].
However, with the expanding scope of UAV applications, particularly in complex environments, the efficient planning of flight paths has become a pressing issue [
4]. In mountainous or other topographically challenging regions, UAVs need to avoid obstacles and effectively exchange data with IoT nodes. At this point, optimizing UAV path planning becomes crucial, as the choice of path directly impacts data collection efficiency and the stability of communication links [
5]. For example, in mountainous areas, frequent obstacle avoidance may lead to overly meandering paths, potentially interrupting the communication connection with ground sensors, resulting in data loss or delays. Such situations undoubtedly affect the overall performance of IoT systems, particularly in real-time data transmission and task scheduling scenarios. Recent studies have proposed innovative solutions to improve UAV path planning in such environments. For instance, research on AoI-minimal clustering explores how to minimize the age of information in UAV-assisted networks, thereby enhancing the efficiency of data transmission in IoT systems [
6]. Additionally, transmission and trajectory co-design for UAV-assisted wireless powered communication networks (WPCNs) has been investigated to optimize both UAV transmission and movement, leading to more effective data relay and energy utilization [
7]. These works underscore the growing demand for integrated path planning algorithms that address both trajectory optimization and communication performance issues in UAV systems.
In this context, the design of efficient UAV path planning algorithms is crucial for IoT applications. Through efficient path planning, unnecessary detours can be minimized, energy consumption during flight can be reduced, and data collection efficiency can be improved, thereby enhancing the overall performance of the IoT system. Especially in complex terrains, good path planning not only ensures efficient obstacle avoidance by UAVs but also provides stable communication between ground sensors and UAVs. Therefore, an efficient path planning algorithm not only helps improve the operational efficiency of UAVs but also has a positive impact on the communication stability of the entire IoT system. Investigations have disclosed that the task of devising optimal path planning for UAVs is subsumed within the class of NP problems that are exceedingly challenging, a complexity that is observed to escalate exponentially in direct correlation with the augmentation of the problem’s dimensions [
8,
9,
10].
Recently, a significant surge in research has led to the development of numerous path planning algorithms specifically designed for UAV operations. Among these, metaheuristic algorithms based on swarm intelligence have gained prominence, playing a vital role in advancing UAV path planning methodologies. For instance, Cunjie Li et al. [
11] proposed an improved particle swarm optimization algorithm combined with a genetic algorithm, which strengthens the global search capability and enhances local optimization, enabling the UAV to find the shortest path and avoid all threats in both simple and complex terrain environments. Cao et al. [
12] proposed a UAV route planning method based on genetic algorithms. Experimental results demonstrate that the improved genetic algorithm, based on artificial intelligence, is both efficient and feasible for autonomous UAV route planning. Van Truong Hoang et al. [
13] proposed an optimization algorithm called multi-subject-TLBO, which enhances the original method through mutation, elite selection, and multi-agent training to improve solution quality and accelerate convergence. This approach effectively generates optimal, collision-free, and feasible flight paths for UAVs in complex operational environments. Although existing UAV path planning algorithms can meet basic flight requirements to some extent, there is an urgent need to develop more advanced planning algorithms as task complexity increases and environmental uncertainty intensifies.
The teaching–learning-based optimization (TLBO) algorithm, as initially proffered by Rao et al. in 2011 [
14], has been widely recognized as an efficient population-based method that mimics knowledge transfer mechanisms in classroom environments. While its parameter-free structure and rapid convergence characteristics have enabled successful applications in power dispatch and production systems, three fundamental limitations persist in complex optimization scenarios: premature convergence due to population diversity loss, inadequate environmental adaptation mechanisms, and insufficient collaborative search strategies [
15]. Recent advancements in TLBO variants have systematically addressed these challenges through two primary improvement pathways. First, TLBO algorithm operators are modified to enhance search flexibility, as demonstrated by Hong et al. [
16] through reliability-based teaching factors that enable phased convergence control. Wang et al. [
17] proposed an improved DSTLBO algorithm that uses dynamic opposition populations, a sorting mechanism for learning styles, and random information exchange to enhance exploration and avoid local optima, significantly improving the efficiency of photovoltaic parameter extraction. Second, hybrid strategies have been developed to combine TLBO with complementary optimization philosophies, for example, Santosh Kumari Meena et al. [
18] presented a hybrid TLBO-PSO method to minimize PMU installation while ensuring full system observability; Kundu and Garg [
19] presented the integration of Levy-flight mutation operators from SMA algorithms. Thanh Long Duong et al. [
20] proposed a hybrid approach DE-TLBO to simultaneously optimize reactive power planning and determine the optimal size of the thyristor-controlled series compensator (TCSC), thereby effectively minimizing power losses in the ACTEP problem. In summary, integrating various improvement strategies into the TLBO algorithm has significantly enhanced its optimization capabilities. As a result, both the original and improved versions of the algorithm have demonstrated widespread applicability across diverse engineering and operational challenges, encompassing though not limited to improvements in economic load dispatch [
21], smart city communication and energy management [
22], optimal distribution of reactive power within electric grids [
23], production systems [
24], logistics and supply chain management issues [
25], nonlinear equation problems [
26], shop scheduling problems [
27], high-dimensional expensive problems [
28], and optimal control systems [
29].
However, despite the advancements made in these enhanced algorithms, issues such as slow convergence and insufficient solution accuracy persist, especially in the rapidly evolving context of the IoT and increasingly complex environments. These limitations hinder the practical application and broader adoption of such algorithms. Furthermore, the application of TLBO to UAV path planning remains relatively underexplored, contributing to a significant technological bottleneck in this field. To address this gap, this study introduces an innovative solution—an innovative interactive cylindrical vector teaching–learning-based optimization (ICVTLBO) algorithm. Designed to tackle the challenges of 3D UAV path planning in complex environments, ICVTLBO aims to improve both the efficiency and accuracy of path planning. The main contributions of this paper are as follows:
The ICVTLBO algorithm is proposed in this study, which achieves high-precision 3D path planning in complex terrains while maintaining computational efficiency. Its low complexity architecture and adaptive search mechanism lay the foundation for future integration with communication-aware optimization, which is essential for UAV functioning as persistent mobile base stations in IoT networks.
Considering the dynamic nature of drone path planning, waypoints are represented using polar coordinates. The interaction strategies between the teacher and learner phases enhance group diversity and global search capability, accelerating convergence and improving solution accuracy, ensuring the ICVTLBO algorithm effectively plans smooth optimal paths in complex environments. When drones are deployed as IoT data relays, this method is crucial for maintaining network coverage in remote monitoring scenarios.
The effectiveness and robustness of the ICVTLBO algorithm are validated through experimental comparisons with several leading algorithms on the 20-dimensional problems from the CEC2022 benchmark function suite. Further validation across nine different terrain scenarios demonstrates its adaptability and superiority, showcasing its ability to plan high-quality paths for UAVs in complex 3D environments. The algorithm’s high performance ensures reliable UAV operation in diverse terrains, particularly in mountainous wireless sensor network clusters, without the need for environment-specific adjustments.
The subsequent parts of this paper are composed as follows:
Section 2 focuses on the problem of UAV path planning.
Section 3 introduces the teaching–learning-based optimization algorithm and proposes the improved ICVTLBO algorithm.
Section 4 validates the ICVTLBO algorithm through numerical verification experiments.
Section 5 presents an experimental analysis of UAV path planning. Finally,
Section 6 provides a summary of this paper.
3. Teaching–Learning-Based Optimization (TLBO) Algorithm
In this section, the fundamental TLBO algorithm is initially presented. Central to this algorithm are two primary roles: teachers and students. Teachers are tasked with instruction, employing a variety of methods to impart knowledge and skills, as well as guiding the growth and development of students. As educators, they are required not only to have a solid foundation in their subject matter but also to possess exceptional teaching techniques and communication abilities for effective instruction. Students, on the other hand, are the recipients of education; they acquire new knowledge and enhance their overall capabilities by learning from teachers, outstanding peers, and through interactive learning experiences.
Within the field of UAV path planning, maneuverability is crucial for flight efficiency and safety. The traditional Cartesian coordinate system fails to adequately describe the dynamic characteristics of drones and obstacle avoidance. To address this, a cylindrical coordinate system approach is proposed in this paper, utilizing its features to express drone motion and simplify spatial constraints. Building upon this, an enhanced version of the TLBO algorithm is presented, integrating a variety of targeted interactive strategies. These strategies are designed to amplify the speed and improve the quality of solutions, ultimately facilitating the attainment of superior flight paths for UAVs.
3.1. Teacher Phase
In the teacher phase, an emulation of the instructional processes is conducted to identify the solution with the most optimal objective function value within the cohort, a process that can be modeled using Equation (10):
where
and
are utilized to denote the positioning of an entity after and before learning, respectively. The term rand refers to a random number encompassed by the interval [0, 1].
represents the location of the teacher, which is ascribed to the most outstanding student entity within the classroom cohort.
signifies the mean level of the search agents present in the entirety of the population.
identified as a teaching factor, plays a pivotal role in modulating the value of
. The potential value of
can be either 1 or 2, determined stochastically based on the probability distribution stipulated
.
3.2. Learner Phase
In addition to gleaning knowledge from the teacher, learners can also enhance their understanding by learning from each other. In the context of collaborative learning, a learner can internalize knowledge from a randomly chosen peer who occupies a superior rank. The strategic approach of the learner may be formulated as follows:
where
denotes the position of the learner after learning,
is the position of a student randomly selected from the class, and
is a random number between
.
3.3. Improved TLBO Algorithm for Path Planning
In this section, a cylindrical coordinate system model is proposed first, and then, we endeavor to enhance the search efficiency of the TLBO algorithm when addressing optimization problems. To achieve this, we have meticulously refined and augmented the algorithm’s core components, namely, the teaching and learning phases—while preserving their inherent strengths such as simplicity and ease of implementation. Specifically, we have devised targeted interactive strategies, tailored to cater to the distinct requirements of the teaching and learning processes. These innovations have significantly bolstered the population’s exploration and exploitation capabilities, which are critical for avoiding local optima and pinpointing the global optimum. The dual-phase, interactive enhancements ensure that the algorithm navigates the search space more effectively, thereby increasing the likelihood of converging on solutions of superior quality.
3.3.1. UAV Path Planning Method Based on Cylindrical Vector
In the study of UAV path planning, maneuverability is of paramount importance as it affects the efficiency and safety of UAV flight in complex environments. The traditional Cartesian coordinate system has been identified to have limitations when describing the dynamic characteristics of UAVs and their obstacle avoidance capabilities. Consequently, a novel approach based on a cylindrical coordinate system is presented in this paper, which encodes each feasible path from the starting point to the endpoint as a series of vectors. Within this framework, an individual vector meticulously defines the UAV’s flight maneuver as it transitions from one waypoint to the subsequent one. In the UAV path planning method premised on cylindrical vectors, each vector in the set comprises three components:
, and
. Here,
signifies the magnitude,
denotes the flight elevation angle of the UAV, and
corresponds to the UAV’s azimuthal orientation. Consequently, any viable path
with
navigational nodes can be articulated by a compilation of 3D cylindrical vectors, as illustrated below:
The fundamental principle underlying the cylindrical vector-based path planning approach involves leveraging the interplay between the magnitude, elevation, and azimuth angles experienced during UAV flight and the corresponding velocity, turning angle, and climb angle of the UAV. This method facilitates the enhancement of both safety and efficiency in the planned path. By enabling metaheuristic algorithms to operate within the configuration space rather than Cartesian space, the cylindrical vector-based technique effectively curtails the search domain, thereby augmenting the algorithm’s capacity to discern superior flight paths for the UAV.
The projection
of the waypoint
on the
plane needs to be determined. Subsequently, the cylindrical coordinates can be transformed into Cartesian coordinates through the following equations.
where
and
are the waypoint
and
in the Cartesian coordinate system, and
is the cylinder vector.
3.3.2. Interactive Strategy at the Teacher Phase
In actual teaching, teachers fulfill a pivotal role. Their responsibilities extend beyond a profound command of subject matter to include exceptional pedagogical skills and communicative abilities, enabling them to efficiently impart knowledge to students. To cater to the diverse learning needs of their pupils, teachers must employ a versatile repertoire of instructional methods and strategies. Furthermore, timely assessment and feedback on student progress are essential, as they inform adjustments to the content and approach of teaching, thereby facilitating continuous improvement in educational practice.
Interaction between teachers and learners is a critical component of the educational process, fostering not only the transfer of knowledge but also enhancing student engagement and motivation. Nevertheless, individual differences among learners, such as variations in cognitive styles, learning habits, and prior knowledge, can impede the complete absorption and comprehension of the knowledge by learners. The dynamic process is defined as follows:
where
is the position of the individual after learning, and
is random numbers between (0,1).
is a constant of
,
denotes the current number of iterations, and
denotes the maximum number of iterations.
Figure 1 illustrates a nonlinear decline in the value of
as the number of iterations increases. This trend mirrors a fundamental aspect of the teaching process: initially, students possess a limited knowledge base, offering substantial room for improvement, which results in notably effective teaching interactions. As teaching progresses, and students develop a solid foundation, the potential for significant advancement through interaction diminishes. In response to this observation, the refined algorithm decelerates the performance value’s decay at the onset of iterations, enabling more extensive global exploration. Later in the iterative process, the algorithm accelerates convergence, enhancing the efficiency of the global search. This modified teaching phase effectively boosts the overall performance of the algorithm.
3.3.3. Enhanced Interactive Learner Phase
On the foundation of previous research into the learner phase, this paper further contemplates the significant impact of student interactions during the educational process on learning outcomes. In practical education, in-depth exchanges and communication with students who excel academically play an undeniable role in enhancing learning effectiveness. There is a positive correlation between the frequency of interaction and learning outcomes, which stems from the fact that by interacting with high-performing peers, students can acquire a richer reservoir of knowledge and valuable experience, and receive positive encouragement and guidance, thereby effectively boosting their motivation to learn and improving their academic performance. Moreover, spatial proximity is another crucial factor affecting learning efficacy. Maintaining proximity to top-performing students allows for more convenient seeking of help and advice, consequently gaining access to more learning resources and support. At the same time, close interaction with outstanding students helps ignite enthusiasm for learning and uncovering one’s potential. The above processes can be defined mathematically as follows:
The learning rate adjustment,
, is influenced by both the iteration number and random variables, ensuring that the algorithm adapts dynamically across iterations. This term can be described as follows:
where
represents a random number between 0 and 1; the value of
is [0, 1], affecting the exploration–exploitation balance of the algorithm; and
[
30] is used to denote the impact of the distance between students on the effectiveness of their interactions, as specified by the following equation:
where the
is a constant representing the scale of the Levy flight, and
and
are randomly selected numbers in the range of [0, 1]. This random walk mechanism, described by the Levy distribution, is crucial for balancing the global exploration and local exploitation aspects of the algorithm.
In summary, the flow chart of ICVTLBO is shown in
Figure 2. The specific processes are as follows:
Step 1: The environment model, including the terrain, obstacles, and any other environmental constraints, is input into the system. This model will be used throughout the optimization process to simulate and evaluate the paths.
Step 2: Key parameters of the ICVTLBO are initialized, including population size, maximum iterations, and cylindrical coordinate system parameters.
Step 3: Initial UAV routes are randomly generated in cylindrical coordinates, leveraging rotational maneuver characteristics of UAVs. Each path is represented as a vector with angular and radial components.
Step 4: The fitness value of each path is calculated based on criteria, including path length, collision risk with static obstacles, and terrain clearance safety margin.
Step 5: The population is ranked by fitness values, and the top-performing path is designated as the preliminary candidate solution.
Step 6: Step 6a: A teacher-guided challenger path is generated using Equation (13), which integrates terrain elevation constraints into the cylindrical vector update mechanism. Step 6b: The challenger’s fitness is compared with the current candidate solution. If superior, the challenger replaces the current solution; otherwise, it is rejected.
Step 7: The global best path is iteratively refined based on the teacher phase outcomes, ensuring alignment with static environmental features.
Step 8: Step 8a: A neighborhood-based challenger path is generated via Equation (15), incorporating dynamic vector adjustments between adjacent UAV trajectories to avoid local optima. Step 8b: The challenger is accepted only if it demonstrates improved fitness, emphasizing obstacle avoidance and path smoothness in static environments.
Step 9: All UAV paths are synchronously updated based on the interactive teacher–learner phases, preserving diversity through cylindrical vector rotations and radial scaling.
Step 10: The process repeats until predefined termination criteria.
Step 11: The algorithm outputs the globally optimal collision-free 3D path in Cartesian coordinates, optimized for static terrain constraints and IoT service requirements.
5. UAV Path Planning Experiments
To thoroughly assess the efficacy of the proposed algorithm, a series of simulations are executed upon authentic digital elevation model (DEM) maps that are sourced from LiDAR sensor-derived terrain data [
40]. These simulations entail the development of nine distinct UAV path planning scenarios, each presenting a unique level of terrain complexity, thereby allowing for a comprehensive evaluation of the algorithm’s performance. Within this experimental context, a series of comparative analyses are conducted, positioning ICVTLBO against well-established optimization methods, including PSO, ABC, TLBO, TLSBO, and COA. Central to these analyses is the optimization of a carefully devised cost function, which serves as the basis for quantifying and comparing the optimization proficiency of each algorithm in addressing path planning challenges. Throughout the optimization endeavor, all contending algorithms utilize a cylindrical vector planning methodology for pathway search and refinement. In this process, prospective flight paths are initially encoded into a collection of vectors—defined by their amplitude, elevation angle, and azimuth—utilizing a metaheuristic encoding technique. This is followed by an extensive search for the configuration space to explore these vectors. Upon completion of the search, the resultant vectors are then converted into a set of Cartesian coordinates, permitting a heightened precision in the delineation of three-dimensional pathways. Concluding the procedure, the cost function is applied to each candidate pathway, culminating in the identification of the path exhibiting the least cost as the most optimal solution.
5.1. Experimental Settings
A fixed parameter set is utilized across the experiments to ensure consistency. The population size is designated as N = 100 to facilitate iterative optimization, and the upper limit for iterations is set at MaxIter = 200, which strikes a balance between exhaustive search and computational efficiency. Moreover, the number of pathway nodes is established at
n = 10, forming the scaffold for the drone’s flight path. The starting coordinates are defined as (200,100,150), with the endpoint specified at (800,800,200), reflecting the unipoint-to-unipoint nature of the flight mission. Each simulation scenario is run independently 30 times, with outcomes systematically documented in tabular form for subsequent analytical purposes. Furthermore, nine distinct terrain scenarios have been meticulously crafted, each of which includes a series of obstacles, all represented as red cylinders. The three-dimensional coordinates and radii of these obstacles are meticulously listed in
Table 2, providing crucial information for understanding UAV flight path planning in diverse landscapes.
5.2. Comparison and Analysis of Results of Different Methods
The performance data of the six optimization methods in 30 rounds of independent testing are summarized in
Table 3, with the superior outcomes being emphasized in boldface type. The data reveal that ICVTLBO consistently exhibits superior accuracy across all tested scenarios, surpassing the performance of other comparative algorithms.
Within scenarios 1 to 4, characterized by fewer obstacles and threats, the average fitness values among the various methods do not deviate significantly. For instance, in scenario 2, ICVTLBO achieves an average fitness value of 4627.4116, whereas the corresponding values for TLBO, TLSBO, PSO, and ABC are 4645.3865, 4655.3958, and 4646.9727, respectively. Notably, ICVTLBO demonstrates remarkable stability as its standard deviation in fitness value is merely 0.8281, a figure distinctly lower than the standard deviations of the other compared methods, thereby affirming its significant advantage in terms of stability.
Enhanced scrutiny of the data encapsulated in
Table 3 indicates a decline in solution precision for the other algorithms, aside from ICVTLBO, as the complexity of scenarios escalates from 5 to 9, encompassing a greater number of obstacles. Within these intricate environments, although the COA algorithm maintains a certain level of solution quality, it fails to parallel ICVTLBO’s performance. Specifically, ICVTLBO records average fitness values of 5160.3514, 4888.1636, 5413.4242, 5131.2233, and 5209.7683 across scenarios 5 to 9, respectively. These figures unequivocally demonstrate ICVTLBO’s efficacy and stability in addressing complex path planning problems, even amidst a general decline in search capabilities.
Upon examining
Table 3, the superior performance of the ICVTLBO algorithm becomes evident. Specifically, in the third-to-last row (W|T|L), the algorithm consistently achieves the best performance across all nine scenarios. In contrast, the PSO algorithm demonstrates inferior performance across these scenarios, while the other algorithms tested exhibit moderate performance levels. Furthermore, as indicated by the Friedman rank sum test results presented in the last two rows of
Table 3, the ICVTLBO algorithm ranks first with a significantly lower mean rank compared to the other algorithms, affirming its leading position within the comparative group.
In summary, through a comparative analysis against other advanced algorithms, ICVTLBO proves to have a significant edge not only in the quality of solutions but also in the stability and reliability of the algorithm when tackling diverse path planning challenges.
In the current investigation, an exhaustive comparative analysis delves into the performance of various path-planning algorithms across nine distinct scenarios to elucidate the variances in solution accuracy and convergence rate among these algorithms. The empirical findings reveal that the ICVTLBO algorithm demonstrates superior performance metrics when benchmarked against other contending algorithms. Specifically,
Figure 4 illustrates the three-dimensional, top, and side views of routes generated by different algorithms in nine path planning scenarios. While all tested algorithms can produce effective obstacle-free paths, the trajectories derived from ICVTLBO not only exhibit shorter lengths but also surpass in terms of smoothness, culminating in the highest path quality. In contrast, the pathways orchestrated by TLBO, TLSBO, PSO, ABC, and COA exhibit greater tortuosity, unequivocally attesting to the pronounced superiority of ICVTLBO in path optimization. Particularly commendable is ICVTLBO’s performance in intricate terrains, where the algorithm adeptly aids UAVs in maintaining a safe distance from obstacles while simultaneously devising short and secure navigation routes. This capability holds significant relevance for autonomous UAV flight within complex environments, as it not only ensures flight safety but also amplifies operational efficiency.
In the convergence curve diagrams across nine distinct scenarios, the ICVTLBO algorithm remarkably surpasses the other competitive algorithms in addressing the three-dimensional path planning issues for drones. A meticulous examination of the convergence behaviors of various algorithms under diverse conditions yields a clear conclusion that ICVTLBO significantly outperforms its predecessor, the conventional TLBO algorithm, in terms of convergence rate and global search capability. This fact robustly validates that the improved operator proposed in this study effectively remedies the deficiencies inherent to the TLBO algorithm. As depicted in
Figure 5, aside from scenarios 4, 7, and 9, ICVTLBO rapidly converges to the optimal path within approximately 100 iterations in the remaining six scenarios. Among the six meta-heuristic algorithms compared, ICVTLBO exhibits the most rapid convergence speed. Further analysis reveals that as environmental complexity escalates, algorithms such as TLBO, TLSBO, PSO, and ABC are increasingly prone to becoming trapped in local optima, whereas ICVTLBO consistently approaches the global optimum unperturbed by the complexity of the path scenarios. This phenomenon strongly suggests that ICVTLBO possesses the most exceptional global exploration ability among the multitude of meta-heuristic algorithms. Synthesizing the analysis, ICVTLBO leads conspicuously ahead of the other comparative algorithms in convergence velocity, solution accuracy, and competence in navigating complex environments. Hence, ICVTLBO undoubtedly qualifies as a path planning algorithm replete with potential, meriting further research and widespread application.
5.3. Time Complexity Analysis
This section uses average execution time as the evaluation metric to verify the execution efficiency of ICVTLBO and the comparison algorithms, recording the total runtime of six comparison algorithms across nine scenarios. To ensure the reliability of the experimental data, each algorithm is independently run thirty times, and the average runtime is taken as the evaluation metric. The experimental results are shown in
Figure 6.
As shown in
Figure 6, when UAV path planning is performed in different scenarios, the average execution time of ICVTLBO is comparable to TLBO, while the time costs of TLSBO, PSO, ABC, and COA progressively increase. The average runtime of ICVTLBO exceeds TLBO by only 0.0333 s, and the time cost difference can be considered negligible, primarily due to the introduction of targeted interaction strategies in ICVTLBO during two stages. Although these strategies add some time cost, they significantly enhance convergence performance. Notably, ICVTLBO maintains efficiency like TLBO while demonstrating significant advantages. Through targeted interaction strategies, ICVTLBO more effectively explores the solution space, avoiding local optima and identifying better path planning solutions. Experimental results show that ICVTLBO converges quickly to high-quality solutions in various complex scenarios, significantly improving path planning efficiency. The paths planned by ICVTLBO are not only smooth and continuous but also better meet UAV flight performance constraints, enhancing flight safety and mission completion. In conclusion, ICVTLBO is comparable to TLBO in terms of time cost but shows significant advantages in global optimization, convergence speed, environmental adaptability, and path quality, providing an efficient and reliable solution to UAV path planning problems.
6. Conclusions
To address the challenging issue of three-dimensional drone path planning, this study proposes an innovative optimization algorithm, ICVTLBO, engineered to surpass the limitations of the original TLBO algorithm in terms of rapid convergence and avoidance of local optima. To augment the efficacy of the algorithm, ICVTLBO incorporates a diversified interactive strategy that not only bolsters its global exploration capabilities but also enhances population diversity, thereby circumventing the problem of premature convergence. Initially, by integrating a refined interactive strategy during the teacher phase within the iterative process, the ICVTLBO algorithm achieves a smooth transition from global exploration to local exploitation, significantly improving its search efficiency. Furthermore, the learner phase’s interactive strategy facilitates knowledge exchange among elite and average students within the population, hastening the algorithm’s convergence rate and enabling it to approach or attain the optimal solution in a shorter timeframe. Exhibiting its exceptional performance in the CEC2022 benchmark suite, the ICVTLBO algorithm ranked first across multiple metrics and significantly outperformed other comparative algorithms by minimizing mean values when solving functions. When applied to the 3D drone path planning problem, the algorithm consistently achieved the best fitness function values across nine diverse terrain scenarios, confirming its robust resilience and adaptability in complex environments. Additionally, the convergence curves of these algorithms indicate a significant advantage of ICVTLBO. Consistent experimental outcomes indicate that the ICVTLBO algorithm can secure highly competitive solutions under varying test environments, effectively resolving the 3D path planning problem for drones in intricate settings.
While the ICVTLBO algorithm demonstrates promising performance in current simulations, future research could extend from two key directions: (1) Cross-domain adaptive enhancement: Integrate deep reinforcement learning paradigms to strengthen dynamic decision-making under real-time environmental perturbations and explore federated learning frameworks for multi-UAV collaborative path planning in large-scale IoT deployments. (2) Resource-intelligence coupling: Develop energy-time-risk tri-objective optimization models that holistically consider 6G edge computing resource scheduling, UAV battery dynamics, and link stability thresholds, especially for time-sensitive sensing applications in smart cities.