Towards Flocking Navigation and Obstacle Avoidance for Multi-UAV Systems through Hierarchical Weighting Vicsek Model

: Flocking navigation and obstacle avoidance in complex environments remain challenging for multiple unmanned aerial vehicle (multi-UAV) systems, especially when only one UAV (termed as information UAV) knows the predetermined path and the communication range is limited. To this end, we propose a hierarchical weighting Vicsek model (HWVEM). In this model, a hierarchical weighting mechanism and an obstacle avoidance mechanism are designed. Based on the hierarchical weighting mechanism, all the UAVs are divided into different layers, and assigned with different weights according to the layer to which they belong. The purpose is to align the rest of UAVs with the information UAV more efﬁciently. Subsequently, the obstacle avoidance mechanism that utilizes only the local information is developed to ensure the system safety in an environment ﬁlled with obstacles differing in size and shape. A series of simulations have been conducted to demonstrate the high performance of HWVEM in terms of convergence time, success rate, and safety.


Introduction
Unmanned aerial vehicle (UAV) swarm is an inevitable trend for the aerospace industry [1], and flocking navigation is one of its spotlights. Flocking behavior refers to a group-level phenomenon that occurs through simple individual interactions, where individuals are guided by the local rules rather than central coordination [2][3][4]. Navigation requires the group to move along a preset path [5] while all of the members should be able to avoid obstacles they confront [6], as shown in Figure 1.
Recently, many approaches have been developed to realize flocking navigation for multi-UAV systems. For example, Yan et al. [7] considered the leader-followers flocking problem of fixed-wing UAVs in the context of deep reinforcement learning. The followers can always follow the leader closely. However, it assumes that all followers can obtain the position and velocity information of the leader no matter how large the group is. Chen et al. [8] addressed the coordinated path following problem of fixed-wing UAVs on a 2D plane and derived the sufficient conditions for the stability of the closed-loop system. However, in the system, every UAV is required to possess the ability to plan its own path. Hoang et al. [9] proposed a novel control strategy for a swarm of UAVs that adopts an adaptive weight allocation mechanism based on the current context. The scheme performs better than the conventional one, yet all the UAVs must know the position of the target point.
Most of the previous studies either predefine the path of every UAV, or give the information of the leader to them [7][8][9][10][11], both of which are hard to realize in practice. Firstly, the mechanism of receiving the path information remotely from the ground station requires a communication device to be equipped on each UAV, which in turn burdens the data transmission load. Secondly, the UAV can execute the path planning based on the perceived information, but in practice the capability varies a lot; maybe only a few of them can obtain the goal position and plan the path correctly. Thirdly, the non-cooperation goal location may not be interpreted precisely, and the information transmission may be blocked due to the communication interruption. Therefore, determining how to navigate the multi-UAV system with only one individual knowing the path information is of great importance. Additionally, determining how to realize path following and obstacle avoidance within a limited interaction radius is also a challenging task.
To realize flocking navigation, all of the flocking members must follow the trajectory that is created by the Information UAV (a UAV with the predetermined path information). As a classic flocking model, the Vicsek model (VEM) describes a system where the member aligns to the average movement direction of its neighbors, and all the members in the flocking are finally aligned [12]. Inspired by the VEM, it can be seen that as long as all the members align themselves with the information UAV within a short enough time, they can follow tightly with the information UAV, and hence the system can move along the desired path.

Predetermined Path
Obstacle Information UAV Ordinary UAV Figure 1. The scenario of flocking navigation with obstacle avoidance. The Information UAV (the red UAV) adjusts its heading angle according to the predetermined path information it learns in advance. The objective of this paper is to make sure that all the blue UAVs follow the red one closely and avoid the obstacle successfully.
There have been a lot of studies on how to improve the alignment speed of the Vicsek model [13][14][15]. Nevertheless, most of them are only applicable in converging to a random rather than a desired direction. To solve the problem, we proposed the weighting Vicsek model (WVEM) to induce all individuals to converge to the reference state [16]. In WVEM, the individuals are divided into three types according to whether they can directly interact with the information UAV. The individuals of the same type are assigned with the same constant contribution intensities. However, the convergence efficiency is not fast enough for large-scale multi-UAV systems. In this paper, we further propose a hierarchical weighting Vicsek model (HWVEM) to improve the convergence performance. Compared with WVEM, HWVEM distinguishes the influence of different individual in a more detailed way and sets the weight of each individual more flexibly and reasonably. WVEM can be regarded as the simplest form of HWVEM, and HWVEM is more general and normative.
The main contributions of this paper are twofold. Firstly, we propose the hierarchical weighting mechanism to induce all the individuals to align with the information UAV. All the individuals are divided into many layers based on their topological distance from the information UAV. The contribution intensity of each individual is assigned dynamically to ensure that the key information is always provided enough influence. Secondly, the obstacle avoidance mechanism is designed to ensure the system safety in an obstacleexisting environment with obstacles. The mechanism induces the UAVs to align with the edge of the obstacles in a tangential direction and it can be added into other models conveniently.
The rest of this paper is organized as follows. Section 2 proposes HWVEM and describes the involved parameters. Numerous experiments are carried out in Section 3 to verify and analyze the model performance. Section 4 summarizes all the points of this paper.

Flocking Navigation and Obstacle Avoidance with HWVEM
This section first proposes the hierarchical weighting mechanism. Then, according to that, a HWVEM is proposed to facilitate the rest of UAVs in aligning with the information UAV. Subsequently, an obstacle avoidance mechanism is designed to keep the UAV away from the obstacle. Finally, a HWVEM-based flocking algorithm is proposed to achieve both the flocking navigation and obstacle avoidance for a multi-UAV system.

Hierarchical Weighting Mechanism
In our work, the multi-UAV system assumes that only the information UAV masters the predetermined path. To describe that capability, this paper applies the enable matrix E N = [e i ] N×1 , in which the element follows the rule, where i ∈ Γ IU means that the individual i is the information UAV [16]. Based on the topology with the information UAV, the rest of UAVs are divided into different layers (As shown in Figure 2a). The information UAV belongs to the 1-st layer and i ∈ Λ n means that the individual i belongs to the n-th layer. There is i ∈ Λ n if there exists the shortest sequence of neighborhoods N q 1 , N q 2 , . . . , N q n−1 , such that q 2 ∈ N q 1 , q 3 ∈ N q 2 , . . . , i ∈ N q n−1 , where n − 1 denotes the length of the sequence and q 1 ∈ Γ IU . j ∈ N i means the individual j is the neighbourhood of the individual i and satisfies 0 < x i − x j ≤ r, where x i represents the position of the individual i and r represents the interaction radius. The layer matrix L N = [l i ] N×1 is proposed to denote the layer of every UAV and the element l i follows the rule, Transmitting the layer information between UAVs brings a time consumption, thereby the layer of the individual i at time step k is calculated based on the layers of its neighborhoods at time step k − 1. The layer of the UAV is one layer lower than the highest layer within its neighborhoods, which is updated by Accordingly, a hierarchical topology network is established (Figure 2a). The upperlayer UAVs are more directly and quickly influenced by the information UAV. They are assigned with higher weight to ensure enough influence from the crucial information and to improve the speed of alignment of the individuals with the information UAV.

Hierarchical Weighting Vicsek Model
The information UAV adjusts its movement direction according to the mastered path information, as its heading angle θ d becomes the reference state for the rest of UAVs. In the Vicsek model [12], N individuals move at a constant speed v 0 on the D × D plane with periodic boundary in discrete time, and the UAV density is denoted as ρ = N/D 2 . At the initial moment, the position of the individual evenly distributes on the plane and the heading angle obeys [0, 2π) uniform distribution. The heading angle of each individual at time step k + 1 is the average of the direction of its neighbours at time step k, which is updated by where N i (k) denotes the index set of the neighbourhoods of the individual i, and ∆θ i (k) ∈ [−η/2, η/2] represents the noise, η is the noise intensity. The individual j is the neighbor of the individual i if requiring 0 < x j (k) − x i (k) ≤ r, r is the interaction radius. The velocity of the individual i is updated by where e 1 and e 2 are mutually perpendicular unit vectors. The position of the individual i is updated by Considering that the system driven by VEM alone cannot achieve flocking navigation, and inspired by the hierarchical weighting mechanism, HWVEM is raised. In HWVEM, each individual has the same kinematic models as VEM, i.e., Equations (7) and (8). The heading angle of the individual is updated as follows, where H N = h ij N×N is the neighbor matrix that describes the neighbor relationships of UAVs, in which the element is defined by In the following, we will introduce the involved parameters i.e., a ij , b ij , c ij in more details.

Adjacency Matrix
The adjacency matrix A N = a ij N×N is used to describe the individuals within the interaction range of the given individual. Only the individuals within the interaction range can influence the given individual. The element a ij is expressed as

Dominance Matrix
In the swarm, the heading angle of the information UAV will not be influenced by any other individual. For the UAVs of other layers, all individuals can be influenced by others. The flows of information among UAVs are shown in the Figure 2a. Only when the information of an individual flows to the given UAV, can it contribute to the given UAV [17]. The dominance matrix B N = b ij N×N is defined to describe the information flow direction between each pair of UAVs. When the information of the individual j flows The UAV does not contribute to its own new preferred movement direction, b ii (k) = 0.

Contribution Matrix
The contribution matrix C N = c ij N×N is defined to describe the contribution intensity of every UAV during the decision-making process regarding the new preferred directions of the given UAV. The contribution intensity c ij of the individual j to the individual i consists of three parts: the absolute layer weight w abs j of the individual j, the relative layer weight w rel j of j, and the weight w nei ij of the neighborhood layer status d ij of the individual i. Thus, the contribution c ij can be expressed as follows: The Absolute Layer Weight Individuals of the same layer are assigned with the same absolute weight. The higher the layer of the UAV, the earlier it will be affected by the information UAV, and the closer its direction is to the reference state. The absolute layer weight is used to characterize this attribute. The closer the individual direction is to the reference state, the higher the absolute weight it is assigned, and following this principle the faster the UAVs align themselves to the reference state. to encourage all UAVs to align to the reference state faster. Moreover, the absolute layer weight value cannot appear negative. Based on the above two properties, a simple yet effective form of the absolute weight can be designed as follows, where p 1 ≥ 0 is the tuning parameter that is used to adjust the value of the absolute layer weight.

The Relative Layer Weight
The relative weight refers to the intensity that the given UAV contributes to the UAVs on its above, same, and beneath layer. As for the given UAV, considering its neighbors, among them the upper-layer individuals are the closest to the reference state, while the lower-layer is the opposite, and the individual from the same layer is the middle one. According to the relative topological distance between the individuals and the information UAV, the given individual possesses a higher weight to its lower layer individual and a lower weight to its upper layer. To simplify the model, in general, let the contribution intensity of the given individual to its same layer individual, and its upper layer and its lower layer be 1, 1/p 2 and p 2 , respectively, where p 2 ≥ 1. The relative layer weight of every UAV is the same. w rel i can be expressed as follows, The Weight of the Neighbourhood Layer Status The neighbourhood layer status d ij is proposed to represent the status of the number of individuals at the layer of the individual j in the neighbours of the individual i, which is calculated by where | · | represents the number of the elements of the set. When the individuals at the layer of the individual j is too few, j is assigned with higher weight to ensure that its layer can possess a certain contribution intensity. However, the influence of the neighbourhood layer status should not grow large nor become the dominant factor, so the natural logarithm is used to describe it. In this paper, when d ij > e, it means that there are too few individuals at the layer of the j. If the number of individuals at the layer of the j is enough, its contribution intensity is 1. Therefore, w nei ij is designed as follows,

Obstacle Avoidance Mechanism
The insight behind our obstacle avoidance mechanism is to make the UAV who detects the obstacle move along the tangent direction of the obstacle [6]. Once the UAV detects an obstacle, it will first determine whether to take action to avoid the obstacle or not. Specifically, when the obstacle is in the way of UAV's original movement direction, the UAV will align its heading angle to the tangent direction of that obstacle.
The genuine desired movement direction θ g i of the individual i represents the direction contributions of its neighborhoods, except the individuals who are avoiding the obstacle. The action of avoiding obstacles is both temporary and reactive, and is not a perpetual demand of the swarm. Ignoring the influence of the individual who is avoiding the obstacle, θ g i is updated by where the element of the state matrix i ∈ Γ avoidance means that the individual i is taking action to avoid the obstacle.
The occasion to avoid the obstacle refers to when the position of the individual i, the genuine desired movement direction of i and the position of the nearest obstacle point satisfy the following constraints, where x obs represents the position of the nearest obstacle point, abs(·) means the absolute value of the number,θ(·) denotes the angle between the vector and the x-axis of the inertial coordinate system, r avoid and θ obstruct represent the distance and the direction, respectively, that the obstacle can threaten the movement of the individual. If the vector is obtained by rotating x-axis counterclockwise, its heading angle takes a positive value, otherwise it is taken as a negative value. In this paper, r avoid = 0.5, θ obstruct = 11π/20. If the individual i satisfies the constraints above, it will take action to avoid the obstacle. The strategy is to adjust its heading angle to align to the tangent direction of the nearest obstacle point and be consistent with the original direction as much as possible. The tangent direction can denote two opposite angles, the individual chooses the angle that makes an acute angle with the previous heading angle (i.e., θ obs in Figure 3). The UAV will not take action based on the obstacle point where there is no tangent. If it only detects the obstacle point without tangent, it will move in the original direction. The heading angle of the individual i is updated by

HWVEM-Based Flocking Algorithm
To achieve flocking navigation, it is still necessary to add attraction term and repulsion term for the multi-UAV system [18,19]. The attraction force is used to maintain the swarm compactness and the repulsion force is to avoid the collision. The form of attraction term is as follows: where r att represents the range of the attraction force, r rep represents the range of the repulsion force and j ∈ N 1 i satisfies r rep < x ij ≤ r att . The form of repulsion term is as follows: where j ∈ N 2 i satisfies 0 < x ij ≤ r rep . The intensities of both terms of the UAVs of all layers to other individuals are the same and the information UAV is still not influenced by any other individual. The velocity of the UAV is updated according to the following equations: where c align , c rep , c att represent the coefficients of the alignment term, the repulsion term and the attraction term, respectively, and v max represents the max velocity magnitude of the UAV, v max = 0.1. Accordingly, the HWVEM-based flocking algorithm is summarized as shown in Algorithm 1.

Require:
The position of each neighborhood individual j, x j (k) The velocity of each neighborhood individual j, v j (k) The layer of each neighborhood individual j, l j (k − 1) The state of each neighborhood individual j, o j (k − 1) Ensure: The position of the individual i, x i (k + 1) The velocity of the individual i, v i (k + 1) The layer of the individual i, l i (k) 1: Calculate the adjacency matrix a ij (k) with (11) 2: Calculate the dominance matrix b ij (k) with (12) 3: Calculate the contribution matrix c ij (k) with (13) 4: Calculate the coefficient matrix h ij (k) with (10) 5: Update genuine direction θ g i (k + 1) with (18) 6: Update obstacle detection state o i (k) with (19) 7: Update heading angle θ i (k + 1) with (22) 8: Update the alignment term v align i (k + 1) with (7) 9: Update the attraction term v att i (k + 1) with (23) 10: Update the repulsion term v rep i (k + 1) with (24) 11: Update the velocity v i (k + 1) with (26) 12: Update the position x i (k + 1) with (8) 13: Update the layer matrix l i (k) with (4) 14: Return v i (k + 1), x i (k + 1), l i (k)

Numerical Simulation and Analysis
This section first conducts numerical simulation experiments to test the alignment performance and verify the effectiveness of the designed hierarchical weighting mechanism. Then, the proposed HWVEM-based algorithm is applied to a complex mission scenario to demonstrate the flocking navigation and obstacle avoidance.

The Verification of the Model Alignment Performance
This section verifies whether all UAVs can converge to a fixed direction, which is the foundation of achieving flocking navigation. The attraction term and repulsion term are ignored and the simulation is carried out in a periodic boundary environment. We use the convergence time k con as performance metric: k con denotes the minimum time steps required for the system to reach the desired consistency degree Deg d con . Deg con represents the consistency degree between all UAVs' heading angles and the reference state θ d , where The smaller the k con is, the shorter the time it is for all UAVs to converge to the reference state, the better the system performance is achieved.

Contribution of Each Weighting Term
We first evaluate the model performance with different layer weighting parameters. The reference state is set to π/4. An exclusive information UAV is set with a π/4 heading angle fixed, its initial position is at the lower left corner of the area. To stress that the feasibility of the alignment to the reference state is the focus of our studies, the noise will not be taken into account here. The following settings are the same: v 0 = 0.03, ρ = 4.102, r = 1, Deg d con = 0.9984, η = 0. Three sets of groups are discussed: (a)N = 100, D = 4.962; (b)N = 200, D = 7; (c)N = 500, D = 11.051. The initial position and velocity of UAV also influence the convergence performance, and thereby the 100 simulations with different initial states are carried out for each parameter to avoid the randomness and obtain the general results. Figure 4 shows the process of the multi-UAV system evolution. All of the individuals have transformed from the initial chaos and disorder to the final alignment, strongly testifying the reliability of the proposed model. Figure 5 shows the average convergence time of the system with different layer weighting parameter. Intuitively, the convergence time increases with the increase of p 1 while decreasing with the increase of p 2 . The relative weight of the upper layer UAV and the given UAV decreases, following the increase of p 1 while increasing with the growing of p 2 . The upper layer UAV is closer to the reference state and is assigned with higher relative weight, which can induce the given UAV to align with the upper layer UAV faster, and thus the given UAV can converge to the reference state faster. Furthermore, as shown in Figure 5b, when the value of p 2 is quite large, especially p 2 ≥ 50, the convergence time is no longer significantly reduced. The reason is that the system performance has reached saturation. On one hand, as long as the given individual is affected by the upper layer individual, it can take the fastest action to align with the upper individual. On the other hand, even if the influence of the upper-layer continues to be improved, the given individual cannot take faster action to align to the reference state. Therefore, when p 2 is large enough, the system performance will not be further promoted with the increase of p 2 .   Then, the ablation experiment is carried out to verify the contribution of the mechanism of weighting the neighborhood layer status. The ablation results are shown in Figure 6.  Figure 6. The box-plot for system performance under different application circumstances of the neighborhood layer status. The red boxes represent the system performance with the mechanism of weighting the neighborhood layer and the blue boxes without. The black rhombus are the extreme outliers.
As shown in Figure 6, the larger the group, the longer the convergence time. Obviously, weighting the neighborhood layer status brings better system performance. When the upper layer is lack of individuals, by this mechanism the contribution intensities can be improved to a certain extent compared to not applying the mechanism. The individuals can be induced to align with the upper layer individual and the reference state more quickly.
When there are too few individuals of other layers, the contribution intensities provided by the mechanism are limited. The individuals of the upper layer are sufficient to induce the given individual to align with the reference state with the shortest time. This result demonstrates that the proposed mechanism of weighting the neighborhood layer status can improve the system performance.

Comparisons of Different Models
To further evaluate the performance of HWVEM, we compare it with five baseline models, i.e., the classic Vicsek model (VEM), the classic Vicsek model with one information UAV (VEM-A), the improved Vicsek model (IVEM) [15], the improved Vicsek model with one information UAV (IVEM-A), the weighting Vicsek model (WVEM) [16]. In IVEM, the individual obeys the rule of moving toward the middle of two neighbors' motion directions with the maximum deviation. The updated model of the individual heading angle is as follows, Furthermore, to make IVEM adapt to the mission scenario and make sure all the individuals converge to the desired direction, a simple improved model named IVEM-A is proposed. In IVEM-A, the information UAV is introduced. If an individual can interact with the information UAV, it will take action based on the information of its neighbors, where the influence weight of the information UAV is 100 and ordinary UAV is 1. According to our previous work, the parameters k α = 30, k β = 18, k γ = 1 are purposely set for WVEM and the system can reach saturation (excellent) performance. k β means the weight of the UAVs who directly interact with the information UAV and k γ means the UAVs who not. For HWVEM, we simply set p 1 = 0.1, p 2 = 5. In these experiments, the reference state is set to π/4. The following settings are the same:  Table 1 shows the alignment performance of these models. Obviously, the convergence time of IVEM is shorter than VEM and the method of IVEM can enhance the convergence efficiency. As can be seen, both VEM-A and IVEM-A can induce the system to converge to the reference state π/4 while the system inspired by VEM or IVEM cannot. The only difference between VEM-A and VEM, and also between IVEM and IVEM-A, is the information UAV, which indicates that the information UAV is the foundation for the system to converge to the reference state. In VEM-A or IVEM-A, the individuals are constantly directly or indirectly affected by the information UAV, leading to the tendency that they align with the information UAV and eventually align to the reference state. However, the convergence time increases greatly.
Compared with VEM-A and IVEM-A, the convergence time of WVEM is significantly shorter. WVEM proposes a fairly simple weighting mechanism. When considering the contribution intensity of an individual to the new preferred movement direction of the given individual, the highest weight is assigned to the information UAV, the second is assigned to the individuals directly interacting with the information UAV, and the least is assigned to other UAVs. By assigning the higher weight to the individual who is closer to the reference state, the whole system is induced to converge in a faster way to the reference state. Therefore, the system performance of WVEM is more superior, which demonstrates that an appropriate weighting mechanism can shorten the convergence time. The time required by HWVEM is drastically shorter than WVEM. That is because HWVEM occupies the following improvements. Firstly, the exploration of the identity information of the UAV in HWVEM is more comprehensive and precise. The influence of the different individuals can be distinguished in a more detailed way and the weight of each individual can be set more flexibly. Secondly, the designed contribution intensity in HWVEM is no longer constant but dynamic. The designed rule is more reasonable and convincing. It can determine the intensity easily and flexibly and adapt to the large-scale multi-UAV system with lots of layers. Therefore, HWVEM performs better than WVEM and the weighting mechanism of HWVEM is more valid than WVEM.

Flocking Performance in Complex Scenario
The UAVs driven by VEM and IVEM cannot converge to the reference state, so they cannot accomplish flocking navigation. Other models acquire different convergence time, and parts of UAVs cannot keep up with the information UAV in time in complex mission scenarios and thus lose track and fail to complete the task. This section will remove periodic boundary and add the attraction term and repulsion term to evaluate the task performance of the flocking algorithms based on VEM-A, IVEM-A, WVEM, and HWVEM, respectively. The influence of weighting parameters setting for HWVEM is also considered and two sets of parameters whose alignment performances vary greatly are chosen: p 1 = 0.1, p 2 = 5 for HWVEM-I and p 1 = 0, p 2 = 100 for HWVEM-II. In Figure 5 in Section 3.1.1, we can simply conclude that the alignment performance of HWVEM-II is much better than HWVEM-I.
We define the success rate P s as metric to evaluate the task performance. P s represents the proportion of the number of individuals that complete the task to the total individuals, where N s represents the number of the UAVs that achieve the task successfully. The larger the P s and N s , the better the task performance. If a UAV follows the information UAV closely from the beginning to the end, it is considered to be successful. Besides, to evaluate the safety, two metrics are proposed: the minimum distance between individuals D ind and the minimum distance from the individual to the obstacle D obs during the entire process. The larger the D ind and D obs , the safer the swarm.
In the experiments, a certain degree of noise is introduced to test the system robustness. The following settings are the same: v 0 = 0.1, ρ = 3.265, r = 1, c align = 1, r align = 1, c rep = 0.1, r rep = 0.7, c att = 0.003, r att = 1, η = 0.1. To reduce computational burden and take the actual group into account, three sets of groups are discussed: (a)N = 5, D = 1.2375; (b)N = 20, D = 2.4749; (c)N = 50, D = 3.9132. One-hundred simulations with different initial states are carried out for each model. Figure 7 shows the entire mission scenario and task execution of HWVEM and Figure 8 displays the snapshots. As shown in Figure 7, the mission scenario is comprehensive, including traveling through narrow areas, and bypassing tiny or huge obstacles. The shapes of the obstacles include spheres, rectangles, diamonds, and triangles. In this experiment, all UAVs align with the information UAV within a short time and follow closely with the information UAV during the entire process, which illustrates that the swarm moves along the predetermined path successfully. Besides, the safety is demonstrated because no collision happened during the whole process. These results verified the feasibility and adaptability of HWVEM. The system can still complete the mission in the presence of noise, which shows that the model is robust and can be used to guide the actual flocking.
As shown in Table 2, when N = 5, all five models possess a relatively high success rate. Since the group is not large, every model can induce the UAVs to align with the information UAV within a short enough time. Nevertheless, VEM-A and IVEM-A acquire the lowest success rate, which is consist with their alignment performances.   When N = 20, the task performances of VEM-A and IVEM-A decline sharply. Once the system reaches a certain large group, the difference of task performance between the models begins to manifest. The task performance of HWVEM is the best, which is consistent with the alignment performance. It implies the rationality of our idea.
When N = 50, the performance of HWVEM-II is significantly higher than HWVEM-I. The better the parameters for the system convergence performance, the better the performance of the multi-UAV system executing the mission, which emphasizes the importance of the weighting parameter configurations.
Additionally, the values of D ind and D obs of different models are always similar, which shows the same system safety. The values are always positive and the minimal values are D ind = 0.0427, D obs = 0.0492. It shows that a certain degree of security of the multi-UAV system can be guaranteed.

Conclusions
This paper proposed a HWVEM based on VEM to achieve the flocking navigation and obstacle avoidance for the multi-UAV system in which only one UAV knows the predetermined path. According to the hierarchical weighting mechanism, all UAVs are divided into different layers based on their topological distance with the information UAV. The contribution intensity of each individual is assigned reasonably and flexibly, which is composed of three parts: the absolute layer weight, the relative layer weight, and the weight of the neighborhood layer status. The rationality of each part has been verified through numerous experiments. The proposed obstacle avoidance mechanism consists of two steps: occasion and behavior. Specifically, when the obstacle obstructs the genuine desired movement direction of the UAV, the UAV will align its heading angle to the tangent direction of the obstacle. The feasibility and reliability of HWVEM have been verified by comparing it with several baseline models in comprehensive mission scenario. Additionally, the system can still achieve mission in the presence of noise, which demonstrates the robustness of HWVEM and its potential in the real world. In the future, we will further evaluate our proposed model in the real-world environment. Another promising direction is to search the optimal weights for HWVEM via intelligent optimization algorithms, such as genetic algorithm (GA) and particle swarm optimization algorithm (PSO).

Data Availability Statement:
The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest:
The authors declare they have no conflict of interest.