Hierarchical Weighting Vicsek Model for Flocking Navigation of Drones

: Flocking navigation, involving alignment-guaranteed path following and collision avoidance against obstacles, remains to be a challenging task for drones. In this paper, we investigate how to implement ﬂocking navigation when only one drone in the swarm masters the predetermined path, instead of all drones mastering their routes. Speciﬁcally, this paper proposes a hierarchical weighting Vicsek model (WVEM), which consists of a hierarchical weighting mechanism and a layer regulation mechanism. Based on the hierarchical mechanism, all drones are divided into three layers and the drones at different layers are assigned with different weights to guarantee the convergence speed of alignment. The layer regulation mechanism is developed to realize a more ﬂexible obstacle avoidance. We analyze the inﬂuence of the WVEM parameters such as weighting value and interaction radius, and demonstrate the ﬂocking navigation performance through a series of simulation experiments.


Introduction
In recent years, researchers are showing growing interests in drone swarms, among which flocking navigation is particularly eye-catching. Flocking behaviors refer to the emergence of group-level phenomena through simple individual interactions and probably arise from local rules without the central coordination [1][2][3]; navigation refers to the swarm tracking a predetermined path [4].
There are a great deal of studies that can achieve flocking navigation successfully by assuming that all drones have access to the predetermined path information [5][6][7]. If all drones need to master the path information, the data transmitting load is heavy, or it is simply difficult to achieve. On the one hand, the drones receive the predetermined path from the ground station in remote control mode. Then, all drones should be equipped with communicators, which in turn, causes a heavy data transmitting load. On the other hand, the drones are designed to plan their path based on the perceived goal position in autonomous control mode, but the capability of path planning varies a lot; thus, maybe only few of them can accomplish the planning task by obtaining the goal information and selecting the path accordingly and correctly. Furthermore, the noncooperation goal position may not be accurately expressed and the path information may not be transmitted for the interruption of communication. Therefore, it is a meaningful task to navigate the flocking with only one drone mastering the path information. At the same time, it is also a challenging one due to the requirements of rapid convergence and obstacle avoidance under the limited interaction radius.
To achieve flocking navigation, we need to ensure that all drones align to the path of the information drone who means the drone that knows the predetermined path. As a classic flocking model, the Vicsek model (VEM) describes the system where individuals align their movement directions to those individuals within a local neighborhood, and all individuals in the flock eventually align [8]. Taking the advantage of the alignment properties of VEM, as long as all the drones successfully make themselves align with the information drone within a short enough time, they can follow with the information drone closely; thus, the swarm can move along the desired path.
To achieve alignment as soon as possible, the topology relationship is established according to the information held by the drone and the interaction radius. Related studies have suggested that if the hierarchy exists and if several drones are not easily influenced by other drones, they can induce of the movement of the Vicsek-model swarm [9][10][11]. A hierarchical structure-dividing all drones into α drones, β drones, and γ drones-is established in this paper to guarantee enough influence of the crucial information, as shown in Figure 1. Besides, the information is no longer limited to the path information and can also include information about the perceived obstacles. A layer-regulation mechanism is also designed to ensure rapid alignment with the drones that detect the obstacle, which leads to the more flexible obstacle avoidance.
Accordingly, this paper proposes the hierarchical weighting Vicsek model (WVEM), including the hierarchical weighting mechanism and the layer regulation mechanism. The hierarchical weighting mechanism is designed to improve the convergence speed of alignment with the information drone, and the layer regulation mechanism is to improve the performance of obstacle avoidance.  The drones' movements are chaotic and disordered at the beginning. Only the α drone masters the path information and it adjusts its heading angle accordingly. The drone whose perception range covers the α drone (β drone with blue color) can perceive the information of the α drone, while the drone whose range does not cover the α drone (γ drone with purple color) cannot perceive it directly.
The rest of this paper is organized as follows. In Section 2, we introduce some background work including VEM and several matrixes. Section 3 proposes the WVEM and the WVEM-based flocking algorithm. Numerous simulations are carried out in Section 4 to verify and analyze the model performance. The last section summarizes all the points of this paper.

Related Works
This section firstly introduces the classic Vicsek Model and a detailed description of the involved parameters. Then, we introduce several coefficient matrixes to describe the relationship among drones in the flocking.

Vicsek Model
VEM describes N drones moving on the L × L plane with periodic boundaries at a constant speed v 0 in discrete time, and the drone density is denoted as ρ = N/L 2 . At time step k, the position of the drone i is noted as x i (k) ∈ R 2 and the velocity is v i (k) ∈ R 2 , i ∈ 1, . . . , N. At the initial moment, both the horizontal coordinate and vertical coordinate of each drone obey [0, L] uniform distribution, the direction obeys [0, 2π] uniform distribution. The heading angle of each drone at time step k + 1 is the average of the directions of its neighbors at time step k, which is updated by where N i (k) represents the index set of the neighbors of drone i, ∆θ i (k) denotes the noise and is evenly distributed in [−η/2, η/2], and η is the noise intensity. Drone j is the neighbor of drone i if it requires 0 < x j (k) − x i (k) ≤ r, and r is the interaction radius. The velocity of the drone i is updated by where e 1 and e 2 are mutually perpendicular unit vectors, e 1 is parallel to the x-axis, and e 2 is parallel to the y-axis. The position of drone i is updated by

Coefficient Matrixes
In this paper, some matrixes will be adjusted to describe our proposed model and characterize the relationship among drones clearly, including the contribution matrix, dominance matrix, and adjacency matrix [12].
The dominance matrix B N = b ij N×N is defined to describe the information flow direction between each pair of drones. The information flow specifies the set of drones whose behavior influences the decision of the given drone. For drone i and drone j, only if the information of j flows to i can its behavior influence drone i and b ij = 1; otherwise, b ij = 0. For VEM, each pair of drones have the same status and their behaviors can be influenced by each other-that is, b ij = 1, ∀i, j = 1, . . . , N.
The contribution matrix C N = c ij N×N is defined to describe the contribution intensity of every drone during the decision-making process regarding the new preferred directions of the drones. The element value in the matrix corresponds to the weight value assigned to every drone, respectively, and c ij represents the contribution of drone j to drone i. For VEM, every drone has the same contribution intensity, and it is reasonable to assume that c ij = 1, ∀i, j = 1, . . . , N.
The adjacency matrix A N = a ij N×N is defined to describe the drones within the perception range of the given drone. Only the drones within the perception range of the given drone can influence the given drone. The element of A N is expressed as where N i (k) = j | 0 < x i (k) − x j (k) ≤ r and r denotes the interaction radius. Therefore, the movement direction of the drone in VEM is given by where l ij (k) = c ij · b ij · a ij (k), ∀i, j = 1, . . . , N. These matrixes mentioned above will be applied later when describing our model.

Hierarchical Weighting Vicsek Model
This section introduces our hierarchical weighting Vicsek model. Firstly, the properties of the α drone are presented. Then, all drones are divided into three layers based on the interaction topology and assigned different weights. Next, the layer regulation mechanism is developed for the flock to adapt to a variety of scenarios. Finally, a WVEM-based flocking algorithm is proposed to achieve flocking navigation for the drone swarm.

Hierarchical Weighting Mechanism
As long as each drone follows the information drone within a permitted time, flocking navigation can be achieved. The goal of the hierarchical weighting mechanism is to align drones with the information drone as soon as possible.
A so-called α drone is proposed to represent the drone that knows the path information or detects the obstacle. The heading angle θ i (k) of the α drone is only determined by the scenario-specific actual demand and is not affected by other drones, which is the reference state of all the other drones. To describe the ability of the α drone to access the reference state, this paper proposes the enable matrix E N = [e i ] N×1 , in which the elements follow the rule, where i ∈ Λ α means that drone i is the α drone. The approach to define the α drone is introduced in the next section. Based on the interaction topology relationship with the α drone, other drones are divided into two layers: β drones and γ drones. The drone that can directly interact with the α drone is called a β drone, namely, the drone whose perception range covers the α drone. Correspondingly, the drones that do not directly interact with the α drone are called γ drones. The definition of drones is shown in Figure 2. The definition process of drones is dynamic during the flocking motion. As the system evolves, some drones enter or leave the interaction range of the α drone. The β drone changes to a γ drone once failing to perceive the α drone. Instead, the γ drone becomes a β drone. The drones in the entire system are divided into three layers: α drone, β drone, and γ drone, which follows the rule, The definition matrix D N = [d i ] N×1 is proposed to denote the layer of every drone, and the element d i follows the rule, where i ∈ Λ β means that drone i is a β drone and i ∈ Λ γ means that drone i is a γ drone.
In the model, the movement direction of the α drone will not be affected by any other drone. For drones of other layers, all members excluding themselves have the ability to influence their movement directions. The flows of direction information among drones are shown in the Figure 2a. Only when the information of a certain drone flows to a given drone, can it contribute to the given drone. WVEM has its dominance matrix B N , in which the element b ij follows the rule, Besides, b ii = 0, thereby B N is also called a strictly dominance matrix. Furthermore, we hope to increase the system convergence speed by weighting drones. The same weight is assigned to the drones of the same layer, and the drones of different layers are assigned with different weights. The α drone is assigned with weight k α , β drones are assigned with k β , and γ drones are assigned with k γ . WVEM has its contribution matrix C N , in which the element c ij follows the rule, The value of weighting plays a significant role in system performance. In the system evolution, the heading angle of the α drone is exactly the reference state. A β drone is directly influenced by the α drone, so that it aligns with the α drone more quickly and its direction is closer to the reference state than the γ drone. The closer the drone movement direction is to the reference state, the higher the assigned weight value to encourage all drones to align with the reference state faster. Therefore, there should be Its rationality is verified, and more beneficial weighting guidelines have been researched by experiments since. To simplify the model, without loss of generality, let k γ = 1 in this paper, which means that the contribution intensity of the γ drone is regarded as a benchmark for other drones.
The drones in WVEM have the same kinematic model as VEM. The updated model of drones heading angles is as follows and is shown in Figure 2b: where and θ d i (k) refers to the angle obtained by the α drone after adjusting its angle according to the path information that it masters.

Layer Regulation Mechanism
As long as the α drone is identified, the layers of other drones can also be determined, which denotes that the identification of the α drone is the most important. Whoever masters the information conducive to the flock's survival becomes the α drone. The setup of the α drone is related to the specific scenario encountered by the flocking. This paper mainly focuses on two actual scenarios: • Scenario (I), Flocking Navigation. This refers to the process where the flocking movement is prompted by the drone that masters the predetermined trajectory. • Scenario (II), Obstacle Avoidance. This refers to the process where the flocking movement is prompted by the drones that detect the obstacle.
The scenario matrix S N = [s i ] N×1 represents the scenario information mastered by each drone, and the matrix element s i follows the rule, where i ∈ Γ navigation means that drone i knows the trajectory information, and i / ∈ Γ navigation means that i does not know. Thereby, the definition of the α drone is as follows: The behavior of the α drone is to adjust its heading angle θ i (k), which is exactly the reference state of the flocking. For specific scenarios, the α drone has different behavior strategies:

•
In scenario (I), the α drone acts to move along the trajectory In scenario (II), the α drone acts to stay away from the obstacle θ i (k) = θ d i,s i =2 . Different scenarios possess different degrees of importance. When the α drone is in comprehensive scenarios, it takes the action that should be taken assuming the most important scenario. For the flocking survival, obstacle avoidance is more important, and the α drone takes action to stay away from the obstacle if finding the objective and obstacle at the same time, During the flocking movement, the abrupt scenario appears randomly, while an immutable layer of the drone cannot address it appropriately. Hence, the layer regulation mechanism for the drone is proposed in response to the abruptness and randomness of the scenario. The layer regulation mechanism only refers to the mutual transformation of non-α drones and the α drone, and the layers of other drones are transformed accordingly. For a navigating flock who suddenly detects obstacle, the regulation mechanism includes two rules: • If a non-α drone detects the objective or obstacle, it immediately transforms into an α drone and takes corresponding actions required by the specific scenario. For the α drone, only if it detects the obstacle during the process of moving to the objective does it change its action to stay away from the obstacle; otherwise, it maintains the previous behavior.

•
For the α drone that previously detected the obstacle and is no longer detecting the obstacle, if it does not master the objective location, it is transformed into a non-α drone and its identity is determined with (7) and (8), otherwise it is also an α drone and only changes behavior.

WVEM-Based Flocking Algorithm
VEM is the solution to the rule of velocity matching and provides the alignment term. To achieve complete flocking navigation, it is still necessary to add attraction term and repulsion term for the drone swarm [13,14]. The attraction force is used to maintain the swarm compactness and the repulsion force is to avoid the collision. The form of the attraction term is as follows: where r att represents the range of the attraction force, r rep represents the range of the repulsion force, and j ∈ N 1 i (k) satisfies r rep < x ij ≤ r att . The form of the repulsion term is as follows: where j ∈ N 2 i (k) satisfies 0 < x ij ≤ r rep . The intensities of both terms of the drones of all layers to other drones are the same and the α drone is still not influenced by any other drone. The velocity of the drone is updated according to the following equations: where c align , c rep , c att represent the coefficients of the alignment term, the repulsion term, and the attraction term, respectively, and v max represents the max velocity magnitude of the drone, v max = 0.1. Accordingly, the process of the WVEM-based flocking algorithm is summarized as Algorithm 1.

Numerical Simulation and Analysis
This section firstly verifies whether all drones converge to the reference state with the α drone in a fixed movement direction. Next, the factors affecting the model alignment performance are analyzed in detail. Then, the flocking navigation and better avoidance of abrupt are realized in the presence of noise, which verifies the robust performance and practical value of the model.

The Verification of Alignment with the α Drone
This section mainly observes whether all drones can converge to the fixed direction, which is the foundation of achieving flocking navigation. The attraction term and repulsion term are ignored and the experiments are conducted in the periodic boundary environment. We use convergence time k con as a performance metric to evaluate system performance. k con denotes the minimum time steps required for the system to reach the desired consistency degree Deg d θ . Deg θ represents the consistency degree between all drones' heading angles and the reference state, where θ d represents the reference state and N α represents the quantity of the α drone. Accordingly, k con is expressed as follows; where 0 < Deg d θ < 1; the higher Deg d θ , the more severe consistency degree required by the system. The shorter k con , the faster convergence speed and the better performance.
In these experiments, the reference state is set to π/4. There is one α drone whose heading angle is always π/4, with an initial position that is in the lower-left corner of the area. To stress the feasibility of the alignment to the reference state as the focus of our study, this section dose not consider any noise temporarily. The parameter settings are as follows: v 0 = 0.03, ρ = 4.102, r = 1, Deg d θ = 0.9984, η = 0, k α = 10, k β = 5, N = 201, L = 7. The process of system evolution is shown in Figure 3.   Figure 3 shows the drones transforming from the initial chaos and disorder to the final alignment with the α drone, vividly illustrating the feasibility. As shown in Figure 4, the movement directions of all drones converge to the reference state and the consistency degree gradually tends to 1, which also demonstrates the reliability of WVEM.  Table 1.
As shown in Table 1, VEM-α can induce the swarm to converge to the reference state π/4 while the swarm inspired by VEM cannot. The only difference between VEM-α and VEM is α drone, which suggests that the α drone is the foundation for the swarm to converge to the reference state. The direction of the α drone is not affected by other drones. Other drones are constantly directly or indirectly affected by the α drone, leading to the tendency that they align with the α drone and eventually align to the reference state.
Compared with VEM-α, the convergence time of WVEM with k α = 10, k β = 5 is significantly shorter. VEM-α is a special case of WVEM, specifically referring to the case of k α = k β = 1, which means that in the swarm, all drones posses the same contribution intensities. The superior system performance of WVEM demonstrates that the weighting mechanism can improve the speed of the alignment of all drones to the reference state.

Analysis of Factors Affecting Alignment Performance
This section analyses the factors affecting the alignment performance. There are many factors that affect the system convergence time due to numerous parameters involved in WVEM-these include, but not limited to, weighting values, quantity and location of the α drone, drone interaction radius, drone density, noise intensity and type, consistency degree requirement, limited field of view, drone velocity, and other factors. Among these factors, weighting is regarded as the highlight of this paper. We firstly explore the influence of weighting values on the model performance. Then, we discuss how the drone interaction radius influences the performance. The attraction term and repulsion term are also ignored and the experiments are conducted in the periodic boundary environment. There is one α drone whose heading angle is always π/4 and initial position is the lower-left corner of the area in the system.

Impacts of Weighting Values
To initially analyze the impact of weighting values on the performance with k α changing, let k β = 1. The other parameter settings are as follows: v 0 = 0.03, ρ = 4.102, r = 1, Deg d θ = 0.9984, η = 0, N = 201, L = 7. Experiments of different initial states are conducted 100 times for different k α to obtain the general conclusions. The results are shown in Figure 5.
As shown in Figure 5, the general trend is that the convergence time decreases with k α increasing. The α drone holds larger contribution intensity to induce other drones to align with it more quickly, which allows better system performance. Before k α = 16, the system convergence time decreases strictly. From k α = 16, the system convergence time begins to fluctuate. When k α = 66, the system has the shortest convergence time and optimal performance. From k α = 41 to k α = 100, none of the convergence time exceeds the 5% of optimal convergence and the performance does not change significantly, which means the influence of the α drone on surrounding drones has reached saturation at k α = 41. When the surrounding drones receive the information of the α drone, although there is interference from γ drones and other β drones, they can take the fastest action to align with the α drone. Even if the weight of the α drone is further increased, the surrounding drones cannot take faster action. The existence of fluctuation is because the flocking is probabilistic but not too large-as mentioned, it does not exceed 5%. The more numerical simulations, the smaller the fluctuation. On the other hand, the number of drones within the interaction range of the α drone is limited due to the limitation of drone density, which constrains the α drone from influencing more drones in a given time. Consequently, although k α continues to increase, the model's performance cannot be further improved. At this moment, it is essential to adjust the value of k β to improve model performance.
To further study the influence of weighting values on model performance, we simultaneously adjust the values of k α and k β to observe system performance. The results are as shown in Figure 6 and Table 2.   Figure 6a shows that appropriate parameter settings reduce the convergence time greatly compared with the case of k α = 1, k β = 1. Weighting distinctly improves the system performance, which demonstrates its feasibility and significance. Besides, for any fixed k α , the system performance with k β ≥ 1 is superior to k β < 1, which verifies the rationality of k β > k γ .
For any fixed k β , the general trend is that the convergence time decreases as k α increases, which is consistent with the result shown in Figure 5. The movement direction of the α drone is the reference state, and the increase of its contribution intensity does not produce any interference information to the system or increase the weight of interference. The continuing increase of k α only becomes meaningless due to the existence of saturated performance, but does not negatively affect system performance.
For fixed k α , when k β is relatively small, the convergence time decreases as k β increases. Compared with γ drones, the proximity of the heading angle of β drones to the reference state is higher. When a β drone owns higher contribution intensity, it induces other drones to converge to the reference state more quickly. When b continues to increase, the system performance begins to fluctuate and there is no significant increase. Similar to the α drone, performance fluctuation and saturated performance are also adopted by the β drone. As shown in Figure 6b, when k α = 90, k β = 63, the system possesses global optimal performance, while when k α = 30, k β = 18, the system has achieved global saturation performance. For any fixed value of k α , the system has obtained local saturation performance when k β = k α . Therefore, it is not necessary for the α drone and β drones to be assigned with extremely large values because of the existence of the saturation performance, and it is sufficient that k α ≥ k β .
Furthermore, when k β k α -specifically, k β ≥ 4k α -the system convergence time increases sharply with k β increasing. For a given β drone, besides the α drone, other β drones in the interaction range also affect its movement direction. The movement direction of a β drone may be far from the reference state, and the increase of k β implies the improvement of the contribution intensity of possible interference information. The influence of the movement information of the α drone on the system becomes relatively insignificant due to the larger intensity of other interference information, and given that β drones and α drones align more slowly. Although the β drone induces the γ drone to align with it faster, the current movement direction of the β drone is not necessarily the exact reference state, which is not only meaningless but also may seriously reduce the system performance. Therefore, it is inappropriate that k β k α . Accordingly, to a certain extent, the rationality of k α > k β > k γ has been successfully proved.

Impacts of Interaction Radius
This section mainly discusses the influence of interaction radius on the model. Let   Figure 7 presents many clusters instead of a whole flocking under r = 0.1, r = 0.2, and r = 0.3, although after a long enough operation time. The α drone induces only a few drones to converge to the reference state, because too small an interaction radius limits the influence range of the α drone and the system is not globally connected. Most drones do not establish direct or indirect interaction with the α drone from start to finish, thereby they do not align with the α drone. The system only spontaneously forms several small clusters with their own unpredictable direction. Therefore, the whole system is unable to converge to the reference state.
As shown in Figure 8, when r ≤ 2.3, the convergence time decreases as the interaction radius increases. The drones that align with the α drone equivalently master the reference state. With the interaction radius increasing, more drones are capable of interacting with the α drone or the drones that master the reference state. This means that information of the reference state holds a larger influence range and a rapider transmission efficiency, and all drones are able to interact with the reference state in a shorter time. Therefore, the drones converge to the reference state more quickly.
Furthermore, when r > 2.3, the larger the interaction radius, the longer the convergence time, which is contrary to our ordinary perception. This is because the correction efficiency of the drone angle is reduced. Correction efficiency refers to the time taken for a drone to align to the reference state after interacting with drones that have mastered the reference state and depends on the intensity of the reference state relative to other interference information. The movement directions of β drones and γ drones are determined by the movement direction of the surrounding drones. It is impossible for all the surrounding drones to align with the α drone; the larger the drone interaction radius, the more interference information the drone may receive. If the contribution intensity of the reference state is not large enough, the drones are more likely to be induced by interference information, thus, increasing the convergence time. Considering this, the system performance can be further improved by adjusting k α and k β .
Therefore, an optimal interaction radius exists when other settings are fixed. r = 2.3 is the optimal radius under the current settings.

Flocking Navigation and Obstacle Avoidance
This section mainly verifies whether the swarm can achieve flocking navigation according to the WVEM-based flocking algorithm and if better obstacle avoidance can be realized.

The Performance of Flocking Navigation
As long as the movement direction of each drone aligns with the α drone within a short enough time, the drone swarm can move along the predetermined trajectory. This paper considers the flock composed of 15 drones, among which there is one α drone. The attraction term and repulsion term are added and the periodic boundaries removed. In this section, a certain amount of noise is added; let v 0 = 0.1, ρ = 4.102, r = 1, N = 15, c align = 1, r align = 1, c rep = 0.05, r rep = 0.7, c att = 0.001, r att = 1, η = 0.1. At the initial moment, the coordinates of the α drone are (0.9585,0.9585); both abscissa and ordinate of each drone obey a [0,1.9170] uniform distribution, and the heading angle obeys [0, 2π] uniform distribution. The α drone moves along a rectangular trajectory. The results are shown in Figure 9.
In Figure 9a, k α = 50, k β = 1; in Figure 9b, k α = 50, k β = 16. Figure 9a shows that some drones detach from the flocking and do not move along the predetermined trajectory, exposing the failure of flocking navigation. Figure 9b shows that in the presence of noise, all drones always align with and follow the α drone closely and move along the predetermined trajectory, which means the system achieves flocking navigation successfully and acquires a certain degree of robustness. To eliminate the influence of noise on system performance and obtain the general conclusions, ten trials of experiments are carried out for each set of parameter settings. For k α = 50, k β = 1, the average number of drones that can move along the predetermined trajectory is 10; for k α = 50, k β = 16, the average number is 15; all drones can achieve flocking navigation. It is obvious that a suitable weighting value plays a significant role for WVEM. The flocking navigation can be achieved based on WVEM if the parameter settings are appropriate.

The Performance of Obstacle Avoidance
In this paper, the layer regulation mechanism is an approach to improve the survival of the swarm when detecting the obstacle, rather than a specific avoidance mechanism. Therefore, the simple obstacle and corresponding avoidance mechanism are set, which are enough to verify the effectiveness of the layer regulation mechanism.
In this section, there is an infinitely long obstacle parallel to the y-axis at x = 13. The detection radius of the drone is equal to the interaction radius, r = 1. For drone i, when x i ≥ 12, it can detect the obstacle and adjust its movement direction to θ i = π/2. When the layer regulation mechanism is applicable, if β drones or γ drones detect the obstacle, they become α drones and own the reference state θ de i,s i =2 = π/2; if α drones detect the obstacle, they adjust reference state θ de i,s i =3 = π/2. When the layer regulation mechanism is not applicable, if β drones or γ drones detect the obstacle, their layers remain the same and they are still influenced by repulsion terms and attraction terms of other drones. Let v 0 = 0.1, ρ = 4.102, r = 1, N = 15, c align = 1, r align = 1, c rep = 0.05, r rep = 0.7, c att = 0.001, r att = 1, η = 0.1, k α = 50, k β = 16. At the initial moment, the coordinates of the α drone are (0.0959,0.0959), both abscissa and ordinate of other drones obey a [0,1.9170] uniform distribution, and the heading angles of all drones are 0. The results are shown in Figure 10. Figure 10 presents, in the presence of noise, the success of obstacle avoidance, during which no drone appears x i ≥ 13. When drones detect the obstacle, the faster the movement direction of the flocking converges to the reference state, the further the drones are away from the obstacle, and the safer the drones are. To evaluate the system performance in detail, the average distance between all drones and the obstacle within a certain period of time is proposed, which is calculated as follows: where x obs = 13, T dis = 20. The last 20 time-steps are selected for calculations. In Figure 10a, the layer regulation mechanism is applied, Dis = 1.562; in Figure 10b, the layer regulation is not applied, Dis = 1.314. It is obvious that the safe distance with the layer regulation mechanism is larger than without the mechanism. More drones becoming α drones can possess greater contribution intensity to induce the whole flocking to move in the direction conducive to survival. Therefore, the appropriate application of the layer regulation mechanism can improve system performance, and its robustness and practical value are verified. 13 (a) 13 (b) Figure 10. The trajectories of all drones under different conditions. (a) Applying layer regulation mechanism. (b) Not applying layer regulation mechanism.

Conclusions
This paper proposes WVEM based on VEM to achieve flocking navigation with only one drone mastering the path information. First, WVEM introduces the α drone, which occupies two characteristics: the specific movement direction and immunity to any other drone. It is verified that the existence of the α drone is the fundamental condition to achieve flocking navigation. Then, all drones are divided into three layers dynamically based on the interaction topology and the drones at different layers are assigned with different weights. The weighting rule is put forward to improve the alignment performance and has been proved through experiments. In addition, a detailed analysis about the influence of interaction radius on model performance is conducted. Then, the layer regulation mechanism is proposed to achieve the flexible obstacle avoidance. Thereby, the flocking navigation and obstacle avoidance composed of 15 drones are achieved, which confirm the reliability of the proposed model and the effectiveness of proposed mechanisms. In the future, we will map the settings to the real world, design a better approach to define the layers and assign weights to improve the model performance, and propose better order parameters to analyze flocking performance.