Research on Formation Recovery Strategy for UAV Swarms Based on IVYA-Nash Algorithm

Li, Junfang; Gu, Zexin; Zhang, Lei; Wang, Junchi

doi:10.3390/electronics14183653

Open AccessArticle

Research on Formation Recovery Strategy for UAV Swarms Based on IVYA-Nash Algorithm

by

Junfang Li

^1,2,

Zexin Gu

^1,2,

Lei Zhang

^3,* and

Junchi Wang

³

¹

School of Electrical Engineering and Automation, Tianjin University of Technology, Tianjin 300384, China

²

Tianjin Key Laboratory of New Energy Power Conversion, Transmission and Intelligent Control, Tianjin 300384, China

³

School of Artificial Intelligence and Data Science, Hebei University of Technology, Tianjin 300401, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(18), 3653; https://doi.org/10.3390/electronics14183653

Submission received: 12 August 2025 / Revised: 8 September 2025 / Accepted: 12 September 2025 / Published: 15 September 2025

Download

Browse Figures

Versions Notes

Abstract

Contemporary multi-UAV formations face dual challenges of obstacle avoidance and rapid formation recovery. To enable UAV swarms to efficiently restore their predefined configurations post-obstacle navigation, a formation recovery strategy grounded in Nash equilibrium game theory is proposed in this paper. By integrating the IVY optimization algorithm, a collaborative control model that systematically balances individual UAV interests with swarm-level objectives through carefully designed optimization criteria is established. Comparative experimental results demonstrate that, compared to traditional formation obstacle-avoidance algorithms, Improved Particle Swarm Optimization (IPSO), Ant Colony Optimization (ACO), and Genetic Algorithm (GA), our method exhibits superior performance across multiple key metrics, including average path length, formation accuracy rate, recovery time, and total time consumption. Real-flight tests on a multi-UAV platform confirm IVYA-Nash surpasses improved APF in formation accuracy and aerodynamic disturbance resistance, proving robustness in dynamic multi-agent scenarios. The work provides an efficient and reliable solution for coordinated control of UAV formations in complex environments.

Keywords:

UAV swarms; formation recovery; Nash equilibrium; IVY optimization; collaborative control; obstacle negotiation

1. Introduction

The formation control of multi-UAV systems has remained a research hotspot for decades. With expanding applications in military reconnaissance, logistics transportation, and disaster rescue [1,2,3,4,5], UAV formations face escalating challenges in obstacle-rich environments. These systems must not only execute efficient path planning but also maintain formation stability and mission coordination during obstacle avoidance requirements that impose extreme demands on cooperative control.

With the rapid development of UAV swarm applications, their use cases extend far beyond military reconnaissance and logistics. UAV swarms are increasingly employed in industrial inspections, agricultural monitoring, environmental sensing, disaster response, and UAV light shows, demonstrating the need for reliable formation control in diverse scenarios [6]. However, state-of-the-art (SOTA) swarm intelligence approaches such as PSO, ACO, GA, and reinforcement learning-based methods suffer from notable drawbacks when operating in highly dynamic and high-dimensional environments. For instance, PSO and GA typically face slow convergence and premature local optima, while APF-based methods often trap UAVs in local minima. These gaps directly motivate the development of our proposed IVYA-Nash framework [7,8,9].

Existing research on UAV formations has mainly focused on path planning, obstacle avoidance, and cooperative control using approaches such as game theory [6] and swarm intelligence algorithms [10,11,12,13,14]. For instance, Zhang et al. [13] demonstrated that an improved PSO (IPSO) can effectively handle 3D obstacle avoidance in multi-UAV formations. However, traditional PSO frameworks face inherent limitations in formation recovery: static inertia weights restrict adaptability to dynamic environments, which undermines both avoidance efficiency and the stability of formation restoration. Moreover, they often fail to converge to globally optimal solutions in real time.

Beyond PSO, other strategies attempt to enhance specific capabilities or adapt to specialized environments. Yu’s evolutionary RL improves exploration efficiency [14], Li’s hybrid SAQPSO-SA enhances target tracking [15], Wu’s DSCA-IRRT addresses urban path planning efficiency [16], and Kabore introduces onboard perception for GPS-denied control [17]. Zhu’s integration of APF with consensus control [18] directly tackles obstacle avoidance during recovery, while Wu’s HPIO [19] and Sun’s compensation look-ahead algorithm [20] improve cooperative avoidance and heading optimization in complex scenarios. Despite these advances, three critical gaps remain: (i) Nash-based methods often assume complete information, making them less effective under communication constraints [21]; (ii) path planning and formation control are frequently treated as separate problems, ignoring their dynamic coupling in obstacle-rich environments, which leads to issues such as GA’s slow convergence [22] or APF’s susceptibility to local minima [13,23]; and (iii) while some methods improve global search (e.g., IVY [10]), they do not systematically integrate dynamic obstacle avoidance, reducing robustness in multi-conflict scenarios [24].

Classic swarm intelligence algorithms such as GA, HS, ABC, and GWO have also been applied to UAV path planning. Yet, in high-dimensional, dynamic, cooperative game problems, they suffer from high computational overhead, parameter sensitivity, and a tendency to fall into local optima. For example, GA converges too slowly for real-time decision-making, and GWO struggles with complex constraints. These limitations underscore the need for an algorithm that can balance global exploration with local exploitation, while maintaining computational efficiency for real-time multi-UAV recovery.

To address these challenges, the paper proposes an IVYA-Nash cooperative optimization framework for UAV formation recovery, featuring three key innovations: First, the Ivy Algorithm (IVYA) solves non-cooperative game Nash equilibria through biomimetic growth mechanisms, balancing global exploration and local exploitation to overcome convergence bottlenecks in dynamic optimization [25]. Second, a multi-objective cost function fusion mechanism is developed that transforms the virtual structure method’s rigid constraints into elastic potential fields, enabling dynamic trade-offs between formation stability and obstacle avoidance flexibility. Third, a hierarchical decision architecture enhances swarm coordination through IVYA’s adaptive perturbation strategies.

Recent advances further highlight emerging directions in UAV swarm coordination. Bu et al. [26] reviewed formation control methods and outlined key challenges. Chen et al. [27] demonstrated that DRL can enhance swarm coverage and tracking through prior experience, while Zhao et al. [28] developed a distributed formation planning method emphasizing scalability. These works confirm the importance of reinforcement learning and distributed optimization. Building on this trend, our IVYA-Nash framework integrates game-theoretic decision-making with a metaheuristic optimizer to achieve robust and efficient formation recovery in dynamic environments.

The Ivy algorithm mimics Hedera helix’s phototropism and climbing growth patterns. Its stochastic perturbation vectors help escape local optima, while nonlinear convergence mechanisms excel in solving high-dimensional non-convex Nash equilibrium problems. Compared to PSO’s inertia updates and ACO’s genetic operations, IVYA demonstrates higher search efficiency in obstacle environments (as shown in Section 3), providing novel theoretical tools for real-time decision-making in complex game scenarios.

The key innovations of this work are threefold, each directly addressing limitations in existing methods, as illustrated in Figure 1.

(i): IVYA introduces biomimetic growth mechanisms that balance global exploration with local exploitation, overcoming convergence bottlenecks in PSO and ACO.
(ii): By integrating Nash equilibrium into formation recovery, the proposed framework systematically couples obstacle avoidance with formation maintenance, addressing the limitations of decoupled designs in prior works.
(iii): An elastic potential field mechanism is developed to enhance adaptability during obstacle negotiation, ensuring both recovery speed and formation stability in complex environments.

Overall, existing UAV swarm formation recovery methods can be classified into four categories. Classical control approaches are simple and efficient but often lack robustness. Swarm intelligence algorithms (PSO, ACO, GA, HS, GWO) offer global search but suffer from slow convergence. Game-theoretic methods enable distributed decision-making but usually assume perfect communication. Reinforcement learning approaches provide adaptability in dynamic settings but require heavy training and face generalization challenges. In contrast, our proposed IVYA-Nash framework combines the distributed optimization of game theory with the convergence efficiency of the Ivy Algorithm, achieving faster recovery and stronger robustness while remaining computationally feasible for real-time swarm operations.

2. Problem Formulation and Model Architecture

2.1. Problem Formulation

The Multi-UAV formation game problem studied in this paper can be described as: UAVs evaluate their expected costs based on environmental information, individual status and overall formation requirements in a UAV formation mission. And then determine the optimal action plans for all participants through game-theoretic interactions, aiming to rapidly reform the formation and reaching the designated destination.

For each UAV in the formation, its state can be represented as:

X_{i} = [\begin{array}{l} x_{i} & y_{i} & z_{i} \\ v_{x i} & v_{y i} & v_{z i} \end{array}],

(1)

where

x, y, z

denote the 3D coordinates of the

i - th

UAV, and

v_{x}, v_{y}, v_{z}

represent its velocity components along three axes. Let

N

denote the UAV swarm set. The action library for the next moment is constructed as:

Ω_{i} = [\begin{array}{l} x_{i}^{*} & y_{i}^{*} & z_{i}^{*} \\ v_{x i}^{*} & v_{y i}^{*} & v_{z i}^{*} \end{array}]

(2)

where

x_{i}^{*}, y_{i}^{*}, z_{i}^{*}

represent the 3D coordinates of the

i - th

UAV at the next moment, and

v_{x i}^{*}, v_{y i}^{*}, v_{z i}^{*}

denote its velocity values at the next moment. The drone’s action is as shown in Figure 2.

2.2. Construction of Formation Game Model

To establish the UAV formation recovery game model, we first construct the UAV action library, as illustrated in Figure 3. This study designs a three-dimensional UAV action library based on attitude and velocity control methods. The action library design is formulated as follows:

\{\begin{matrix} v_{x} = v \cos θ \cos φ \\ v_{y} = v \cos φ \sin θ \\ \begin{array}{l} v_{z} = v \sin φ \end{array} \end{matrix}

(3)

where

θ

is the angle between the

x y - p l a n e

and the projection of the UAV on the

x y

-plane,

φ

is the angle between the

z - a x i s

and the velocity of the UAV, and

v

is the velocity of the UAV. To prevent sudden changes in velocity caused by strategy selection issues, the following smoothing function is defined:

v_{s} (t) = α v_{s} (t) + (1 - α) \cdot v_{s} (t - Δ t)

(4)

where

v_{s} (t)

denotes the magnitude of velocity vector at time

t

,

v_{s} (t - Δ t)

is the velocity value at the previous moment, and

α \in [0, 1]

is the parameter value. The schematic diagram of the UAV action library is illustrated as follows.

2.3. Construction of Formation Game Model

The UAV formation restoration requires restoring the UAVs to their initial formation, as illustrated in Figure 4. The target points of UAVs after the overall movement of the formation are calculated using the virtual structure method, with the formula as follows:

P_{i} = P_{a} + o f f s e t \cdot r_{i}

(5)

where

P_{i} = (x_{i}, y_{i}, z_{i})

is the target position of the

i - th

UAV,

P_{a}

is the position of the virtual UAV after the change in the formation position, and

r_{i}

is the distance between the

i - th

UAV and the virtual UAV.

Constructing the UAV situation cost function is a crucial step in realizing the UAV formation game. The objective of the game problem proposed in this paper is to make the cost function converge to the Nash equilibrium point through a heuristic optimization algorithm. For each UAV participating in the game, the target cost function is as follows:

J_{i} = \frac{η_{1}}{N} \cdot ‖Ω_{i} - ω_{i} \cdot X_{i}‖ + \frac{η_{2}}{N} \cdot ‖P_{i} - ω_{i} \cdot \partial_{i}‖

(6)

where

P_{i}

denotes the target (desired) position UAV

i

,

\partial_{i}

is the current position of the UAV

i

,

Ω_{i}

is the action plan of the UAV

i

at the next time step,

X_{i}

is the action of the UAV i at the current moment, and

η_{1}

and

η_{2}

are both weight coefficients.

η_{1}

adjusts the weight of the action difference term, and

η_{2}

adjusts the weight of the position deviation term.

ω_{i}

is the weight coefficient of the UAV

i

, which adjusts the influence of the current action and position.

For each player participating in the game, the goal is to minimize the above-mentioned cost function. However, in this process, it is inevitable that the cost functions of other players will increase. To optimize the overall formation efficiency, it is necessary to construct the Nash equilibrium condition as the optimization objective. The Nash equilibrium in the UAV formation restoration problem proposed in this paper is defined as follows:

For the cost function

J_{i}

of each UAV, it depends not only on its own behavior but also on the behaviors of other UAVs. The main meaning is that no player can unilaterally change its behavior under the Nash equilibrium to reduce its cost. It is defined as:

η^{'} = col \{η_{1}^{*}, \dots, η_{k}^{*}\} \in Ω

is a Nash equilibrium solution if for any

i \in Ω_{i}

, the following condition is satisfied:

J_{i} (η_{i}^{*}, η_{- i}^{*}) ⩽ J_{i} (η_{i}, η_{- i}^{*}), \forall η_{i} \in Ω_{i}

(7)

where

Ω_{i}

is the set of admissible strategies for the

i - th

UAV. From the definition of the Nash equilibrium, it can be seen that once the strategies of UAVs form a Nash equilibrium, no UAV will unilaterally deviate from the Nash equilibrium, and the UAV formation will maintain flight in the achieved equilibrium state.

From the above, the formation game problem can be transformed into a non-cooperative game Nash equilibrium solving problem. The formation game can be described by a triple

Λ = {T, J (η), Ω}

, where

T

is the set of players,

Ω = \forall Ω_{i}

is the strategy space of the players participating in the game (

η \in Ω_{i} \subseteq R^{n}

is the strategy space of the

i - th

player),

η \in Ω_{i} \subseteq R^{n}

is the current strategy of the UAVs, and

J (η)

is the overall pay-off matrix, as shown below:

J (η) = [\begin{matrix} j_{1} (η_{1}) & \dots & j_{1} (η_{k}) \\ \dots & j_{i} (η_{j}) & \dots \\ j_{n} (η_{1}) & \dots & j_{n} (η_{k}) \end{matrix}]

(8)

where

j_{i} (η_{j})

is the cost function of the

i - th

player participating in the formation game.

Based on the above model construction, the Nash equilibrium is further solved. The next-step solutions of all UAVs participating in the game are the next-step actions of the UAVs. The construction process of the game model is shown in the following figure:

In Figure 5, d1, d2, and d3 represent the position differences of the drone from its position at the next moment, the position difference between the drone and the target point, and the distance between drones, respectively.

2.4. Nash Equilibrium Solution and Problem Transformation

The Ivy Algorithm was selected due to its superior ability to escape local optima and handle non-convex optimization problems. Inspired by Hedera helix’s phototropism and climbing behavior, IVYA achieves nonlinear convergence and high-dimensional optimization efficiency. In this study, we conducted comparative evaluations with GA, PSO, and ACO, which confirm the advantages of IVYA in terms of convergence speed, robustness, and recovery accuracy.

To solve this non-linear function, this paper introduces the chaos-enhanced IVY algorithm (Chaotic Ivy Optimizer, IVYA-C) to find the Nash equilibrium solution of the game problem. The algorithm mimics the growth process of ivy plants with chaotic dynamics. By combining chaotic perturbation, random exploration, and global-optimal guidance, it achieves an improved balance between local exploitation and global optimization. The state of each UAV is described by an optimization function incorporating chaotic factors. The IVYA-C algorithm dynamically adjusts the positions of UAVs using chaotic mapping, enabling individual actions to meet local requirements while effectively optimizing the overall objective. Its fitness function

f (l_{i})

is:

\min f (l_{i}) \leftarrow \min \sum_{j \neq i} {[J (η) - E (J (η))]}^{2}

(9)

where

J (η)

is the pay-off matrix,

E (•)

is the expected payoff of the payoff matrix. The steps of the Ivy optimization algorithm are as follows (Algorithm 1):

Algorithm 1 IVYA-C

1 Begin

2 Initialize the parameters of the Ivy algorithm:

I_{\min}, I_{\max}, I t e r_{\max}, N_{p o p},

and set the initial iteration number:

I t e r = 1

3 Initialize chaotic parameters:

μ = 4.0, p_{c h a o s} = 0.3

4 Initialize the population

\vec{I} = (I_{1}, I_{2}, \dots, I_{i}, \dots, I_{Npop})

using chaotic initialization:

for each individual

i

x = rand ()

for each dimension:

x = μ \cdot x \cdot (1 - x)

I_{i, d} = I_{\min} + rand \cdot (I_{\max} - I_{\min})

Store chaotic state

C_{i} = x

5 Calculate fitness values

f (l_{i})

for all individuals. Sort by fitness (high to low. Set optimal individual (

I_{b e s t} = I_{1}

)

6 while

I t e r \leq I t e r_{\max}

do

7 for

i = 1

to

N_{p o p}

do:

8 Chaotic perturbation with probability

p_{c h a o s}

if

rand () < p_{c h a o s}

:

C_{i} = μ \cdot C_{i} \cdot (1 - C_{i})

I_{i} = I_{i} + 0.1 \cdot (I_{\max} - I_{\min}) \cdot (C_{i} - 0.5)

I_{i} = \min (\max (I_{i}, I_{\min}), I_{\max})

end if

9 Calculate growth vector:

Δ g_{v_{i}} = |N (1, D)| \times (I_{i} - I_{best})

10 Calculate:

Δ g_{v_{i}}^{'} = r a n d \times Δ g_{v_{i}}

11 Select random neighbor

I_{i i}

from population

12 if (

f (I_{i i}) < f (I_{i})

) then

13

β = (2 + C_{i}) / 2 [Originally : (2 + rand) / 2]

14

I_{i}^{new} = I_{i} + β \times Δ g_{v_{i}}^{'} \times (I_{i i} - I_{i})

15 else

16

I_{i}^{new} = I_{i} + Δ g_{v_{i}}^{'} \times (C_{i} - 0.5) \times (I_{\max} - I_{\min})

17 end if

18 Evaluate new fitness

f (I_{i}^{n e w})

19 Calculate

Δ g_{v_{i}}^{''} = | N (1, D) | \times (I_{i}^{new} - I_{i})

20 Add

I_{i}^{n e w}

to merged population:

{\vec{I}}_{merged} = \{\vec{I}, {\vec{I}}^{new}\}

21 end for

22 Merge populations:

{\vec{I}}_{t o t a l} = \vec{I} \cup {\vec{I}}^{new}

23 Sort

{\vec{I}}_{t o t a l}

by fitness.

24 Select top

N_{p o p}

individuals.

25 Update

I_{b e s t} = I_{1}

(current best individual)

26

I t e r = I t e r + 1

27 end while

28 Output global optimal solution (drone optimal strategy)

29 End

In the described algorithm,

I_{i}

denotes the position of the

i - th

individual, where

rand

represents a uniformly distributed random number within the interval

[0, 1]

. The search space boundaries are defined by

[I_{\min}, I_{\max}]

. The term

Δ g_{v_{i}}

corresponds to the growth vector of the

i - th

individual, which guides its directional evolution during optimization. The operator

N (1, D)

generates normally distributed random numbers with mean 1 and dimension D, while

⊙

signifies element-wise multiplication between vectors. Through iterative optimization, this algorithm converges to the Nash equilibrium solution, which determines the optimal velocity strategy for each UAV. The resultant velocity control vector is formulated as:

v_{f o r m a t i o n, i} (t) = δ_{i}^{*}

(10)

where

δ_{i}^{*}

represents the optimized velocity strategy derived from the Ivy algorithm, directing UAVs toward their target positions while maintaining swarm coordination.

2.5. Obstacle Avoidance Controller Implementation

During UAV formation flight, sudden obstacles may emerge. Without timely obstacle avoidance and trajectory replanning, the mission may fail. The artificial potential field method is a common local path planning approach with advantages including low control requirements, high computational efficiency, good robustness, and smooth trajectory generation. However, it tends to trap UAVs in local optima. To address this limitation, this paper adopts an improved potential field method that combines the strengths of artificial potential fields and non-potential vector field methods. The key innovation lies in making the overall repulsive field orthogonal to the gravitational field under specific conditions, thereby resolving obstacle avoidance challenges and preventing UAV stagnation in local potential field minima.

The conventional APF method defines two fundamental forces, as shown in Figure 6a. Attractive Force towards the target:

Φ_{a, i} = - k_{a} (q_{i} - q_{goal})

(11)

where

k_{a} > 0

tunes attraction strength, and

q_{goal}

denotes the target position. Repulsive Force from obstacles:

Φ_{r, i} = \{\begin{array}{l} k_{r} (\frac{1}{d_{obs}} - \frac{1}{d_{safe}}) \frac{q_{i} - q_{obs}}{{‖q_{i} - q_{obs}‖}^{3}} & if d_{obs} \leq d_{safe} \\ 0 & otherwise \end{array}

(12)

where

q_{i}

and

q_{obs}

denote the 3D position vectors of the UAV and obstacle, respectively. The term

d_{o b s} = ‖q_{i} - q_{obs}‖

calculates their Euclidean separation distance. The repulsive force comprises three key components:

The non-potential orthogonal vector field method modifies the conventional potential field approach by retaining only the orthogonal components between repulsive and attractive forces when their angle exceeds 90 degrees, as shown in Figure 6a. For the

i - th

UAV, the angle

χ_{i} \in [0, π]

between attractive force

Φ_{a, i}

and repulsive force

Φ_{r, i}

is defined as:

\cos χ_{i} = \frac{Φ_{a, i}^{T} Φ_{r, i}}{‖Φ_{a, i}‖ ‖Φ_{r, i}‖}

(13)

According to the above, based on the angle function and the conventional potential field method function, the following potential field force function

Φ_{ovf, i}

can be obtained and is defined as:

Φ_{ovf, i} = \{\begin{cases} - sat (Φ_{a, i} + P_{a, i} \times sat (Φ_{r, i}, v_{m, i}^{'}), v_{m, i}) \cos χ_{i} \in [μ_{i}, 0] \\ - sat (Φ_{a, i} + sat (Φ_{r, i}, v_{m, i}^{'}), v_{m, i}) \cos χ_{i} \in [- 1, μ_{i}) \cup (0, 1] \end{cases}

(14)

P_{a, i} = I_{n} - \frac{Φ_{a, i} Φ_{a, i}^{T}}{{‖Φ_{a, i}‖}^{2}}

(15)

where

v_{m, i}

is the maximum speed of the UAV;

sat

is a limiting function to prevent the speed output value from exceeding

v_{m, i}

. It can be seen from the above formula that the field-free orthogonal vector field has a defect, that is, it is discontinuous at

\cos χ_{i} = μ

, which will lead to the non-existence of

Φ_{ovf, i}

. For UAVs, it will cause a sudden change in the desired speed command. Therefore, a smooth function is proposed and defined as

τ (x, d_{1}, d_{2}) = \{\begin{matrix} 0 & x \leq d_{1} \\ A x^{3} + B x^{2} + C x + D & d_{1} \leq x \leq d_{2} \\ 1 & d_{2} \leq x \end{matrix}

(16)

where

A = 2 / {(d_{1} - d_{2})}^{3}

,

B = - 3 (d_{1} + d_{2}) / {(d_{1} - d_{2})}^{3}

,

C = 6 d_{1} d_{2} / {(d_{1} - d_{2})}^{3}

,

D = (d_{1}^{3} - 3 d_{1}^{2} d_{2}) / {(d_{1} - d_{2})}^{3}

are parameters. Thus, an improved field-free orthogonal vector field method with a continuous transition process is proposed as

Φ_{movf, i} = \{\begin{matrix} - sat (Φ_{a, i} + P_{a, i} \times sat (Φ_{r, i}, v_{m, i}^{'}), v_{m, i}) & \cos χ_{i} \in [- 1, 0] \\ - sat (Φ_{a, i} + sat (Φ_{r, i}, v_{m, i}^{'}), v_{m, i}) & \cos χ_{i} \in (0, 1] \end{matrix}

(17)

where

P_{ma, i} = I_{n} - γ_{i} \frac{Φ_{a, i} Φ_{a, i}^{T}}{{‖Φ_{a, i}‖}^{2}}

,

γ_{i} = τ (\cos χ_{i}, μ_{i}, - \frac{2 \sqrt{5} μ_{i}}{5 α_{i}})

.

By using this orthogonal vector field method, obstacles in the formation path can be effectively avoided. Combined with the UAV formation methods proposed in the previous sections, the formation restoration and formation transformation of UAVs can be efficiently completed. Thus, the speed control input for obstacle avoidance can be obtained as:

v_{obs, i} = k_{v} \cdot Φ_{movf, i}

(18)

where

v_{obs, i}

is the obstacle-avoidance speed input,

Φ_{movf, i}

is the output of the potential field force function, and

k_{v}

is the speed adjustment gain, which adjusts the mapping intensity of the potential field force on the speed.

2.6. Implementation of the Total Controller

To achieve the efficient execution of UAV formation restoration and obstacle-avoidance tasks, this paper designs a comprehensive total controller that integrates formation restoration control and obstacle-avoidance control. By using an optimization algorithm to solve the Nash equilibrium of the multi-UAV system, the optimal control strategy for each UAV is generated. The core objective of the total controller is to calculate the optimal control input for each UAV, enabling them to move coordinately towards the target position in a dynamic environment and quickly avoid obstacles when encountered, thereby maintaining the stability and safety of the formation.

The control input of the total controller for each UAV is:

u_{i} (t) = α v_{f o r m a t i o n, i} + (1 - α) v_{obs, i}

(19)

where

v_{formation, i}

is the control input for formation restoration;

v_{obstacle, i}

is the control input for obstacle avoidance; and

α \in [0, 1]

is an adjustment factor that controls the priority of the formation restoration and obstacle-avoidance tasks. The proof of the controller’s stability is detailed in Appendix A. The algorithm flow is shown in Figure 7.

3. Experimental Comparison

The method proposed in this paper is verified by digital simulation platforms and real-flight tests. The first step is the digital simulation verification. Multiple programming languages such as Python3.11, MATLAB R2024a, and C++ on the Windows platform are used to conduct digital simulation verification of the designed algorithm. In combination with the RflySim3D simulation platform, the verification results are fully demonstrated.

The Rflysim simulation platform consists of flight control modules, Coptersim modules, 3D display modules, swarm control modules, and vision control modules. This platform can provide UAV flight simulations based on the UE engine, and can simulate the flight control controller environment and control command transmission in real time. The architecture of the Rflysim platform and its 3D simulation are shown in the following Figure 8.

In the process of solving the Nash equilibrium, the parameter settings of the solution algorithm in this paper are as follows (Table 1):

The key parameters of the IVYA-C algorithm, including the population size, maximum iterations, and chaos probability, were determined through a series of preliminary experiments to balance solution quality and computational cost. For instance, the chaos probability was set to a small value (e.g., 0.03) to introduce sufficient randomness for escaping local optima without disrupting the convergence process. The number of iterations was set to ensure convergence across all benchmark algorithms, providing a fair basis for comparison. A detailed sensitivity analysis of these parameters is a valuable direction for future work to fully characterize their impact on performance.

This paper conducts multiple types of formation method comparison experiments, mainly comparing the formation recovery ability and formation transformation ability of UAVs. The algorithm comparisons include the leader–follower method and the virtual structure method. The experiments include comparisons of path length, time, and convergence speed.

The formation accuracy rate

γ

in this paper is defined by the following formula:

γ = (1 - \frac{1}{N} \sum_{i = 1}^{N} (\frac{∥ P_{i} - P_{i}^{*} ∥}{P_{i}^{*}})) \times 100 %

(20)

where

P_{i}

is the position of the

i - th

UAV, and

P_{i}^{*}

is its target position.

3.1. Simulation Comparison Experiment of Formation Restoration

To comprehensively verify the effectiveness of the UAV formation restoration strategy proposed in this paper, digital simulation experiments were carried out using MATLAB. The experiments involved multiple comparative tests, benchmarking our proposed method against the traditional improved APF method and three other widely used swarm intelligence algorithms: the Genetic Algorithm (GA), Improved Particle Swarm Optimization (IPSO), and Ant Colony Optimization (ACO). In addition, 3D simulation experiments and real-flight verifications were conducted to evaluate the adaptability and robustness of the algorithm.

The experimental parameter settings and experimental results are as follows (Table 2):

The experimental results indicate that the algorithm proposed in this paper outperforms other algorithms in terms of formation recovery speed. Not only is its convergence speed slightly faster than other algorithms, but it can also fully meet the computing power requirements of conventional UAV equipment. Overall, compared with other algorithms, the IVYA-Nash algorithm better aligns with the operational needs of current UAV formations. Comparisons of fitness values and experimental simulation results are presented in Figure 9 and Figure 10.

As illustrated in the convergence comparison (Figure 9), the proposed IVYA-Nash algorithm achieves the fastest convergence speed and the lowest steady-state error among all tested methods. While PSO, ACO, and GA converge slowly and often stagnate near local optima, IVYA-Nash rapidly reduces the objective function value within the first 10 iterations and maintains superior stability thereafter. This demonstrates its strong global search capability and efficiency in avoiding local minima.

Figure 9b presents the objective function landscape. The results show that IVYA-Nash converges closest to the global optimum with concentrated solutions, while the other algorithms exhibit more scattered distributions around suboptimal regions. This highlights the robustness and precision of IVYA-Nash in complex optimization spaces.

As can be seen from Table 2 and Figure 10, to verify the performance of the method proposed in this paper in UAV formation obstacle-avoidance tasks, the experimental results were compared against the improved APF method, the Genetic Algorithm (GA), Particle Swarm Optimization (IPSO), and Ant Colony Optimization (ACO). A comprehensive evaluation was carried out based on four indicators: average path length, formation accuracy rate, recovery time, and total time consumption.

The proposed algorithm demonstrates strong overall advantages across several key performance metrics. In terms of Recovery Time, the proposed method is the fastest at 6.4 s, significantly outperforming GA (7.1 s), IPSO (7.2 s), Improved APF (7.6 s), and ACO (7.8 s), reflecting its highly efficient real-time decision-making capability. Similarly, its Total Time Consumption of 15.42 s is considerably lower than all other algorithms, proving its excellent computational efficiency.

Regarding Formation Accuracy Rate, the proposed method achieves 95.63%, second only to the Improved APF (96.51%) but markedly better than ACO (94.51%) and significantly higher than IPSO (92.91%) and GA (92.42%), indicating its superior ability to maintain formation integrity during dynamic recovery.

As for the Average Path Length, the 27.05 m of the proposed method is slightly longer than that of Improved APF (25.70 m), GA (26.10 m), and IPSO (25.91 m). This can be seen as a reasonable trade-off: the algorithm sacrifices the shortest possible path in exchange for much faster decision-making and recovery speed, while also ensuring a higher degree of formation accuracy. As can also be observed from the trajectories in Figure 10, the path of the proposed method is smoother, avoiding sharp turns and formation disturbances that might result from an excessive focus on the shortest path.

Overall, the experimental data demonstrates that, compared to other benchmark algorithms including GA, the proposed IVYA-Nash method strikes the best balance between recovery speed, computational efficiency, and formation robustness, resulting in superior overall performance.

On the basis of completing the simulation experiments, this study further conducted 3D simulation and real-flight experiments to verify the feasibility and effectiveness of the proposed formation cooperative control strategy based on non-cooperative games in actual scenarios. The experiments were carried out in an indoor flight laboratory. The experimental environment was similar to that of the simulation experiments, with multiple static and dynamic obstacles set up to simulate UAV formation tasks in complex environments.

This paper first simulated an industrial warehousing environment where the passage width was narrow, and UAVs needed to maintain a relatively close distance to complete the obstacle-avoidance task. The design intention of this scenario was to verify the inter-UAV obstacle-avoidance ability of UAVs in high-density environments and the stability of the formation during group flight.

The UAVs needed to maintain a column formation to pass through the narrow passage while avoiding mutual interference. During this process, individual UAVs might need to make minor altitude or speed adjustments to ensure the overall passage of the formation. In a narrow space, a minor deviation of an individual UAV might lead to the instability of the overall formation structure. Therefore, strict path control and dynamic obstacle-avoidance strategies were required. The simulation diagram is shown as follows in Figure 11.

3.2. Verified by Real Flight Experiments

The experiment was conducted with 20 small quad-rotor UAVs. Each UAV was equipped with a high-precision positioning system and a communication module to ensure real-time acquisition of position and status information in an indoor environment. The scene was set up according to the MATLAB simulation experiment, with obstacles added in the middle. During the real-flight tests, the multi-UAV system executed the designated formation obstacle-avoidance task. In this process, more than ten UAVs successfully avoided obstacles and subsequently re-established the formation. The real-flight experiment was carried out with the following parameters and initial conditions: the radius of each UAV was 0.15 m, the row-to-row distance of the UAV formation was 1.2 m, the distance between adjacent UAVs in each row was 1 m, and the moving speed was 0.3 m/s. At the beginning of the real-flight, all UAVs took off in place and entered the target positions after the formation started.

In the real-flight experiments, the communication delay among UAVs was maintained within 15 ms, which ensured reliable real-time interaction for formation recovery. All experiments were conducted in an indoor environment without electromagnetic interference, thereby providing stable communication links. Under these conditions, the proposed IVYA-Nash framework demonstrated effective obstacle avoidance and rapid formation recovery.

During the real-flight process, the UAVs were initially set to take off at different positions and automatically form a square formation. As can be seen from the following group of figures, the UAV formation could perform adaptive obstacle-avoidance when encountering obstacles and could adaptively restore the original formation and continue to move after obstacle-avoidance. This shows that the method proposed in this paper can be applied to actual UAV flights.

This study constructed a narrow channel experimental environment based on the Rflysim3D simulation platform, with field flight tests validating the formation compression capability of the proposed algorithm. As demonstrated in Figure 12c, the algorithm successfully achieved a dual-layer formation flight mode.

Figure 13 provides a detailed comparison between the proposed IVYA-Nash method and the conventional Artificial Potential Field (APF) approach. Subplots (a) and (b) illustrate the 3D flight trajectories of 20 UAVs under both methods. With APF (subplot a), multiple UAVs exhibit unstable flight and collision tendencies, leading to eventual crashes due to aerodynamic interference, especially among vertically stacked drones. In contrast, our method (subplot b) ensures smoother trajectories and stable convergence toward the target points, maintaining the swarm formation without collisions. Subplots (c) and (d) further present the individual flight paths of UAV #20 under APF and our method, respectively. The APF-based trajectory (subplot c) shows significant oscillations and instability, while the IVYA-Nash trajectory (subplot d) remains smooth and successfully reaches the target. These results highlight the superior robustness, safety, and autonomous formation recovery capability of the proposed approach in cooperative multi-UAV control.

Figure 14 quantitatively characterizes the terminal positioning accuracy during formation assembly through box plots, where the box spans reveal error distribution characteristics. Experimental results demonstrate that our method achieves an average total-distance positioning error of 5.39 cm, representing a 94.5% accuracy improvement over the conventional potential field method’s 99.1 cm. Error distribution analysis shows our method maintains a stable median error of 5.12 cm, while the baseline exhibits significant outliers. This validates the proposed method’s millimeter-level precision and robust anti-interference capability in terminal positioning phases.

During practical flight operations, environmental uncertainties-including intermittent communication delays, stochastic aerodynamic disturbances, and illumination-dependent visual positioning artifacts-introduce systematic errors in formation control. To rigorously evaluate performance, we define the formation accuracy rate η as the percentage of UAVs satisfying positioning error threshold

γ

:

γ = (\frac{\sum_{i = 1}^{N} I (d_{i} \leq T)}{N}) \times 100 %

(21)

where

d

denotes the minimum approach distance (in meters) for the

i - t h

UAV,

T

represents the allowable maximum error threshold (in meters), and

N

is the total number of UAVs.

I (•)

represents the indicator function, which equals 1 if the specified condition is satisfied and 0 otherwise.

As evidenced by Table 3, the proposed algorithm demonstrates superior performance over the APF method across three distinct error threshold scenarios (0.3 m/0.15 m/0.1 m). Remarkably, it achieves 100% formation accuracy rate when the positioning error threshold is set at 0.3 m. Field experiments further confirm the methodology’s practical efficacy, showing greater performance advantages compared with other simulation-tested approaches, thereby substantiating its engineering applicability.

4. Discussion

While the experimental results validate the effectiveness of the IVYA-Nash algorithm for a 20-UAV swarm, it is important to discuss the computational complexity and scalability for larger formations. The time complexity of the proposed algorithm is primarily determined by the IVYA-C optimization process, which is on the order of O(

T \times M \times D

), where T is the number of iterations,

M

is the population size, and

D

is the problem dimension (

D = 3 N

for a swarm of

N

UAVs).

For larger swarms (e.g.,

N > 50

), the linear increase in dimension

D

will lead to a corresponding increase in computation time. However, the distributed nature of the game-theoretic model allows for potential parallelization, where subsets of UAVs could compute their strategies concurrently, mitigating the scalability challenge. The current implementation on standard hardware demonstrates real-time feasibility for the tested swarm size, but deployment on larger swarms would likely require more powerful onboard processors or edge computing resources to maintain the same rapid decision-making performance.

5. Conclusions

This paper proposes a UAV formation restoration strategy based on the Nash equilibrium game theory and constructs a UAV collaborative control model by combining with the IVY optimization algorithm. By constructing a reasonable game model to balance the interests of individual UAVs and the overall formation, this method can ensure that the UAV formation can quickly restore its formation and efficiently complete obstacle avoidance tasks in complex environments. The Ivy algorithm avoids the problem of local optimal solutions through global search during the optimization process, enabling the UAV formation to better adapt to changes in obstacles and target adjustments in dynamic environments.

The experimental results verify the superiority of this method in multiple performance indicators. Especially in terms of path planning, formation restoration accuracy, restoration speed, and obstacle avoidance efficiency, it performs significantly better than improved APF, the Particle Swarm Optimization (PSO) algorithm, and the Ant Colony Optimization (ACO). In particular, the combination of the game model based on the Nash equilibrium and the Ivy algorithm enables UAVs to effectively improve task execution efficiency and robustness while ensuring the stability of the formation.

In the real flight experiment, real-flight comparative tests conducted on a multi-UAV collaborative platform demonstrated that the proposed LVYA-Nash algorithm significantly outperforms the conventional Artificial Potential Field (APF) method in core performance metrics, including formation accuracy maintenance and resistance to aerodynamic disturbances from neighboring aircraft. These results validate its robustness advantages in highly dynamic multi-agent interaction scenarios.

Future work will focus on further optimizing the computational efficiency of the Ivy algorithm and exploring its applications in larger-scale and multi-task complex environments. To further enhance the external validity of our findings, future experiments should include outdoor trials in more challenging, unstructured environments, particularly under GPS-denied or windy conditions. In addition, the real-time performance and adaptability of the algorithm will also be the key points for further improvement, aiming to provide a more efficient solution for multi-UAV collaborative operations.

Limitations and Future Work: While the proposed IVYA-Nash algorithm demonstrates strong performance in both simulations and real-flight experiments, several limitations remain. First, the current tests were conducted in controlled indoor environments, and large-scale outdoor experiments have not yet been implemented. Second, potential collisions with dynamic external entities such as other UAVs or birds were not modeled.

Future work will therefore focus on (i) optimizing the computational efficiency of IVYA, (ii) conducting large-scale outdoor swarm experiments, (iii) incorporating dynamic obstacle modeling, and (iv) integrating reinforcement learning methods to enhance adaptability.

Author Contributions

Writing—review and editing, J.L.; writing—review and editing, Z.G.; methodology, Z.G.; Conceptualization, L.Z.; visualization, J.W. and Z.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Algorithmic Convergence Analysis

To theoretically ensure the stability and reliability of the proposed IVYA-C algorithm in solving for the Nash equilibrium, this section will demonstrate, through a series of mathematical theorems, that the algorithm converges to the global optimum with probability 1.

The proof primarily relies on the theory of stochastic functional analysis. First, we define the solution space and its mathematical properties.

Theorem A1.

Let

S

be the solution space of the algorithm. For any two solutions

I_{i}, I_{j} \in S

, let the distance metric be defined as

S (I_{i}, I_{j}) = |f (I_{i}) - f (I_{j})|

(A1)

where f is the objective function. Then,

(S, d)

constitutes a complete and separable metric space.

Proof.

To prove that

(S, d)

is a metric space, it must satisfy the axioms of positive-definiteness, symmetry, and the triangle inequality. The distance metric

S (I_{i}, I_{j})

is inherently non-negative and equals zero if and only if the solutions are equivalent in the objective space

f_{i} = f_{j}

thus fulfilling positive-definiteness. Symmetry is also evident from the properties of the absolute value, as

S (I_{i}, I_{j}) = |f (I_{i}) - f (I_{j})| = |f (I_{j}) - f (I_{i})| = d (I_{j}, I_{i})

(A2)

For any third solution

I_{k}

, the triangle inequality

S (I_{i}, I_{k}) \leq d (I_{i}, I_{j}) + d (I_{j}, I_{k})

(A3)

also holds. Therefore, (S, d) is a metric space. Since the solution space S is a bounded and closed set in Euclidean space

ℝ^{n}

and

f

is a continuous function, any Cauchy sequence generated by the elitist strategy must converge to a point within

S

; thus, the space is complete. As

S

is bounded, a countable dense subset can be found, making the space separable. This concludes the proof of Theorem A1. □

Theorem A2.

Let

ϕ (ω, I_{k})

be the operator representing one iteration update of the IVYA-C algorithm on an individual

I_{k}

under a random event

w

. Due to the elitist selection strategy, this operator is a random compression operator.

Proof.

The IVYA-C algorithm, in the final stage of each iteration (Algorithm 1, Steps 22–24), merges the parent and offspring populations and selects the individuals with the best fitness for the next generation. Let

I_{b e s t}^{t}

be the best solution at iteration t. The best solution of the next generation

I_{b e s t}^{t + 1}

will necessarily satisfy

f (I_{best}^{t + 1}) \leq f (I_{best}^{t})

(A4)

This implies that the expected distance from the population to the global optimum

I^{*}

does not increase. This property conforms to the definition of a random compression operator. This concludes the proof of Theorem A2. □

Theorem A3.

Let

\{I_{best}^{t}\}

be the sequence of best solutions generated by the IVYA-C algorithm, and let

S^{*}

be the set of global optimal solutions. Then, the algorithm converges to the global optimum with probability 1, i.e.,

P (\lim_{t \to \infty} I_{best}^{t} \in S^{*}) = 1

(A5)

Proof.

According to random search theory, an algorithm converges globally with probability 1 if it satisfies two assumptions: elitism and global reachability. Assumption 1 (Elitism) is satisfied by Theorem 2, as the algorithm ensures

f (I_{best}^{t + 1}) \leq f (I_{best}^{t})

(A6)

Assumption 2 (Global Reachability) requires that the algorithm must be able to explore any region A (where

v (A) > 0

) of the solution space

S

after a sufficient number of iterations. IVYA-C ensures this through two mechanisms: the chaotic perturbation mechanism (Step 8) leverages the ergodicity of chaotic maps, allowing an individual to escape any local optimum with a non-zero probability; and the growth vector (Step 9) incorporates randomness, permitting exploration in any direction. These mechanisms jointly guarantee that for any region A, the probability of generating a new solution within it,

μ_{t} (A)

, is always greater than zero. According to random search theory, this condition ensures the algorithm will not be permanently confined to any subset of the solution space, thus satisfying the assumption. Since the IVYA-C algorithm satisfies both assumptions, this concludes the proof of Theorem A3. □

This theoretical analysis provides strong mathematical support for the stability and reliability of the proposed algorithm.

Appendix A.2. Stability Analysis

Consider a swarm of n UAVs governed by double-integrator dynamics:

\{\begin{array}{l} {\dot{p}}_{i} = v_{i} \\ {\dot{v}}_{i} = u_{i} \end{array}

(A7)

where

p_{i} = {[x_{i}, y_{i}, z_{i}]}^{T}

and

v_{i}

denote position and velocity vectors. Under the composite controller:

u_{i} (t) = α v_{f o r m a t i o n, i} + (1 - α) v_{obs, i}

(A8)

with

v_{f o r m a t i o n, i}

being the Nash equilibrium solution from IVYA algorithm (Algorithm 1) and

v_{obs, i} = k_{v} \cdot \overset{•}{Φ_{movf, i}}

the obstacle avoidance vector (Equations (17) and (18)), the closed-loop system is asymptotically stable at the desired formation configuration.

Proof.

The paper constructs the candidate Lyapunov function:

V = \sum_{i = 1}^{n} J_{i} + \sum_{i = 1}^{n} \sum_{o \in O_{i}} Ψ (d_{i o}) + \frac{1}{2} \sum_{i = 1}^{n} {∥ v_{i} - v_{formation, i} ∥}^{2}

(A9)

Formation potential

v_{formation, i}

based on objective function (Equation (6)):

J_{i} = \frac{η_{1}}{N} \cdot ‖Ω_{i} - ω_{i} \cdot X_{i}‖ + \frac{η_{2}}{N} \cdot ‖P_{i} - ω_{i} \cdot \partial_{i}‖

(A10)

which satisfies

J_{i} \geq 0

with

J_{i} = 0

if and only if UAV

i

\cos χ_{i} \in [μ_{i}, 0]

reaches its target position

P_{i}

with consistent action

Ω_{i}

. The obstacle potential term

v_{obs, i}

utilizes a repulsive potential field:

Ψ (d_{i o}) = k_{o} {(\frac{1}{d_{i o} - r_{safe}} - \frac{1}{d_{\max} - r_{safe}})}^{2}, d_{i o} < d_{\max}

(A11)

where

d_{i o} = ∥ p_{i} - p_{o} ∥

denotes the distance to obstacle o ∈ Oi,

r_{safe}

is the safety margin, and

k_{o} > 0

. The velocity tracking term

\sum_{i = 1}^{n} {∥ v_{i} - v_{formation, i} ∥}^{2}

penalizes deviations from the formation velocity.

In summary,

V > 0

holds for all non-equilibrium states, and

V (x^{*}) = 0

if and only if the system is in the desired formation configuration.

The time derivative of

V

along system trajectories is given by:

\dot{V} = \sum_{i = 1}^{n} [\nabla_{p_{i}} J_{i}^{T} v_{i} + \nabla_{v_{i}} J_{i}^{T} {\dot{v}}_{i}] + \sum_{i = 1}^{n} \sum_{o \in O_{i}} Ψ ˙ (d_{i o}) + \sum_{i = 1}^{n} {(v_{i} - v_{formation, i})}^{T} ({\dot{v}}_{i} - {\dot{v}}_{formation, i})

(A12)

for the formation potential derivative, the Nash equilibrium property of IVYA ensures

\nabla_{v_{i}} J_{i}^{T} v_{formation, i} = 0

, leading to

{\dot{J}}_{i} \leq - k_{1} ∥ \nabla J_{i} ∥^{2} + \nabla_{v_{i}} J_{i}^{T} u_{i}

.

The obstacle potential derivative follows:

Ψ ˙ (d_{i o}) = - \frac{2 k_{o}}{{(d_{i n} - r_{rsf})}^{3}} {(p_{i} - p_{o})}^{T} v_{i}

(A13)

while the fundamental orthogonal property of NPOVF (Equation (17)) ensures

{(\nabla_{p_{i}} Ψ)}^{T} v_{obs, i} = 0

when. Velocity smoothing (Equation (4)) provides bounded formation acceleration

∥ {\dot{v}}_{formation, i} ∥ \leq k_{2} ∥ v_{i} - v_{formation, i} ∥

.

Combining these results and substituting the controller yields:

\begin{array}{l} \dot{V} \leq & - \sum_{i = 1}^{n} k_{1} ∥ \nabla J_{i} ∥^{2} + (1 - α) \sum_{i = 1}^{n} [\nabla_{v_{i}} J_{i}^{T} v_{obs, i} + {(\nabla_{p_{i}} V_{obstacle})}^{T} v_{obs, i}] \\ - \sum_{i = 1}^{n} k_{3} {∥ v_{i} - v_{formation, i} ∥}^{2} \end{array}

(A14)

The orthogonal decomposition property of NPOVF nullifies the cross terms, resulting in:

\dot{V} \leq - \sum_{i = 1}^{n} (k_{1} {∥ \nabla J_{i} ∥}^{2} + k_{3} {∥ v_{i} - v_{formation, i} ∥}^{2}) \leq 0

(A15)

with equality holding exclusively at equilibrium.

Since

V

is positive definite and radially unbounded while

V

is negative semi-definite, LaSalle’s invariance principle guarantees asymptotic convergence to the desired formation.

To verify whether the Lyapunov stability guarantees still hold under non-ideal conditions, we inject observation noise and communication delay into the double-integrator model. The following perturbations are considered:

Position measurement noise:

n_{p} \sim N (0, {0.05}^{2} m^{2})

Velocity measurement noise:

n_{p} \sim N (0, {0.05}^{2} m^{2})

One-step communication delay:

τ = 20 ms

The measured states become:

{\hat{p}}_{i} (t) = p_{i} (t) + n_{p}, {\hat{v}}_{i} (t) = v_{i} (t) + n_{v}

(A16)

Substituting these into the Lyapunov function (22) yields:

{\dot{V}}_{noise} = {\dot{V}}_{ideal} + Δ V_{noise}

(A17)

where

∥ Δ V_{noise} ∥ \leq ε = λ_{\max} (A_{v}) (0.05 + 0.02 ∥ K_{v} ∥) \approx 0.18

. If the gain matrices

K_{p}, K_{v}

are chosen such that

λ_{\min} (Q) - ε > 0

(A18)

then

{\dot{V}}_{noise} \leq 0

holds, ensuring practical stability under noisy and delayed environments. □

References

Cheng, C.; Sha, Q.; He, B.; Li, G. Path planning and obstacle avoidance for AUV: A review. Ocean Eng. 2021, 235, 109355. [Google Scholar] [CrossRef]
Chiriatti, G.; Palmieri, G.; Scoccia, C.; Palpacelli, M.C.; Callegari, M. Adaptive obstacle avoidance for a class of collaborative robots. Machines 2021, 9, 113. [Google Scholar] [CrossRef]
Duhé, J.F.; Victor, S.; Melchior, P. Contributions on artificial potential field method for effective obstacle avoidance. Fract. Calc. Appl. Anal. 2021, 24, 421–446. [Google Scholar] [CrossRef]
Cao, X.; Ren, L.; Sun, C. Research on obstacle detection and avoidance of autonomous underwater vehicle based on forward-looking sonar. IEEE Trans. Neural Netw. Learn. Syst. 2022, 34, 9198–9208. [Google Scholar] [CrossRef] [PubMed]
Qi, J.; Guo, J.; Wang, M.; Wu, C.; Ma, Z. Formation tracking and obstacle avoidance for multiple quadrotors with static and dynamic obstacles. IEEE Robot. Autom. Lett. 2022, 7, 1713–1720. [Google Scholar] [CrossRef]
Sangaiah, A.K.; Anandakrishnan, J.; Meenakshisundaram, V.; Rahman, M.A.A.; Arumugam, P.; Das, M. Edge-IoT-UAV Adaptation Toward Precision Agriculture Using 3D-LiDAR Point Clouds. IEEE Internet Things Mag. 2025, 8, 19–25. [Google Scholar] [CrossRef]
Sun, X.; Pan, S.; Bao, N.; Liu, N. Hybrid ant colony and intelligent water drop algorithm for route planning of unmanned aerial vehicles. Comput. Electr. Eng. 2023, 111, 108957. [Google Scholar] [CrossRef]
Li, Z.; Luo, Y. Deep reinforcement learning for Nash equilibrium of differential games. IEEE Trans. Neural Netw. Learn. Syst. 2024, 36, 2747–2761. [Google Scholar] [CrossRef]
Wen, Y.; Yang, Y.; Luo, R.; Wang, J.; Pan, W. Probabilistic recursive reasoning for multi-agent reinforcement learning. arXiv 2019, arXiv:1901.09207. [Google Scholar] [CrossRef]
Ghasemi, M.; Zare, M.; Trojovský, P.; Rao, R.V.; Trojovská, E.; Kandasamy, V. Optimization based on the smart behavior of plants with its engineering applications: Ivy algorithm. Knowl.-Based Syst. 2024, 295, 111850. [Google Scholar] [CrossRef]
He, H.; Quan, S.; Sun, F.; Rao, R.V.; Trojovská, E.; Kandasamy, V. Model predictive control with lifetime constraints based energy management strategy for proton exchange membrane fuel cell hybrid power systems. IEEE Trans. Ind. Electron. 2020, 67, 9012–9023. [Google Scholar] [CrossRef]
Ahmed, G.; Sheltami, T.; Ghaleb, M.; Hamdan, M.; Mahmoud, A.; Yasar, A. Energy-efficient internet of drones path-planning study using meta-heuristic algorithms. Appl. Sci. 2024, 14, 2418. [Google Scholar] [CrossRef]
Zhang, H.; Gan, X.; Li, S.; Chen, Z. UAV safe route planning based on PSO-BAS algorithm. J. Syst. Eng. Electron. 2022, 33, 1151–1160. [Google Scholar] [CrossRef]
Zhu, K.; Han, B.; Zhang, T. Multi-UAV distributed collaborative coverage for target search using heuristic strategy. Guid. Navig. Control 2021, 1, 2150002. [Google Scholar] [CrossRef]
Kabore, K.M.; Güler, S. Distributed formation control of drones with onboard perception. IEEE/ASME Trans. Mechatron. 2021, 27, 3121–3131. [Google Scholar] [CrossRef]
Yu, J.; Zhang, Y.; Sun, C. Balance of exploration and exploitation: Non-cooperative game-driven evolutionary reinforcement learning. Swarm Evol. Comput. 2024, 91, 101759. [Google Scholar] [CrossRef]
Wu, Y.; Low, K.H. Discrete space-based route planning for rotary-wing UAV formation in urban environments. ISA Trans. 2022, 129, 243–259. [Google Scholar] [CrossRef] [PubMed]
Li, K.; Han, Y.; Yan, X. Distributed multi-UAV cooperation for dynamic target tracking optimized by an SAQPSO algorithm. ISA Trans. 2022, 129, 230–242. [Google Scholar] [CrossRef] [PubMed]
Zhu, L.; Ma, C.; Li, J.; Lu, Y.; Yang, Q. Connectivity-maintenance UAV formation control in complex environment. Drones 2023, 7, 229. [Google Scholar] [CrossRef]
Wu, H.; Duan, H. Hierarchical Pigeon Inspired Optimization Based Multi-UAV Obstacle Avoidance Control. Aerosp. Sci. Technol. 2025, 159, 109963. [Google Scholar] [CrossRef]
Ding, Z.; Su, D.; Liu, Q.; Jin, C. A deep reinforcement learning approach for finding non-exploitable strategies in two-player atari games. arXiv 2022, arXiv:2207.08894. [Google Scholar] [CrossRef]
Dewangan, R.K.; Saxena, P. Three-dimensional route planning for multiple unmanned aerial vehicles using Salp Swarm Algorithm. J. Exp. Theor. Artif. Intell. 2023, 35, 1059–1078. [Google Scholar] [CrossRef]
Avanzato, R.; Beritelli, F.; Raciti, F.; Spataro, E. A Game Theory-based Flight Strategy to Energy Efficient UAV-Femtocell Geolocation Systems. IEEE Trans. Aerosp. Electron. Syst. 2024, 60, 7903–7916. [Google Scholar] [CrossRef]
Li, S.; Fang, X.A. modified adaptive formation of UAV swarm by pigeon flock behavior within local visual field. Aerosp. Sci. Technol. 2021, 114, 106736. [Google Scholar] [CrossRef]
Battiston, A.; Sharf, I.; Nahon, M. Attitude estimation for collision recovery of a quadcopter unmanned aerial vehicle. Int. J. Robot. Res. 2019, 38, 1286–1306. [Google Scholar] [CrossRef]
Bu, Y.; Yan, Y.; Yang, Y. Advancement Challenges in UAV Swarm Formation Control: A Comprehensive Review. Drones 2024, 8, 320. [Google Scholar] [CrossRef]
Chen, Y.; Chen, R.; Huang, Y.; Xiong, Z.; Li, J. DRL-Based Improved UAV Swarm Control for Simultaneous Coverage and Tracking with Prior Experience Utilization. Drones 2024, 8, 784. [Google Scholar] [CrossRef]
Zhao, Z.; Zhang, X.; Fang, H.; Yang, Q. Distributed Formation Planning for Unmanned Aerial Vehicles. Drones 2025, 9, 306. [Google Scholar] [CrossRef]

Figure 1. Flowchart of Game-Theoretic Formation Control for UAV Swarms.

Figure 2. UAV Situational Awareness Diagram.

Figure 3. Action Library: (a) 2D Simple Action Library; (b) 3D Simple Action Library.

Figure 4. Virtual Structure Method.

Figure 5. The Construction Process of the Payoff Matrix.

Figure 6. Analysis of improved potential field method: (a) Force analysis of improved potential field method; (b) Analysis of non-potential orthogonal vector field.

Figure 7. Implementation of the Total Controller.

Figure 8. Rflysim Architecture.

Figure 9. Comparison of Fitness Values. (a) Convergence Comparison on Drone Objective Function; (b) Drone Objective Function Landscape.

Figure 10. Comparison of Fitness Values: (a) Method of This Paper; (b) Improved Artificial Potential Field; (c) Improved Particle Swarm Optimization; (d) Ant Colony Optimization; (e) Genetic Algorithm.

Figure 11. UAV Formation Obstacle Avoidance in a Factory Environment, the numbers represent the UAV IDs. (a) Industrial Warehousing Environment; (b) Square Formation; (c) Triangular Formation; (d) Obstacle-Avoiding Movement; (e) Formation Restoration.

Figure 12. Formation Restoration Experiment in a Simple Environment: (a) Industrial Warehousing Environment; (b) Obstacle-Avoiding Movement; (c) Formation Restoration; (d) Recovery Finished.

Figure 13. Actual Flight Path. (a) The Global Path of the Method in this Article; (b) The Global Path of the APF; (c) The Path of the 20th Drone of The Method in this Article; (d) The Path of the 20th Drone of the APF.

Figure 14. The Target Spread Between the Actual Formation and the Expected Formation.

Table 1. Experimental Parameters.

Algorithm	Value
IVYA-C	$N = 20, M a x I t e r = 100, I_{\min} = 0, I_{\max} = 3, Δ g_{v_{i}} = 0.5, μ = 4.0, p_{c h a o s} = 0.3$
IVYA	$N = 20, M a x I t e r = 100, I_{\min} = 0, I_{\max} = 3, Δ g_{v_{i}} = 0.5, μ = 4.0$
IPSO [11]	$N = 20, M a x I t e r = 100, η_{1} = 1.5, η_{2} = 1.5, w_\max = 0.9, w_\min = 0.4$
GA	$N = 20, M a x I t e r = 100, p_{c} = 0.8, p_{m} = 0.1$ .
ACO	$N = 20, α = 1.0, β = 2.0, ρ = 0.5, Q = 100$

Table 2. Experimental Comparisons.

Evaluation Index	Method of This Paper	Improved APF	IPSO	ACO	GA
Average Path Length/m	27.05	25.70	25.91	29.21	26.10
Formation Accuracy Rate	95.63%	96.51%	92.91%	94.51%	92.42%
Recovery Time/s	6.4	7.6	7.2	7.8	7.1
Total Time Consumption/s	15.42	17.89	17.51	16.80	17.22

Table 3. Formation Accuracy at Different Thresholds.

Threshold Method	Method of This Paper	Improved APF
T = 0.1	85%	45%
T = 0.15	95%	60%
T = 0.3	100%	75%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, J.; Gu, Z.; Zhang, L.; Wang, J. Research on Formation Recovery Strategy for UAV Swarms Based on IVYA-Nash Algorithm. Electronics 2025, 14, 3653. https://doi.org/10.3390/electronics14183653

AMA Style

Li J, Gu Z, Zhang L, Wang J. Research on Formation Recovery Strategy for UAV Swarms Based on IVYA-Nash Algorithm. Electronics. 2025; 14(18):3653. https://doi.org/10.3390/electronics14183653

Chicago/Turabian Style

Li, Junfang, Zexin Gu, Lei Zhang, and Junchi Wang. 2025. "Research on Formation Recovery Strategy for UAV Swarms Based on IVYA-Nash Algorithm" Electronics 14, no. 18: 3653. https://doi.org/10.3390/electronics14183653

APA Style

Li, J., Gu, Z., Zhang, L., & Wang, J. (2025). Research on Formation Recovery Strategy for UAV Swarms Based on IVYA-Nash Algorithm. Electronics, 14(18), 3653. https://doi.org/10.3390/electronics14183653

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Research on Formation Recovery Strategy for UAV Swarms Based on IVYA-Nash Algorithm

Abstract

1. Introduction

2. Problem Formulation and Model Architecture

2.1. Problem Formulation

2.2. Construction of Formation Game Model

2.3. Construction of Formation Game Model

2.4. Nash Equilibrium Solution and Problem Transformation

2.5. Obstacle Avoidance Controller Implementation

2.6. Implementation of the Total Controller

3. Experimental Comparison

3.1. Simulation Comparison Experiment of Formation Restoration

3.2. Verified by Real Flight Experiments

4. Discussion

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A

Appendix A.1. Algorithmic Convergence Analysis

Appendix A.2. Stability Analysis

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI