Next Article in Journal / Special Issue
MICFOA: A Novel Improved Catch Fish Optimization Algorithm with Multi-Strategy for Solving Global Problems
Previous Article in Journal
Development and Evaluation of Fucoidan-Loaded Electrospun Polyvinyl Alcohol/Levan Nanofibers for Wound Dressing Applications
Previous Article in Special Issue
Enhanced Multi-Strategy Slime Mould Algorithm for Global Optimization Problems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Trajectory Optimization to Enhance Observability for Bearing-Only Target Localization and Sensor Bias Calibration

1
School of Automation, Southeast University, Nanjing 210096, China
2
School of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
3
Unmanned Aerial Vehicles Research Institute, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
*
Author to whom correspondence should be addressed.
Biomimetics 2024, 9(9), 510; https://doi.org/10.3390/biomimetics9090510
Submission received: 23 July 2024 / Revised: 19 August 2024 / Accepted: 20 August 2024 / Published: 23 August 2024

Abstract

This study addresses the challenge of bearing-only target localization with sensor bias contamination. To enhance the system’s observability, inspired by plant phototropism, we propose a control barrier function (CBF)-based method for UAV motion planning. The rank criterion provides only qualitative observability results. We employ the condition number for a quantitative analysis, identifying key influencing factors. After that, a multi-objective, nonlinear optimization problem for UAV trajectory planning is formulated and solved using the proposed Nonlinear Constrained Multi-Objective Gray Wolf Optimization Algorithm (NCMOGWOA). Simulations validate our approach, showing a threefold reduction in the condition number, significantly enhancing observability. The algorithm outperforms others in terms of localization accuracy and convergence, achieving the lowest Generational Distance (GD) (7.3442) and Inverted Generational Distance (IGD) (8.4577) metrics. Additionally, we explore the effects of the CBF attenuation rates and initial flight path angles.

1. Introduction

In recent years, the popularity of Unmanned Aerial Vehicles (UAVs) has increased because of their affordability and versatility [1,2,3]. These characteristics have facilitated the utilization of UAVs across multiple domains and applications, including infrastructure inspection [4], environmental monitoring [5], rescue missions and research [6], mapping [7], surveillance [8], and remote sensing [9]. In these sensing applications, target localization plays a critical role, which involves determining the position of a target via relative information between the UAV and the target, obtained from airborne sensors equipped on UAVs [10]. However, target localization based on passive sensors faces two main challenges. The first is that the bearing measurement bias introduces substantial errors in target localization [11]. The second challenge is that the estimation accuracy may be reduced because of the reliance solely on bearing information [12].
Target localization, which is usually aimed at obtaining a target’s inertial position, necessitates obtaining bearing information relative to the world frame [13,14]. Consequently, attitude sensors and onboard cameras, which provide the UAV’s attitude and the target’s line of sight (LOS) angle relative to the UAV’s fuselage frame, respectively, are generally integrated into the bearing-only target localization problem. According to references [15,16], targets can be accurately localized when the bearing measurements are perturbed only by Gaussian noise. However, in practice, this assumption is not valid. The outputs from both the attitude sensor and the onboard camera are biased [17]. Neglecting the bias can notably impair the target localization performance [11,17]. Considerable research efforts have focused on calibrating sensor biases. The method proposed in [18] constructs the bias pseudo measurement exclusively through the manipulation of local tracks, covariances, and the equivalent bias measurement matrices to estimate the sensor bias in sensor registration. The authors of [19] integrated sensor calibration and trajectory fusion within a multi-target tracking framework to mitigate the effect of bearing measurement bias on target tracking. The integration of data is crucial, particularly when a target needs to be observed simultaneously by two radar devices. In terms of state estimation, the EKF is a useful method for dealing with diffuse white noise models [20]. It also has the advantage of high computational efficiency compared with the Unscented Kalman Filter (UKF), which has been widely used in recent years for nonlinear filtering problems [21]. Given that the observability of the system directly impacts the performance of the state estimation [22,23], it is necessary to maintain and enhance the observability of the target localization system.
Observability is a fundamental property of a system that indicates its ability to uniquely determine an initial state from its outputs. Only if the system is observable can the states at any time be determined by a state estimator such as the Kalman Filter. The Fisher Information Matrix (FIM) is commonly used as a metric for assessing system observability [24,25]. The inverse of FIM corresponds to the Cramer–Rao Lower Bound (CRLB), which sets a theoretical lower limit on the covariance matrix of an ideal estimator, thereby representing the best achievable performance in state estimation. As highlighted in [12], maximizing system observability typically involves maximizing the determinant of the FIM, which in turn minimizes the estimation error covariance of the filter, enhancing its overall performance.
The observability of a bearing-only measurement system is dynamic and depends on the relative positions of the observer and the target [26,27,28,29,30]. Therefore, it is necessary to enhance the system’s observability by trajectory optimization. The authors of [31] utilized the rank of the observability matrix as a criterion to assess the system’s observability and to determine the reliability of different sensor locations. In [24,32,33], the determinant of the FIM was employed as a metric. It is maximized to generate the optimal trajectories that enhance the system’s observability. Unfortunately, there has been no investigation in the literature into UAV trajectory optimization for target localization based on directly enhancing system observability.
This work first analyzes the observability of a target localization system with biased bearing measurements via the Lie derivative method. It derives the conditions necessary to maintain system observability. To ensure observability, inspired by the phototropism of plants, a control barrier function was designed. This function restricts UAV motion, allowing it to avoid areas that may affect observability, with adjustable avoidance levels. Additionally, the condition number of the system observability matrix was employed as a metric to quantify the system observability, helping to identify the factors that contribute to system observability. Based on this analysis, a multi-objective, nonlinear programming problem was established to maintain and enhance system observability. To effectively solve the multi-objective, nonlinear programming problem, a penalty function was integrated into the Multi-Objective Gray Wolf Optimization Algorithm to address nonlinear constraints. Simulations confirmed the effectiveness of the proposed method. The UAV operated at a fixed altitude, modeled on a 2D, obstacle-free map, with constraints on the speed and turn rate of the UAV to limit its turning radius. The root-mean-square error (RMSE) of localization was used as a performance metric indicator for localization accuracy. The effectiveness of the proposed Nonlinear Constrained Multi-Objective Gray Wolf Optimization Algorithm (NCMOGWOA) was verified through comparisons with the Multi-Objective Particle Swarm Optimization Algorithm (MOPSOA) [34], Multi-Objective Arithmetic Optimization Algorithm (MOAOA) [35], and Sequential Quadratic Programming (SQP) method [36]. While the MOPSOA and MOAOA are both heuristic and neglect the fitness among the nondominated solutions, the SQP method requires a suitable starting point. To address these limitations, we propose the NCMOGWOA, which shows faster convergence and lower localization error in simulations. It outperforms the other methods in terms of the convergence metrics GD and IGD. Additionally, we explore the impact of the CBF attenuation rates and initial flight path angles on trajectory optimization.
The main contributions of this paper include the following:
(1)
Observability Analysis: Deriving necessary conditions for maintaining the observability of the target localization system using only biased bearing measurements. A control barrier function is designed to ensure system observability by restricting UAV motion.
(2)
Optimization Metric: Utilizing the condition number of the observability matrix as a metric. A multi-objective optimization algorithm is proposed to enhance system observability.
(3)
Algorithm Improvement: To address the limitations of the MOPSOA and MOAOA, the NCMOGWOA incorporates nondominated sorting and a crowding distance mechanism to improve the solution accuracy. A penalty function is constructed to manage nonlinear constraints, and random starting points increase adaptability.
The remaining sections are organized as follows. The kinematic model of the UAV and the target localization system with biased bearing measurement information are detailed in the subsequent section. Then, we conduct both qualitative and quantitative analyses of the system’s observability and introduce the designed trajectory optimization method. The simulation results are provided in the penultimate section, and the conclusions are presented in the final section.

2. System Models

This paper explores a two-dimensional stationary target localization problem. As illustrated in Figure 1, the inertial reference is denoted as ( X , Y ). The variables with subscripts U and T indicate those of the UAV and target, respectively. The speed of the UAV is represented by V; α and R denote the bearing angle and the relative distance between the UAV and target, respectively. θ represents the flight path angle of the UAV defined in the inertial reference frame. The separation angle λ is defined as the angle between the longitudinal axis of the UAV and the line of sight of the UAV, which can be expressed as the bearing angle and flight path angle of the UAV: λ = α θ. To ensure the uniqueness of each angle, let α , λ , θ π , π.
Assuming that the UAV moves at the same horizontal altitude as the target, the kinematics of the UAV can be formulated as follows:
p ˙ U ( t ) = g ( θ ( t ) ) V ( t ) γ ˙ ( t ) = ω ( t )
where g ( θ ( t ) ) = cos θ ( t ) , sin θ ( t ) T, while P U ( t ) = x U ( t ) , y U ( t ) T and ω ( t ) represent the position vector and turn rate of the UAV, respectively.
P T ( t ) = x t ( t ) , y t ( t ) T is defined as the position vector of the target at time t in the inertial reference frame, which the UAV cannot directly obtain. The objective of target localization is to calculate P T ( t ) with the UAV’s position, speed, and bearing measurement h ( t ) (measured by the UAV onboard sensor). According to the literature [15], target localization with bearing information can be accurate only when affected by Gaussian noise. However, in practice, the bearing information is collected by the attitude sensor and onboard camera of the UAV, which have errors in their outputs. As shown in Figure 1, b is defined as the bias in the bearing measurement. To achieve better performance in target localization, it is necessary to estimate b to compensate for the measured bearing information; thus, the state vector X to be estimated is defined as
X ( t ) = x t ( t ) , y t ( t ) , b T
The discrete-time system dynamics can subsequently be formulated as
X ( t + 1 ) = Φ t + 1 , t X ( t ) + w ( t ) h ( t ) = α ( t ) + b + v ( t )
where Φ t + 1 , t denotes the state transition matrix from t to t + 1. w ( t ) and v ( t ) are the white Gaussian noise with corresponding covariances Q k and L k, respectively.
r ( t ) = P T ( t ) P U t = r x ( t ) , r y ( t ) T R 2 is defined as the relative position vector; then, in the form of the relative position, the biased measurement function can be represented as
h ( t ) = arctan ( r y ( t ) r x ( t ) ) + b + v ( t )
and the dynamic system (3) can be rewritten as
X ( t + 1 ) = Φ t + 1 , t X ( t ) + w ( t ) h ( t ) = arctan ( r y ( t ) r x ( t ) ) + b + v ( t )
Since observability directly influences the accuracy of target estimation, this paper aims to maintain the observability of the dynamic system (5) and subsequently enhance its performance in the target localization system with biased bearing measurements.

3. Bio-Inspired Observability Enhancement Optimization Model

In this section, the qualitative and quantitative analyses of the system are provided, as are the observability conditions and influencing factors.

3.1. Qualitative Analysis of System Observability

Within the continuous-time framework, the dynamic system (5) can be replaced by
X ˙ = f ( X ) h = arctan ( r y r x ) + b + v
Definition 1.
The system (6) is observable in the time interval  [ 0 , M ] if the initial state  X 0 can be uniquely determined from  h m, m [ 0 , M ].
Defining H ( t ) = [ h ( t ) , h ˙ ( t ) , , h n 1 ( t ) ] T as the cumulative measurement vector, the arbitrary initial state X ( t ) and its corresponding measurement H ( t ) are related by
H ( t ) = g ( X ( t ) )
According to the implicit function theorem, the unique determination of the initial state X ( t ) from the measurement H ( t ), if and only if the observability matrix, denoted by
O ( t ) = H ( t ) X ( t )
is nonsingular.
Theorem 1.
The dynamic system (6) is observable only if the observability  O ( t ) has full rank, i.e., rank  r a n k ( O ( t ) ) = n, where  n is the order of the system.
To compute the observability matrix O ( t ), the Lie derivative is employed [37]. For simplicity, the state and time symbols are both ignored. The Lie derivative of h with respect to f is expressed as L f h. The j-th order Lie derivative can be calculated via
L f j h = L f j 1 h X · f .
Consequently, the relationship between states and measurement can be represented by
Y ( t ) = [ L f 0 h , L f 1 h , , L f n 1 h ] T
and its corresponding observability matrix can be computed by
O ( t ) = L f 0 h X , L f 1 h X , , L f n 1 h X T
where
L f 0 h X = h X , L f j h X = ( L f i 1 h X · f ) X
In the dynamic system (6), the bias b remains constant, providing no additional information in the computation of the Lie derivative vector. Therefore, when computing the observability matrix, only the first two orders of O ( h ) are considered:
O = [ L f 0 ( h ) ; L f 1 ( h ) ] = r y r x 2 + r y 2 r x r x 2 + r y 2 v y r x 2 + 2 v x r x r y + v y r y 2 r x 2 + r y 2 2 v x r x 2 + 2 v y r x r y v x r y 2 r x 2 + r y 2 2
For simplicity, let R = r , V = v. Hence, in polar coordinates, we have
r x = R cos α r y = R sin α , v x = V cos θ v y = V sin θ
By substituting (14) into (13), the observability matrix can be simplified as
O = sin a R cos a R V sin 2 a θ R 2 V cos 2 a θ R 2
and its corresponding determinant can be computed as
det ( O ) = R V sin a θ R 4
According to Theorem 1, the system (6) is observable only when the observability matrix O has full rank, which is equivalent to the fact that the determinant of O is nonzero, i.e.,
det ( O ) 0 λ = α θ 0 V 0
When V = 0, the UAV remains stationary, and the bearing measurement cannot be updated; when α θ = 0, r ( t + 1 ) = k 1 r ( t ) , k 1 +. Regardless of the angle measurement, distinguishing the next step relative position vector from the current step relative position vector is impossible. Hence, to ensure system observability, the separation angle λ and the relative speed between the UAV and the target must remain nonzero.

3.2. Bio-Inspired Unobservable Area Avoidance Based on the CBF Method

The observability matrix O is determined by the bearing angle α, the relative distance R, the speed V, and the flight path angle θ of the UAV. When the UAV flight parameters are constant, the observability matrix O can be uniquely determined by the relative positional relationship between the UAV and the target.
Definition 2.
The observability of the dynamic system can be determined by the geometric relationship between the UAV and the target.
To guarantee the observability of the dynamic system, it is crucial to modify the geometric relationship between the UAV and the target. In this work, inspired by phototropism and plants’ dark avoidance behavior, the CBF was employed to categorize the observable and unobservable areas. Additionally, restrictions were imposed to ensure UAV movement within the observable areas and maintain the system’s observability.
Phototropism refers to the phenomenon in which plants bend toward light when they are exposed to it [38,39]. It is considered to be a mechanism by which plants adapt to low-light environments, as illustrated in Figure 2.
Definition 3.
For a smooth function  h : D R, define  C as a superlevel set of  h, its boundary as  C, and its interior as  int ( C ):
C = x D R n : B ( x ) 0 C = x D R n : B ( x ) = 0 int ( C ) = x D R n : B ( x ) > 0
Considering a general nonlinear system
x ˙ = f ( x ) + g ( x ) u
if there exists a constant  δ > 0 such that for all  x D, satisfying
sup L f h ( x ) + L g h ( x ) u + δ h ( x ) 0 i . e . , sup h ˙ ( x ) u + δ h ( x ) 0
then,  h is a control barrier function of (19).
The control barrier function is frequently employed for addressing safety analysis and control issues in nonlinear systems [40,41]. In this work, the control barrier function was employed to delineate observable and unobservable areas and restrict UAV motion, thereby ensuring system observability.
From (1), the kinematics of the UAV can be rewritten as
X ˙ U = f ( X U ) + g ( X U ) u = sin ( θ ) cos ( θ ) 0 V + 0 0 1 w
where X U = [ x u , y u , θ ] T.
By utilizing the system observability conditions provided in (16) and (17), the control barrier function can be formulated as
B ( x ) = ( R V sin ( α θ ) ) 2
The CBF observability constraints can subsequently be established through the following inequality:
B ˙ ( x ) + δ B ( x ) 0 i . e . , L f B + L g B u + δ B ( x ) 0
where δ ( 0 , 1 ) represents the attenuation rate of B ( x ). L f B and L g B denote the Lie derivatives along the vector fields f and g, respectively:
L f B = B ( x ) f , L g B = B ( x ) g
B ( x ) = 2 r V 2 sin ( α θ ) sin θ 2 r V 2 sin ( α θ ) cos θ 0
Substituting (24) and (25) into (23) yields the observability constraints of the system:
R 2 V 2 sin ( 2 θ 2 α ) w + δ ( R V sin ( α θ ) ) 2 0

3.3. Influence of δ on System Observability

By integrating (23) from t to t + Δ t, we have
B ( x ( t + Δ t ) ) B ( x ( t ) ) t t + Δ t δ B ( x ( τ ) ) d τ
Subsequently, the superlevel set S Δ t that satisfies the CBF observability constraint at x ( t + Δ t ) can be defined as
S Δ t = x n : B ( x ( t + Δ t ) ) B ( x ( t ) ) t t + Δ t δ B ( x ( τ ) ) d τ
Let R Δ t denote the set of states reachable by the system after Δ t when the constraints are satisfied. Hence, a solution exists at x ( t ) if the UAV’s motion at x ( t ) fulfills both the control input constraints and the observability CBF constraints, i.e., R Δ t S Δ t 0. The impact of δ on system observability is illustrated through the geometric relationship between R Δ t and S Δ t, as shown in Figure 3.
When δ = δ 1, as illustrated in Figure 3a, and S Δ t R Δ t = R Δ t, the UAV has the flexibility to either approach or move away from the unobservable area, while ensuring the system’s observability at x ( t + Δ t ). However, the possibility of the UAV moving closer to the unobservable area increases the risk of the system becoming unobservable.
When δ = δ 2, S Δ t R Δ t 0, as shown in Figure 3b, the UAV is restricted to moving far from the unobservable area, ensuring observability. However, as the coverage area of S Δ t R Δ t decreases, the feasible area for UAV motion diminishes.
From Figure 3a,b, it is evident that with a certain value of Δ t, as δ decreases, as indicated by (28), the area of S Δ t diminishes. This compression reduces the area covered by S Δ t R Δ t, thereby decreasing the likelihood of the UAV moving closer to the unobservable area and ensuring system observability.

3.4. Quantitative Analysis of System Observability

In addition to maintaining the system’s observability, we aim to enhance it to improve the state estimation performance. To analyze system observability quantitatively, C was defined as the observability metric:
C = ξ max ( O ) ξ min ( O )
where C represents the condition number of the observability matrix O. ξ min ( O ) and ξ max ( O ) denote the minimum and maximum singular values, respectively.
Definition 4.
A system is considered to have weak observability if the condition number of the observability matrix is extremely large or infinite.
Definition 5.
A system is considered to have strong observability if the condition number of the observability matrix is close to one.
Substituting (15) into (29), we have
C = V 2 + R 2 + V 2 + R 2 2 4 V 2 R 2 sin a θ 2 V 2 + R 2 V 2 + R 2 2 4 V 2 R 2 sin a θ 2
(30) indicates that C 1, C = 1 if and only if the following equation is satisfied:
V 2 + R 2 2 = 4 V 2 R 2 sin a θ 2 1 + R 2 V 2 = 2 R V sin a θ
Typically, the relative distance between the UAV and target significantly exceeds the speed of the UAV, i.e., R V. Consequently, 1 + R 2 / V 2 is generally much larger than 2 R / V · sin a θ, and Equation (31) can be satisfied only when R / V = 1 and α θ = ± π / 2.
By defining c = 1 / C , a heatmap was generated to visualize the distribution of c. As demonstrated in Figure 4, when R / V is closer to 1 and λ is closer to ± π / 2, C is closer to 1, indicating improved system observability. Consequently, to enhance the observability, the UAV must approach the target closely, while also adjusting the flight path angle θ to make the separation angle λ approach ± π / 2.

3.5. Trajectory Optimization Model

Based on the observability analysis above, a multi-objective optimization model was formulated to enhance the observability of the dynamic system while ensuring it. The object functions were defined as f 1 = R / V 1 and f 2 = a b s ( λ ) π / 2, respectively. Typically, the distance from the UAV to target R is significantly greater than the speed of the UAV; the proximity of R / V and 1 is comparable to the proximity of R and 1. Hence, considering a minimum safe distance R s a f e between the UAV and the target, the first objective function can be redefined as f 1 = R R s a f e.
min f 1 = R R s a f e min f 2 = a b s ( λ ) π / 2 subject to r 2 V 2 sin ( 2 θ 2 α ) ω + δ ( r V sin ( α θ ) ) 2 0 v min v v max ω min ω ω max R s a f e R
where v min and v max denote the minimum and maximum UAV speeds, respectively; ω min and ω max denote the minimum and maximum UAV turn rates, respectively.
Although the optimization problem in (32) can be solved via traditional linear optimization techniques, heuristic methods provide greater flexibility and adaptability, especially when searching for local optimal solutions. To improve the convergence speed, we utilized the NCMOGWOA algorithm in this study. As will be shown in Section 5, our simulations highlight the superior convergence speed of the NCMOGWOA algorithm compared with other methods, demonstrating its effectiveness for the given optimization problem.

4. Nonlinear Constrained Multi-Objective Gray Wolf Optimization Algorithm (NC-MOGWOA)

The optimization model presented in the last section constitutes a multi-objective, nonlinear programming problem. As the number of candidates in the state space increases, the potential combinations available for selection rise exponentially, posing a challenge for solutions using conventional methods [42,43].
The Gray Wolf Optimization Algorithm (GWOA) is a bio-inspired algorithm that simulates the predatory actions of gray wolf populations in nature [44]. It efficiently tracks the optimal solution’s iterative direction and finds the optimal solution, enabling quick discovery. In this paper, a Nonlinear Constrained Multi-Objective Gray Wolf Optimization Algorithm (NCMOGWOA) is employed to address the presented multi-objective, nonlinear programming problem.

4.1. Gray Wolf Optimization Algorithm (GWOA)

The GWOA is a meta-heuristic algorithm inspired by the predatory behavior observed in gray wolf populations. It combines the hierarchy and distribution patterns observed within these populations to simulate the hunting and encircling process of gray wolves when they pursure their prey. This process includes four steps: establishing social hierarchies, searching for prey, encircling prey, and attacking prey.
The wolves are classified into 4 distinct classes: 1, 2, 3, and 4, and each class has unique responsibilities within the pack. Wolf 1 possesses managerial skills and oversees decisions regarding food acquisition and location; wolf 2 aids in decision making and serves as a communicator; wolf 3 follows the directives of wolf 1 and wolf 2, undertaking tasks such as scouting and guarding; and wolf 4 complies with the pack’s hierarchy, maintaining social equilibrium. The GWO model can be expressed as
D = C · X
X ( t + 1 ) = X p ( t ) A · D
A = 2 a · r 1 a
C = 2 · r 2
D 1 = C 1 · X 1 X D 2 = C 2 · X 2 X D 3 = C 3 · X 3 X
X 1 = X 1 A 1 · D 1 X 2 = X 2 A 2 · D 2 X 3 = X 3 A 3 · D 3
X ( t + 1 ) = ( X 1 + X 2 + X 3 ) / 3
where D represents the distance between the wolf and the prey, while X p and X denote the positions of the prey and the wolf, respectively. Both A and C are coefficient vectors. The parameter a denotes the convergence factor, which linearly decreases with each iteration, and r 1 and r 2 are randomly selected values within the range [0, 1].
In the hunting process, wolves 1, 2, and 3 initially make a random estimation of the prey’s location, since it is unknown. They then guide the other wolves to assess and update the estimated location iteratively until an optimal solution is achieved.
The coefficient vector A, with a range of [ 2 a , 2 a ], influences the wolf’s decision making regarding its current position relative to that of the prey. More precisely, when A > 1, the algorithm exhibits a robust search capability, causing the wolf to move farther away from the prey. Conversely, when A 1, the algorithm shows a strong developmental ability, prompting the wolf to move closer to the prey.

4.2. Multi-Objective Gray Wolf Optimization Algorithm

To adapt the GWOA to multi-objective problems, two enhancements were introduced to the algorithm.

4.2.1. External Stock Archive

An external population archive was introduced to store nondominated solutions. At each iteration, the algorithm generates a new position for the gray wolf. The new gray wolves are compared to the original gray wolves stored in the archive when their eligibility for the archive is assessed. If the new gray wolf is dominated by all the original wolves in the archive, it cannot join the pack. Conversely, if the new gray wolf dominates one or more gray wolves, it joins the pack, displacing any dominated wolf. If neither dominates the other, the new gray wolf can join only if the archive has not reached its maximum capacity.

4.2.2. Decision-Making Wolf Selection

In the literature [45], a roulette method was employed to choose the decision-making wolf from the archive. This method involves identifying the least crowded grid in the archive and randomly selecting three solutions corresponding to wolves 1, 2, and 3, without any perceived superiority or inferiority. If the number of segments is insufficient, the selection is deferred to the grid with the second lowest crowdedness.

4.3. Nonlinear Constraint Penalty Function

Multi-objective optimization problems frequently involve nonlinear inequality and equation constraints, rendering them challenging to solve. However, the Multi-Objective Gray Wolf Optimization Algorithm (MOGWOA) does not account for these nonlinear constraints, potentially causing it to exceed the allowable boundaries during wolf location updates [45]. In this paper, based on the MOGWOA, we exclusively consider nonlinear inequality and equation constraints. A general model for multi-objective programming problems can be expressed as
min f 1 ( x ) , f 2 ( x ) , f n ( x ) s . t x i L x i x i U g m ( x ) = 0 , m = 1 , 2 , , j f q ( x ) 0 , q = 1 , 2 , , k
where g m ( x ) and f m ( x ) represent the nonlinear equational and inequality constraints, respectively. The constraint penalty function ρ ( x ) can be defined as follows:
ρ ( x i ) = m = 1 j p g · g m ( x i ) + q = 1 k p f · f q ( x i ) · H ( x i )
where p f and p f are constants, and where H ( x ) is a judgment function of inequality constraint f ( x ):
H ( x ) = 0 , f ( x ) 0 1 , f ( x ) > 0
With the introduction of the constraint penalty function, the nonlinear constraints can be converted into the objective function, thereby simplifying the problem-solving process. The multi-objective programming problem can be reformulated as
min f 1 ( x ) + ρ ( x ) , f 2 ( x ) + ρ ( x ) , f n ( x ) + ρ ( x ) s . t x i L x i x i U g m ( x ) = 0 , m = 1 , 2 , , j f q ( x ) 0 , q = 1 , 2 , , k
As represented in (43), if the updated position of the wolves breaches the nonlinear constraints, the values of each objective function significantly increase, rendering the wolves’ location nonoptimal. In other words, the optimal solution in each iteration must satisfy all the constraints.
Algorithm 1 demonstrates the process of the proposed NCMOGWOA. In line 1, the process begins with the random initialization of the wolf population through the following equation:
X i = X min + r a n d ( 0 , 1 ) · ( X max X min )
where i = 1 , 2 , , N, r a n d ( 0 , 1 ) represents a uniformly distributed random number in the range [0, 1]; X min and X max denote the lower and upper bounds of the dimension, respectively.
Algorithm 1: Nonlinear Constrained Multi-Objective Gray Wolf Optimization Algorithm (NCMOGWOA)
1:begin
2:Selected the gray wolf population Xi randomly selected within the feasible region (i = 1, 2, …, n)
3:Initialize a, A, and C using Equations (35) and (36).
4:Calculate the objective values for each search agent using Equation (40)
5:The initial archive Ar0 ← The nondominated solutions
6:X∂1, X∂2, X∂3 ← The initial archive A0 (select the initial three wolves with the lowest objective
7:function values)
8:for t = 1,2, …, Max number of iterations
9:  Update the positions of each current search agent using Equations (37), (38) and (39)
10:  Update a, A, and C using Equations (35) and (36)
11:  Calculate [f1(x), f2(x), …fn(x)] of all search agents using Equation (40)
12:  Calculate ρ(x) for all search agents using Equation
13:  [f1(x), f2(x), …fn(x)] ← [f1(x) + ρ(x), f2(x) + ρ(x), …fn(x) + ρ(x)]
14:  The archive A ← The nondominated solutions
15:  If the archive is full
16:   Omit one of the current archive members
17:The archive Ar ← The archive Ar + The new solution
18:end if
19:If solutions are outside the hypercubes
20:Update the grids to cover the new solutions
21:end if
22:X∂1, X∂2, X∂3 ← The archive Ar
23:end for
24:end
25:return archive Ar
The proposed algorithm uses the archive for storing the nondominated solution. For each iteration in the loop, after generating a new wolf population, the penalty function is calculated along with all the objective functions (lines 11, 12). The objective function is updated by adding the penalty function to the original objective function values (line 13).
All the current new gray wolves are compared with the original gray wolves stored in the archive in terms of the updated objective functions. If the new gray wolf is dominated by all the original wolves in the archive, it cannot join the pack. Conversely, if the new gray wolf dominates one or more gray wolves, it joins the pack, displacing any dominated wolf (lines 14–18).
After the search process, wolves 1, 2, and 3 are replaced by the three best wolves in the current archive, respectively (line 22). When the algorithm finishes, the updated archive is stored as the output, which stores all the nondominated solutions.

4.4. Selection of the Optimal Solution

As the NCMOGWOA yields a Pareto optimal solution, it becomes challenging to simultaneously obtain optimal solutions for both objectives f 1 and f 2, thus complicating the selection of optimal UAV inputs.
Definition 6.
Given a cost-based, multi-objective optimization problem with a feasible solution set  Ω:  min f ( X ),  X Ω, if  X Ω satisfies.
Λ i I ( f i ( X ) f i ( X ) )
Consequently, X is denoted as the optimal solution of min f ( X ).
Definition 7.
Given a cost-based, multi-objective optimization problem with a feasible solution set  Ω:  min f ( X ),  X Ω, if there exists  X Ω and no other  X ¯ Ω, then (46) holds.
f j ( X ) f j ( X ¯ ) , ( j = 1 , 2 , , r )
and at least one of (46) is a strict inequality; then X is denoted as the Pareto optimal solution of min f ( X ).
The set containing all the Pareto optimal solutions of min f ( X ) is referred to as the Pareto optimal solution set of min f ( X ). The graphical representation of the Pareto optimal solution set in the space of the objective function is defined as the Pareto front. These concepts are demonstrated in Figure 5.
Figure 5 illustrates the Pareto solution of a two-objective optimization problem. Each black dot within the feasible solution set Ω represents a Pareto solution, collectively forming the Pareto solution set. The red line indicates this front, highlighting solutions where neither objective f 1 ( x ) nor f 2 ( x ) can be improved without sacrificing the other. Thus, x j is not a Pareto solution.
To provide a comprehensive evaluation, a synthesized evaluation methodology that combines Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) and the Criteria Importance Through Intercriteria Correlation (CRITIC) method (TCM) was proposed. Applying the TCM to the Pareto front solution set obtained by the NCMOGWOA enables it to determine the optimal solution to the problem.
In this paper, the CRITIC method is employed to assign weights to each objective function in every iteration. CRITIC is usually used to weight indicators. It considers the differences and similarities among evaluation indicators, assigning smaller weights to indicators with high horizontal similarity and larger weights to those with significant vertical differences [46].
With the weights generated by the CRITIC method, the TOPSIS method was employed to synthesize the assessments of various objective function values. TOPSIS chooses the optimal solution by establishing the positive and negative ideal solutions of the evaluation problem and the optimal and worst solutions for each index. It ranks the solutions based on their relative closeness to the ideal solution, considering their proximity to the positive and negative ideal solutions [47].
The TCM process is illustrated in Figure 6. Once the Pareto front solution set was obtained by the NCMOGWOA in each iteration, the weight of each objective function could be computed using the CRITIC method. Based on these weights, the synthesized evaluation index was subsequently constructed using the TOPSIS method, and the final scores were derived for all Pareto solutions. We selected the optimal solution and the corresponding optimized variables from these scores.
Upon examining the distribution of c in Figure 4, it is evident that during the initial phase, the impact of R on the condition number is almost negligible due to its large magnitude. Consequently, it is imperative to prioritize the optimization of f 1 to ensure a decrease in the value of C to an appropriate level.
However, solely concentrating on reducing the distance R can lead the UAV to violate the CBF constraint. Therefore, the optimization objectives of this problem are divided into two stages:
S o l u t i o n o p t = S o l u t i o n R , R > R s w i t c h S o l u t i o n T C M , R R s w i t c h
where S o l u t i o n o p t represents the optimal solution, S o l u t i o n R represents the Pareto optimal solution with the smallest value in f 1, S o l u t i o n T C M indicates the optimal solution selected by the TCM, and R s w i t c h is a constant.
Figure 7 demonstrates an algorithm for biased bearing information-only target localization. The aim was to optimize the UAV trajectory to improve the localization performance. First, by employing the NCMOGWOA, the optimal Pareto set S p a r e t o was obtained, as was its corresponding objective function value vector f 1 ,   f 2. Afterward, utilizing the optimal Pareto set S p a r e t o with the TCM, we selected the optimal solution according to (47), which was then used to update the position of the UAV. Furthermore, employing the Extended Kalman Filter (EKF) facilitated obtaining both P ^ T and b ^, where P ^ T and b ^ denote the estimation of the target’s position and bias in the bearing measurement h ( t ), respectively. In closed-loop mode, b ^ was employed to compensate for the bearing measurement h ( t ), enhancing its accuracy.

5. Experiments and Results

In this section, the bearing-only stationary target localization problem is solved by the proposed UAV trajectory optimization algorithm based on observability enhancement. We analyze the differences between open-loop and closed-loop modes within the algorithm and explore the effects of varying δ and initial flight path angles on trajectory optimization. Finally, we evaluate the proposed NCMOGWOA’s localization accuracy and convergence by comparing it with the MOPSOA [34], MOAOA [35], and SQP methods [36]. The comparison is conducted through state estimation and the GD and IGD metrics. The simulation platform for each experiment in this study is the Windows 11 AMD Ryzen 5 5600H chip system, and the MATLAB version is R2021b.

5.1. Trajectory Optimization Results for Target Localization

In the case of a stationary target, the state transition matrix Φ t + 1 , t can be represented by an identity matrix. The relevant parameters are summarized in Table 1.
In this study, the target position and bias bearing measurement bias were estimated using the Extended Kalman Filter (EKF). The initial value for state estimation was set as follows:
X 0 = 470   m 470   m 3 ° T
The covariance Q and initial values of the error covariance matrix were set to
Q = diag ( ( 0.1   m ) 2 ( 0.1   m ) 2 ( 0.001 ° ) 2 )
P 0 = diag ( ( 25   m ) 2 ( 25   m ) 2 ( 0.1 ° ) 2 )
The target localization system, which relies on biased bearing measurements, operates in closed-loop mode. In this mode, the estimation of the bearing measurement bias b at each moment compensates for the subsequent bearing measurement, reducing its impact. Accurate estimation of bias b improves the precision of bearing measurements. In closed-loop mode, based on (4), the measurement function at each time instant t is described as follows:
h c l o s e ( t ) = h ( t ) b ^ ( t 1 )
where b ^ ( t 1 ) denotes the estimation of bias b at time instant t 1.
The trajectory optimization results are illustrated in Figure 8. In Figure 8a, the UAV trajectory forms a decreasing radius circle around the stationary target. When R > R s w i t c h, the UAV approaches the target, increasing the separation angle λ. As R rapidly decreases to R s w i t c h, there is a sharp decline in the condition number, as shown in Figure 8l. At R s w i t c h, S o l u t i o n T C M is selected as the optimal solution. To maintain the separation angle λ near π / 2, the UAV continues a circular path with a diminishing radius, enhancing system observability. When R reaches the safe distance R s a f e, the UAV slows down to maintain this distance, orbiting the target at R s a f e.
Figure 8c,d,k,l demonstrates that the observability matrix remained full rank throughout. The localization and bearing bias estimation errors converged to zero as the condition number decreased. Moreover, the estimated position of the target eventually converged to its real position, as shown in Figure 8b. As R stabilizes, the speed of the UAV must decrease, causing a slight increase in the condition number. However, these fluctuations were minimal, and had a negligible effect on estimation accuracy. It is evident that the proposed trajectory algorithm had a significant effect on observability enhancement, resulting in accurate localization of the target and calibration of the bearing measurement.

5.2. Comparison between the Open-Loop and Closed-Loop Modes

This section analyzes the performance differences in trajectory optimization between the open-loop and closed-loop modes, focusing on whether the estimation of bias b is used to compensate for the target bearing measurement. The simulation parameters for both modes remain consistent with the values provided in Table 1.
For simplicity, we denoted the open-loop mode as M o d e o p e n and the closed-loop mode as M o d e c l o s e d. Figure 9 shows that the trends in the condition number were similar for both modes. However, a significant difference exists in the estimation of bias b: the error can converge to nearly zero at M o d e c l o s e d but not at M o d e o p e n. Figure 9a,c and Table 2 demonstrate the superior performance of M o d e c l o s e d in target localization, where the estimated target location converged to the actual position, unlike in M o d e o p e n. Moreover, except for the initial period, the localization error in M o d e c l o s e d remained lower than M o d e o p e n and could converge to zero.

5.3. Effect of δ on Trajectory Optimization

To explore the effect of δ on trajectory optimization, the attenuation rates δ of the CBF were divided into three groups, namely, δ 1 = 0.06 , δ 2 = 0.08 , δ 3 = 0.1, instead of being optimized as a parameter by the NCMOGWOA. To exclude other factors, the optimal solution was chosen by the TCM throughout the process. The simulation results are presented in Figure 10.
Figure 10a,b illustrates that as δ decreased, the constraints imposed by the CBF on the UAV were reduced. A smaller δ allowed the UAV to approach the target more rapidly, leading to a quicker reduction in both R and the condition number, thus enhancing observability. When R > R s a f e, a smaller δ resulted in a faster decrease in the condition number, as shown in Figure 10d. After R reached R s a f e, the condition number stabilized across different cases. Figure 10c shows that, although it eventually converges to nearly zero in all the scenarios, the localization error was significantly lower with a smaller δ. To exclude the effects of stochastic factors such as Gaussian noise, a Monte Carlo simulation was conducted. The mean and standard deviation (Std) of the root-mean-square error (RMSE) of the localization error over 100 runs are summarized in Table 3, clearly demonstrating that a smaller δ improves target localization performance.
Although a smaller δ implied better observability, it is apparent that δ was not as small as it should be. When δ became too small, the CBF constraints on the UAV became excessively stringent, as illustrated in Figure 10a, resulting in a severely limited feasible area for the UAV. This limitation makes it challenging for UAVs to fulfill certain mission requirements, such as cruising. Therefore, it is crucial to optimize δ to effectively constrain UAV motion based on mission objectives.

5.4. Effect of the Initial Flight Path Angle on Trajectory Optimization

To analyze the effect of the initial flight path angle on trajectory optimization, the initial flight path angles θ were divided into six groups: θ 1 = 15 , θ 2 = 25 , θ 3 = 35 , θ 4 = 55 , θ 5 = 65 , and θ 6 = 75 . The optimal solution was chosen by the TCM throughout the process.
Figure 11a shows the UAV trajectory for different initial angles. The trajectory was divided into two categories based on a 45° flight path angle: For angles less than 45°, the UAV followed a counter-clockwise circular path around the target with a continuously decreasing radius. Conversely, for angles greater than 45°, the UAV followed a clockwise circular path. Similarly, as shown in Figure 11b, the convergence of the UAV separation angle was categorized into two groups. For initial flight path angles greater than 45°, the separation angle converged to −90°. When the initial flight path angle was less than 45°, it converged to 90°.
When the flight path angle was exactly 45°, the UAV moved directly toward the target, resulting in a separation angle of zero. In this case, the target localization system became unobservable, meaning that the target’s position could not be accurately estimated.

5.5. Comparison between the NCMOGWOA and Other Methods

To evaluate the effectiveness of the proposed NCMOGWOA, it was compared with the MOPSOA, MOAOA, and SQP methods [34,35,36]. The same experimental parameters (Table 1) were selected for the NCMOGWOA. The MOPSOA, MOAOA, and SQP parameters used in this study are summarized in Table 4.
The results are illustrated in Figure 12. To exclude stochastic effects such as Gaussian noise, a Monte Carlo simulation was conducted. The mean and standard deviation (Std) of the root-mean-square error (RMSE) are summarized in Table 5. As shown in Figure 12 and Table 5, although each method achieved a low localization error, the proposed NCMOGWOA converged more rapidly. It also achieved a lower mean and Std for the target position and bearing bias errors, indicating a superior localization performance.
To assess algorithm performance quantitively, the Generational Distance (GD) and Inverted Generational Distance (IGD) were used to evaluate convergence and comprehensive performance. They are expressed as follows:
G D = i = 1 u d i u 2 u , I G D = i = 1 v d i v 2 v
where u and v denote the number of obtained Pareto solutions and true Pareto solutions, respectively; d i u denotes the Euclidean distance between the ith obtained Pareto solution and the closest true Pareto solution; and d i v denotes the Euclidean distance between the ith true Pareto solution and the closest true Pareto solution. A smaller value of the GD means better convergence and the lower the value of the IGD is, the better the comprehensive performance of the algorithm.
Table 6 compares the proposed NCMOGWOA with the other methods via the GD and IGD metrics. The NCMOGWOA has slightly lower values, indicating better performance in terms of convergence property. This improvement is due to the effective leader selection strategy, which involves choosing wolves 1, 2, and 3 from the current Pareto front, optimizing the process by leveraging the best results, and exploring diverse solutions across the objective space.
Figure 13 illustrates the Pareto solutions from different algorithms over 50 runs. Although the Pareto solution sets of the NCMOGWOA, MOPSOA, and SQP were similar when t = 10 s, the NCMOGWOA performed better in both objective functions, initially showing good exploration ability. Additionally, Figure 13a,b shows that the Pareto optimal solutions converged as the simulation time increased, which was consistent with the localization error trend in Figure 12a, where all algorithms eventually optimized the objectives, reducing the localization error to nearly zero.

6. Discussion

Most previous studies focused on system observability and maintenance. This study used the condition number of the observability matrix to numerically analyze system observability. We formulated a multi-objective, nonlinear optimization problem for UAV trajectory planning to enhance observability. The proposed NCMOGWOA can efficiently solve this problem.
The simulation demonstrated that optimizing UAV trajectories significantly improved localization system observability and target localization performance. Compared with the other algorithms, NCMOGWOA achieved better performance in both target localization and convergence.
This study focused on a two-dimensional observability matrix. As the dimensions increased, computing the condition number became challenging, limiting its use as a metric. Future work will explore alternative metrics to quantify system observability.

7. Conclusions

In this work, we developed a control barrier function inspired by plant phototropism to ensure observability in a target localization system with biased bearing information. This prevents the UAV from entering unobservable areas. By analyzing the condition number of the observability matrix, we identified two influencing factors. We then formulated a multi-objective, nonlinear optimization problem for UAV trajectory planning to enhance system observability. To solve this problem, a penalty function was introduced into the Multi-Objective Gray Wolf Optimization Algorithm (MOGWOA) to manage nonlinear constraints. Simulations confirmed the effectiveness of this trajectory optimization for stationary target localization. Additionally, we analyzed the effects of different CBF attenuation rates and initial flight path angles. Finally, the proposed NCMOGWOA was compared with other algorithms using GD and IGD metrics, which yielded competitive results.

Author Contributions

Conceptualization, K.L. and J.P.; methodology, K.L., J.P. and Q.W.; software, J.P.; validation, K.L. and J.P.; formal analysis K.L. and J.P.; investigation, K.L. and J.P.; writing—original preparation, J.P.; writing—review and editing, K.L., J.P., B.J., Y.Z. and Q.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant number Nos. 62373101, and 62073075).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CBFcontrol barrier function
UAVUnmanned Aerial Vehicle
LOSline of sight
GWOAGray Wolf Optimization Algorithm
MOGWOAMulti-Objective Gray Wolf Optimization Algorithm
NCMOGWOANonlinear Constrained Multi-Objective Gray Wolf Optimization Algorithm
TOPSISTechnique for Order Preference by Similarity to Ideal Solution
CRITICCriteria Importance Through Intercriteria Correlation
TCMTOPSIS and CRITIC Method
EKFExtended Kalman Filter
NSGANondominated Sorting Genetic Algorithms
SQPSequential Quadratic Programming
RMSEroot-mean-square error
Stdstandard deviation
MOPSOAMulti-Objective Particle Swarm Optimization Algorithm
MOAOAMulti-Objective Arithmetic Optimization Algorithm
SQPSequential Quadratic Programming
GD; IGDGenerational Distance; Inverted Generational Distance
Nomenclature
α, θ, λBearing angle, flight path angle, and separation angle of the UAV
P U, P TPosition vector of the UAV and target
v, ωSpeed and turn rate of the UAV in m/s and rad/s
R, rRelative distance and its corresponding vector
r x, r yx- and y-axis components of relative distance
hTarget bearing information measured by the UAV
bBias in target bearing measurement
X, Φ t + 1 , tState vector and state transition matrix from t to t + 1
L f j hThe jth order Lie derivative of h with respect to f
S Δ tThe set satisfies control barrier function constraints
R Δ tThe reachable set of states after Δ t
δAttenuation rate of control barrier function
C , cCondition number and its inverse of system observability matrix
f nThe nth objective function
R s a f eThe minimum safe distance between the UAV and target
R s w i t c hThe distance for switching solutions
ρNonlinear constrain penalty function
S o l u t i o n RPareto optimal solution with smallest value in distance
S o l u t i o n T C MOptimal solution selected by the TCM

References

  1. Austin, R. Unmanned Aircraft Systems: UAVS Design, Development and Deployment; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
  2. Beard, R.W.; McLain, T.W. Small Unmanned Aircraft: Theory and Practice; Princeton University Press: Princeton, NJ, USA, 2012. [Google Scholar]
  3. Valavanis, K.P.; Vachtsevanos, G.J. Handbook of Unmanned Aerial Vehicles; Springer: Berlin/Heidelberg, Germany, 2015; Volume 1. [Google Scholar]
  4. Kocer, B.B.; Tjahjowidodo, T.; Pratama, M.; Seet, G.G.L. Inspection-while-flying: An autonomous contact-based nondestructive test using UAV-tools. Autom. Constr. 2019, 106, 102895. [Google Scholar] [CrossRef]
  5. Aleotti, J.; Micconi, G.; Caselli, S.; Benassi, G.; Zambelli, N.; Bettelli, M.; Zappettini, A. Detection of nuclear sources by UAV teleoperation using a visuo-haptic augmented reality interface. Sensors 2017, 17, 2234. [Google Scholar] [CrossRef]
  6. Atif, M.; Ahmad, R.; Ahmad, W.; Zhao, L.; Rodrigues, J.J. UAV-assisted wireless localization for search and rescue. IEEE Syst. J. 2021, 15, 3261–3272. [Google Scholar] [CrossRef]
  7. Singla, A.; Padakandla, S.; Bhatnagar, S. Memory-based deep reinforcement learning for obstacle avoidance in UAV with limited environment knowledge. IEEE Trans. Intell. Transp. Syst. 2019, 22, 107–118. [Google Scholar] [CrossRef]
  8. Abdallah, A.; Ali, M.Z.; Mišić, J.; Mišić, V.B. Efficient security scheme for disaster surveillance UAV communication networks. Information 2019, 10, 43. [Google Scholar] [CrossRef]
  9. Yao, H.; Qin, R.; Chen, X. Unmanned aerial vehicle for remote sensing applications—A review. Remote Sens. 2019, 11, 1443. [Google Scholar] [CrossRef]
  10. Yang, X.; He, S.; Lin, D.; Wang, Y. Distributed unmanned aerial vehicle positioning for multitarget tracking with bearing-only information. J. Aerosp. Inf. Syst. 2022, 19, 355–365. [Google Scholar] [CrossRef]
  11. Whitacre, W.; Campbell, M.; Wheeler, M.; Stevenson, D. Flight results from tracking ground targets using seascan uavs with gimballing cameras. In Proceedings of the 2007 American Control Conference, New York, NY, USA, 9–13 July 2007; pp. 377–383. [Google Scholar]
  12. Anjaly, P.; Ratnoo, A. Observability enhancement of maneuvering target with bearings-only information. J. Guid. Control Dyn. 2018, 41, 184–198. [Google Scholar] [CrossRef]
  13. He, S.; Shin, H.-S.; Tsourdos, A. Trajectory optimization for target localization with bearing-only measurement. IEEE Trans. Robot. 2019, 35, 653–668. [Google Scholar] [CrossRef]
  14. Yang, X.; Lin, D.; Zhang, F.; Song, T.; Jiang, T. High accuracy active stand-off target geolocation using UAV platform. In Proceedings of the 2019 IEEE International Conference on Signal, Information and Data Processing (ICSIDP), Chongqing, China, 11–13 December 2019; pp. 1–4. [Google Scholar]
  15. Frew, E.W. Sensitivity of cooperative target geolocalization to orbit coordination. J. Guid. Control Dyn. 2008, 31, 1028–1040. [Google Scholar] [CrossRef]
  16. Corbets, J.; Langelaan, J. Parameterized optimal trajectory generation for target localization. In Proceedings of the AIAA Guidance, Navigation and Control Conference and Exhibit, Hilton Head, SC, USA, 20–23 August 2007; p. 6750. [Google Scholar]
  17. Whitacre, W.; Campbell, M.E. Decentralized geolocation and bias estimation for uninhabited aerial vehicles with articulating cameras. J. Guid. Control Dyn. 2011, 34, 564–573. [Google Scholar] [CrossRef]
  18. Shu, K.; Wan, X.; Yi, J.; Gong, Z. A New Bias Pseudo-Measurement Method for Distributed Sensor Bias Estimation. In Proceedings of the 2021 CIE International Conference on Radar (Radar), Haikou, China, 15–19 December 2021; pp. 2595–2599. [Google Scholar]
  19. Okello, N.N.; Challa, S. Joint sensor registration and track-to-track fusion for distributed trackers. IEEE Trans. Aerosp. Electron. Syst. 2004, 40, 808–823. [Google Scholar] [CrossRef]
  20. Ribeiro, M.I. Kalman and extended kalman filters: Concept, derivation and properties. Inst. Syst. Robot. 2004, 43, 3736–3741. [Google Scholar]
  21. St-Pierre, M.; Gingras, D. Comparison between the unscented Kalman filter and the extended Kalman filter for the position estimation module of an integrated navigation information system. In Proceedings of the IEEE Intelligent Vehicles Symposium, Parma, Italy, 14–17 June 2004; pp. 831–835. [Google Scholar]
  22. Liu, Y.-Y.; Slotine, J.-J.; Barabási, A.-L. Observability of complex systems. Proc. Natl. Acad. Sci. USA 2013, 110, 2460–2465. [Google Scholar] [CrossRef]
  23. Kim, D.; Lee, B.; Sung, S. Observability-Driven Path Planning Design for Securing Three-Dimensional Navigation Performance of LiDAR SLAM. Aerospace 2023, 10, 492. [Google Scholar] [CrossRef]
  24. Skaloud, J.; Lichti, D. Rigorous approach to bore-sight self-calibration in airborne laser scanning. ISPRS J. Photogramm. Remote Sens. 2006, 61, 47–59. [Google Scholar] [CrossRef]
  25. James, M.R.; Robson, S.; d’Oleire-Oltmanns, S.; Niethammer, U. Optimising UAV topographic surveys processed with structure-from-motion: Ground control quality, quantity and bundle adjustment. Geomorphology 2017, 280, 51–66. [Google Scholar] [CrossRef]
  26. Sen, A.; Sahoo, S.R.; Kothari, M. Cooperative Formation Control Strategy to Maximize Target Information. In Proceedings of the 2018 AIAA Information Systems-AIAA Infotech @ Aerospace, Kissimmee, FL, USA, 8–12 January 2018. [Google Scholar]
  27. Yang, B.; Huang, H.; Cao, L. Centered error entropy-based sigma-point Kalman filter for spacecraft state estimation with non-Gaussian noise. Space Sci. Technol. 2022, 2020, 9854601. [Google Scholar] [CrossRef]
  28. Stachura, M.; Frew, E.W. Cooperative target localization with a communication-aware unmanned aircraft system. J. Guid. Control Dyn. 2011, 34, 1352–1362. [Google Scholar] [CrossRef]
  29. Sen, A.; Kothari, M.; Sahoo, S.R. Information-rich formation tracking: A unified scheme of cooperative control and localization. J. Aerosp. Inf. Syst. 2020, 17, 390–406. [Google Scholar] [CrossRef]
  30. Klein, I.; Diamant, R. Observability analysis of DVL/PS aided INS for a maneuvering AUV. Sensors 2015, 15, 26818–26837. [Google Scholar] [CrossRef] [PubMed]
  31. Jin, B.; Xu, H.; Peng, J.; Lu, K.; Lu, Y. Derivative-Free Observability Analysis for Sensor Placement Optimization of Bioinspired Flexible Flapping Wing System. Biomimetics 2022, 7, 178. [Google Scholar] [CrossRef] [PubMed]
  32. Moranduzzo, T.; Melgani, F. Detecting cars in UAV images with a catalog-based approach. IEEE Trans. Geosci. Remote Sens. 2014, 52, 6356–6367. [Google Scholar] [CrossRef]
  33. Habib, A.; Zhou, T.; Masjedi, A.; Zhang, Z.; Flatt, J.E.; Crawford, M. Boresight calibration of GNSS/INS-assisted push-broom hyperspectral scanners on UAV platforms. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 1734–1749. [Google Scholar] [CrossRef]
  34. Cui, Y.; Meng, X.; Qiao, J. A multi-objective particle swarm optimization algorithm based on two-archive mechanism. Appl. Soft Comput. 2022, 119, 108532. [Google Scholar] [CrossRef]
  35. Bahmanyar, D.; Razmjooy, N.; Mirjalili, S. Multi-objective scheduling of IoT-enabled smart homes for energy management based on Arithmetic Optimization Algorithm: A Node-RED and NodeMCU module-based technique. Knowl. Based Syst. 2022, 247, 108762. [Google Scholar] [CrossRef]
  36. Lin, X.; Wang, Z.; Zeng, S.; Huang, W.; Li, X. Real-time optimization strategy by using sequence quadratic programming with multivariate nonlinear regression for a fuel cell electric vehicle. Int. J. Hydrogen Energy 2021, 46, 13240–13251. [Google Scholar] [CrossRef]
  37. Hermann, R.; Krener, A. Nonlinear controllability and observability. IEEE Trans. Autom. Control 1977, 22, 728–740. [Google Scholar] [CrossRef]
  38. Liscum, E.; Askinosie, S.K.; Leuchtman, D.L.; Morrow, J.; Willenburg, K.T.; Coats, D.R. Phototropism: Growing towards an understanding of plant movement. Plant Cell 2014, 26, 38–55. [Google Scholar] [CrossRef]
  39. Izzo, L.G.; Aronne, G. Root tropisms: New insights leading the growth direction of the hidden half. Plants 2021, 10, 220. [Google Scholar] [CrossRef]
  40. Ames, A.D.; Coogan, S.; Egerstedt, M.; Notomista, G.; Sreenath, K.; Tabuada, P. Control barrier functions: Theory and applications. In Proceedings of the 2019 18th European Control Conference (ECC), Naples, Italy, 25–28 June 2019; pp. 3420–3431. [Google Scholar]
  41. Ames, A.D.; Xu, X.; Grizzle, J.W.; Tabuada, P. Control barrier function based quadratic programs for safety critical systems. IEEE Trans. Autom. Control 2016, 62, 3861–3876. [Google Scholar] [CrossRef]
  42. Wang, H.; Olhofer, M.; Jin, Y. A mini-review on preference modeling and articulation in multi-objective optimization: Current status and challenges. Complex Intell. Syst. 2017, 3, 233–245. [Google Scholar] [CrossRef]
  43. Li, H.; Deb, K.; Zhang, Q.; Suganthan, P.N.; Chen, L. Comparison between MOEA/D and NSGA-III on a set of novel many and multi-objective benchmark problems with challenging difficulties. Swarm Evol. Comput. 2019, 46, 104–117. [Google Scholar] [CrossRef]
  44. Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef]
  45. Mirjalili, S.; Saremi, S.; Mirjalili, S.M.; Coelho, L.d.S. Multi-objective grey wolf optimizer: A novel algorithm for multi-criterion optimization. Expert. Syst. Appl. 2016, 47, 106–119. [Google Scholar] [CrossRef]
  46. Fan, W.; Xu, Z.; Wu, B.; He, Y.; Zhang, Z. Structural multi-objective topology optimization and application based on the criteria importance through intercriteria correlation method. Eng. Optim. 2022, 54, 830–846. [Google Scholar] [CrossRef]
  47. Ozturk, D.; Batuk, F. Technique for order preference by similarity to ideal solution (TOPSIS) for spatial decision problems. In Proceedings of the ISPRS, Munich, Germany, 5–7 October 2011. [Google Scholar]
Figure 1. Model of the relative motion of the UAV and the target.
Figure 1. Model of the relative motion of the UAV and the target.
Biomimetics 09 00510 g001
Figure 2. Phototropism in plants, where the bulb represents the light source.
Figure 2. Phototropism in plants, where the bulb represents the light source.
Biomimetics 09 00510 g002
Figure 3. Effect of δ ( 0 < δ 2 < δ 1 < 1) on system observability: (a) δ = δ 1; (b) δ = δ 2. The gray oval with a solid outline denotes the unobservable area. The gray oval with a dashed outline and the white oval with a solid outline represent S Δ t and R Δ t, respectively. The black dot represents x ( t ).
Figure 3. Effect of δ ( 0 < δ 2 < δ 1 < 1) on system observability: (a) δ = δ 1; (b) δ = δ 2. The gray oval with a solid outline denotes the unobservable area. The gray oval with a dashed outline and the white oval with a solid outline represent S Δ t and R Δ t, respectively. The black dot represents x ( t ).
Biomimetics 09 00510 g003
Figure 4. Heatmap of the inverse of condition number. The horizontal coordinate represents the ratio of the relative distance between the UAV and the target to the speed of the UAV. The vertical coordinate represents the angle of separation.
Figure 4. Heatmap of the inverse of condition number. The horizontal coordinate represents the ratio of the relative distance between the UAV and the target to the speed of the UAV. The vertical coordinate represents the angle of separation.
Biomimetics 09 00510 g004
Figure 5. The Pareto solution of the two objective optimization problems. The black dots denote the Pareto solution while the blue dots do not. The red line denotes the Pareto front.
Figure 5. The Pareto solution of the two objective optimization problems. The black dots denote the Pareto solution while the blue dots do not. The red line denotes the Pareto front.
Biomimetics 09 00510 g005
Figure 6. Process of the TOPSIS and CRITIC methods (TCM).
Figure 6. Process of the TOPSIS and CRITIC methods (TCM).
Biomimetics 09 00510 g006
Figure 7. Flowchart of the algorithm for biased bearing information-only target localization and UAV trajectory optimization based on observability enhancement.
Figure 7. Flowchart of the algorithm for biased bearing information-only target localization and UAV trajectory optimization based on observability enhancement.
Biomimetics 09 00510 g007
Figure 8. UAV trajectory optimization results for a stationary target: (a) trajectory of UAV; (b) estimated position of target; (c) localization error; (d) estimation error of bias b; (e) speed of UAV; (f) turn rate of UAV; (g) bearing and flight path angle; (h) separation angle; (i) attenuation rate of CBF; (j) relative distance; (k) rank of observability matrix; and (l) condition number.
Figure 8. UAV trajectory optimization results for a stationary target: (a) trajectory of UAV; (b) estimated position of target; (c) localization error; (d) estimation error of bias b; (e) speed of UAV; (f) turn rate of UAV; (g) bearing and flight path angle; (h) separation angle; (i) attenuation rate of CBF; (j) relative distance; (k) rank of observability matrix; and (l) condition number.
Biomimetics 09 00510 g008aBiomimetics 09 00510 g008b
Figure 9. UAV trajectory optimization results between the open-loop and closed-loop mode: (a) estimated location of the target; (b) condition number; (c) localization error; and (d) estimation error of bias b.
Figure 9. UAV trajectory optimization results between the open-loop and closed-loop mode: (a) estimated location of the target; (b) condition number; (c) localization error; and (d) estimation error of bias b.
Biomimetics 09 00510 g009
Figure 10. Trajectory optimization results for different δ values: (a) trajectory of UAV; (b) relative distance; (c) localization error; and (d) condition number.
Figure 10. Trajectory optimization results for different δ values: (a) trajectory of UAV; (b) relative distance; (c) localization error; and (d) condition number.
Biomimetics 09 00510 g010aBiomimetics 09 00510 g010b
Figure 11. UAV trajectory optimization for different initial flight path angles: (a) trajectory of the UAV; (b) separation angle.
Figure 11. UAV trajectory optimization for different initial flight path angles: (a) trajectory of the UAV; (b) separation angle.
Biomimetics 09 00510 g011
Figure 12. State estimation errors between the NCMOGWOA, MOPSOA, MOAOA, and SQP Methods: (a) localization errors; (b) estimation error of b.
Figure 12. State estimation errors between the NCMOGWOA, MOPSOA, MOAOA, and SQP Methods: (a) localization errors; (b) estimation error of b.
Biomimetics 09 00510 g012
Figure 13. Comparison of the Pareto optimal solution sets over 50 runs: (a) t = 10 s; (b) t = 50 s.
Figure 13. Comparison of the Pareto optimal solution sets over 50 runs: (a) t = 10 s; (b) t = 50 s.
Biomimetics 09 00510 g013
Table 1. UAV trajectory optimization parameters for a stationary target.
Table 1. UAV trajectory optimization parameters for a stationary target.
ParameterValue
Position for stationary target(500 m, 500 m)
Initial position for UAV(20 m, 20 m)
Bearing measurement bias b 3 °
Range of UAV speed [ 5   m / s , 30   m / s ]
Range of UAV turn rate [ 15 ° / s , 15 ° / s ]
Smallest safe distance R s a f e 50 m
Sampling time Δ t 0.1 s
Switch distance R s w i t c h 300 m
Parameters in Equation (35) α = 0.1
Population size M P 100
Iteration times T P 10
p g , p f p g = p f = 1000
Range of δ [ 0.01 , 0.2 ]
Grid number N g r i d 30
β in NCMOGWOA4
γ in NCMOGWOA2
Table 2. RMSEs for the open-loop and closed-loop modes over 100 Monte Carlo experiments.
Table 2. RMSEs for the open-loop and closed-loop modes over 100 Monte Carlo experiments.
ModeOpen-Loop ModeClosed-Loop Mode
Localization error mean, m14.708211.6686
Localization error Std, m13.424611.0246
Table 3. RMSEs for different δ values over 100 Monte Carlo experiments.
Table 3. RMSEs for different δ values over 100 Monte Carlo experiments.
δ Value δ 1 = 0.06 δ 2 = 0.08 δ 3 = 0.1
Localization error mean, m10.042710.855411.7662
Localization error Std, m11.022911.927913.3564
Table 4. Simulation parameters for the MOPSOA, MOAOA, and SQP.
Table 4. Simulation parameters for the MOPSOA, MOAOA, and SQP.
Common ParametersMOPSOAMOAOASQP
Population
Size
Iteration
Time
Archive
Size
w c 1 c 2 β γ α ρ
10010500.61.222210.5
Table 5. RMSE for different algorithms over 100 Monte Carlo experiments.
Table 5. RMSE for different algorithms over 100 Monte Carlo experiments.
AlgorithmNCMOGWOAMOPSOAMOAOASQP Method
Localization error mean, m13.164614.112519.481913.5077
Localization error Std, m10.564512.611415.069610.3645
Bearing bias mean, °1.34461.46371.59031.3921
Bearing bias Std, °1.15441.29381.40861.1544
Table 6. GD and IGD metrics over 100 Monte Carlo experiments.
Table 6. GD and IGD metrics over 100 Monte Carlo experiments.
AlgorithmNCMOGWOAMOPSOAMOAOASQP Method
GD mean7.34427.99678.69067.7604
GD Std5.40624.96326.22755.0342
GD median6.90887.76468.83647.8621
IGD mean8.45778.89959.65548.6613
IGD Std7.86546.68778.22497.2145
IGD median8.51248.64669.43368.6027
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Peng, J.; Wang, Q.; Jin, B.; Zhang, Y.; Lu, K. Trajectory Optimization to Enhance Observability for Bearing-Only Target Localization and Sensor Bias Calibration. Biomimetics 2024, 9, 510. https://doi.org/10.3390/biomimetics9090510

AMA Style

Peng J, Wang Q, Jin B, Zhang Y, Lu K. Trajectory Optimization to Enhance Observability for Bearing-Only Target Localization and Sensor Bias Calibration. Biomimetics. 2024; 9(9):510. https://doi.org/10.3390/biomimetics9090510

Chicago/Turabian Style

Peng, Jicheng, Qianshuai Wang, Bingyu Jin, Yong Zhang, and Kelin Lu. 2024. "Trajectory Optimization to Enhance Observability for Bearing-Only Target Localization and Sensor Bias Calibration" Biomimetics 9, no. 9: 510. https://doi.org/10.3390/biomimetics9090510

APA Style

Peng, J., Wang, Q., Jin, B., Zhang, Y., & Lu, K. (2024). Trajectory Optimization to Enhance Observability for Bearing-Only Target Localization and Sensor Bias Calibration. Biomimetics, 9(9), 510. https://doi.org/10.3390/biomimetics9090510

Article Metrics

Back to TopTop