Autonomous Trajectory Planning and Control of Anti-Radiation Loitering Munitions under Uncertain Conditions

Bai, Linyuan; Luo, Hongchuan; Ling, Haifeng

doi:10.3390/electronics10192399

Open AccessArticle

Autonomous Trajectory Planning and Control of Anti-Radiation Loitering Munitions under Uncertain Conditions

by

Linyuan Bai

,

Hongchuan Luo

^* and

Haifeng Ling

Field Engineering College, Army Engineering University of PLA, Nanjing 210022, China

^*

Author to whom correspondence should be addressed.

Electronics 2021, 10(19), 2399; https://doi.org/10.3390/electronics10192399

Submission received: 26 August 2021 / Revised: 28 September 2021 / Accepted: 29 September 2021 / Published: 30 September 2021

(This article belongs to the Special Issue Guidance, Navigation, and Fault-Tolerant Control of Autonomous Systems)

Download

Browse Figures

Versions Notes

Abstract

:

As an autonomous system, an anti-radiation loitering munition (LM) experiences uncertainty in both a priori and sensed information during loitering because it is difficult to accurately know target radar information in advance, and the sensing performance of the seeker is affected by disturbance and errors. If, as it does in the state of the art, uncertainties are ignored and the LM travels its planned route, its battle effectiveness will be severely restricted. To tackle this problem, this paper studies the method of autonomous planning and control of loitering routes using limited a priori information of target radar and real-time sensing results. We establish a motion and sensing model based on the characteristics of anti-radiation LMs and use particle filtering to iteratively infer the target radar information. Based on model predictive control, we select a loitering path to minimize the uncertainty of the target information, so as to achieve trajectory planning control that is conducive to the acquisition of target radar information. Simulation results show that the proposed method can effectively complete the autonomous trajectory planning and control of anti-radiation LMs under uncertain conditions.

Keywords:

anti-radiation loitering munition; loiter; control; uncertainty; Bayesian inference; particle filter

1. Introduction

An anti-radiation loitering munition (LM) can use the enemy’s electromagnetic radiation to guide, suppress, and destroy its air defense warning and guidance radar [1], and it is mainly used to perform the suppression of enemy air defense (SEAD) missions, creating conditions for its own air combat forces to penetrate defenses. Since the mid-1980s, many militaries have developed or introduced anti-radiation LMs, which have become an important force to suppress and destroy an opponent’s air defense. In the armed conflict between Azerbaijan and Armenia in the Naka region in 2020, Harop, as the improved version of Israeli’s Harpy, received great attention for destroying S-300PS long-range surface-to-air missile systems [2].

During battle, operational parameters such as the transit and loitering paths of an anti-radiation LM are input to its navigation controller. After launch, it relies on the autopilot and positioning and navigating equipment to perform automatically along the pre-programmed path. Only when the target radar signal is intercepted during the search will it switch from loitering to tracking and attacking mode. Since there is no need for humans to participate in guidance and control in the loop during this process, the anti-radiation LM is regarded as a “fire and forget” weapon [3], as shown in Figure 1. Its path during the loitering phase is planned according to the known target radar position and the characteristics of the seeker’s field of view [4,5], which is generally in a fixed figure-eight or runway pattern. However, in an actual battle with fierce confrontation and a complex information environment, it may not be possible to accurately obtain the target radar information in advance. At the same time, the anti-radiation seeker’s sensing results still have uncertainties such as missed signal detections, false alarms, and random errors. Therefore, it is necessary to study the trajectory planning and control of an anti-radiation LM under uncertain conditions.

Studies on the planning and control of unmanned systems have been increasing for environmental monitoring [6] and search and rescue [7] under uncertain conditions. Schlotfeldt et al. [8] studied the problem of searching for uncertain targets of a Gaussian distribution by ground robots with sensing with Gaussian additive noise; they used invariant extended Kalman filter (IEKF) to iteratively estimate the target position and the variance of the target estimate as the path cost to plan the direction of a robot’s motions. The feasibility of information entropy, mutual information, and KL divergence was discussed from the perspective of sensor deployment as an indicator of target information uncertainty, and it was noted to be equivalent to variance measures when the prior and noise follow Gaussian distributions [9]. However, when the distribution does not have a unimodal characteristic, to use variance as a metric will misdirect the uncertainty estimation of the target information.

Therefore, based on the modeling of the anti-radiation LM’s motion and sensing characteristics, we use particle filtering to perform Bayesian iterative inferencing of target radar information to solve the problem of estimating the target radar state under the conditions of a priori and sensing uncertainty. Then, we minimize the uncertainty of the target information measured by conditional entropy as the control optimization objective. Using a model predictive control method, we select the heading of the anti-radiation LM for the airborne autopilot to execute and achieve the loitering trajectory planning control that is conducive to the acquisition of target radar information. The general idea of planning control is shown in Figure 2. The effectiveness of the method is verified by simulation experiments.

The rest of this paper is consisted as follows. Firstly, the description and modeling of the problem are formulated in Section 2. Then, the inference of target radar position is described in Section 3. Section 4 shows how the model predictive control method is applied to the control of loitering under uncertainty. Simulation results and conclusions are mentioned in Section 5 and Section 6, respectively.

2. Problem Description and Modeling

2.1. Mission Scenario

Assume a fixed target radar station in a known battle airspace

A = W \times H

. An anti-radiation unmanned aerial vehicle has arrived and entered the autonomous patrol mode.

2.2. Anti-Radiation LM Trajectory Planning Model

The motion planning models used in LM trajectory planning usually include free mass point [4], direction-limited mass point [10], and first-order coordinated turn [11]. The more accurate the trajectory planning model, the easier it is for the flight controller to complete the planned trajectories or selected control actions. The anti-radiation LM seeker is mounted on the nose and is limited by the field of view. During loitering, by default, the airborne autopilot uses coordinated turns to control the heading, keeping the nose direction consistent with the flight direction. Some models of anti-radiation LMs are specially designed with side-enforced plates to assist in accurate and coordinated turning [1].

Referencing the coordinated turning motion model, assuming the anti-radiation LM is flying in the battle airspace at a constant speed and height during the loitering phase, the control action space is a finite discrete yaw angle, and the factor of flight height is ignored, we build a two-dimensional trajectory planning model on the flight plane of the LM,

\begin{matrix} s_{k + 1} & = {[\begin{matrix} s_{k + 1}^{E} & s_{k + 1}^{N} & φ_{k + 1} \end{matrix}]}^{T} \\ = [\begin{matrix} s_{k}^{E} + v c o s (φ_{k + 1}) Δ t \\ s_{k}^{N} + v s i n (φ_{k + 1}) Δ t \\ φ_{k} \end{matrix}] + [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}] u_{k}, \end{matrix}

(1)

where

s_{k}^{E}

and

s_{k}^{N}

are the east and north coordinates, respectively, of the anti-radiation LM relative to a certain origin of the battle airspace at time

k

in the northeast coordinate system;

v

is a constant speed;

φ_{k}

is the yaw angle; and

u_{k} \in U = \{0, - ω_{0}, ω_{0}\}

denotes the control actions of the anti-radiation LM at time

k

, i.e., level flight, and yawing to the left and right.

2.3. Radar Target Modeling

The state of the target radar consists of its position coordinates and transmission power, which is represented by a vector,

x = {[\begin{matrix} x^{E} & x^{N} & T_{0} \end{matrix}]}^{T}

, where

x^{E}

and

x^{N}

are the east and north coordinates, respectively, of the radar target in the northeast coordinate system of a certain origin in the battle airspace, and

T_{0} > 0

is the radar transmission power reference constant, which is determined by the logarithmic transmission power at a fixed distance

d_{0}

from the radar transmitter. For the anti-radiation LM,

x

is an unknown vector, in the form of an a priori probability distribution based on pre-battle intelligence information and the commander’s judgment. When the target radar position information is unknown, it is considered uniformly distributed in the battle airspace.

2.4. Seeker Sensing Modeling

The target radar information sensed by an anti-radiation LM consists mainly of the signal amplitude and angle of arrival measured by the seeker on the signal within the main lobe of the antenna. Due to the complex electromagnetic environment of the battlefield, in addition to the real signal of the target radar are environmental background noise and impact clutter [12], which lead to uncertainties such as missed signal detection, false alarms, and random errors in the seeker’s sensing results.

Under noise disturbance, the sensing result

y_{k}

obtained by the anti-radiation LM seeker at time

k

can be expressed as [13]

y_{k} = h_{n} (h (s_{k}, x), ν_{k} (s_{k}, x)),

(2)

where

y_{k} \in ℝ^{n_{y}}

is the sensing result of the seeker at time

k

,

ν_{k}

is the noise related to the LM’s pose and environment,

h (\cdot)

is the theoretical sensing result without noise disturbance, and

h_{n} (\cdot)

is a nonsingular vector transformation that characterizes the sensing result.

Assuming that the noise level

r

obeys a zero-mean Gaussian distribution with variance

σ

, and the consistant false-alarm rate is

P_{f}

, the radar signal radiation intensity

r s s

reaching the receiver can be modeled as

r s s = T_{0} - 10 T \log (d) + r,

(3)

where

T > 1

is a constant related to the electromagnetic propagation mode, and

d = \frac{\sqrt{{(s^{E} - x^{E})}^{2} + {(s^{N} - x^{N})}^{2}}}{d_{0}}

is the relative distance between the LM and the radar target.

Affected by internal device thermal noise and environmental background noise, the seeker’s detection of radar signals depends on setting the detection threshold and filtering out signals below it. Too high a detection threshold will decrease the sensitivity of detection and miss the target, while too low a threshold will fail to filter out the noise and will generate false signals. The consequences of failure due to too many false alarms caused by saturation of the back-end signal processing module are more serious than those of missed detection. Therefore, modern anti-radiation LM seekers usually adopt a constant false-alarm detection system [14] whose detection threshold is set according to noise intensity and a given false-alarm probability. The adaptive detection threshold

t h

satisfies

P (r \geq t h) = P_{f}

; hence,

t h = q_{σ} (1 - P_{f})

, where

q_{σ} (\cdot)

is the Gaussian distribution quantile. Then, the signal detection probability determined by a given false-alarm rate, noise, and target radar radiation signal strength is

P_{d} = 1 - P (r s s \leq t h)

.

The arrival angle of a signal is obtained by the direction measurement of the antenna array mounted on the front of the LM, whose zero direction is consistent with the direction of the LM fuselage. The signal arrival angle measurement can be modeled as

b e a r = atan 2 (\frac{x^{N} - s^{N}}{x^{E} - s^{E}}) + ω,

(4)

where

atan 2

is the four-quadrant inverse tangent function, and

ω

is the angle measurement noise introduced by random distortion of the target radar waveform, whose distribution usually has a long tail [15]. We use the symmetric alpha-stable distribution [16],

ω \sim S α S (γ),

(5)

where

α, γ

are the parameters of the symmetric alpha-stable distribution.

So far, the target radar signal

y_{k}

sensed by the anti-radiation LM seeker can be modeled as a set of unordered vectors,

y_{k} = \{\begin{matrix} {[\begin{matrix} r s s \\ b e a r \end{matrix}]}^{j} & r s s \geq t h and |b e a r - φ_{k}| \leq φ_{f o v} \\ \emptyset & otherwise \end{matrix}

(6)

where

φ_{f o v}

is the effective angle measurement range determined by the antenna directional characteristics of the seeker, and

j

is the number of detected signals in the sensing result, which may contain real signals coming from the target radar, or all false signals caused by noise. Assuming that the false-alarm signals are uniformly distributed over the direction measurement range of the seeker, when the seeker’s angular resolution is high (

N_{r e s} ≫ 1

) and the false-alarm rate is low (

P_{f} ≪ 1

), the appearance of false-alarm signals in sensing can be considered a Poisson point process,

P o (λ, φ_{f o v})

, where

λ = P_{f} \cdot N_{r e s}

is the rate parameter.

3. Inferencing Target Radar Position

The anti-radiation LM continuously uses the seeker’s sensing results to update the information of the target radar and perform Bayesian inference in the air. Assuming that the a priori distribution of the target radar state at time

k

is

P_{k} (x) = P (x | y_{k}, s_{k})

, and knowing that the radar target is stationary, the result of the seeker’s sensing at time

k + 1

is

y_{k + 1}

. Using Bayes’ theorem, the probability distribution of the radar target state at time

k + 1

is updated as

\begin{matrix} P_{k + 1} (x) & = \frac{P_{k} (x) P (y_{k + 1} ∣ x, s_{k + 1})}{\int_{x \in X} P_{k} (x) d x} \end{matrix},

(7)

where the finite term summation in the integral part can be performed in the finite state hidden Markov model. Under the linear system and prior Gaussian distribution, the closed-form solution of the parameters of the conjugate distribution can be obtained recursively. However, for a non-Gaussian nonlinear observation, such as Equation (6), Equation (7) has no closed-form recursive solution.

To solve this problem, we use a sampling-importance resampling (SIR) particle filter and implement Bayesian recursion through Monte Carlo simulation. The SIR particle filter obtains a set of weighted random samples (particles) by performing sampling using the prior distribution as the proposed distribution, and it uses the sampled particle likelihood function to update the weights to approximate the posterior probability density. The SIR particle filter uses resampling to reduce particle degradation and improve computational efficiency [17]. Let

〈 {\hat{x}}_{p, k}, w_{p, k} 〉

be the weights of a group of

n_{p}

particles at time

k

, where

w_{p, k} \in ℝ_{+}

and

\sum_{p = 1}^{n_{p}} w_{p, k} = 1

. The probability distribution of the radar target states at time

k

can be approximated by particle weights and positions as

P (x ∣ y_{k}) \approx \sum_{p = 1}^{n_{p}} w_{p, k} δ (x - {\hat{x}}_{p, k})

(8)

where

δ (x - {\hat{x}}_{p, k})

is the unit impulse function at

{\hat{x}}_{p, k}

[17].

SIR particle filtering has prediction and update steps. Since the target radar is stationary, the positions of new states of the randomly sampled particles in a prediction step predicted by the target motion model do not change. In addition, resampling decreases particle diversity (particle depletion) and reduces estimation accuracy. Random disturbances with bandwidths inversely proportional to particle weights are introduced to maintain particle diversity,

{\hat{x}}_{i, k + 1} = {\hat{x}}_{i, k + 1} + \frac{c}{w_{i, k}} ρ

, where

ρ

obeys a standard normal distribution, and

c ≪ 1

is a selected constant [18].

An update step uses the likelihood function of the sensing result to update the particle weight as

w_{i, k + 1} \propto w_{i, k} P (y_{k + 1} ∣ {\hat{x}}_{i, k + 1}, s_{k + 1})

(9)

In the presence of false-alarm signals, the likelihood function of the sensing result

P (y_{k + 1} | x, s_{k})

is obtained from the joint correlation probability of the radar target and the sensing result. Let

𝓎^{i}

be the

i

-th element in the unordered vector set

y_{k + 1}

, which can be obtained according to the LM seeker’s sensing characteristic; see Equation (6). When

∥ y_{k + 1} ∥_{0} > 0

,

\begin{matrix} P (y_{k + 1} | x, s_{k + 1}) & = \end{matrix} \frac{λ^{j - 1} e^{- λ}}{j!} P_{d} (x, s_{k + 1}) \sum_{i = 1}^{j} P (𝓎^{i} | x, s_{k + 1}) (1 - P_{d} (x, s_{k + 1})) \frac{λ^{j} e^{- λ}}{j!},

(10)

and when

∥ y_{k + 1} ∥_{0} = 0

, i.e.,

y_{k + 1} = \emptyset

,

P (y_{k + 1} | x, s_{k + 1}) = 1 - P_{d} (x, s_{k + 1}) .

(11)

At this time, the estimate of the target radar state is

\bar{x} = \sum_{i} w_{i, k} {\hat{x}}_{i, k + 1} .

(12)

4. Loitering Control by Minimizing Target Uncertainty

According to the sensing characteristics of the seeker, the sensing result has uncertain factors due to environmental noise disturbance and measurement errors, and certain factors that are determined by the relative position and orientation between the anti-radiation LM and the target radar. Autonomous anti-radiation LMs can plan and control the selection of loitering paths, and they can fly to a position in an orientation that is conducive to the acquisition of target radar information. This improves the probability of obtaining high-quality sensing and optimizes loitering trajectories.

4.1. Measure of Target Information Uncertainty

The motion and sensing of an anti-radiation LM are highly nonlinear. The sensor noise is not Gaussian. The prior information of the target radar must be extracted from prior intelligence, which cannot be guaranteed to be Gaussian. Therefore, we use information entropy to measure the uncertainty of target radar information.

Information entropy is a widely used measure of the uncertainty of random variables. The higher the information entropy, the greater the uncertainty. Considering target radar states and sensing results as random variables

X, Y

, the information entropy of

X

is

H (X) = - \int_{x \in X} P (x) \log P (x) d x .

(13)

In radar position estimation, omitting the symbol

s

, the conditional probability of using the sensing data to update the estimation of the target radar state is

P (x | y)

. We substitute this in Equation (13) to obtain the conditional entropy of target radar information

X

about sensing variable

Y

,

H (X | Y) = - \int_{y \in Y} \int_{x \in X} P (x, y) \log P (x | y) d x d y .

(14)

Conditional entropy reflects the uncertainty of the target radar information after Bayesian inference to obtain the sensing variable

Y

, and it determines the lower limit of the accuracy of estimating X with Y [19]. The lower the conditional entropy, the higher the achievable estimation accuracy, which satisfies

E [(X - \bar{X} (Y))^{2}] \geq \frac{1}{2 π e} e^{2 H (X | Y)} .

(15)

Let

n

be the time when loitering ends, and let the subscript

1 : n

represent the set of all subscripts

i = 1, 2, \dots, n

. We seek to minimize the conditional entropy of the target radar information on sensing and establish an optimal control model for the choice of loitering path of the anti-radiation LM,

u_{1 : n}^{*} = \underset{u_{1 : n}}{argmin} H (X_{n} | Y_{1 : n}), s . t . \{\begin{matrix} (1), (6) \\ u_{i} \in U, i = 1, 2, \dots, n \end{matrix}

(16)

4.2. Conditional Entropy Calculation Based on Particle Position Weight

Since particle filtering uses a set of discrete weighted particles sampled from the proposed distribution to approximate the target posterior distribution, we cannot directly calculate the conditional entropy [20]. Many conditional entropy approximation algorithms been proposed based on particle filtering [6,21,22,23]. A line segment fitting method was used to approximate conditional entropies [21], which is equivalent to the number of nearest neighbors being one [22]. This method has no parameters to be adjusted, but its accuracy decreases as dimensionality increases. A Gaussian kernel function was used to smooth the posterior distribution represented by weighted particles [6]. This requires the selection of a reasonable kernel bandwidth. First-order historical information of particles was used to approximate the conditional entropy in gradient calculation [22,23], requiring the storage of only the states of particles at the previous moment, with complexity

O (N^{2})

. In this paper, the target radar states are static with

P ({\hat{x}}_{i, k} {\hat{x}}_{i, k - 1}) = 1

. Substituting

P ({\hat{x}}_{i, k} {| \hat{x}}_{i, k - 1})

with constant 1 in Equation (52) in [23] yields an approximation of the conditional entropy,

H (X_{k} | Y_{k}) \approx \ln \sum_{i = 1}^{n_{p}} w_{i, k - 1} P (y_{k} {| \hat{x}}_{i, k - 1}) - \sum_{i = 1}^{n_{p}} w_{i, k} \ln (P (y_{k} {| \hat{x}}_{i, k})) .

(17)

4.3. Model Predictive Optimal Control

As an optimal control problem, the full time-domain optimal closed-loop feedback control law in Equation (16) is usually difficult to obtain. Model predictive control adopts the idea of rolling-horizon optimization, and it only considers optimization in the finite time domain

τ

. It solves the finite time-domain optimization problem in an open loop, taking the control action sequence of the solution, applying it to the current moment, and seeking a solution according to the state information of the next moment. Model predictive optimal control has the following advantages. As a method to solve open-loop optimization problems, it ensures the feasibility of solving problems with complex constraints. Using the system’s state feedback to periodically seek a solution, it improves execution reliability in the presence of environmental disturbances. State prediction takes into account future benefits and can improve system performance.

Expanding Equation (16) according to the step length

τ

yields the model prediction form

u_{k : k + τ}^{*} = \underset{u_{k : k + τ}}{argmin} H (X_{k + τ} | Y_{k : k + τ}), s . t . \{\begin{matrix} (1), (6), \\ u_{i} \in U, i = k, k + 1, \dots, k + τ \end{matrix}

(18)

Since the maneuvering has limited options, based on the motion element graph search method [24], letting

s_{k}

be the root node, we select

u_{i}

from

U

to expand the candidate node

{\hat{s}}_{k + 1}^{i}

, calculate the conditional entropy of the group of weighted particles at time

k

, construct a motion element graph with LM states as nodes, control actions as edges, and consider the conditional entropy as the node score. Traversal search is used to find the optimal solution, and the optimal control action sequence is obtained in Equation (18).

5. Experimental Verification by Simulation

To verify the effectiveness of the proposed method, we compared it to two others in a simulated scenario of an anti-radiation LM arriving in a designated airspace for patrol and searching for a target radar with unknown location and radiation power [4,8].

5.1. Experimental Conditions

The simulated battle airspace was the plane area

A = [- 1000, 1000] \times [- 1000, 1000]

(in units of 100 m). The target radar coordinates were

(300, 300)

. The radar radiation-related coefficient was

T_{0} = 30.88

. The initial state model parameters of the anti-radiation LM were loitering speed

v = 30

(180 km/h), initial heading angle

φ ~ U (0, 2 π)

, yaw angle

ω_{0} = 15 °

, and initial position coordinates

(- 500, - 480)

. The anti-radiation LM airborne seeker sensing model parameters were

T

= 2,

P_{f}

= 5%,

σ

= 3,

φ_{f o v}

= 60°,

α

= 0.75,

γ

= 0.8, and

N_{r e s}

= 120. For a seeker at 350 distance units (3.5 km), the probability that the target radar signal exceeds the detection threshold is 95%. We assumed no advance intelligence information. The target radar position was considered to be uniformly distributed in the battle airspace. Other parameters were as follows: model prediction step

τ = 3

and number of particle filter particles

n_{p}

= 200. The simulation was coded with Octave [25] and ran on a computer with a 3.8-GHz i9 processor with 32 GB memory.

5.2. Benchmark

5.2.1. Stochastic Decision-Making

In a purely stochastic strategy, each LM randomly selects a command that will not fly out of the battle airspace to execute in the optional action space at each decision moment.

5.2.2. Re-Planning Based on Field-of-View Coverage

Under the uncertainty conditions in this study, there is no known actual position information of the radar target, and the method in [4] cannot be used directly. Therefore, the path planning method of [4] was used only after a particle filter with the same parameters as ours was used to estimate the target position. When the sensing result of the anti-radiation LM seeker is updated, the target position estimate is updated, the route is re-planned, and the flight path of the anti-radiation LM is controlled according to the new route. Parameters used are

p o p s i z e = 200

,

p c = 0.9

,

p m = 0.05

, and

M A X G E N = 50

. Refer to [4] for details.

5.2.3. Method Assuming Gaussian Noise

The IEKF method [8] was used to replace the particle filter algorithm used in this study to test the effectiveness of the particle filter in nonlinear and non-Gaussian cases. In this method, the nonlinear motion and sensing models are approximated by a first-order linear expansion, and the non-Gaussian noise component in the angle measurement is approximated by Gaussian noise

ω ~ N (0, 0.4)

. Since the prior distribution of the target radar information in the IEKF method must also be Gaussian, uniform distributions cannot be used. Each simulation randomly selects a Gaussian distribution with a center uniformly distributed within

[0, 600] \times [0, 600] \times [20, 40]

and variance

D i a g (33^{2}, 33^{2}, 20^{2})

as the prior distribution of target radar information.

5.3. Simulation Results and Analysis

The first simulation experiment is conducted in a fixed total number of simulation steps (

n_{s t e p}

=150) to see the difference of performance among the methods to be compared.

Table 1 shows the statistical results of target radar conditional entropy

H (X | Y_{1 : n})

and position RMSE at the end of 100 repeated simulations of the four methods. A target radar position RMSE is the Euclidean distance between the filtering-estimated position coordinates at time

k

and the true values, i.e.,

R M S E_{k} = \sqrt{{({\bar{x}}_{k}^{E} - x_{k}^{E})}^{2} + {({\bar{x}}_{k}^{N} - x_{k}^{N})}^{2}}

. RMSE reflects the absolute accuracy of the system’s estimation of the target radar position. The smaller the RMSE, the higher the estimation accuracy. The data in Table 1 show that at the end of the simulation, the target radar conditional entropy and position RMSE obtained by the proposed method were the lowest, which were about 30% of those obtained by the “field-of-view coverage” method with the re-planning mechanism.

Figure 3 shows the variations of target radar mean conditional entropy and position RMSE with the number of steps during 100 simulations. The rates of decrease in conditional entropy and RMSE of the proposed method are greater than those of the other methods, indicating that the planning control method with conditional entropy as the optimization objective can better deal with uncertainty during loitering, achieves better final estimation accuracy, and reduces the effect of uncertainty.

Compared with stochastic decision making, the method of constantly re-planning based on the sensing of field-of-view coverage is beneficial to the estimation of the target radar position, but the performance is not as good as that of the IEKF method based on the Gaussian assumption and the proposed method. This is because of the following: (1) the method of constantly re-planning based on the sensing of field-of-view coverage makes advance planning in the prediction step without considering the motion characteristics of anti-radiation LMs; and (2) in the stage of high uncertainty, this method is still directly based on the estimated target radar position to plan the route, and the use of low-precision estimation results degrades performance.

Based on the Gaussian assumption, the IEKF method has a slightly lower initial conditional entropy due to the different selection of prior distributions. As the simulation progresses, the rates of decrease in RMSE and conditional entropy are not as great as those of the proposed method because noise with an alpha-stable state distribution has a higher tail probability than noise with the Gaussian distribution, and there are more outliers. Furthermore, with the loss of accuracy due to the linear approximation of IEKF, its estimation accuracy under nonlinear non-Gaussian noise scenarios is less than that of the particle filter method used in this paper. This lack of accuracy affects the choice of control actions and further affects the acquisition of target radar information. Therefore, the particle filter used in this paper can improve performance in nonlinear dynamic and non-Gaussian noise sensing scenarios.

As shown in Figure 3, the RMSE and conditional entropies obtained by different methods change with the simulation steps in three stages. At the beginning, while the conditional entropy decreases, the RMSE does not decrease. At this stage, the anti-radiation LM quickly reduces the uncertainty of target radar information through sensing, but because the estimated distribution of the target radar still is not unimodal, the estimated deviations are always relatively large. Subsequently, the decrease in conditional entropy is basically consistent with that of RMSE, indicating that the anti-radiation LM continues to reduce uncertainty through sensing while simultaneously improving estimation accuracy. Finally, the decline in conditional entropy and RMSE slows down, marginal information gains during loitering begin to decrease, and improvement of estimation accuracy slows down. At this time, according to the mission objective, the seeker can be switched from the wide-area search mode to the tracking mode, and the anti-radiation LM ends the loitering stage and executes a diving attack.

Figure 4 shows the statistical results of total control inputs during the simulation. The total control

t c

inputs are accumulated by control inputs

c i_{k}

at each time step as:

t c = \sum_{k} c i_{k}, \{\begin{matrix} c i_{k} = 0; if u_{k} = 0 \\ c i_{k} = 1; else \end{matrix}

(19)

where total control inputs

t c

reflects the total control efforts consumed during the patrolling process and related to energy consumption of the flight trajectory. The smaller the total control inputs are, the less energy is consumed during the patrolling process. It can be inferred from Figure 4 that the total control inputs of the four methods compared here, including the proposed method, are similar in average. However, the total control inputs of the “Coverage of Field”, “Iterative EKF”, and “Proposed Method” are slightly higher than the “Random Strategy” in that “Random Strategy” ignores the perception results and choses the control action randomly. That the total control inputs of the proposed method are similar to the other methods shows that the performance improvement shown in Figure 3 depends on the cost of control efforts or energy consumption.

Figure 5 shows the trajectories and final particle distributions of the anti-radiation LM in a randomly selected simulation. Although, due to the uncertainty of a priori and sensing information, the planned path and inferred particle distribution of the anti-radiation LM are different in each simulation, it can be seen that the trajectory generated by our method circles more around the target radar, and it is closer to the planned trajectory when the conditions are certain; at the end of the inference, the distribution of particles is also more concentrated near the true position.

Table 2 shows the statistical results of the time step to reach the desired RMSE during the simulation. The lower the desired RMSE needed, the faster it is achieved. The specific time to reach the desired RMSE for anti-radiation LM differs from the different methods. Among these methods, the proposed method performs best, especially in low RMSE scenarios. “Random Strategy” and “Coverage of Field” cannot reach the desired RMSE of 100 m in 2000 time steps, which is more than 10 times what the proposed method does.

6. Conclusions

We studied the autonomous trajectory planning and control of anti-radiation LMs under conditions of uncertain a priori information of the target radar and the seeker’s sensing. By modeling the problem as an optimal control problem that minimizes the uncertainty of target information, combined with particle filtering and model predictive control methods, better loitering tracking and higher quality target information than those of previous methods were obtained. The results can provide a reference for research and development and for the operational use of anti-radiation LMs.

It should be noted that we did not discuss trajectory planning control when the marginal information gains decrease in the later phase, when the LM shifts from the cursing stage to tracking and attacking, nor trajectory planning control when target decoys exist. Future research can be carried out in the following areas: (1) reducing computational cost and improving real-time performance by improving the proposed distribution, increasing the efficiency of particle sampling, and introducing box-type and adaptive particle filtering; (2) based on information entropy, studying the strategy of autonomous switching from loitering to tracking and attacking when information return declines; (3) studying the anti-decoy loitering method with decoy information sensing incorporated in the presence of decoy radar; (4) testing the proposed method in a high fetidity simulation environment or real-world experiment.

Author Contributions

Conceptualization, L.B. and H.L. (Hongchuan Luo); methodology, L.B.; software, L.B.; validation, L.B., H.L. (Hongchuan Luo) and H.L. (Haifeng Ling); investigation, L.B.; resources, H.L. (Hongchuan Luo); data curation, H.L. (Hongchuan Luo); writing—original draft preparation, L.B.; visualization, H.L. (Hongchuan Luo); supervision, L.B.; project administration, L.B.; funding acquisition, H.L. (Haifeng Ling). All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Defence Science and Technology Innovation Special Zone of the Military Science and Technology Commission of China (17-163-11-ZT-001-XXX-XX); Military Graduate Student Fund (KYGYJAJY20XX).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

John, C. Harop Loitering Munitions UCAV System—Airforce Technology. 1 September 2018. Available online: https://www.airforce-technology.com/projects/haroploiteringmuniti/ (accessed on 30 October 2020).
Gettinger, D.; Michel, A.H. Loitering Munitions. Cent. Study Drone 2017, 2, 1–3. [Google Scholar]
IAI Group, A. Fire and forget: Harpy is an Autonomous Weapon for All Weather. 13 August 2017. Available online: https://www.iai.co.il/p/harpy (accessed on 17 August 2020).
Liu, P.; Sheng, H. The Route Planning of Anti-radiation UAV Search Based on Improved Artificial Potential Field Method. J. Proj. Rocket. Missiles Guid. 2020, 1, 1–5. [Google Scholar]
Liu, P.; Sheng, H. A Search Route Optimization of Anti-Radiation UAV in SEAD Mission. J. Air Force Eng. Univ. (Nat. Sci. Ed.) 2020, 21, 87–91. [Google Scholar]
Zhu, H.; Wang, Y.; Du, C.; Zhang, Q.; Wang, W. A novel odor source localization system based on particle filtering and information entropy. Robot. Auton. Syst. 2020, 132, 103619. [Google Scholar] [CrossRef]
Park, M.; Oh, H. Cooperative information-driven source search and estimation for multiple agents. Inf. Fusion 2020, 54, 72–84. [Google Scholar] [CrossRef]
Schlotfeldt, B.; Thakur, D.; Atanasov, N.; Kumar, V.; Pappas, G.J. Anytime Planning for Decentralized Multirobot Active Information Gathering. IEEE Robot. Autom. Lett. 2018, 3, 1025–1032. [Google Scholar] [CrossRef]
Aoki, E.H.; Bagchi, A.; Mandal, P.; Boers, Y. A theoretical look at information-driven sensor management criteria. In Proceedings of the 14th International Conference on Information Fusion, Chicago, IL, USA, 5–8 July 2011. [Google Scholar]
Jin, T.; Chen, Y.; Shen, L. Cooperative Search Algorithm for Multi-UAVs in Uncertainty Environment. J. Electron. Inf. Technol. 2007, 29, 2325–2328. [Google Scholar]
Han, X.; Sheng, H.; Chen, M. Anti-radiation UAV Cluster Cooperative Search Based on D-S Evidence Theory. J. Detect. Control. 2018, 40, 80–87. [Google Scholar]
Liu, X. Electromagnetic Echo Environment Modeling, Platform Location and Weak Target Detection in Complex Random Scattering Environment. Master’s Thesis, Shanghai Jiao Tong University, Shanghai, China, 2019. [Google Scholar]
Mahler, R.P.S. Statistical Multisource-Multitarget Information Fusion; Artech House: Norwood, MA, USA, 2007. [Google Scholar]
Li, L.; Wang, X.; Gui, J.; Zhou, Z. Processing method of constant false alarm rate technology. Laser J. 2018, 39, 8–13. [Google Scholar]
Middleton, D. Non-Gaussian noise models in signal processing for telecommunications: New methods an results for class A and class B noise models. IEEE Trans. Inf. Theory 1999, 45, 1129–1149. [Google Scholar] [CrossRef]
Liu, M.; Zhang, J.; Tang, J.; Jiang, F.; Liu, P.; Gong, F.; Zhao, N. 2-D DOA Robust Estimation of Echo Signals Based on Multiple Satellites Passive Radar in the Presence of Alpha Stable Distribution Noise. IEEE Access 2019, 7, 16032–16042. [Google Scholar] [CrossRef]
Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 2002, 50, 174–188. [Google Scholar] [CrossRef] [Green Version]
Banerjee, A.; Burlina, P. Efficient Particle Filtering via Sparse Kernel Density Estimation. IEEE Trans. Image Process. 2010, 19, 2480–2490. [Google Scholar] [CrossRef] [PubMed]
Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley: Hoboken, NJ, USA, 1991. [Google Scholar]
Ajgl, J.; Šimandl, M. Differential entropy estimation by particles. IFAC Proc. Vol. 2011, 44, 11991–11996. [Google Scholar] [CrossRef]
Ryan, A.; Hedrick, J.K. Particle filter based information-theoretic active sensing. Robot. Auton. Syst. 2010, 58, 574–584. [Google Scholar] [CrossRef]
Skoglar, P.; Orguner, U.; Gustafsson, F. On information measures based on particle mixture for optimal bearings-only tracking. In Proceedings of the 2009 IEEE Aerospace Conference, Big Sky, MT, USA, 7–14 March 2009; pp. 1–14. [Google Scholar] [CrossRef] [Green Version]
Boers, Y.; Driessen, H.; Bagchi, A.; Mandal, P. Particle filter based entropy. In Proceedings of the 2010 13th International Conference on Information Fusion, Edinburgh, Scotland, 26–29 July 2010; pp. 1–8. [Google Scholar] [CrossRef] [Green Version]
Jarin-Lipschitz, L.; Paulos, J.; Bjorkman, R.; Kumar, V. Dispersion-Minimizing Motion Primitives for Search-Based Motion Planning. arXiv 2021, arXiv:robotics/2103.14603. Available online: http://arxiv.org/abs/2103.14603 (accessed on 10 September 2021).
Eaton, J.W.; Bateman, D.; Hauberg, S.; Wehbring, R. GNU Octave Version 5.2.0 Manual: A High-Level Interactive Language for Numerical Computations. 2019. Available online: https://www.gnu.org/software/octave/doc/v5.2.0/ (accessed on 2 December 2020).

Figure 1. Schematic diagram of anti-radiation LM battle stages.

Figure 2. Trajectory planning control to minimize the uncertainty of target information.

Figure 3. Changes of target conditional entropies (a) and estimation accuracy (b) with number of simulation steps in 100 simulations.

Figure 4. Total control inputs of statistic of 100 runs.

Figure 5. Trajectories in simulation. (a–d) shows one of the trajectory generated in the 100 simulations by the random strategy method, coverage of field method, iterative EKF method and the proposed method respectively. The second simulation experiment is conducted in a variant number of simulation steps, and the simulation stops once the desired RMSE is reached. This experiment examines the time required for different methods to achieve the desired RMSE for anti-radiation LM tracking and attacking under a combat scenario.

Table 1. Means and variances of target conditional entropy and RMSE at completion of 100 simulations.

	Mean of Conditional Entropy (Nats)	Variance of Conditional Entropy	Mean of RMSE (m)
Random Strategy	10.18	0.724	301.80
Coverage of Field	10.03	1.059	256.10
Iterative EKF	9.85	0.983	176.05
Proposed Method	9.69	0.481	74.29
Minimum numbers are underlined

Table 2. Means and variances of time to reach the desired RMSE of 100 runs.

	Desired RMSE 400 (m)	Desired RMSE 250 (m)	Desired RMSE 100 (m)
Random Strategy	mean: 19.34 std: 5.92	mean: 406.72 std: 59.43	>2000
Coverage of Field	mean: 18.63 std: 5.74	mean: 112.32 std: 39.52	>2000
Iterative EKF	mean: 16.77 std: 5.68	mean: 71.26 std: 37.23	mean: 276.15 std: 79.24
Proposed Method	mean: 14.81 std: 5.42	mean: 32.41 std: 22.32	mean: 87.31 std: 44.39
Minimum numbers are underlined

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bai, L.; Luo, H.; Ling, H. Autonomous Trajectory Planning and Control of Anti-Radiation Loitering Munitions under Uncertain Conditions. Electronics 2021, 10, 2399. https://doi.org/10.3390/electronics10192399

AMA Style

Bai L, Luo H, Ling H. Autonomous Trajectory Planning and Control of Anti-Radiation Loitering Munitions under Uncertain Conditions. Electronics. 2021; 10(19):2399. https://doi.org/10.3390/electronics10192399

Chicago/Turabian Style

Bai, Linyuan, Hongchuan Luo, and Haifeng Ling. 2021. "Autonomous Trajectory Planning and Control of Anti-Radiation Loitering Munitions under Uncertain Conditions" Electronics 10, no. 19: 2399. https://doi.org/10.3390/electronics10192399

APA Style

Bai, L., Luo, H., & Ling, H. (2021). Autonomous Trajectory Planning and Control of Anti-Radiation Loitering Munitions under Uncertain Conditions. Electronics, 10(19), 2399. https://doi.org/10.3390/electronics10192399

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Autonomous Trajectory Planning and Control of Anti-Radiation Loitering Munitions under Uncertain Conditions

Abstract

1. Introduction

2. Problem Description and Modeling

2.1. Mission Scenario

2.2. Anti-Radiation LM Trajectory Planning Model

2.3. Radar Target Modeling

2.4. Seeker Sensing Modeling

3. Inferencing Target Radar Position

4. Loitering Control by Minimizing Target Uncertainty

4.1. Measure of Target Information Uncertainty

4.2. Conditional Entropy Calculation Based on Particle Position Weight

4.3. Model Predictive Optimal Control

5. Experimental Verification by Simulation

5.1. Experimental Conditions

5.2. Benchmark

5.2.1. Stochastic Decision-Making

5.2.2. Re-Planning Based on Field-of-View Coverage

5.2.3. Method Assuming Gaussian Noise

5.3. Simulation Results and Analysis

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI