1. Introduction
Missiles are modern weapons guided by guidance systems that actively fly and ultimately destroy the targets according to the control systems. Compared to the traditional weapons, missiles have guidance systems that can autonomously capture targets, measure relevant information, plan flight paths, and complete predetermined strike missions. Generating the missile’s flight path is the primary role of the missile guidance system. The missile guidance system comprises a detection equipment and a guidance command-forming device [
1]. The detection equipment measures the relative position and motion information or the deviation between the flight trajectory and the predetermined trajectory of the missile. The command-forming device calculates and generates guidance commands according to the deviation and measured information to determine the trajectory required for the missile to attack the target successfully [
2]. Obviously, missile guidance law plays an essential role in missile guidance and control.
There have been many research results on missile guidance laws so far. Since the 1950s, optimal guidance theory has been extensively studied with the development of aerospace technology. The main problem of optimal guidance research is to design the optimal guidance law, which minimizes the given performance index and satisfies specific constraints according to the system model and the guidance objective. A penalty term was added to the performance index in [
3] to constrain the change in guidance command and to avoid the acceleration saturation. The concept of ZEM was first proposed by Newman [
4], which was used to characterize the miss between the missile and the target during guidance without control. In [
5], a weighted ZEM guidance law with direct applicability to various operational objectives was proposed. For the long-range interception problem, a new ZEM prediction method was derived in [
6] by considering gravitational acceleration at different stages, using modern control theory and Newton’s binomial theorem. In [
7], a data-driven online estimation algorithm based on ZEM and remaining flight time was proposed to estimate the ZEM effectively and accurately. In [
8], a ZEM-based guidance was employed in the highly nonlinear orbital transfer and raising problems, indicating that the designed ZEM guidance was more suitable for dealing with uncertainties and perturbations. For Mars precision landing, a new two-phase ZEM feedback guidance strategy was proposed in [
9], which directly accommodated a variety of constraints and requirements, such as retrorocket thrust magnitude limits, obstacles over the surface of Mars, and abnormal initial conditions.
In the context of optimal missile guidance, the performance function is generally taken as a weighted function of miss, guidance energy, and flight time. However, there is no analytical solution to the corresponding optimal guidance problem for general guidance systems. Namely, the optimal guidance problem can only be approximately solved by numerical methods. In order to avoid solving the Hamilton–Jacobi–Bellman (HJB) equation directly, the inverse optimal guidance method has been proposed. The so-called inverse optimal guidance determines the performance function reversely, which makes the specific form of guidance law optimal. The inversely determined performance function generally has practical physical meaning. In [
10], an inverse optimal control method was studied for the spacecraft rotational motion. The technique solves an HJB equation and leads to the obtaining of a particular form for the stable control law and for the significant performance index. For the helicopter longitudinal state’s stabilization problem with external disturbances, an inverse optimal control law based on the nonlinear DO was proposed in [
11], and the stability of the closed-loop system was proved, which avoided the difficulty of solving the HJB equation directly. By introducing inverse optimal control into the design of guidance law, the efficiency of solving guidance law can be improved.
The concept of ZEM was introduced in the field of guidance. The ZEM-based optimal guidance technology was widely used in guidance law design. An optimal quadratic performance index with an exponential time-varying gain was designed in [
12]. The convergence speed of ZEM and the robustness of time-varying gain optimal guidance law were improved. In [
13], an optimal interception angle guidance law using gravity for extra-atmospheric interception in the case of ballistic terminal processes was proposed. A finite-time optimal regulation problem was established by taking the instantaneous ZEM and cut-off angle error, as the system states. Inspired by the works above, it is promising to introduce inverse optimal control and ZEM into the missile guidance law design.
For the practical engineering application of missiles, it is not enough considering only the guidance under ideal conditions. Guidance is affected by various factors such as model uncertainties, target maneuvers, crosswinds, etc. These factors affect the guidance accuracy and the guidance efficiency to a certain extent. Therefore, researchers have developed disturbance estimate techniques based on feedforward compensation to estimate the disturbances that can not be measured directly. For a class of uncertain systems with unknown frequency sinusoidal disturbances, a disturbance suppression method based on frequency factor observer and full-dimensional state observer was proposed in [
14]. A new finite time extended state observer was proposed in [
15] for nonlinear systems affected by external disturbances, one has also proved that all error signals can converge to zero in finite time. A control scheme based on DO and backstepping control was proposed in [
16] for strictly feedback nonlinear systems to suppress disturbances. Considering a nonlinear uncertain system affected by external disturbances, a controller based on adaptive fuzzy DO was designed in [
17] to improve the robustness of the system. Therefore, it is necessary to consider the influence of disturbance in the design of the guidance law.
Some scholars have applied the disturbance estimate technology to the missile guidance field and proposed a series of effective guidance laws. Dwivedi [
18] proposed a method to accurately estimate the states and ZEM of non-maneuvering or maneuvering targets, which can ensure fewer misses and lower guidance energy loss. Liu [
19] designed a composite guidance law based on DO for the missile guidance system with input saturation. The improved saturation function was used to deal with saturation, while the DO was used to estimate unknown target acceleration and model uncertainty. For a strong nonlinear missile system with external disturbance, Yang [
20] developed an anti-disturbance composite guidance controller based on DO in three channels of missile pitch, roll, and yaw, respectively. For the three-dimensional large maneuvering target interception problem, the unknown target acceleration was regarded as an external disturbance in [
21], and a DO was designed to estimate it. Combined with the sliding mode guidance method, it can achieve accurate target interception. Since the DO technology is efficient, it is instrumental to reducing the adverse effect of target acceleration by designing the DO for the guidance method proposed in this paper.
Motivated by the disturbance estimate technique and inverse optimal guidance, a backstepping method with DO and inverse optimal guidance is designed in this article. The DO is designed to estimate the acceleration of the maneuvering target. The convergence of ZEM and its integral is confirmed by inverse optimal guidance and Lyapunov stability analysis.
The main contribution of this paper is to design a new robust inverse optimal guidance law based on disturbance observer and ZEM. The simulation experiment is designed to compare it with two guidance laws, and it is proved that the guidance law designed in this paper has advantages in guidance accuracy and guidance time. By changing the system parameters, changing the initial conditions, and adding feedback noise, the robustness of the guidance law designed in this paper is further demonstrated.
The following structure of this paper is divided into five sections.
Section 2 introduces the two-dimensional guidance model, linearizes the model, and gives the guidance target.
Section 3 treats an unknown target maneuver as an external disturbance and designs a DO for obtaining an accurate target maneuver estimate. In
Section 4, the ZEM-based inverse optimal guidance law is derived in detail.
Section 5 designs simulations to verify the effectiveness of the guidance method.
Section 6 summarizes the full paper.
2. Problem Statement and Modeling
The relative kinematics model of the missile-target interception system is introduced in this section, which forms the foundation for designing inverse optimal ZEM guidance law. To simplify the analysis and derivation in the below, the following assumptions are given [
22]:
Assumption 1. The missile and the target are both assumed as ideal point-mass;
Assumption 2. The guidance process takes place in a two-dimensional plane;
Assumption 3. The flight speed of the missile and the target remains unchanged;
Assumption 4. Both the target acceleration and the derivative of the target acceleration are bounded.
Figure 1 demonstrates the relative motion schematic diagram for the missile-target interception system in a two-dimensional plane [
23];
M denotes the missile and
T denotes the target,
denotes the initial inertial frame,
stands for the line-of-sight (LOS) angle,
represents the missile’s flight path angle,
represents the target’s flight path angle,
r stands for the relative distance between the missile and the target,
stands for the normal acceleration of the missile and
stands for the normal acceleration of the target,
represents the missile’s flight speed, while
represents the target’s flight speed. According to the principle of relative kinematics, the missile-target interception system relative kinematics equation can be obtained as [
24].
In terms of the principle of kinematics, the kinematic equation of the missile is written as [
1]
where
denotes the missile’s position coordinate in the initial inertial frame
.
According to the principle of kinematics, the kinematic equation of the target is obtained as [
1]
where
represents the target’s position coordinate in the initial inertial frame
.
To obtain the ZEM in real-time, the reference inertial frame is obtained by rotating the initial inertial frame by a constant angle counterclockwise. During the actual guidance process, the change in the LOS angle is small. Therefore, is generally selected as the initial value of LOS angle such that .
The relative kinematic equation can be linearized in the
direction in the reference inertial frame as [
22]
where
y stands for the component of relative distance
r in the
direction,
v denotes the component of closing velocity
in the
direction,
and
stand for the components of the normal acceleration of the missile and of the target in the direction perpendicular to the LOS, respectively, being defined as
Remark 1. It can be demonstrated that (5) is unavailable when . Hence, the feasible region S is given as .
We define
z as the ZEM of the linearized model (
4). According to [
5], the ZEM of system (
4) can be expressed as
where
stands for the remaining guidance time and
represents the guidance end time. More generally, we call
the time-to-go.
Since the speeds of the missile and of the target remain unchanged, the remaining guidance time can be approximately determined as [
24]
Remark 2. In the actual guidance process, if the relative distance r is less than a specific constant value (), then the guidance process can be considered to be over [1]. Therefore, the time-to-go has a lower bound, namely, there exists a positive constant such that . Since ZEM is a predictor, ZEM guidance has the property of prediction. The ZEM of the guidance system is the difference between the predicted longitudinal distance and the actual longitudinal distance produced by the current longitudinal velocity during the current remaining time-to-go. The purpose of ZEM guidance is to make z approaches zero, namely, to make the prediction difference approaches zero, which ensures the effectiveness of the guidance. A ZEM-based feedback guidance command can be designed according to the property of ZEM. The convergence of ZEM can be guaranteed, thereby ensuring the accuracy and the effectiveness of the ZEM guidance.
Remark 3. Regarding the effectiveness of ZEM guidance, the following analysis is given. Due to the small change in the LOS angle, the LOS angle in Figure 1 can be linearized as [22] Considering (4) and (7), the derivative of α can be expressed as Invoking (9) and (6), we can obtain that In the guidance phase, we hope to make z approach zero through the guidance method, namely, . Thus, in the guidance stage, the ZEM guidance is similar to the parallel approach guidance.
Considering (
4), (
6),
and taking the derivative of
z with respect to time, one yields
In the guidance process, it is hoped that ZEM should quickly converge to the neighborhood of zero within the time-to-go so that the relative distance
r converges to zero, ensuring the effectiveness of ZEM guidance. Equation (
11) indicates that the unknown acceleration of the target will affect the dynamics of ZEM, thereby affecting the convergence of ZEM.
To enhance the robustness of the ZEM-based inverse optimal guidance law, referring to the idea of PI controller, the integral of ZEM is introduced as a new state. We define the state variables of the augmented system as [
25]
We define the disturbance
, the composite guidance law
, and we derive the state variables
and
of the augmented system. Then the dynamics of the augmented system can be written as
The idea of this article is to regard the target maneuver as an external disturbance. A disturbance observer is designed to estimate the unknown target acceleration. Then, the estimated values are feedforwarded into the guidance channel to suppress the adverse effects of the unknown target acceleration on the guidance system. On this basis, the integral of the ZEM is regarded as a new system state of augmented system. A ZEM-based inverse optimal robust guidance law based on the augmented system is designed. The guidance accuracy is guaranteed, and the system robustness is improved.
In order to design the composite ZEM-based inverse optimal guidance law, firstly, two lemmas are given.
Lemma 1 (See the work in [
10])
. Consider a nonlinear affine in the control system aswhere contains the system states, contains the system inputs, is a smooth nonlinear vector-valued function with , and is a nonlinear matrix-valued function. Then, the specific form of state feedback control law iswhere is optimal with respect to the following performance index functionwhere is a constant parameter to be designed, is a positive definite matrix, represents a positive definite radially unbounded Lyapunov function of the nonlinear affine system (14), while is given by Lemma 2. If there exists a continuous and positive definite Lyapunov function such that [26]for a control system with bounded initial conditions withwhere are class K functions, and and are positive constants, then the solution of state is uniformly bounded. 3. Design of the Disturbance Observer
For the guidance augmented system (
13) with unknown target maneuvering, assuming that the states of the system can be measured, the disturbance observer is designed as [
16]
where
denotes the internal state of the disturbance observer,
is a parameter to be designed, and
is the observed value of
d.
We define the estimated error
of the disturbance
d as
Considering (
13) and (
20), the derivative of
can be obtained as
The derivative of
d can be written as
Because , and are all bounded, the derivative of d is also bounded. Namely, there exists a constant such that .
Defining the Lyapunov function
and taking the derivative of
, we have
It can be concluded from (
24) and Lemma 2 that the observed error
is uniformly bounded.
We define the estimation error associated with the target’s acceleration as
; in terms of the definition of disturbance
d, the following relationships hold
Since the time-to-go has a lower bound and the estimation error of the disturbance observer is uniformly bounded, the estimation error associated with the target’s acceleration is also uniformly bounded.
4. Design of the Inverse Optimal Guidance
In terms of Lemma 1, it is essential to know the Lyapunov function of the system (
13) in order to design an inverse optimal guidance law based on disturbance observer and ZEM. Considering the particularity of the system (
13), its Lyapunov function is constructed by using the backstepping guidance method given in [
27].
Step 1. To design the inverse optimal guidance law based on DO and ZEM, we define
where
and
is a virtual control law that should be designed.
Taking the derivative of
and invoking (
13), we can obtain
Considering (
27), Equation (
28) can be written as
The virtual control law
is designed as
where
is a parameter to be computed.
By substituting (
30) into (
29),
can be obtained as
Selecting the Lyapunov function of the system (
31) as
, differentiating
with respect to time, one yields
Step 2. Considering (
27) and (
30), the derivative of
is written as
To ensure the stability of the error system, the composite guidance law
is designed as
where
denotes the inverse optimal guidance law that should be designed, while
represents the feedforward guidance law.
Substituting (
34) into (
33), the derivative of
can be obtained as
Obviously, the feedforward guidance law term
can be designed as
Invoking (
36) and (
21), Equation (
35) becomes
The composite guidance law
is to be designed to make the error systems (
37) and (
31) boundedly stable; for this purpose, the candidate Lyapunov function
V is chosen as
Taking the derivative of
V with respect to time,
can be written as
The specific form of the optimal guidance law (
15) is designed according to Lemma 1. Considering that the composite guidance law is designed into two parts, the derivative of
V can also be divided into three parts, which can be written as
Considering (
37) and (
39), the coefficient term of the inverse optimal guidance law
can be expressed as
In terms of the specific form of (
15),
can be designed as
where
is a design function.
Substituting (
22), (
31), and (
37) into (
39),
can be computed as
The design, the analysis and the stability proof of the inverse optimal guidance law based on a disturbance observer and ZEM can be summarized as the following theorem.
Theorem 1. Considering that the guidance system (13) under external disturbance satisfies Assumptions 1–4, the disturbance observer is designed as (20), the composite guidance law is designed as (34), the feedforward guidance law is designed as (36), and the Lyapunov function is choosen as (38). We choose , and the inverse optimal guidance law is designed aswhere will be designed by minimizing the following performance functionalwhere Considering (17), we have Then, the composite guidance lawcan make the system (13) boundedly stable. Proof of Theorem 1. Bringing the inverse optimal guidance law (
44) into (
43), one yields
where
According to Lemma 2, the equilibrium point is boundedly stable. □