Based on the previous sections, first of all, this section integrates the maneuver unit prediction model and the maneuver trajectory prediction model through a hierarchical strategy to obtain a hierarchical tactical maneuver trajectory prediction model. Second, the above-mentioned maneuver trajectory prediction model is embedded into the UAV autonomous maneuver decision-making model to verify the effectiveness and superiority of the proposed method.
  5.1. Tactical Maneuver Trajectory Prediction Process Based on Hierarchical Strategy
The hierarchical tactical maneuver trajectory prediction process mainly involves the model training process and the online maneuver trajectory prediction process. 
Figure 13 shows the above process. The black arrows represent the training process of the maneuver unit prediction model and the maneuver trajectory prediction model, and the red arrows represent the online maneuver trajectory prediction process. The specific steps are as follows:
Step 1: Utilize the simulation data of air-to-air confrontation to extract situation characteristic parameters and identify tactical maneuver units. Subsequently, train the tactical maneuver unit prediction model based on DeepESN-AE-AM and construct the maneuver unit prediction layer. The trajectory characteristic parameters of different maneuver units are extracted by using the maneuver trajectory simulation data. On this basis, 21 maneuver trajectory prediction models based on TSO-GRU-AM are trained to construct the maneuver trajectory prediction layer.
Step 2: Obtain air-to-air confrontation information through airborne sensors. Extract the characteristic parameters of the situation at historical moments and the characteristic parameters of the historical trajectory of the target aircraft and identify the sequence of tactical maneuver units at past moments.
Step 3: Input the time series of situation characteristic parameters and the maneuver unit sequence into the DeepESN-AE-AM model to obtain the tactical maneuver unit sequence at future moments;
Step 4: Determine whether the maneuver unit at the previous moment is the same as that at the most recent moment in the future. If they are the same, the maneuver trajectory prediction model is adopted to predict the trajectory characteristic parameters of the target aircraft at the next moment. Otherwise, the trajectory characteristic parameters output by the maneuver unit prediction model are adopted as the trajectory characteristic parameters of the predicted target aircraft (directly output the trajectory characteristic parameters at the next moment);
Step 5: When the maneuver units at the previous and subsequent moments are the same, select the maneuver trajectory prediction model of the corresponding maneuver unit, and input the time series of historical trajectory characteristic parameters of the target aircraft into the TSO-GRU-AM model to obtain the maneuver trajectory characteristic parameters at future moments.
Step 6: Calculate the trajectory status information and output the predicted trajectory of the target aircraft.
In the fusion module shown in 
Figure 13, in order to determine whether the target’s tactical maneuver intent has changed, the maneuver unit categories of the target at the current time and future time are compared, and a reasonable maneuver trajectory prediction model is selected based on the comparison results. Based on the output results of the maneuver trajectory prediction model, the predicted maneuver trajectory feature parameters related to the target state calculation are selected and substituted into Equation (1) for calculation and solution, ultimately obtaining the predicted trajectory state of the target.
  5.2. Tactical Maneuver Trajectory Prediction Simulation Based on Hierarchical Strategy
To further verify the effectiveness and robustness of the tactical maneuver trajectory prediction method based on the hierarchical strategy, in this section, a section of air-to-air confrontation trajectory is selected from the air-to-air confrontation simulation database for prediction. The above-mentioned selected confrontation scenario is called case 1. The initial situations of both sides are as follows: The height of the red side is 8 km, and that of the blue side is 12 km. The speed of both sides is 250 m/s. The trajectory inclination angle is 0 degrees. Both sides are in a head-on position. The adopted maneuver unit prediction algorithms are SVR, BP, ESN, DeepESN, DeepESN-AE, DeepESN-AM, and DeepESN-AE-AM, and their parameter settings refer to 
Table 2. The adopted maneuver trajectory prediction algorithms are Elman, LSTM, GRU, TSO-LSTM, TSO-GRU, and TSO-GRU-AM, and their parameter settings refer to 
Table 4. The specific simulation experiment results are as follows:
Figure 14 presents the prediction statistical results of the combined model of different maneuver unit prediction algorithms and maneuver trajectory prediction algorithms in the case 1 scenario. It can be seen from the RMSE error values in 
Figure 14a that the prediction error of DeepESN-AE-AM is the smallest compared with other maneuver unit prediction methods, and the prediction error of TSO-GRU-AM is the smallest compared with other maneuver trajectory prediction methods. As can be seen from the single-step prediction time in 
Figure 14b, DeepESN-AE-AM has the longest prediction time compared to other maneuver unit prediction algorithms, and Elman has the longest time compared to other maneuver trajectory prediction algorithms. In addition, the prediction time of TSO-GRU-AM is approximately the same as that of LSTM, GRU, TSO-LSTM, and TSO-GRU. Based on the above analysis, on the premise of meeting the real-time requirements of maneuver decision-making, the prediction error of the combination of DeepESN-AE-AM and TSO-GRU-AM is the smallest.
 To validate the superiority of the method proposed in this paper, a model combining DeepESN-AE-AM and TSO-LSTM is selected as the comparison method, and the predicted trajectories of the comparison algorithms are compared with the actual trajectories in the demonstrated motion trajectories. The first three figures in 
Figure 15 show the prediction results of the hierarchical tactical maneuver trajectory prediction model proposed in this paper for case 1. The last figure in 
Figure 15 shows the prediction results of the proposed hierarchical tactical maneuver trajectory prediction model and the combined model of DeepESN-AE-AM and TSO-LSTM in case 1. The predicted enemy trajectory-1 in 
Figure 15d represents the prediction results of the proposed algorithm, while the predicted enemy trajectory-2 represents the prediction results of the comparison algorithm. 
Figure 15a gives the target tactical maneuver unit predicted by DeepESN-AE-AM, which is consistent with the trajectory maneuver state in 
Figure 15d. As shown in 
Figure 15b, due to the input step size limitation of the prediction model, the first 5 s primarily utilize the trajectory recursion method to collect target state information. During this timeframe, the hierarchical tactical maneuver trajectory prediction method is not employed to predict the target’s maneuver trajectory. Therefore, the first 5 s of 
Figure 15a–c do not display target trajectory prediction information. The trajectory prediction method refers to modeling the target’s motion model based on prior knowledge and assuming that the target exhibits inertial motion characteristics in the short-term domain. This allows the target’s state variables at the next time step to be calculated using the maneuver control variables at the current time step. Additionally, 
Figure 15b shows the comparison curves between the target’s tactical maneuver trajectory prediction features and the target’s actual maneuver trajectory features, including the comparison curves between the actual trajectory deflection angle change and the predicted trajectory deflection angle change, the actual trajectory inclination angle change and the predicted trajectory inclination angle change, the actual altitude change and the predicted altitude change, and the actual velocity change and the predicted velocity change. The maximum deviation values for the above comparison curves are 5°, 5°, 1 m, and 1 m/s, respectively. At 40 s, due to a change in the target’s tactical maneuver unit category, this change indicates that the target’s maneuver trajectory change pattern has altered, and the aforementioned change in maneuver trajectory pattern significantly affects the change in trajectory deflection angle. Therefore, the deviation between the actual trajectory deflection angle change and the predicted trajectory deflection angle change reaches its maximum value. Since the deviation between the other actual maneuver trajectory features and the predicted maneuver trajectory features in 
Figure 15b is small, the change patterns of the other comparison curves in the figure are similar, meaning that the actual maneuver trajectory feature change curve and the predicted maneuver trajectory feature change curve are basically overlapping. From 
Figure 15c, it can be seen that the maneuver trajectory positional state x, y, and z axis errors do not exceed 6 m, 6 m, and 10 m, respectively. As can be seen from 
Figure 15d, the predicted trajectory of the algorithm proposed in this paper is close to the actual trajectory of the enemy, while the error between the predicted trajectory of the comparison algorithm and the actual trajectory of the enemy is greater. In summary, the trajectory prediction error based on the combination of DeepESN-AE-AM and TSO-GRU-AM is small and meets the accuracy requirements.
  5.3. Simulation Analysis of Air-to-Air Confrontation
To test the online prediction performance of the hierarchical tactical maneuver trajectory prediction model, first of all, the algorithm in literature [
28] is used as the maneuver decision-making method for both sides of the air-to-air confrontation in this section. Second, one side of the confrontation uses the hierarchical trajectory prediction model to obtain the trajectory state of the target aircraft, while the other side does not use the trajectory prediction model. Then, the confrontation simulation experiment is carried out. Finally, the experimental results are statistically analyzed, and a conclusion is drawn.
- (1)
- xperimental setup 
Based on the hierarchical tactical maneuver trajectory prediction model, to further verify the superiority of the algorithm combination proposed in this paper, the red side selects the algorithm proposed in this paper and the combination of DeepESN-AE-AM and TSO-LSTM to predict the maneuver trajectories of the enemy, while the blue side does not use maneuver trajectory prediction methods. Both sides used the algorithm in reference [
28] as the maneuver decision-making method, and the situation advantage functions are the same as those in the above-mentioned references. Count the winning situation, the number of decision-making steps, and the average prediction time for each step. Each adversarial simulation runs independently 20 times, with a simulation step size of 0.1 s. The following three initial scenarios are adopted: In case 1, the red and blue sides fly in the same direction; in case 2, the blue side follows the red side. In case 3, the red and blue sides fly away from each other. The specific initial settings of the air-to-air confrontation scenarios are shown in 
Table 6. The experimental simulation environment is the same as in 
Section 3.3.
- (2)
- Simulation results analysis 
Table 7 gives the statistical results of the confrontation between the two sides under the three initial scenarios. Due to the limited space, this section gives the air-to-air confrontation trajectories and predicted trajectories, tactical maneuver unit prediction results, maneuver trajectory characteristics prediction results, trajectory position state errors, and single-step prediction times for case 1 and case 2, as shown in 
Figure 16 and 
Figure 17. The first three-dimensional trajectory diagram in the above figures shows the predicted trajectories of the algorithm proposed in this paper, the comparison algorithm, the actual trajectory of the enemy aircraft, and the UAV trajectory. The predicted enemy trajectory-1 represents the prediction result of the algorithm proposed in this paper, and the predicted enemy trajectory-2 represents the prediction result of the comparison algorithm. The other figures show the prediction results of the proposed algorithm for the enemy aircraft’s maneuver units and trajectory features, as well as the predicted enemy aircraft position state error and single-step prediction time. From 
Figure 16a, it can be seen that at the beginning, the red side has a height disadvantage, and in order to prevent rushing forward, it adopts the pull-up dive tactic to get the angle advantage and track the blue side from the tailback; then, the blue side pulls up to get the speed advantage through the low-speed yoyo tactic to avoid the opponent’s threat as soon as possible; at last, the red side allows the blue side to rush forward by decelerating the speed and then steadily intercepts the blue side. In addition, comparing the predicted enemy aircraft trajectories in the figure with their actual trajectories shows that the predicted enemy trajectory-1 basically coincides with its actual trajectory, while the predicted enemy trajectory-2 has a large error compared to its actual trajectory. 
Figure 16b gives the prediction results of the tactical maneuver trajectory unit of the target aircraft. According to the prediction results of the maneuver unit, the red side will predict the fixed trajectory characteristics of the blue side, as shown in 
Figure 16c. The trajectory deflection angle change amount, trajectory inclination angle, altitude change amount, and speed change amount errors are not more than 5°, 5°, 10 m, and 5 m/s, respectively. From 
Figure 16d, it can be seen that the maneuver trajectory positional state x, y, and 
z-axis errors are not more than 10 m, 5 m, and 8 m, respectively. The results of single-step prediction time variation are given in 
Figure 16e, and the single-step prediction time satisfies the real-time requirement.
 From 
Figure 17a, it can be seen that in order to obtain a favorable position, first, the red side adopts a dive tactic to obtain a speed advantage and avoid being tracked steadily by the blue side; then the blue side obtains an angular advantage by descending and turning maneuver to avoid the opponent’s threat as soon as possible; finally, the red side aims the nose at the blue side through a turning maneuver and shoots down the blue side. In addition, comparing the predicted enemy aircraft trajectories in the figure with their actual trajectories shows that the predicted enemy trajectory-1 basically coincides with its actual trajectory, while the predicted enemy trajectory-2 has a large error compared to its actual trajectory. 
Figure 17b gives the prediction results of the tactical maneuver trajectory unit of the target aircraft. According to the prediction results of the maneuver unit, the red side will predict the fixed trajectory characteristics of the blue side, as shown in 
Figure 17c. The trajectory deflection angle change amount, trajectory inclination angle, altitude change amount, and speed change amount errors are not more than 6°, 6°, 12 m, and 6 m/s, respectively. From 
Figure 17d, it can be seen that the maneuver trajectory positional state 
x, 
y, and 
z-axis errors are not more than 9 m, 9 m, and 12 m, respectively. The results of single-step prediction time variation are given in 
Figure 17e, and the single-step prediction time satisfies the real-time requirement.
As can be seen from 
Table 7, the algorithms with the smallest number of decision steps in the three scenarios are Elman, LSTM, and TSO-GRU-AM; the algorithms with the smallest amount of time spent on single-step prediction in the three scenarios are GRU, TSO-GRU, and TSO-GRU. The highest number of wins in scenario 1 are GRU, TSO-LSTM, TSO-GRU, and TSO-GRU-AM; in scenario 2, they are Elman, LSTM, GRU, TSO-GRU, and TSO-GRU-AM; in scenario 3, it is TSO-GRU-AM.
In Scenario 1, although Elman has the smallest average decision steps, its standard deviation is relatively large. Moreover, this method requires the most average single-step prediction time and has the lowest average winning rate. The average single-step prediction time required by GRU is the least, and its winning rate is the same as that of TSO-GRU-AM. However, the standard deviation of the average decision step of this method is the largest, which is prone to cause significant fluctuations in the solution process of the optimized maneuver strategy. The standard deviation of the average single-step prediction time of TSO-GRU is the smallest, but the average number of decision steps of this method is the largest. The winning rate of TSO-LSTM is the same as that of TSO-GRU-AM, but the standard deviation of its average decision steps is larger. Although the average decision step of TSO-GRU-AM is not the smallest, the standard deviation of its average decision step is the smallest, indicating that the solution process of its optimized maneuvering strategy is stable, and the average single-step prediction time of this method meets the single-step decision-making time requirements of the simulation system. The comparison and analysis of the results of the performance indicators of each algorithm in the above scenario 1 show that, on the premise of meeting the real-time requirements of the simulation system’s decision-making, TSO-GRU-AM can stably help the UAV obtain the optimal maneuvering strategy. Driven by the above maneuvering strategy, the red side UAV achieves all victories in the 20 air-to-air confrontation simulation experiments.
In Scenario 2, although the average decision steps of LSTM are the smallest, its standard deviation is relatively large, and the standard deviation of the average single-step prediction time required by this method is also relatively large. The average single-step prediction time required by TSO-GRU is the least, but its standard deviation is large, and the average number of decision steps of this method is relatively large. The standard deviation of the average single-step prediction time of TSO-LSTM is the smallest, but the average winning rate of this method is also the smallest. Elman has a relatively high winning rate, but it requires the most average single-step prediction time. The average number of decision steps of GRU is slightly more than that of TSO-GRU-AM. The comparison and analysis of the results of the performance indicators of each algorithm in the above scenario 2 show that, on the premise of meeting the real-time requirements of the simulation system’s decision-making, TSO-GRU-AM helps the red side UAV achieve a higher winning rate.
In Scenario 3, the average decision step of TSO-GRU-AM is the smallest, and its standard deviation is small. With the assistance of this method, the red side UAV can obtain the optimal maneuver strategy relatively stably with the minimum step. Although the standard deviation of the average decision steps of LSTM is the smallest, its average decision steps are the largest. This is not conducive to the red side UAV quickly obtaining the optimal strategy in the intense air-to-air confrontation scenario, resulting in a poor winning rate of this method. The average single-step prediction time and its standard deviation of TSO-GRU are both the smallest. However, the average number of decision steps of this method is relatively large, which makes the win rate of the red side UAV based on this method not good. The comparison and analysis of the results of the performance indicators of each algorithm in the above scenario 3 show that, on the premise of meeting the real-time requirements of the simulation system’s decision-making, TSO-GRU-AM helps the red side quickly obtain the optimal maneuvering strategy in the air-to-air confrontation. This is beneficial for the red side to implement faster attacks and avoid maneuvers. Eventually, the red side’s UAV also achieved the highest winning rate.
In conclusion, combining the target tactical maneuver intention with the maneuver trajectory prediction model helps to improve the prediction speed and accuracy of TSO-GRU-AM, thereby assisting UAVs in quickly occupying favorable offensive positions or implementing evasive maneuvers in air-to-air confrontation scenarios. Because the autonomous maneuver decision-making model of the red side UAV uses the target prediction state variables obtained by the above-mentioned hierarchical tactical maneuver trajectory prediction method, it can, on the one hand, identify the tactical maneuver intention of the target, and on the other hand, accurately assess the battlefield situation and select the optimized maneuver strategy. Eventually, it has stably achieved a relatively high winning rate. In conclusion, the hierarchical tactical maneuver trajectory prediction method proposed in this paper is effective and has better performance than other comparison methods.