1. Introduction
In recent years, with the rapid advancement of science and technology, unmanned aerial vehicles (UAVs) have progressively matured and emerged as a focal point in the new wave of global technological and industrial revolution. Currently, there exists a diverse range of UAV types available in the market including fixed-wing, multi-rotor, and hybrid models, among others. As a reusable aircraft operated by autonomous control or radio remote control [
1], UAVs are widely used in environmental monitoring, aerial photography, disaster detection, power inspection, express transportation, and other fields due to their advantages of small size, lightweight, strong mobility, low cost, and easy use [
2,
3]. In the field of agricultural and forestry plant protection, UAVs can be utilized for plant health detection and pesticide spraying. Additionally, UAV aerial photography has gained significant popularity among photography enthusiasts in the realm of aerial photography. Moreover, they are employed for power line inspection and substation equipment monitoring. UAV is a complex system that integrates multiple disciplines such as electronics, control, sensors, and information, and it is usually designed with low or no redundancy [
4,
5]. Since there is no pilot on-site operation in the process of carrying out the task, the UAV does not have the real-time observation and response-ability of the pilot, and it will not be able to take emergency measures in time in case of failure, which leads to a higher accident rate of the UAV compared with that of the man–machine. The expansion of the UAV industry and the diversification of application scenarios have led to a frequent occurrence of safety accidents during UAV flights, thus prompting widespread concerns regarding the reliability and safety of UAVs.
UAV flight data refer to a series of flight parameters related to UAV performance and status collected by onboard sensors during flight, usually including various attitudes, speed, altitude, position, and other information. The common parameters are shown in
Table 1.
Currently, anomaly detection based on UAV flight data is a powerful approach for UAV condition monitoring and potential anomaly mining. This data-driven method eliminates the reliance on specific physical models or expert knowledge, allowing for the comprehensive utilization of flight data to achieve effective anomaly detection [
6]. It holds more promise compared to prior knowledge and model-based methods, making it highly significant in reducing the risk of UAV flight accidents and enhancing operational safety. Many researchers are using machine learning algorithms to conduct anomaly detection research on UAV flight data due to the rapid development of machine learning technology. Wang et al. [
7] proposed an online anomaly detection method for UAV flight status using the least squares support vector machine (LS-SVM) prediction model. The proposed algorithm was verified using simulated flight data, and the experimental results demonstrated its effectiveness in detecting online anomalies for UAVs. Bronz et al. [
8] used SVM algorithms based on UAV flight logs to categorize UAV behavior during normal and fault phases and achieved a high level of Accuracy in detecting uncontrolled UAV faults. Yaman et al. [
9] developed a lightweight fault detection algorithm for UAV engine anomalies using the SVM algorithm to classify the audio signals, which can work in real time in an embedded system. González-Etchemaite et al. [
10] developed a supervised learning-based fault detection and identification module for multi-rotor UAVs, which realized real-time detection based on random forest and support vector machine. Experiments on simulated and real data verified the effectiveness of the solution. Thanaraj et al. [
11] proposed a hybrid FDI model for a quadrotor UAV that integrates an extreme learning neuro-fuzzy algorithm with a model-based extended Kalman filter (EKF). The results show that the Fuzzy-ELM FDI model exhibits greater efficiency and sensitivity, while Fuzzy-ELM and R-EL-ANFIS FDI models demonstrate better performance than a conventional neuro-fuzzy algorithm. Li et al. [
12] detected anomalies in flight data based on the density similarity between the data using the DBSCAN clustering algorithm and achieved better results. Altinors et al. [
13] performed fault diagnosis based on data received by the UAV motor. Signal preprocessing, feature extraction, and machine learning methods were applied to the obtained dataset. Decision tree (DT), Support Vector Machines (SVM), and K-Nearest Neighbor algorithms are used for experiments, and the results showed that the three machine learning algorithms all achieved high Accuracy. Aiming at the high-dimensional characteristics of UAV sensor data, Duan [
14] et al. proposed an anomaly detection method for UAV sensor data based on Kernel Principal Component Analysis (KPCA) and experimented with simulated data, and the results showed that the method can achieve satisfactory results. Ahn et al. [
15] proposed a detection algorithm based on clustering and a convolutional neural network for the anomaly detection problem of UAV swarm flight data. First, the flight data were labeled by principal component analysis and K-means clustering, and then a one-dimensional convolutional neural network classifier was trained. This method achieved good results on real flight data. It can be seen from the above relevant studies that machine learning methods are widely used in anomaly detection of UAV.
In recent years, deep learning technology has developed rapidly, and it has been widely used in many research fields, such as lithium battery health prediction [
16,
17,
18], bearing fault diagnosis [
19,
20,
21,
22], and air quality prediction [
23,
24,
25,
26]. Therefore, some researchers have started to study the anomaly detection of UAV flight data based on deep learning technology to ensure the safety and reliability of UAV flights. Xiao et al. [
27] proposed a scheme based on recurrent neural networks (RNNs) for the detection of abnormal behavior of UAVs. Firstly, the RNN was used to train and build the normal behavior model of UAV, and then an appropriate threshold was selected through extensive experiments. If the normalized root mean square error between the true position and the model output position is greater than the threshold, the UAV is considered to have abnormal behavior. Finally, the proposed scheme achieves an average Accuracy of 98% on both experimental data. Wang et al. [
28] proposed a long-short-term memory recurrent neural network method for UAV anomaly detection and used real UAV flight data to verify the method. The experimental results show that the method can effectively detect point anomalies. Jeon et al. [
29] proposed a method for detecting structural abnormalities of quadcopter UAVs, which uses a long short-term memory (LSTM) network and an autoencoder model to learn complex features from routine flight data. The experimental results show that the proposed method has a good effect on detecting structural abnormalities of UAVs. Yang et al. [
30] proposed a method for detecting anomalous states in UAVs using timestamp slicing and multiple separable convolutional neural networks (TS-MSCNN) and conducted experiments with real data. This method solves the problem that the traditional abnormal state detection model ignores the difference of POS data frequency domain in the process of feature learning. Wang et al. [
31] proposed a method based on long and short-term memory residual filtering. Firstly, the method involves extracting the spatiotemporal characteristics of the flight data using an LSTM network to obtain the estimated values of the monitoring parameters. Then the residuals between the real data and the estimated values are smoothed by the filter. Finally, the smoothed residuals are compared with a statistical threshold to achieve fault detection of UAV flight data. Wang et al. [
32] proposed a multivariate sequence anomaly detection model based on an improved graph neural network with a transformer, a graph attention mechanism, and a multi-channel fusion mechanism. The combination of a multi-channel transformer structure and a graph attention mechanism enables intrinsic pattern extraction from different data and better captures the features of the time series. Finally, the multi-channel data fusion module is used to integrate the global information and improve the Accuracy of anomaly detection. The experimental results demonstrate that the average Accuracy of the proposed model is 92.83% and 96.59% on the two datasets of UAV, respectively. Zhong et al. [
33] conducted research on anomaly detection and recovery prediction of UAV flight data by combining artificial neural networks and long-short-term memory networks based on spatiotemporal correlation. Artificial neural networks were used to mine the spatiotemporal correlation of flight data and screen out relevant parameter sets. Subsequently, a long-term and short-term memory network model was trained based on these parameter sets to achieve anomaly detection and recovery prediction. Anidjar et al. [
34] collected audio information from the UAV flight via a Bluetooth earphone fixed on top of the UAV and then converted the audio signal into a graphical representation using the Wav2Vec2 model based on the transformer structure. Next, a modified VGG-16 convolutional neural network is used to train the image classification model to achieve anomaly detection of the UAV. Finally, the authors further improved the approach for real-time detection and verified the effectiveness of the approach in real-time scenarios. In addition, the establishment of UAV real-time anomaly detection systems based on deep learning technology is also a popular research field. At present, many anomaly detection algorithms have high accuracy, but the calculation process is complicated and takes a long time, which makes it difficult to meet the needs of real-time scenarios. Therefore, the research of high-accuracy and lightweight algorithms based on deep learning technology is the future development direction.
Autoencoder is a typical unsupervised learning model that enables the automatic learning of a representation function of a dataset from a large number of data samples, which can be used for feature extraction and data dimensionality reduction. In order to quickly achieve anomaly detection from system log files, Cavallaro et al. [
35] analyzed log files from data centers. First, they used modern Natural Language Processing (NLP) methods to map the log files’ words to a high-dimensional metric space; then, they used an invariant mining model and autoencoder to cluster and classify various system events. Finally, they carried out experiments and obtained an average of F-measure metric over 86%. With the development of deep learning technology, autoencoder models have attracted the attention of many researchers and have been applied in many fields [
36,
37,
38]. Meanwhile, researchers have proposed improved models for traditional autoencoders, such as noise reduction autoencoders, convolutional autoencoders, and stacked autoencoders. Autoencoder models have been successfully applied in anomaly detection and fault diagnosis [
39,
40] due to their ability to learn effective data representations for downstream classification or regression tasks. Zhang et al. [
41] proposed a method for anomaly detection in high-dimensional data based on an autoencoder and least squares support vector machine. The experimental results on real high-dimensional datasets demonstrate the method’s excellent performance. Yang et al. [
42] proposed the STC-LSTM-AE method, a spatiotemporal correlation neural network based on LSTM and autoencoder, for unsupervised anomaly detection and the recovery of UAV flight data. The model uses the Savitzky–Golay filtering technique to reduce sensitivity to data noise. The effectiveness of the method is verified using real UAV flight data.
However, although traditional autoencoder models are commonly used in UAV anomaly detection research, stacked autoencoders, a deep model, have received less attention in this field. Due to the relative simplicity of the structure of the traditional self-encoder model, it may have insufficient feature extraction ability and overfitting problems when dealing with high-dimensional data. UAV flight data are typical high-dimensional data, so it is of great significance to investigate the application of stacked self-encoders in UAV flight data anomaly detection. In addition, as the difficulty of UAV missions increases and the harsh environment in which they operate increases, the Accuracy of UAV status information acquisition decreases, and the collected flight data contain a large amount of noisy data, which reduces the effectiveness of the anomaly detection algorithm to some extent. Therefore, it is necessary to study noise reduction processing for UAV flight data and combine it with deep learning models to improve the Accuracy of the anomaly detection algorithm in detecting noisy data. Wavelet decomposition is a signal processing technology based on wavelet transform, which can be used to remove noise in signals and has been widely used in many research fields [
43,
44,
45,
46,
47,
48].
This paper proposes a UAV flight data anomaly detection method based on wavelet decomposition and stacked denoising autoencoder. The wavelet decomposition technology can effectively denoise the original data. As an efficient representation learning model, the stacked denoising autoencoder can well complete the feature extraction of data, which can then be input to the softmax classifier to achieve the classification and identification of the anomalous state. A series of experiments were conducted on an actual dataset to verify the effectiveness of the proposed method. It was also compared with other common anomaly detection algorithms, and the comparative results demonstrate its superior performance. The innovations of this study are outlined below:
This study proposes a deep learning method based on wavelet decomposition and stacked denoising autoencoder for detecting anomalies in noisy UAV data.
The adoption of wavelet decomposition can effectively filter the noise information in the original data and improve the signal-to-noise ratio of the data.
As a deep representation learning model, the stacked denoising autoencoder can effectively learn the feature representation of high-dimensional UAV data, reduce the data dimension, and overcome the difficulty of insufficient abnormal data in the dataset to a certain extent.
By improving the reconstruction loss function of the stacked autoencoder model, selecting the PReLU activation function, adding the batch normalization layer, and setting the learning rate dynamic adjustment strategy CosineAnnealingLR, the method gives better performance on real datasets.
This study is divided into five parts.
Section 1 outlines the background significance of the study while providing an overview of UAV anomaly detection research and the shortcomings of the current research.
Section 2 describes the data and outlines the selected data parameters. It also presents various models and methods, such as wavelet decomposition, autoencoder, denoising autoencoder, stacked denoising autoencoder models, and evaluation metrics for model performance.
Section 3 introduces the experimental conditions, research methods, and results, and it conducts a model comparison experiment.
Section 4 is the experimental study of some important factors that affect the performance of the model. Finally,
Section 5 summarizes the research of this paper.
4. Discussion
To assess the effectiveness and importance of wavelet decomposition denoising, we conducted a repeated experiment using the stacked denoising autoencoder without wavelet decomposition. We kept all data preprocessing operations and parameter settings the same as in the original experiment. The results of the two experiments are shown in
Table 6.
Table 6 shows that the stacked noise reduction autoencoder model achieves better classification after wavelet denoising. This indicates that wavelet decomposition effectively reduces the negative impact of high-frequency noise on feature extraction, and the SDAE extracts important feature representations.
To explore the influence of wavelet decomposition layers on subsequent models, we tested the data filtered by different wavelet layers. In the test, other experimental settings were identical except for wavelet decomposition layers. The wavelet decomposition layers are set as 1, 2, 3, and 4, respectively, for the experiment, and the results are shown in
Table 7.
It can be seen from
Table 7 that when the number of wavelet decomposition layers is 1, 2, 3, and 4, the Accuracy of the model is 95.73%, 96.71%, 97.53%, and 95.18%, respectively. The model works best when the number of wavelet decomposition layers is 3.
In addition, the Accuracy of the model increases as the number of wavelet decomposition layers increases, which may be since as the number of wavelet decomposition layers increases, the greater the filtering of high-frequency noise. However, when the number of wavelet decomposition layers is 4, the model’s Accuracy does not increase but decreases, which may be that excessive filtering destroys the important features of the data.
The random noise added to the network is an important factor affecting the performance of the stacked denoising autoencoder. Excessive noise interference will increase the difficulty of model training and reconstruction output, and affect the result of feature learning. Too little noise may lead to the inability to extract robust feature representation. To determine the appropriate noise and make the model extract more robust feature expression, this paper carried out test experiments with different proportions of Gaussian noise and selected the appropriate noise ratio according to their training loss results. Under the same model structure, the noise ratios of 0.05, 0.1, 0.15, 0.2, 0.25, and 0.3 are selected for training. The training loss results under different noise ratios are shown in
Figure 10.
It can be seen from
Figure 10 that under the same training round, when the noise ratio is 0.1, the training loss value is the smallest, which indicates that the feature extraction and data reconstruction ability of the model is the best at this time. In addition,
Figure 10c–f show that if a larger proportion of Gaussian noise is added, the loss value of the model will increase. Therefore, it is necessary to add an appropriate proportion of noise in combination with the actual situation of the data, which will help the model extract robust data features and better reconstruct the data.