1. Introduction
Hypersonic targets (HTs), achieving significant space access and prompt global striking ability, have attracted much attention over the past few decades [
1]. The classification, recognition and detection of the observed HTs are of great significance in aerospace security defense [
2]. Their observed features are degraded owing to the high moving speed of the HTs, leading to difficulty in detection and recognition with radar and infrared detectors [
3]. Fortunately, the HT surface will generate high temperatures from intense friction with the surrounding atmosphere during the flight state. It ionizes air molecules around the vehicle, developing a certain thickness of “plasma sheath” [
4,
5,
6,
7,
8]. The generated plasma sheath usually shows a thermodynamic nonequilibrium state and will continuously emit strong spectral radiation, which can be used as a fingerprint feature to realize the detection and classification of HTs [
9,
10,
11,
12,
13].
When using the hyperspectral features (HFs) of HTs obtained by space-based detectors for actual observations, greater emphasis should be placed on the unknown military threats of the new class of HTs that the detectors have not observed [
14]. An algorithm should be proposed to distinguish whether the detected samples belong to previously observed classes or a newly emerged class [
15]. Developing an annotated data set is difficult even if a certain number of HFs of the HTs observed with space-based hyperspectral detectors is accumulated to determine the unique value of the HTs [
16]. Therefore, we propose the detection of the new class of hypersonic targets (DNHT) method to be completed in an unsupervised manner.
Many industrial fields are also facing the problem of new class target detection. Some methods have been formulated for the detection of the new class as a two-class recognition problem (unknown versus known) where both positive and negative samples are relevant [
17,
18,
19]. The common approach for detecting the new class is to collect both real and fake data and try to learn a suitable two-class classifier, employing a supervised or semi-supervised approach [
20,
21]. However, the collection of new HTs is difficult, and the detection methods based on two classes are not suitable for DNHT. There are also novelty detection methods that mainly identify data with unsupervised methods. Liu et al. proposed iForest [
22], and Yang studied a KNN-based approach for unsupervised novelty detection [
23,
24]. Rettig et al. proposed online anomaly detection over big data streams [
25], and the unsupervised real-time anomaly detection for streaming data was subsequently developed [
26]. However, unsupervised new class detection requires a large difference between the features of the new class and the existing class samples, that is, anomaly characteristics. The interclass variability of the HFs of HTs might be smaller than the intraclass variability, which results in a high error rate for the new class detection method based on anomaly detection. Another line of work focuses on open-set recognition (OSR) [
27], which aims to classify known classes and reject novel ones. OpenMax utilized Weibull-based calibration to augment the softmax layer and detect novel classes [
28]. Proser proposed reserving the probability for novel classes during close-set training and transformed closed-set training into open-set training [
29]. CPL optimized the embedding with margin-based classification loss for better feature extraction [
30]. However, these OSR models must be trained with a large data set, and annotated observed HT data sets are currently unavailable. Additionally, the imbalanced hypersonic observations often degrade the network performance [
31].
In general, the above algorithms will encounter two challenges when applied to DNHT. First, the accumulated observation samples are not labeled, and the new class HTs are difficult to obtain, which causes the learning of the two-class classifier and OSR framework to fail. Second, detecting the new class based on anomaly detection requires a large feature difference between the new class and previous observations. However, the intraclass variability of HTs is greater than the interclass difference, and the anomaly characteristic of the new class is not apparent compared with globally observed samples. To detect the new class hypersonic vehicle under streaming emerging HTs, we propose using the unsupervised subclass definition and the efficient isolation forest based on anomalous hyperspectral feature selection (USD-EiForest) method for DNHT based on unsupervised classification and anomaly detection, which has the following distinguishing features:
Existing algorithms train classifiers in a supervised or semi-supervised manner to achieve new class detection. In contrast, the proposed USD-EiForest method employs an unsupervised learning model to solve the problem of DNHT in the case where the HFs data set of the HTs has not yet been constructed.
The phenomenon is revealed, whereby the HFs of new class HTs have prominent anomaly characteristics relative to the local subclasses of observed samples. To take full advantage of the local anomaly characteristic for new class detection, we use the unsupervised subclass definition method based on density peak clustering (DPC) to achieve subclass division.
Because the features of the observed targets are more concentrated and more conducive to anomaly detection, an efficient isolation forest algorithm based on anomalous HFs selection is proposed with high detection efficiency and accuracy for new class detection with respect to subclass samples.
The rest of this paper is organized as follows.
Section 2 describes the intuition of the proposed algorithm.
Section 3 describes the related definitions and details of the proposed algorithm. We report the experimental results in
Section 4, which show that the proposed method can accurately detect the new types of HTs. The conclusion is provided in
Section 5.
2. Interclass and Intraclass Variation of HFs of HTs
The plasma sheath (thermochemical nonequilibrium flow field) is generated while an HT is flying in near space [
2], which usually exhibits the stated thermodynamic nonequilibrium [
32]. Research has revealed that the gas in the plasma sheath of the HT (O
2, N
2, NO, N, O, NO
+, N
2+, O
2+, N
+, O
+ and e
−) has a strong spectral radiation effect [
33,
34]. Although the HFs of HTs based on actual space-based detectors are unavailable for the development of HTs and are not comprehensive enough, acquiring HT samples will be straightforward when HTs are put into practical applications in the future. To alleviate the spatial redundancy of the spectral characteristics of the object, this section provides the spectral radiation characteristics at the place with the strongest spectral intensity as fingerprint features for classification and detection based on the numerical calculation results of the surface flow field of the RAM C-II and HTV-2 re-entry HTs. This provides data support for subsequent proposal verification and reveals the distribution of intraclass and interclass HFs of HTs.
The process of obtaining the spectral radiation characteristics of the HTs using the space-based spectral detector is shown in
Figure 1. According to various gas kinetic models, the air plasma sheath can contain several different compositions, such as 5, 7 and 11 species. To consider the computational complexity and simulation accuracy of spectral radiation characteristics at the same time, this paper adopted the gas model of seven species (N
2, O
2, N, O, NO, NO
+, e
−). First, the emission and absorption coefficients that describe the radiative properties are of great importance to the calculations of radiative transfer in the thermochemical nonequilibrium flow field, which has been solved in the literature [
13]. Second, the spectral radiation intensity on the surface of the plasma sheath that envelopes the HT was calculated with the line of sight (LOS) method, assuming that each layered medium is isotropic and isothermal; this was used to compute the radiation transfer equation with known emission and absorption coefficients [
35]. Finally, the transmission model was established based on MODTRAN5 software to reflect the atmospheric attenuation effect of spectral transmission [
36]. The brief process of the software is depicted in
Figure 2, including atmospheric data input, aerosol data input, geometric data input and spectral information input. The 1976 American standard atmospheric model [
37] was used to simulate atmospheric transmission. The spectral radiation characteristics of HTs picked up by the space-based detector after atmospheric attenuation were used for unsupervised classification. The details of the whole process can be found in Ref [
38]. However, owing to the scarcity of plasma sheath flow field data of hypersonic targets, this paper only simulated the HFs of RAM-C and HTV 2 hypersonic targets under different typical flight states.
The spectral radiation characteristics of RAM-C II and HTV-2 HTs observed by space-based low earth orbit (LEO) detectors under different flight states were calculated based on the geometric model and the flow field distribution of these two HTs [
38,
39]. The results are shown in
Figure 3, indicating the intensities of the calculated spectral features not of the same order of magnitude. Moreover, it is impracticable for space-based detectors to obtain all the spectral bands of radiation characteristics of HTs. Therefore, 120 wavebands were selected and normalized at the radiation intensity peak, as shown in
Figure 4. To more clearly reflect the intraclass and interclass variation of the HFs of HTs, principal component analysis (PCA) was carried out on HFs, and the result is shown in
Figure 5 and
Table 1. The pink represents the HTV-2 HTs, and the blue represents the RAM-C II HTs. We can see in
Figure 5 that the HFs of the same type of HTs were different when the flight states changed. When combined with the specific values in
Table 1, this shows that different types of HTs flying in the same state have small HF differences.
Anomaly detection techniques can be successfully used in the area of new class detection if it is understood that the instance of any developing new classes is distant from the known classes or at the edge of the data cloud of known classes. When RAM-C II was viewed as the known class and HTV-2 as novel class, it is evident from
Figure 4 and
Figure 5 that the spectral radiation characteristics of HTV-2 did not differ significantly from those of the global RAM-C II samples. The interclass variability between the two classes of HTs was far lower than the intraclass variability, making it difficult to detect the new class using anomaly detection methods.
5. Conclusions
This paper addressed the HTs’ new class detection problem under challenging conditions without new class samples and data sets. For this purpose, a USD-EiForest algorithm based on unsupervised subclass definition and efficient iForest was presented. Benefiting from the local anomaly characteristics of the new class HT sample relative to the subclasses, the proposal determined whether the sample to be detected was a new class for each subclass to realize the DNHT. Experimental evaluation of the proposed algorithm on simulated HFs of RAM-C II and HTV-2 HTs illustrated that the proposal could detect a new class of HT with high detection accuracy and efficiency. The unsupervised subclass division algorithm divided the observed HT samples into multiple subclasses, and the Nf and Nt of EiForest were significantly reduced compared with the traditional iForest, improving the detection efficiency. However, the algorithm proposed in this paper can only determine whether a sample is a new class at present. With the construction of HFs data sets of HTs, it will be necessary to develop an algorithm that enables both new class detection and known class classification in the future.
Moreover, the research contribution of this paper lays the theoretical foundation for the new class detection of HTs based on HFs. Because of the limitation of the military value of hypersonic targets, we could only verify the algorithm on a simulation data set. However, the spectral radiation characteristics of the simulated data set were calculated based on the real measured flow field data, allowing the simulated hyperspectral data set and the real observed data set to maintain a high consistency. Therefore, with the development of hypersonic targets in the future, the proposed method in this paper may be applied to the actual scene with a suitable space-based detector. The research in this paper is based on the unlabeled observation data set to detect the new types of samples that have not been observed. The classification and labeling of observed samples, construction corresponding to the data set, and supporting type recognition of known hypersonic targets are the work that we carried out and will be published later.