Speciﬁc Emitter Identiﬁcation Based on Multi-Domain Feature Fusion and Integrated Learning

: Speciﬁc Emitter Identiﬁcation (SEI) is a key research problem in the ﬁeld of information countermeasures. It is one of the key technologies required to be solved urgently in the target reconnaissance system. It has the ability to distinguish between different individual radiation sources according to the varying individual characteristics of the emitter hardware within the transmitted signals. In response to the lack of scarcity among labeled samples in speciﬁc emitter identiﬁcation, this paper proposes a method combining multi-domain feature fusion and integrated learning (MDFFIL). First, the received signal is preprocessed to obtain segmented time domain signal samples. Then, the signal is converted to time–frequency distribution using wavelet transform. Afterwards, an integrated learning two-stage recognition classiﬁcation method is designed to extract data features of 1D time domain signals and 2D time–frequency distribution signals using the symmetry network structures of CVResNet and ResNet. Finally, fused features are fed into the complex-valued residual network classiﬁer to obtain the ﬁnal classiﬁcation results. We demonstrate through the analysis results of the measured data that the proposed method has a higher accuracy as compared with the classical feature extraction method, and that this can improve the identiﬁcation of communication radiation sources with fewer labeled samples.


Introduction
Specific Emitter Identification (SEI) is the process of identifying individual emitters by matching the characteristics of the received signal with the emitter for correlation [1]. The production and manufacturing process determines the defects of the hardware. This makes the emitter features unique and difficult to imitate [2]. Therefore, with the help of the fingerprint features of the emitter, SEI is widely used in military and civilian wireless applications [3][4][5].
SEI technology is the fusion of signal processing technology and pattern recognition technology, which can be divided into three parts: data preprocessing, analysis of subtle features, and design of the classifier. Data and processing are used to modify the received signal into data that is suitable for feature extraction through certain methods of processing (such as filtering, normalization, mathematical transformation, etc.). The analysis of subtle features is a process that can effectively and reliably extract the subtle features from the signal by using the existing signal processing methods, while taking into consideration the individual information of the radiation source. Then, the subtle features that can effectively identify the individual radiation source are selected. Designing a classifier is a process that takes into account the quality and efficiency of classification according to the characteristics of the fine features, which are obtained through an analysis of those fine features. The problem of individual recognition of communication radiation sources can be attributed to the classification problem in machine learning. Classification accuracy, on the other hand, is mainly determined by feature extraction methods and classification algorithms.
The electromagnetic environment is becoming more and more complex, which brings more challenges to SEI technology. There are two main types of SEI methods: manual feature extraction-based methods and deep learning-based methods. Artificial feature extraction-based methods rely on signal processing algorithms, and require expert knowledge support. The artificial feature extraction-based methods mainly use the time domain features and transform domain features of the signal. For example, for the time domain characteristics of the signal, N. Scrinken [6] used sliding windows to extract the information of dimensional constructive features of the signal for the recognition of transient signals. Similarly, L. Wu [7] extracted the box dimension and variance dimension for the recognition of individuals. Furthermore, G. Huang [8] used the nonlinear dynamics to extract the alignment entropy for the matching recognition of transmitters. However, these methods are susceptible to noise and have certain limitations. Thus, more mainstream research is based on transform domain features. Time-frequency analysis can be used to reflect subtle differences through the time-frequency joint domain information of the signal. For example, G. Lopez-Risueno proposed a digital channelized receiver based on Short-Time Fourier Transform (STFT) [9], and C. Bertoncini used dynamic wavelet fingerprints to extract features [10]. Hilbert-Huang Transform (HHT) is a well-known method for processing nonlinear and non-stationary signals [11]. Pan [12] converts Hilbert spectra into grayscale images to represent features, and Zhang considers single-hop scenarios and relay scenarios, and proposes three SEI algorithms based on the Hilbert spectrum [13]. In addition, higher-order spectrum-based methods are also a hot research topic. The use of higher-order spectra can maintain the signal amplitude and phase information in order to identify fingerprint features [14,15]. Additionally, it can maintain the method of graphical representation [16] and geometric features [17,18].
The method based on deep learning can use the nonlinear activation function to extract the subtle features of the deep-level radiation source through multiple hidden layers [19], which makes it different from the method based on artificial feature extraction. Deep learning-based methods take the original data-or the image converted from the original data-as the input information, and then uses the input information to train the deep neural network for fingerprint feature learning. Wong [20] inputs the original I/Q signals directly into the neural network and uses the convolutional neural network to measure each transmitter's in-phase/quadrature imbalance parameters. Merchant [21] proposed a method based on time domain complex baseband error signals for transmitter device identification. Pan [12] inputted the Hilbert spectrum into the deep residual network and found that it has better identification under various channel conditions. Sa [22] converted I/Q signals into Contour Stella images and inputted them into a Convolutional Neural Network (CNN) for classification. Wu [23] proposed a Recurrent Neural Network (RNN) recognition algorithm based on long and short-term memory, and found that it achieved high recognition accuracy under the condition of low signal-to-noise ratio. Ding [15] used the bispectral feature of the received signal as the input to CNN, and proved that it has higher accuracy than conventional methods. G. Baldini [24] compared various methods for training CNN by converting signals into images, and concluded that wavelet-based methods outperformed other methods. All of the above deep learning methods require adequate datasets, which can learn the network model more effectively and obtain better recognition accuracy as a result.
Feature extraction methods based on time domain features and transform domain features use a higher-order spectra, which mostly uses a single signal processing method to extract one of the subtle features. On the other hand, the actual communication signal is complex and variable. A single signal feature is not enough to fully and accurately represent the nuances between the radiation source signals. As a result, the final recognition accuracy is limited. In order to enhance the recognition accuracy, most models use deep neural networks as strong classifiers. Data-driven deep learning methods require a large amount of data. Actual communication is limited by time and labor costs. It is difficult to obtain sufficient radiation source signal data for deep learning training. Moreover, if the amount Symmetry 2021, 13, 1481 3 of 12 of data is too small, deeper and more complex neural networks are easily overfitted, which in turn seriously affects the final recognition effect. For the task of individual recognition under the condition of scarcity of labeled samples, some scholars have also proposed the transfer learning method, which first uses a large number of labeled datasets from other domains to train the neural network, and then uses the source domain to migrate to the sample set of the target domain. However, such a method requires obtaining a large number of matching datasets from other domains.
We combine intelligent learning with signal processing technology and divide the SEI task into two steps. The first step is data preprocessing and feature extraction: the one-dimensional complex-valued residual network is trained using time domain signal data, and the two-dimensional residual network is trained using wavelet-transformed time-frequency image data. The second step is the classifier design and training: the features extracted from one-dimensional time domain signal data and two-dimensional time-frequency variation domain data are fused to train the complex-valued neural network classifier. The data set to be recognized is then inputted into to the trained complexvalued neural network classifier to obtain the final classification results. The analysis results of the measured data prove that the proposed method has a higher accuracy rate as compared with the classical feature extraction method, and can improve the identification of communication radiation sources when the labeled samples are small.
The contribution of this paper can be summarized as follows: (1) A symmetric feature extraction architecture is designed, which integrates the time domain and time-frequency variation domain features of limited data to enhance the utilization of the data. (2) An ensemble learning approach is proposed to make full use of the performance of residual networks and complex-valued residual networks. (3) The combination of intelligent learning and signal processing technology greatly improves the ability to recognize radiation sources under the condition of sparse labeled samples.
The structure of this paper is composed as follows: in Section 2, the time-frequency analysis method and neural network model utilized in this paper are introduced; in Section 3, the classification method combining multi-domain features and integrated learning is described in detail; in Section 4, the identification results and discussion for both fixed-frequency and frequency-hopping sample sets are presented; and in Section 5, the conclusions are given.

Wavelet Transform
Since CNN are more suitable for extracting features from images, the time domain signal samples are converted into an image-like two-dimensional time-frequency matrix [12]. Among several two-dimensional representations of the signals, the time-frequency energy distribution shows better performance as compared to the recursive map and bispectrum forms [24]. In the time-frequency energy distribution, the STFT has a fixed resolution as compared to the wavelet transform. The HHT has sparse time-frequency distribution, and thus is not suitable for training neural networks. Wavelet Transform (WT) through localized analysis of time (space) frequency can highlight the subtle regional characteristics of the signal, and has a unique advantage in the field of individual identification. The Morlet wavelet is a complex-valued wavelet which has good aggregation in both time and frequency domains. The measured data collected in this paper are all complex signals. Therefore, the Morlet wavelet is selected as the fundamental wavelet function for time-frequency analysis, and the continuous wavelet transform is used to generate the time-frequency distribution.
where ψ is the Gaussian window, a = 0 is the scale factor, which changes the scaling of the wavelet function. Besides, τ is the time shift factor, which changes the translation of the wavelet function.

Residual Network
Theoretically, the deeper the layers of the neural network, the stronger the nonlinear mapping ability between hidden layers, and the richer the extracted features. However, as the depth of the network model continues to increase, the deeper layers do not improve the accuracy of the model, but instead cause the accuracy to be saturated or to rapidly decline. Residual Network [25] (ResNet) is one of the classical models of deep convolutional networks, which creatively introduces the residual structure, and transfers the results of the previous layer directly to the next layer of the network, so that the error will not further increase. It also serves to alleviate the gradient dispersion problem caused by too many layers. It is a good solution to the problem of "degradation" of the model. Residual learning is much easier than learning overall features. In this way, the entire neural network can be expanded, so that hundreds of layers of convolutional neural networks can be designed.
In the task scenario where labeled samples are scarce, we use a simplified version of ResNet-18 as the feature extractor for the time-frequency transformed two-dimensional image data set, which contains four residual learning units (Table 1). A residual learning unit is made up of two building blocks, so the simplified version of ResNet has nine convolutional layers.

Complex-Valued Residual Network
There are few articles about the basic structure of complex neural networks. In the following, we will give a brief description of the complex-valued convolution layers, the complex-valued weight initialization method, the complex-valued activation function, and the complex-valued normalization layer. Based on these structures, we use the complexvalued residual neural network.

•
Complex-valued convolution layer: Suppose the filter is W = A + iB and input h = x + iy. We use the real number network to simulate the complex-valued operation to obtain the complex-valued convolution output form as follows: expressed in matrix form as: where R(W * h) is the real part of the result of convolution W * h, V(W * h) is the imaginary part of the result of convolution W * h. Combining the real and imaginary parts into a new complex-value is the result of the complex-valued convolution.
• Complex-valued weight initialization method A suitable weight initialization method can speed up the convergence of the network and find the optimal solution quickly. To some extent, it can avoid the gradient disappearance or explosion when the network propagates in the reverse gradient. Define the form of complex weights as: where |w| and θ denote the magnitude and phase of the weights w, respectively, and the variance of the complex weights is expressed as follows: Assuming that w obeys the Gaussian distribution, and at this time its magnitude obeys the Rayleigh distribution, the following formula can be obtained according to the variance calculation of |w|.
Therefore, Var(|w|) = Var(w) − (E |w| 2 ) 2 , which is Var(w) = Var(|w|) + (E |w| 2 ) 2 . Obtaining the variance of the weights only requires calculating the mean and variance of the magnitude |w|. According to the properties of the Rayleigh distribution, it is known that the mean and variance of |w| can be expressed by the parameter σ of the Rayleigh distribution.
Thus, the complex-valued weight is •

Complex-valued activation function
The three main complex-valued activation functions are modReLU, CReLU, and zReLU. •

Complex-valued normalization layer
We use the reciprocal of the square root of the covariance matrix V to scale the data at the center of 0. The derivation process of the bulk normalization of the complex-value is based on the derivation of the normalization of the real numbers, which is derived as follows: where γ rr and γ ii are initialized to 1/ √ 2, γ ri and γ ir are the real part of the imaginary part of β, which are initialized to 0. The mean offset of V rr and V ii is initialized to 1/ √ 2, the mean offset of V ri , V ir , and β is initialized to 0, and the momentum of the mean offset is 0.9.
For the one-dimensional time domain signal dataset, we use the one-dimensional complex-valued residual network as the feature extractor, which contains four complex residual units (Figure 1). Similar to the real residual element, each complex residual element also has the structure of residual connection. Where X r and X i are the in-phase and quadrature components of the signal, H(X r , X i ) is the output through the complex residual unit. X is the envelope of the output X r and X i .

Specific Emitter Identification Based on Integrated Learning
The MDFFIL algorithm is mainly divided into two parts: feature extraction and classifier recognition. In the stage of feature extraction, we use ResNet and Complex-Valued ResNet algorithms to extract the time domain and time-frequency transform domain features of signals. In the stage of classifier recognition, we use complex network classifiers to comprehensively utilize the multi-domain features of signals. The specific operation process is shown in Figure 2 below.

Specific Emitter Identification Based on Integrated Learning
The MDFFIL algorithm is mainly divided into two parts: feature extraction and classifier recognition. In the stage of feature extraction, we use ResNet and Complex-Valued ResNet algorithms to extract the time domain and time-frequency transform domain features of signals. In the stage of classifier recognition, we use complex network classifiers to comprehensively utilize the multi-domain features of signals. The specific operation process is shown in Figure 2 below. plex-Valued ResNet algorithms to extract the time domain and time-frequency transform domain features of signals. In the stage of classifier recognition, we use complex network classifiers to comprehensively utilize the multi-domain features of signals. The specific operation process is shown in Figure 2 below.

Classifier Model Framework
In order to make better use of the features of different domains extracted from the feature extractor, we did not simply splice them together, but designed a classifier based on complex network architecture to comprehensively utilize multi-domain features. The classifier network model structure is shown in Table 2 below: We designed a complex-valued neural network classifier with data features extracted from the time domain and transform domain as inputs to the classifier. Then, the two data features are considered as real and imaginary inputs to the classifier to train the classifier network, which can make full use of the intrinsic connection between the data.

Network Model Setting
The network parameters of the feature extractor and classifier are set, as shown in Table 3 below. Among them, the batchsize of the residual network model is set to 16, the batchsize of the one-dimensional complex-valued residual network model is set to 64, and the network model of the complex-valued neural network classifier is also set to 64. In addition, the initial learning rate of all three network models is set to 1 × 10 −3 , but with the increase of the number of iterations, the learning rate decays by 0.01 for every 100 training sessions.

Baseline Methodology
We used the residual network and the complex-valued residual network [26,27] as comparison experiments. The input to the residual network is the transform domain data, and the input to the complex-value residual network is the one-dimensional time domain signal.

Experimental Data
The experimental data was generated by eight communication radiation source stations of the same model, and collected by the same receiving device. The model of the Symmetry 2021, 13, 1481 9 of 12 receiver is RSA6120A. The baseband signals generated by the radio station include both in-phase signals and quadrature signa. We acquired fixed frequency data and hopping frequency data from radio stations in both voice and digital modes of operation, and obtained four types of data. All of the signals sent by the radio stations are random. Among them, the fixed data frequency is 400 MHz, the hopping frequency range is 450-460 MH, and every 1 MHz is a frequency point. We introduce the signal parameters of the radio station and the parameters of different data sets in Tables 4 and 5.

Experimental Result
From the preprocessed time domain signal data sample set, we randomly selected a different number (100, 200, . . . , 500) of samples as a training sample set. These samples were then put through the wavelet transform to get the corresponding change domain data. Moreover, from each station, we randomly selected 500 samples through the same data process in order to get 4000 identified radiation source test sample sets. The following three identification algorithms were used for testing, and the experiment was repeated five times in order to obtain the average accuracy of the identification results. The identification effect is compared as follows (Figures 3-6). For the two-dimensional change domain signal, we use a real residual neural network, which is able to take advantage of the convolutional network for image processing. In the classifier recognition stage, we again consider the features extracted by the two residual network models as the real and imaginary inputs to the classifier, aiming at mining their correlation. The experimental results demonstrate the superiority of the method, which shows that some more informative parts behind the original data are not exploited. The identification accuracy of various identification algorithms increases with the increase of the input labeled samples, which shows the impact of sufficient numbers of samples on identification classification. It also reflects the importance of mining the information inherent in limited data.

Conclusions
In the actual complex and changeable electromagnetic environment, the non-partner cannot detect and receive a large number of radiation source data. Learning algorithms that rely on deep neural networks generally require a large amount of training data. When the training data is too small, these methods are difficult to achieve the desired results in the test data. In order to address the problem of the scarcity of label samples in real scenes, leading to the difficulty of convergence of deep learning models, this paper proposes a fusion classification and identification method for communicating radiation sources based on MDFFIL. By wavelet transform processing of the original signal, time domain data and two-dimensional time-frequency data are obtained. In addition, this

Conclusions
In the actual complex and changeable electromagnetic environment, the non-partner cannot detect and receive a large number of radiation source data. Learning algorithms that rely on deep neural networks generally require a large amount of training data. When the training data is too small, these methods are difficult to achieve the desired results in the test data. In order to address the problem of the scarcity of label samples in real scenes, leading to the difficulty of convergence of deep learning models, this paper proposes a fusion classification and identification method for communicating radiation sources based on MDFFIL. By wavelet transform processing of the original signal, time domain data and two-dimensional time-frequency data are obtained. In addition, this paper designs an integrated learning two-stage classification identification method. This In our proposed two-stage classification algorithm for integrated learning, we use a one-dimensional complex-valued residual network that is able to take advantage of the intrinsic connection between the in-phase component and the quadrature component for the feature extraction stage for time domain signals. For the two-dimensional change domain signal, we use a real residual neural network, which is able to take advantage of the convolutional network for image processing. In the classifier recognition stage, we again consider the features extracted by the two residual network models as the real and imaginary inputs to the classifier, aiming at mining their correlation. The experimental results demonstrate the superiority of the method, which shows that some more informative parts behind the original data are not exploited. The identification accuracy of various identification algorithms increases with the increase of the input labeled samples, which shows the impact of sufficient numbers of samples on identification classification. It also reflects the importance of mining the information inherent in limited data.

Conclusions
In the actual complex and changeable electromagnetic environment, the non-partner cannot detect and receive a large number of radiation source data. Learning algorithms that rely on deep neural networks generally require a large amount of training data. When the training data is too small, these methods are difficult to achieve the desired results in the test data. In order to address the problem of the scarcity of label samples in real scenes, leading to the difficulty of convergence of deep learning models, this paper proposes a fusion classification and identification method for communicating radiation sources based on MDFFIL. By wavelet transform processing of the original signal, time domain data and two-dimensional time-frequency data are obtained. In addition, this paper designs an integrated learning two-stage classification identification method. This method learns the time domain data by training with a one-dimensional complex-valued residual network, then learns the two-dimensional time-frequency data by training with a residual network, and finally obtains the final classification results by fusing two different features to train a complex-valued neural network classifier. Compared with other comparative algorithms, this method effectively improves the identification of Specific Emitter Identification in sparse labeled sample scenarios.