Next Article in Journal
Performance Evaluation of an Autonomously Driven Agricultural Vehicle in an Orchard Environment
Next Article in Special Issue
The Design and Development of a Ship Trajectory Data Management and Analysis System Based on AIS
Previous Article in Journal
Cloud Platforms for Context-Adaptive Positioning and Localisation in GNSS-Denied Scenarios—A Systematic Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Stage Feature Extraction and Classification for Ship-Radiated Noise

1
Department of Information and Communication, School of Informatics, Xiamen University, Xiamen 316005, China
2
Electrical Engineering Department, Faculty of Engineering, Aswan University, Aswan 81542, Egypt
3
China Electronics Technology Avionics Co., Ltd., Chengdu 610100, China
4
School of Electronic Science and Engineering, Xiamen University, Xiamen 361005, China
5
Department of Information and Communication Engineering, School of Electrical and Electronic Engineering, Tianjin University of Technology, Tianjin 300383, China
*
Author to whom correspondence should be addressed.
Sensors 2022, 22(1), 112; https://doi.org/10.3390/s22010112
Submission received: 2 November 2021 / Revised: 21 December 2021 / Accepted: 22 December 2021 / Published: 24 December 2021
(This article belongs to the Special Issue Recent Advances in Underwater Signal Processing)

Abstract

:
Due to the complexity and unique features of the hydroacoustic channel, ship-radiated noise (SRN) detected using a passive sonar tends mostly to distort. SRN feature extraction has been proposed to improve the detected passive sonar signal. Unfortunately, the current methods used in SRN feature extraction have many shortcomings. Considering this, in this paper we propose a new multi-stage feature extraction approach to enhance the current SRN feature extractions based on enhanced variational mode decomposition (EVMD), weighted permutation entropy (WPE), local tangent space alignment (LTSA), and particle swarm optimization-based support vector machine (PSO-SVM). In the proposed method, first, we enhance the decomposition operation of the conventional VMD by decomposing the SRN signal into a finite group of intrinsic mode functions (IMFs) and then calculate the WPE of each IMF. Then, the high-dimensional features obtained are reduced to two-dimensional ones by using the LTSA method. Finally, the feature vectors are fed into the PSO-SVM multi-class classifier to realize the classification of different types of SRN sample. The simulation and experimental results demonstrate that the recognition rate of the proposed method overcomes the conventional SRN feature extraction methods, and it has a recognition rate of up to 96.6667%.

1. Introduction

Ships are playing an increasingly important role in many military and civilian applications. For example, in military field applications, an effective prediction for enemy ships helps us to take the correct action and activate our countermeasure to avoid enemy attacks and defeat them. For civilian applications, a logical comprehensive analysis of different port noise, including ship-radiated noise (SRN) can help researchers support the reproduction of marine life [1]. For improving the passive sonar operation in ship applications, SNR feature extraction has been proposed [1]. However, marine environmental diversity provides a rich noise environment, and that increases the difficulty of extracting features reflecting the intrinsic characteristics of the ships [1]. In recent years, studies of the SRN feature extraction have increased. Unfortunately, the current SRN feature extraction schemes have many drawbacks. For example, Fourier transform (FT) [1] is only useful in only estimating the signal spectral information, but it is unsuccessful at time-varying representation. To address this drawback, short-time Fourier transform (STFT) has been proposed to indicate the time-varying signal traits. However, a fixed STFT window width makes STFT unable to consider good representation for the time domain and frequency domain at the same time [1]. To overcome the STFT drawbacks wavelet transform (WT) has been proposed by using the unfixed length of the window) [2]. The time-frequency decomposition is improved based on WT, but the wavelet basis function and decomposition layers are required to be set in advance, which practically limits the further application for WT in practice.
Empirical mode decomposition (EMD) has been proposed in [3], to decompose the signal into a group of limited intrinsic mode functions (IMFs), the core of the Hilbert transform (HT). The main purpose of the HT is to snaffle the component of the instantaneous frequency. Therefore, EMD can carefully and rapidly describe the instantaneous frequency of multi-components, hence EMD is more suitable for analyzing non-linear and non-stationary signals. However, the main EMD drawback is the mode mixing problem and many researchers have tried hard to address this issue. To this end, the ensemble empirical mode decomposition (EEMD) [4] has been offered as an amended EMD method [4]. EEMD has been proposed to solve the mode mixing problem by adding Gaussian white noise to the construed signal, and at the end averaging the obtained multiple decompositions results to obtain the IMFs. However, EEMD introduces and has additional problems, firstly, as, in the EMD, the EEMD decomposition results include residual components. Secondly, due to the randomness of Gaussian white noise, the outcome of EEMD is diverse between each decomposition time. Hence, the EEMD lacks a solid mathematical foundation to be distributed and widely accepted. Unlike the EMD and EEMD, variational mode decomposition (VMD) has been proposed in [5], which assumes that each mode is concerning a central frequency with restricted bandwidth. Thus, to obtain the center frequency and bandwidth of each component, VMD constantly searches for the modes and center frequency of each mode by using an alternating direction method of a multiplier, thereby solving the variational problem. In recent years, many works tried to extend the current EMD, EEMD, and VMD methods and applied these schemes in the fields of biomedical engineering [6,7,8,9], mechanical fault diagnosis [10,11,12], and acoustic signal processing [13,14]. In [13], the VMD is firstly performed to decompose the SRN signal, and the permutation entropy (PE) of each IMF with the highest energy is then extracted, achieving a recognition rate of 94%. In [14,15] the VMD decomposition of the SRN signal was performed and the fluctuation-based dispersion entropy (FDE) of each IMF was studied; the obtained IMF with the smallest difference from the FDE of the prime signal was then chosen to describe the raw signal with a recognition rate of 97.5%. In the EMD-EIMF-PE method proposed in [16], the signal-dominant IMF by EMD was chosen based on the energy gauge, and its PE was regarded as the feature parameter effectively distributing the SRN. Although both [13,14] have high recognition rates, there are still flaws in these methods. The mode VMD number has specified the EMD results in [13,14], which will certainly influence the VMD decomposition accuracy. Since the pertinent studies failed to strictly obtain the VMD model number, this paper suggests a new enhanced VMD (EVMD) method with the mode number close to the variance of IMFs’ center frequency.
Entropy is an important indicator to measure the uncertainty of time series and can consider the implicit system dynamics. When the system dynamics vary, the time series complexity will be varied as well. PE [15] has been used in mechanical fault diagnosis [16], agricultural commodity analysis [17], financial sequence analysis [18] due to its fast operation speed and excellent stability. Nevertheless, PE does not consider the case of neighboring vectors having the same ordinal patterns with different amplitudes, which will lead to the estimated value higher than the actual [19]. To this end, due to the preamble weights, the weighted permutation entropy (WPE) [20] is further critical to the amplitude-coded information in the signal and has outperformed the PE in combating the distortion caused by noise. To the best of the authors’ knowledge, WPE has been widely used in uncertainty measurements in many fields [21,22], but is rarely used in the SRN feature extraction.
With current computer technology development and the fast growth of data, it is of great necessity to further process the original high-dimensional data. Feature extraction can be divided into two parts; extracting the feature vectors can reflect the essence of analyzing signals through suitable signal processing methods. The other is selecting an appropriate measure decreases algorithm to minimize the increases, and in that way support the confession execution. Up to now, researchers have suggested different dimension relief algorithms, such as principal component analysis (PCA) [23,24], independent component analysis (ICA) [24], and linear discriminant analysis (LDA) [25]. PCA recognizes measurement saving by realizing optimal variance without dropping the creative data. The PCA and ICA are linear unsupervised dimension reduction algorithms. LDA is also a linear projection method that achieves dimension reduction by making the most of the ratio of the discrete matrix among classes and the discrete matrix within the class. Due to the SRN non-linear characteristics, linear dimension reduction algorithms are scarce in removing the intrinsic features of SRN signals. As a non-linear manifold learning algorithm, local tangent space alignment (LTSA) has been extensively useful in dimension reduction, thanks to its fast process speed and selfishness to selected parameters [26,27,28,29]. To the best of our knowledge, there is no study combining EVMD, WPE, and LTSA to classify underwater acoustic targets. Each method has its pros and cons, motivating us to combine all methods to have the maximum benefits.
To that end, this paper puts forward a novel multistage feature extraction method proposing an EVMD method and combining it with the WPE, and LTSA for SRN samples classification. In this paper, the proposed EVMD method uses the variance of the IMFs’ center frequency to calculate the mode number of VMD and enhance its operation. Next, the new EVMD algorithm is used to decompose the SRN signals plus calculate the WPE of each IMF. Then high-dimensional features are reduced to two-dimensional ones by the LTSA method. Finally, the feature vectors obtained are fed into the PSO-SVM multi-class classifier to recognize the different types of SRN samples.
The structure of the paper is presented as follows: the fundamental theories of the relevant algorithms are described in Section 2. In Section 3, the basic steps of the proposed method are presented. Section 4 applies the proposed method to the analysis of simulated signals. In Section 5, the proposed method is utilized for the feature extraction of SRN. Finally, the paper is concluded in Section 6.

2. Basic Theory

In this section, the theories of the related methods such as VMD, PE, WPE, and LTSA will be presented.

2.1. Variational Mode Decomposition (VMD)

The VMD defines the IMF in the function of the instantaneous amplitude A m k ( t ) and phase P h k ( t ) as an amplitude-modulated-frequency-modulated (AM-FM) signal, given as below:
I k ( t ) = A m k ( t ) cos ( P h k ( t ) )
where the change of the A m k ( t ) and P h ˙ k ( t ) are slower than P h k ( t ) . Each I k ( t ) is compacted around a respective center frequency with limited bandwidth obtained by Gaussian smoothing demodulation. In the VMD algorithm, decomposing the raw signal s ( t ) into a finite group of IMFs to find the variational problem can be expressed as follows:
min { I k , f k } { k | | t [ ( δ ( t ) + j π t ) I k ( t ) ] e j 2 π f k t | | 2 2 } ,                   s . t . k I k = s ( t )
where t , δ ( t ) and f k represent the partial derivative, impulse function, and center frequency of I k ( t ) , respectively. The constrained variation problem in Equation (2) is addressed using the quadratic penalty term and the Lagrange multipliers below:
L ( { I k } , { f k } , λ ) = α k | | t [ ( t + j π t ) I k ( t ) ] e j 2 π f k t | | 2 2 + | | s ( t ) k I k ( t ) | | 2 2 + λ ( t ) , s ( t ) k I k ( t )
where α and λ denote the penalty factor and Lagrange multiplier, respectively. I k n + 1 , f k n + 1 and λ n + 1 are updated as follows:
I ^ k n + 1 ( f ) = s ^ ( f ) i k I ^ i ( f ) + λ ^ ( f ) 2 1 + 2 α ( f f k ) 2
f k n + 1 = 0 2 π f | I ^ k ( f ) | 2 d f 0 | I ^ k ( f ) | 2 d f
λ ^ n + 1 ( f ) = λ ^ n ( f ) + ε ( s ^ ( f ) k I ^ k n + 1 ( f ) )
where ε represents the update parameter. In this method, the stop condition is given by:
k | | I ^ k n + 1 I ^ k n | | 2 2 / | | I ^ k n | | 2 2 < a
where a denotes the convergence accuracy. The VMD algorithm can be summarized in: (1) initialize { I ^ k 1 } ,   { f k 1 } , λ ^ 1 and n = 0 ; (2) update the values of { I ^ k n + 1 } ,   { f k n + 1 } and λ ^ n + 1 based on Equations (4)–(6); (3) check the covariance condition based on Equation (7), and the details about the VMD algorithm are published in [5].

2.2. Permutation Entropy (PE)

PE [15] can not only characterize the randomness of the time series but also detect its dynamic changes. In addition, PE does not consider the amplitude value, but only compares the neighboring values, which makes its operation speed faster. For the given time series x = { x j } j = 1 N , the PE can be reconstructed as:
X i = { x ( i ) ,   x ( i + τ ) ,   , x ( i + ( m 1 ) τ ) } ,   i = 1 ,   2 ,   ,   N ( m 1 ) τ
where m is the embedding dimension, τ is the time delay and i = 1 , 2 , , N ( m 1 ) τ . The elements in X i can be rearranged in increasing order as:
x ( i + ( j 1 1 ) τ ) x ( i + ( j 2 1 ) τ ) x ( i + ( j m 1 ) τ )
If two of the rearranged elements are equal, then,
x ( i + ( j 1 1 ) τ ) = x ( i + ( j 2 1 ) τ )
hence, the new order can be denoted as:
x ( i + ( j 1 1 ) τ ) x ( i + ( j 2 1 ) τ ) ( j 1 j 2 )
Therefore, the symbols group can be obtained as:
S ( g ) = ( j 1 , j 2 , , j m )
where S ( g ) represents one of m ! symbol sequences in phase space, g = 1 , 2 , , k , and k m ! . If the probability distribution of the symbol sequence is P 1 , P 2 , P k , for convergence, the normalized PE is defined as follows:
H p ( m ) = ( ln ( m ! ) ) 1 g = 1 k P g ln ( P g )
From Equation (13), we can observe that the value of PE ranges from 0 to 1. H p indicates the randomness of time sequence, a larger H p value means higher complexity of the time series; a smaller H p value means lower uncertainty of the time series.

2.3. Weighted Permutation Entropy (WPE)

In the PE the neighboring vectors having the same ordinal patterns but with different amplitude values are unreasonably ignored. The WPE [20] has been proposed to take such a situation into account and overcome the PE shortcomings. In the WPE, forgiven embedding dimension m and time delay τ , first, the weight w i of neighbouring vectors X i is calculated as:
w i = k = 1 m [ x j + ( k 1 ) τ X ¯ j m , τ ] 2
X ¯ j m , τ = 1 m k = 1 m x j + ( k 1 ) τ
Then, the weighted relative frequency is calculated as:
p w ( π i m , τ ) =   j N 1 u : type ( u ) = π i ( X ¯ j m , τ )   j N 1 type ( u )   ( X ¯ j m , τ ) w j
Finally, the WPE definition is described below:
H w ( m , τ ) = i : π i m , τ   p w ( π i m , τ ) ln ( p w ( π i m , τ ) )

2.4. Local Tangent Space Alignment (LTSA)

Due to the advantages of insensitivity to parameter selection and fast operation, the local tangent space alignment (LTSA) method has been widely used in dimension reduction in multiple fields [29,30]. The basic idea of the LTSA algorithm is constructing the local tangent space by using the sample neighborhood and mapping the coordinates of the local tangent space corresponding to the global low-dimensional coordinates through the local radiological transformation matrix. Given the data X = { x 1 , x 1 , , x m } R M × N , the principle of LTSA can be briefly described as below:
(1)
Determine the K nearest neighbors of x i to form the set of X i , and centralize X ^ i ,
X i = [ x i , 1 ,   x i , 2 , , x i k ]
X ^ i = X i x ¯ i l k T
where x ¯ i = 1 k j = 1 k x i j and l k is a unit vector with dimension K .
(2)
Calculate the eigenvalues and eigenvectors of a matrix X ^ i by singular value decomposition. The eigenvectors corresponding to the first d largest singular values are the tangent space H i .
θ i j = H i T ( x i j x ¯ i )
(3)
Construct the transformation matrix L i = θ i + , to retain as much information as possible, and the following conditions must be met,
min μ ( Y ) = min i = 1 M | Y i ( I 1 k l l T ) L i θ i |
where θ i + represents the generalized inverse matrix of θ i , Y i represents the set of nearest neighbors of Y after dimension reduction, that is, Y i = ( y i 1 , y i 2 , , y i k ) .
(4)
Solve the optimization problem of Equation (21) by calculating the eigenvalues and eigenvectors of the matrix, and then the embedding matrix Y can be obtained. Equation (21) can be equivalent to the following equation:
min μ ( Y ) = min ( Y H W ) = min t r ( Y H W T H T Y T ) ,
{ H = ( H 1 , H 2 , , H M )                             W = d i a g   ( W 1 ,   W 2 , W M )       W i = ( I 1 k l l T ) ( I θ i + θ i ) I = Y Y T
The low-dimensional embedding matrix Y can be obtained by calculating the eigenvectors corresponding to the second to d -th smallest eigenvalues of the alignment matrix B and it can be calculated as:
B = H W W T H T

3. Proposed Feature Extraction Method Based on Enhanced Variational Mode Decomposition (EVMD), Weighted Permutation Entropy (WPE), and Local Tangent Space Alignment (LTSA)

In Section 2 the details of the basic theories of VMD, WPE, and LTSA are presented. However, the model number of VMD needs to be determined in advance, so this paper proposes the EVMD for SRN signal processing. A multi-stage feature extraction method fully inheriting the advantages of EVMD, WPE, and LTSA is proposed in this paper. The flowchart of the proposed method is shown in Figure 1. The main steps of the proposed method can be summarized as follows:
(1)
Based on the center frequency of the intrinsic mode functions, the variational mode decomposition mode number K will be calculated firstly.
(2)
The VMD algorithm based on the optimum mode number K obtained in the first step is applied to the decomposition of the SRN signals.
(3)
The WPE of each IMF by EVMD is then calculated.
(4)
The feature vectors obtained will be randomly divided into two groups, the first group represent the training data used in classification information while the second one is used for evaluating the testing data and measuring its ability for classification.
(5)
Definitively, PSO-SVM multiclass classifier uses the analysis data that are recognized and detects the underwater acoustic object.
To extract features of SRN, this paper classifies three types of SRN using a combination of VMD, WPE, LTSA, and PSO-SVM multi-class classifier. In the proposed method, the VMD mode number range is first set according to the EEMD decomposition results, and the variance of the IMFs’ center frequency after each decomposition is calculated. The mode number corresponding to the maximum variance is used as an optimum value for VMD. Then, VMD is performed on three SRN signal types. The WPE value of each IMF for each VMD decomposition is calculated. The high-dimensional features obtained are reduced to be two-dimensional features using LTSA. Finally, the obtained feature vectors are input to the PSO-SVM multi-class classifier to achieve classification and recognition of the samples.

4. Simulation Signals Analysis

4.1. Analysis of Simulated Signals Based on EVMD

In this paper, as the SRN signals contain the rich line spectral components, the simulated signals composed of the single-frequency components are introduced. In addition, the simulated signals are feasible as long as they satisfy the measurement signal conditions explained in [12], namely, their frequency interval should not be too small (like 1 Hz, 2 Hz, and 3 Hz). Following the same restriction conditions of [12] and verifying the effectiveness and feasibility of the proposed EVMD method. The simulated signals used in this paper are as follows:
{ f 1 ( t ) = cos ( 10 π t )                                                     f 2 ( t ) = cos ( 60 π t )                                                     f 3 ( t ) = cos ( 110 π t )                                               f ( t ) = f 1 ( t ) + f 2 ( t ) + f 3 ( t ) + η
where the data length is set to be 5000 with a sampling frequency of 1 kHz and η denotes the Gaussian white noise with C N ( 0 , 0.5 ) .
Following the literature [5], before performing the VMD algorithm, the model number K needs to be adjusted, which serves as a main factor affecting the VMD performance. Other parameters are set as constants, namely the balancing parameter of the data-fidelity constraint is α = 2000 , the convergence tolerance level is tol = 1 × 10−7 and the update mode of the center frequency is i n i t = 0 , 1 , 2 for center frequency iterated with 0, uniform distribution, or randomly. Too-large a K will lead to the occurrence of over-decomposition, which means undesirable spurious components will be generated during the decomposition process; too-small a K will cause under-decomposition, which discards some IMFs carrying useful information during the decomposition process. Hence, a properly chosen K value is crucial to the VMD method. Although the VMD method in [13,14] can successfully decompose the SRN signals to some extent, the method for determining the mode number of VMD has not been reasonably demonstrated, making it unacceptable scientifically. In general, the mode number of VMD will not be greater than the mode number of EMD and EEMD, so conventional EMD and EEMD methods are first employed to analyze the simulated signals above. Figure 2 shows the decomposition results and the time domain waveforms of the simulated signals.
Figure 2, show modeled indicator decomposed by using the EMD, one remaining part is obtained in addition to 10 IMFs. While in the EEMD decomposed method there are one remaining part and 10 IMFs, and they are separated. Hence, based on setting the variables for the mode number range to be ( 2 ~ 12 ) , and following calculation of every decomposition IMFs’ center frequency, the good mode number K has the maximum variance which maximizes the IMFs center frequency difference. Figure 3, shows the IMFs’ center frequency at different mode number K when the VMD decomposing method is used, and Figure 3 shows that K = 9 is the best choice.
When the mode number K is set to be more than the good, estimated value K = 9 , the variance begins to reduce and that shows irrelevant variation among the IMFs’ center frequency and the existence of the done decomposition. Figure 4, shows the decomposition results of the modeled signals based on the EVMD procedure. The IMFs’ center frequency distribution at different mode numbers K are listed in Table 1. Table 1 shows how components are unseparated for K = ( 2 ~ 6 ) , and for K 7 the modeled signal is separated. For K = ( 10 ~ 12 ) further false components are created to recognize the split of the modeled signals.
Based on the analysis above, the proposed EVMD method using the variance of the IMFs’ center frequency is feasible in calculating the mode number of VMD. To further verify this method in the VMD algorithm, the VMD method in [4,20] is also introduced for comparison. The correlation coefficients between the corresponding IMF and simulated signals are calculated and the results are listed in Table 2. As shown in Table 2, the corresponding three components of the proposed EVMD have the highest correlation coefficients with the simulated signals, and its decomposition performance is significantly better than the conventional methods of the EMD, EEMD, and VMD. This confirms the validity and feasibility of the method proposed in this paper for determining the VMD mode number.

4.2. Analysis of the Properties Concerning Weighted Permutation Entropy (WPE) and Permutation Entropy (PE)

According to the basic principles of WPE and PE explained in Section 2, when the signal is mutated, it is difficult for PE to detect this state, while WPE should be more sensitive to this mutation. To validate the conjecture, we generate a standard Gaussian white noise series with a length of 5000. As the pulse series means a larger fluctuation, we add the pulse series to these Gaussian white noise series. As in [31], the time delay and the embedding dimension are set as 1 and 6, respectively. The PE and WPE are calculated using a window function with a length of 500 and a sliding step of 50. The time-domain waveforms of the Gaussian white noise series and the signal plus additive pulse series are shown in Figure 5. The results of the calculated entropy are shown in Figure 6.
As shown in Figure 6, the WPE values of two signals of the SRN are smaller than the corresponding PE ones. This indicates how Gaussian white noise has a higher complexity and contains more information. There is no difference in the PE of the two signals in the pulse region, which means the inability of PE to distinguish between these two signals. This can be attributed to the neglect of the amplitude difference between neighboring vectors having the same ordinal patterns in PE calculation. As a result, PE performs poorly in effectively detecting fluctuations caused by noise. In contrast, the WPE value of the signal after superimposed pulses decreases significantly in the pulse region, indicating that the WPE can effectively detect the amplitude-encoded information contained in the signal due to the introduction of weights, which outperforms the PE in noise detection and fluctuation observation.
For further comparison between the performance of the PE and WPE, 50 1/f noise samples with a length of 500 are generated and the pulse series is superimposed on the raw 1/f noise series. For a fair comparison, PE and WPE calculations concern the two signals. The time-domain waveforms of the analyzed signals and the scatter plots of the calculation results are given in Figure 7 and Figure 8, respectively.
As shown in the results, thanks to the good anti-noise ability of WPE, making the estimated WPE value of the two signals is lower than the PE value. Also, sudden change detection is a hard task in the PE method as it neglects the amplitude information. In contrast, except for the significant difference in WPE values between the two signals, the WPE fluctuation trends are more dramatic than PE. Larger fluctuation means stronger discrimination ability. In short, the analysis of experimental results proves the advantages of WPE over PE.

5. Feature Extraction of Ship-Radiated Noise Based on the Proposed Method

5.1. Parameter Selection

The WPE calculation is dependent on the time delay τ and embedding dimension m . If m is too small, the reconstructed sequence will contain less state information, and the WPE algorithm cannot adequately detect the dynamic change of the time series. However, larger m indicates that the time series is homogenized by the reconstructed phase space and cannot detect the time series slight change. Also, if τ is too small, a strong correlation will occur between different delay vector elements, resulting in information redundancy. The phase space trajectory cannot be fully expanded when τ is too large. Therefore, such parameters should be adjusted first before WPE calculation.
To study the influence of m and τ on PE and WPE, the three types of SRN are randomly selected from the data set used in [32]. The samples were recorded on the Atlantic coast in north-western Spain ( 42 °   14   N ,   008 °   43.4   W ) at a depth of 10 m. The sampling frequency is 52.734   kHz and the data length is set to be 5000. The three types of SRN signals are named class A , B , and C , respectively. Figure 9 shows the time-domain waveforms of the normalized signals.
As in [30], the PE and WPE of the three types of SRN signals are calculated under the condition m = 3, 4, 5, 6, 7, and the time delay ranges from 1 to 20. The results are shown in Figure 10. As seen in Figure 10, the class C SRN signal has the maximum entropy value and thus the highest complexity.
Thanks to the excellent anti-noise performance of WPE, the WPE value of the same time series is less than the PE. WPE is more sensitive to the time delay compared to the PE, as the PE fluctuates subtly with the time delay increasing. The WPE and PE begin to separate when m = 6. However, when m is increased to 7, the computational complexity will be increased without improving the accuracy of calculation results. Based on the results obtained in Figure 10, a significant difference occurs between the WPE and PE when the time delay is equal to 1. Therefore, in the proposed method when calculating the WPE and PE, the delay and embedding dimensions are set to be 1 and 6, respectively. These parameters are consistent with the recommendations given by [15,31]. The PE and WPE of the signals versus the time delay are shown in Figure 11. As shown in Figure 11, the time delay has a greater influence on PE and WPE in some ranges, while less in others as the embedding dimension increases. Compared to PE, the trends of WPE fluctuate relatively sharply, as the WPE is more sensitive to the pattern extracted from signals containing amplitude information.

5.2. Decomposition of Ship-Radiated Noise Using VMD

As described in Section 3, to calculate the VMD model number, the EEMD algorithm is first employed to decompose the SRN signals. The decomposition results are presented in Figure 12. Figure 12 shows that 12 IMFs and one residual component are obtained after the EEMD of each type of SRN signal. For the sake of observation, K is set as ( 2 ~ 15 ) to calculate the variance of the IMFs’ center frequency. The results are given in Figure 13. It can be observed from Figure 13 that the optimal K is 12, and when K is higher than 12, the variance starts to decrease sharply, implying the occurrence of over-decomposition. The decomposition results by EVMD are given in Figure 14.

5.3. Classification of Ship-Radiated Noise (SRN)

In this section, the proposed method is applied to the SRN samples classification. 100 samples are randomly selected from each type of SRN sample and thus a total of 300 samples can be obtained. The EVMD is first performed to decompose these samples and the WPE of each IMF is calculated. Figure 15 shows the WPE mean and standard deviation.
Figure 15 shows, when the mode is IMF3 or IMF4, that the three types of samples can be distinguished, and the WPE values of class C samples are higher than that of the other two classes. This can be due to the dynamic behavior changes at different signals. Hence, WPE can effectively reflect the dynamic changes of the time series. However, when the mode is IMF2, the class A and B samples cannot be identified; when IMF5, only class A can be identified. In other modes, all three types of sample are indistinguishable. Therefore, not every IMF can fully characterize the raw SRN signal after the proposed EVMD decomposition. The results can be attributed to two points.
Firstly, due to the pollution of marine environmental noise, some IMFs belong to noise or noise-dominant components. Secondly, the occurrence of over-decomposition in the VMD algorithm allows different IMFs to share the same spectrum information. In this way, the dimension reduction algorithms are introduced to avoid dimensional disasters, thus preparing for the classification below. Next, the PCA, MDS, LLE, and LTSA methods are utilized for the low-dimensional feature extraction. The number of neighboring points and the target dimension are set as 15 and 2, respectively. The results are given in Figure 16. As shown in Figure 16, overall, most of the three types of samples can be distinguished by the four algorithms despite the partially overlapping samples between classes A and B. From a qualitative point of view, the combined EVMD-WPE-PCA algorithm has shown the worst performance with a small number of samples crossing between class A and B, and more samples overlapping between class B and C, while the performance of EVMD-WPE-MDS and EVMD-WPE-LLE has been significantly improved compared to EVMD-WPE-PCA with only a few samples of class A and B crossed. Despite the good performance of EVMD-WPE-PCA, EVMD-WPE-MDS, and EVMD-WPE-LLE in roughly identifying the three types of samples, the degree of clustering within the class is still small, especially for class B. In contrast, the proposed combined EVMD-WPE-LTSA method has the best clustering performance. The three kinds of SRN sample can be separated well, and the samples of classes B and C are better clustered. For a fair comparison, the scatter plots of WPE, EMD-WPE-LTSA and EEMD-WPE-LTSA are represented in Figure 17.
As shown in Figure 17a, affected by the oceanic environment and the ambient noise, we cannot recognize the majority of the three forms of samples as they will be opposed together. Hence in such a case, the algorithms of the decomposition should be used. For the EMD-WPE-LTSA shown in Figure 16b, a large proportion of samples of classes A and B are overlapped, which cannot be separated well. The samples in the C category are also intersected with the others. Hence, recognition of the three samples becomes difficult. While concerning the EEMD-WPE-LTSA, the samples in categories A and B can be completely separated, but the degree of separation between categories B and C is still small. From a qualitative point of view, there is no significant difference between EEMD-WPE-LTSA and EVMD-WPE-LTSA, and EEMD-WPE-LTSA even looks more clustered within the class, but we cannot conclude on the merits of both algorithms. Data visualization is only a qualitative tool rather than a quantitative one. In this situation, the PSO-SVM multi-class classifier is introduced to compare the algorithms above more accurately. Next, 60 samples are randomly selected from each class to be used as a training classifier and the remaining samples are left to be used in testing the performance. Also, both EMD-EIMF-PE [21] and VMD-WPE-LTSA (the mode number of VMD is set as [4,20]) methods are considered. The classification outputs are shown in Figure 18. Table 3 lists the classification accuracy and computational time for SRN feature extraction under different algorithms.
As shown in Figure 18 and Table 3, the direct calculation of the WPE for the samples fails to achieve the SRN samples identification as it has only a 62.5% recognition rate and that is far from the classification standard. The multistage classification techniques based on the proposed EVMD have the highest classification accuracy of 95.8333% and 96.6667% in EVMD-WPE-LLE and EVMD-WPE-LTSA, respectively. The proposed multistage classification techniques based on EVMD outperform the other conventional algorithms with the additional computational time cost. In general, computational complexity and classification accuracy are determined by the configuration of the hardware and algorithm design.

6. Conclusions

A novel multi-stage feature extraction method for underwater acoustic signals is proposed in this paper based on combining the new EVMD method with WPE and LTSA. The main innovations and contributions of this work can be summarized as follows:
(1)
The proposed EVMD method mode number was calculated based on the variance of IMFs’ center frequency.
(2)
WPE and LTSA were combined with the new EVMD and applied to underwater SRN signals for the first time.
(3)
The proposed EVMD-WPE-LTSA was successfully applied, and extracted the features of the underwater acoustic signals. Compared with other algorithms, EVMD-WPE-LTSA had a higher recognition rate and better classification performance at the cost of computational time.
The EVMD algorithm proposed in this paper to overcome the shortage of the VMD performed accurately in the field of underwater acoustic communication. Nevertheless, this paper only addresses the VMD mode number. In future, the optimization between both the model number and the quadratic penalty term will be considered to obtain high decomposition accuracy for the EVMD. Also, we will try to reduce the computation complexity.

Author Contributions

Conceptualization, H.E. and D.X.; methodology, H.E. and D.X.; software, D.X.; validation, Z.A.H.Q. and H.S.; formal analysis, J.Q. and J.W.; investigation, J.Q. and J.W.; resources, H.S.; data curation, D.X.; writing—original draft preparation, H.E. and D.X.; writing—review and editing, H.E. and D.X.; visualization, Z.A.H.Q. and H.S.; supervision, J.Q. and J.W.; project administration, J.Q. and J.W.; funding acquisition, H.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China (61671394), The Fundamental Research Funds for the Central Universities (20720170044), and the Natural Science Foundation of Tianjin (16JCQNJC01100).

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Xie, D.; Esmaiel, H.; Sun, H.; Qi, J.; Qasem, Z.A. Feature Extraction of Ship-Radiated Noise Based on Enhanced Variational Mode Decomposition, Normalized Correlation Coefficient and Permutation Entropy. Entropy 2020, 22, 468. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Rao, R. Wavelet Transforms. In Encyclopedia of Imaging Science and Technology; Wiley: New York City, NY, USA, 2002. [Google Scholar]
  3. Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.-C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
  4. Wu, Z.; Huang, N.E. Ensemble empirical mode decomposition: A noise-assisted data analysis method. Adv. Adapt. Data Anal. 2009, 1, 1–41. [Google Scholar] [CrossRef]
  5. Dragomiretskiy, K.; Zosso, D. Variational mode decomposition. IEEE Trans. Signal Process. 2013, 62, 531–544. [Google Scholar] [CrossRef]
  6. Zhang, X.; Wang, J.; Lin, L. Feature extraction of ship radited noises based on wavelet transform. Acta Acust. Peking 1997, 22, 139–144. [Google Scholar]
  7. Shih, M.-T.; Doctor, F.; Fan, S.-Z.; Jen, K.-K.; Shieh, J.-S. Instantaneous 3D EEG signal analysis based on empirical mode decomposition and the hilbert–huang transform applied to depth of anaesthesia. Entropy 2015, 17, 928–949. [Google Scholar] [CrossRef] [Green Version]
  8. Xie, P.; Yang, F.-M.; Li, X.-X.; Yang, Y.; Chen, X.-L.; Zhang, L.-T. Functional coupling analyses of electroencephalogram and electromyogram based on variational mode decomposition-transfer entropy. Acta Phys. Sin. 2016, 65, 118701. [Google Scholar]
  9. Taran, S.; Bajaj, V. Clustering variational mode decomposition for identification of focal EEG signals. IEEE Sens. Lett. 2018, 2, 1–4. [Google Scholar] [CrossRef]
  10. Hsieh, N.-K.; Lin, W.-Y.; Young, H.-T. High-speed spindle fault diagnosis with the empirical mode decomposition and multiscale entropy method. Entropy 2015, 17, 2170–2183. [Google Scholar] [CrossRef] [Green Version]
  11. Zhang, X.; Liang, Y.; Zhou, J. A novel bearing fault diagnosis model integrated permutation entropy, ensemble empirical mode decomposition and optimized SVM. Measurement 2015, 69, 164–179. [Google Scholar] [CrossRef]
  12. Lei, Y.; He, Z.; Zi, Y. Application of the EEMD method to rotor fault diagnosis of rotating machinery. Mech. Syst. Signal Process. 2009, 23, 1327–1338. [Google Scholar] [CrossRef]
  13. Li, Y.; Li, Y.; Chen, X.; Yu, J. A novel feature extraction method for ship-radiated noise based on variational mode decomposition and multi-scale permutation entropy. Entropy 2017, 19, 342. [Google Scholar] [CrossRef] [Green Version]
  14. Yang, H.; Zhao, K.; Li, G. A new ship-radiated noise feature extraction technique based on variational mode decomposition and fluctuation-based dispersion entropy. Entropy 2019, 21, 235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Bandt, C.; Pompe, B. Permutation entropy: A natural complexity measure for time series. Phys. Rev. Lett. 2002, 88, 174102. [Google Scholar] [CrossRef] [PubMed]
  16. Tian, Y.; Wang, Z.; Lu, C. Self-adaptive bearing fault diagnosis based on permutation entropy and manifold-based dynamic time warping. Mech. Syst. Signal Process. 2019, 114, 658–673. [Google Scholar] [CrossRef]
  17. de Araujo, F.H.A.; Bejan, L.; Rosso, O.A.; Stosic, T. Permutation entropy and statistical complexity analysis of Brazilian agricultural commodities. Entropy 2019, 21, 1220. [Google Scholar] [CrossRef] [Green Version]
  18. Henry, M.; Judge, G. Permutation entropy and information recovery in nonlinear dynamic economic time series. Econometrics 2019, 7, 10. [Google Scholar] [CrossRef] [Green Version]
  19. Li, R.; Wang, J. Interacting price model and fluctuation behavior analysis from Lempel–Ziv complexity and multi-scale weighted-permutation entropy. Phys. Lett. A 2016, 380, 117–129. [Google Scholar] [CrossRef]
  20. Fadlallah, B.; Chen, B.; Keil, A.; Príncipe, J. Weighted-permutation entropy: A complexity measure for time series incorporating amplitude information. Phys. Rev. E 2013, 87, 022911. [Google Scholar] [CrossRef] [Green Version]
  21. Wu, E.Q.; Zhu, L.-M.; Zhang, W.-M.; Deng, P.-Y.; Jia, B.; Chen, S.-D.; Ren, H.; Zhou, G.-R. Novel Nonlinear Approach for Real-Time Fatigue EEG Data: An Infinitely Warped Model of Weighted Permutation Entropy. IEEE Trans. Intell. Transp. Syst. 2019, 21, 2437–2448. [Google Scholar] [CrossRef]
  22. Wu, P.; Guo, L.; Duan, Y.; Zhou, W.; He, G. Control loop performance monitoring based on weighted permutation entropy and control charts. Can. J. Chem. Eng. 2019, 97, 1488–1495. [Google Scholar] [CrossRef]
  23. Wold, S.; Esbensen, K.; Geladi, P. Principal component analysis. Chemom. Intell. Lab. Syst. 1987, 2, 37–52. [Google Scholar] [CrossRef]
  24. Reza, M.S.; Ma, J. ICA and PCA Integrated Feature Extraction for Classification. In Proceedings of the IEEE 13th International Conference on Signal Processing (ICSP), Chengdu, China, 6–10 November 2016. [Google Scholar]
  25. Tharwat, A.; Gaber, T.; Ibrahim, A.; Hassanien, A.E. Linear discriminant analysis: A detailed tutorial. AI Commun. 2017, 30, 169–190. [Google Scholar] [CrossRef] [Green Version]
  26. Zhang, Z.; Zha, H. Principal Manifolds and Nonlinear Dimension Reduction via Local Tangent Space Alignment; Tech Rep CSE-02-019; Department of Computer Science, Pennsylvania State University: University Park, PA, USA, 2002. [Google Scholar]
  27. Ma, P.; Zhang, H.; Fan, W.; Wang, C. Early fault detection of bearings based on adaptive variational mode decomposition and local tangent space alignment. Eng. Comput. 2019, 36, 509–532. [Google Scholar] [CrossRef]
  28. Li, S.; Qian, P.; Gu, H. Fault Detection of Discriminant Improved Local Tangent Space Alignment. In Proceedings of the International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS), Shanghai, China, 21–24 November 2019. [Google Scholar]
  29. Ye, Y.; Zhang, Y.; Wang, Q.; Wang, Z.; Teng, Z.; Zhang, H. Fault diagnosis of high-speed train suspension systems using multiscale permutation entropy and linear local tangent space alignment. Mech. Syst. Signal Process. 2020, 138, 106565. [Google Scholar] [CrossRef]
  30. Zhang, N.; Sun, Y.; Zhang, Y.; Yang, P.; Lin, A.; Shang, P. Distinguishing Stock Indices and Detecting Economic Crises Based on Weighted Symbolic Permutation Entropy. Fluct. Noise Lett. 2019, 18, 1950026. [Google Scholar] [CrossRef]
  31. Zheng, J.; Pan, H.; Yang, S.; Cheng, J. Generalized composite multiscale permutation entropy and Laplacian score based rolling bearing fault diagnosis. Mech. Syst. Signal Process. 2018, 99, 229–243. [Google Scholar] [CrossRef]
  32. Santos-Domínguez, D.; Torres-Guijarro, S.; Cardenal-López, A.; Pena-Gimenez, A. ShipsEar: An underwater vessel noise database. Appl. Acoust. 2016, 113, 64–69. [Google Scholar] [CrossRef]
Figure 1. The flowchart of the proposed method.
Figure 1. The flowchart of the proposed method.
Sensors 22 00112 g001
Figure 2. Modeled signals and decomposition results based on empirical mode decomposition (EMD) and ensemble empirical mode decomposition (EEMD) methods. (a) the initial modeled signals; (b) EMD procedure; (c) EEMD procedure.
Figure 2. Modeled signals and decomposition results based on empirical mode decomposition (EMD) and ensemble empirical mode decomposition (EEMD) methods. (a) the initial modeled signals; (b) EMD procedure; (c) EEMD procedure.
Sensors 22 00112 g002
Figure 3. The variance curve of intrinsic mode functions (IMFs) center frequency with mode number K after simulated signals decomposition using variational mode decomposition (VMD).
Figure 3. The variance curve of intrinsic mode functions (IMFs) center frequency with mode number K after simulated signals decomposition using variational mode decomposition (VMD).
Sensors 22 00112 g003
Figure 4. The EVMD decomposition results for the simulated signals.
Figure 4. The EVMD decomposition results for the simulated signals.
Sensors 22 00112 g004
Figure 5. The time domain diagrams of Gaussian white noise and the signal plus additive pulse series.
Figure 5. The time domain diagrams of Gaussian white noise and the signal plus additive pulse series.
Sensors 22 00112 g005
Figure 6. The curve of PE and WPE with window.
Figure 6. The curve of PE and WPE with window.
Sensors 22 00112 g006
Figure 7. The time domain waveforms for; (a) 1/f noise (b) the signal plus additive pulse series.
Figure 7. The time domain waveforms for; (a) 1/f noise (b) the signal plus additive pulse series.
Sensors 22 00112 g007
Figure 8. The scatter plots of calculation results for PE and WPE.
Figure 8. The scatter plots of calculation results for PE and WPE.
Sensors 22 00112 g008
Figure 9. The time-domain waveforms for the three types of normalized ship−radiated noise (SRN).
Figure 9. The time-domain waveforms for the three types of normalized ship−radiated noise (SRN).
Sensors 22 00112 g009
Figure 10. PE, WPE comparison of the three types of SRN under different embedding dimensions.
Figure 10. PE, WPE comparison of the three types of SRN under different embedding dimensions.
Sensors 22 00112 g010
Figure 11. PE and WPE of the signals varying with the time delay.
Figure 11. PE and WPE of the signals varying with the time delay.
Sensors 22 00112 g011
Figure 12. The EEMD results for SRN signals.
Figure 12. The EEMD results for SRN signals.
Sensors 22 00112 g012
Figure 13. The results of variance analysis for the SRN signals.
Figure 13. The results of variance analysis for the SRN signals.
Sensors 22 00112 g013
Figure 14. The EVMD results for SRN signals.
Figure 14. The EVMD results for SRN signals.
Sensors 22 00112 g014aSensors 22 00112 g014b
Figure 15. The mean and standard deviation of WPE.
Figure 15. The mean and standard deviation of WPE.
Sensors 22 00112 g015
Figure 16. The clustering results of the four algorithms.
Figure 16. The clustering results of the four algorithms.
Sensors 22 00112 g016aSensors 22 00112 g016b
Figure 17. The scatter plots of the three algorithms.
Figure 17. The scatter plots of the three algorithms.
Sensors 22 00112 g017
Figure 18. The outputs of classification under different algorithms.
Figure 18. The outputs of classification under different algorithms.
Sensors 22 00112 g018
Table 1. The mode number corresponding to the distribution of IMF’s center frequency by VMD.
Table 1. The mode number corresponding to the distribution of IMF’s center frequency by VMD.
K Center Frequency/Hz
27.7952.50
37.7752.50310.22
47.7352.51193.88358.92
57.7152.52178.13306.10429.27
67.6352.53132.93231.63325.67438.33
75.0455.1430.02178.15281.03365.89468.10
85.0455.1230.02150.48230.17304.96377.06470.06
95.0455.1130.02133.53202.41280.99349.96413.83476.09
105.0455.0930.01122.45181.76240.15300.08359.29418.56477.08
115.0355.0930.01118.11173.42227.27276.65321.81369.88424.40478.29
125.0355.0530.0199.61146.79192.70237.79283.66325.96372.20425.66468.56
Table 2. Correlation coefficients between corresponding IMF with simulated signals.
Table 2. Correlation coefficients between corresponding IMF with simulated signals.
EMDEEMDVMDEVMD
f1IMF6: 0.9244IMF7: 0.9665IMF1: 0.9938IMF1: 0.9944
f2IMF4: 0.8959IMF5: 0.9806IMF3: 0.9914IMF3: 0.9925
f3IMF3: 0.8875IMF4: 0.9460IMF2: 0.9872IMF2: 0.9880
Table 3. The classification accuracy and computational complexity under different algorithms.
Table 3. The classification accuracy and computational complexity under different algorithms.
MethodNUMBER OF MISCLASSIFIED SAMPLESACCURACY RATE (%)COMPUTATIONAL
TIMES (SECOND)
CLASS ACLASS BCLASS C
WPE1172762.5787.35041
EMD-WPE-LTSA119281.66676956.81681
EEMD-WPE-LTSA01209023,643.2661
EVMD-WPE-PCA2559022,254.0133
EVMD-WPE-MDS4209522,255.2634
EVMD-WPE-LLE14095.833322,255.0276
EVMD-WPE-LTSA30196.666722,255.0041
EMD-EIMF-PE50889.16671342.94618
VMD-WPE-LTSA35093.333312,404.5999
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Esmaiel, H.; Xie, D.; Qasem, Z.A.H.; Sun, H.; Qi, J.; Wang, J. Multi-Stage Feature Extraction and Classification for Ship-Radiated Noise. Sensors 2022, 22, 112. https://doi.org/10.3390/s22010112

AMA Style

Esmaiel H, Xie D, Qasem ZAH, Sun H, Qi J, Wang J. Multi-Stage Feature Extraction and Classification for Ship-Radiated Noise. Sensors. 2022; 22(1):112. https://doi.org/10.3390/s22010112

Chicago/Turabian Style

Esmaiel, Hamada, Dongri Xie, Zeyad A. H. Qasem, Haixin Sun, Jie Qi, and Junfeng Wang. 2022. "Multi-Stage Feature Extraction and Classification for Ship-Radiated Noise" Sensors 22, no. 1: 112. https://doi.org/10.3390/s22010112

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop