A Novel Underdetermined Blind Source Separation Method and Its Application to Source Contribution Quantitative Estimation

To identify the major vibration and radiation noise, a source contribution quantitative estimation method is proposed based on underdetermined blind source separation. First, the single source points (SSPs) are identified by directly searching the identical normalized time-frequency vectors of mixed signals, which can improve the efficiency and accuracy in identifying SSPs. Then, the mixing matrix is obtained by hierarchical clustering, and source signals can also be recovered by the least square method. Second, the optimal combination coefficients between source signals and mixed signals can be calculated based on minimum redundant error energy. Therefore, mixed signals can be optimally linearly combined by source signals via the coefficients. Third, the energy elimination method is used to quantitatively estimate source contributions. Finally, the effectiveness of the proposed method is verified via numerical case studies and experiments with a cylindrical structure, and the results show that source signals can be effectively recovered, and source contributions can be quantitatively estimated by the proposed method.


Introduction
Vibration and radiation noise have a significant effect on the safety and stability of some mechanical systems [1,2], for example, excessive noise of underwater vehicle will interfere with its own detection accuracy. Independently acquiring information from each source of the mechanical system can help to quickly judge its running state. However, in practice, the information measured by sensors is the superposition of some sources, because different components of the mechanical system will interfere with each other, which makes it difficult to directly measure the source information [3]. Therefore, some supplementary signal processing methods are needed to further process the collected information to obtain the expected source signals [1]. Among post-processing approaches, blind source separation (BSS) has demonstrated its usefulness in separating sources from mixed signals, due to its simplicity and effectiveness. More importantly, BSS can be utilized without the structure models and the transmission paths that are difficult to be obtained, and therefore BSS has been widely used in practice [4][5][6][7][8][9]. However, most of these methods are mainly designed for (over)determined BSS where the number of sensors is no smaller than that of sources, and thus they may fail when dealing with underdetermined cases. Therefore, we mainly address the problem of underdetermined BSS (UBSS) where the number of mixed signals is smaller than that of sources in this study. In addition, reducing the vibration of the major sources rather than all sources can achieve satisfactory results with smaller cost [1,10]. Therefore, how to evaluate source contribution quantitatively is another problem to be addressed in our study.

Basic Theory
The linear instantaneous mixed model of UBSS can be expressed as where x(t) = [x 1 (t), x 2 (t), . . . , x N (t)] T and s(t) = [s 1 (t), s 2 (t), . . . , s M (t)] T are the mixed vector and the source vector in the time domain, respectively, and · T represents the transpose operation; N and M (N < M) are the number of mixed signals and source signals, respectively; A = [a 1 , a 2 , . . . , a M ] is the mixing matrix with a i as its column. The aim of UBSS is to estimate source signals without any prior information of s(t) or A, except that N < M.
To increase the sparsity of source signals, the above linear instantaneous mixed model can be transformed into the TF domain as Equation (2) or Equation (3) by STFT.

Proposed Mixing Matrix Estimation Method
The ideal goal of UBSS is to estimate source signals without any prior information, except that N < M. Actually, it seems almost impossible to obtain an effective estimation of source signals if we know nothing about s(t) or A. Therefore, some assumptions are given first.

Assumption 1.
For each source signal s i (t), there are some TF points (t, f ) where only S i (t, f ) is dominant, i.e., |S i (t, f )| S j (t, f ) , ∀ j = i.

Assumption 2.
Source signals are mutually independent. Assumption 3. Any N × N submatrix of the mixing matrix A is of full rank.
These three assumptions could hold in many practical cases and have been widely used in recent UBSS methods. Assumption 1 is used to guarantee the existence of SSPs in the process of mixing matrix estimation. However, Assumption 1 is not necessary for recovering source signals, that is, if the mixing matrix is known or can be estimated by other methods, Assumption 1 can be removed. Assumption 2 and Assumption 3 are used to increase the stability of the SSPs identification method. Assumption 2 is also used to guarantee that there is no cross energy among source signals, which will be used in the source contribution estimation. Besides, Assumption 3 is also used to guarantee that all source signals could be correctly recovered. Now, at any TF point, say (u, v), if only one source is active, say S i (u, v), i.e., (u, v) is an SSP corresponding to s i (t), then Equation (3) can be rewritten as Equation (4) shows that the TF vector of mixed signals at TF point (u, v) is collinear with the i-th column of the mixing matrix. It can be also obtained that all TF vectors of mixed signals at SSPs corresponding to s i (t) will be collinear with a i , that is, all SSPs corresponding to the same active source could be linearly represented by each other. Assume that S i (ψ, ω) is also an SSP corresponding to s i (t), then we will obtain where r is a real coefficient. Now, the problem is how to identify the TF vectors that satisfy Equation (5) from all TF vectors of mixed signals.
From Equation (4), X(u, v) can be normalized as where represents the normalized vector and 2 is the 2-norm. Similarly, the normalized vector of X(ψ, ω) can also be written as X(ψ, ω) = a i As shown in Equations (6) and (7), all the normalized TF vectors at the SSPs corresponding to the i-th dominant source will be equal to the normalized vector of a i . Therefore, SSPs can be identified by searching the identical normalized TF vectors of mixed signals, i.e., Equations (5) and (8) are equivalent. Therefore, SSPs can be identified by checking whether normalized TF vectors are identical or not. As all vectors have been normalized, they will be identical if the directions of vectors are the same. The cosine of the angle between X(u, v) and X(ψ, ω) can be calculated by where X(u, v), X(ψ, ω) is the scalar product of X(u, v) and X(ψ, ω). Therefore, Equation (8) will hold if Noise effect is not considered in the above derivation. In noisy environments, we cannot find SSPs that exactly satisfies Equation (10). Instead, we can get SSPs from the following criterion: where |·| is the absolute value of ·, and δ 1 is the SSPs threshold close to zero. Therefore, both X(u, v) and X(ψ, ω) are regarded as SSPs if they satisfy Equation (11). As stated in [20], most of the signal energy will be concentrated in nearly 10% of the frequency bins. Therefore, in our study, the frequency bins are sorted in descending order according to their variances and only N f frequency bins with larger variance are selected for identifying SSPs. Moreover, we recommend that the data could be segmented when the sampling length is very large, and the results obtained in different segments can be combined by the similarity of the signal itself.
SSPs threshold δ 1 has a large effect on the accuracy of SSPs and we now discuss how to choose the SSPs threshold. If δ 1 is too small, the accuracy of SSPs will increase, however, the number of identified SSPs each time will decrease so that the efficiency of identifying enough SSPs will decline. Too small thresholds will even lead to insufficient SSPs. Otherwise, if δ 1 is too large, the criterion becomes loose and too many outliers will be misjudged as SSPs, which will reduce the accuracy of mixing matrix estimation and source recovery. Since δ 1 is related to the property of source signals, it is hard to give a unified range for all kinds of signals. A feasible approach that considers both efficiency and accuracy is to set a smaller threshold δ 1 and a minimum number N min-SSPs of identified SSPs. If the number N SSPs of extracted SSPs is smaller than N min-SSPs , the threshold will be doubled. When source signals contain large noise or source signals are not very sparse in the TF domain, the threshold will gradually increase, which can reduce the effect of using an unsuitable threshold.
In general, TF vectors with negligible energy are greatly influenced by noise, which will easily lead to the misjudgment of SSPs. To obtain effective SSPs, these vectors should be removed before identifying SSPs if they satisfy where δ 2 is a threshold close to zero and X 2 represents the average of 2-norm of all TF vectors. After identifying SSPs, the next stage is to estimate the mixing matrix. It can be found from Equations (6) and (7) that the identified SSPs are the set of normalized column vectors of the mixing matrix. Therefore, the mixing matrix can be estimated by clustering these TF vectors and the hierarchical clustering technique [21,22] is used here. It should be noted that this clustering technique may not be the best algorithm to cluster SSPs as other algorithms can also be used [23]. More details on adjusting the cluster number can be found in [17]. As studied in [17], the mixing matrix estimation error can be further reduced by removing the points which are away from the mean direction of the cluster. This strategy is also used in our study and the outlier detection rule is the same as [17]. By re-clustering SSPs after elimination of the outliers, each column ofÂ can be obtained via calculating the center of each cluster.

Source Recovery
Assumption 4. At each TF point, the number of source signals is smaller than that of mixed signals.
Even thoughÂ is known, the solution of the system in Equation (1) is not unique. Actually, source signals can be recovered by a series of least square problems [19] with Assumption 3 and Assumption 4, which minimizes the error function by selecting the optimal N × (N − 1) submatrix ofÂ. Let A be a set composed of all N × (N − 1) submatrices ofÂ, that is Then for each TF point (t, k), there exists A * = â φ 1 ,â φ 2 , . . . ,â φ N−1 that satisfies where † is the pseudo-inverse of a matrix. Then, source signals can be estimated bŷ where e = [e 1 , e 2 , . . . , e N−1 ] T = A † * X(t, k), and A * can be obtained by Finally, the time domain of the estimated source signalsŜ(t) can be easily obtained by inverse STFT.

Proposed Source Contribution Estimation Method
Unlike the determined BSS, mixed signals in UBSS usually cannot be linearly represented by the estimated source signals, due to noise and separation errors, i.e., there exists a residual between mixed signals and the estimated source signals. Therefore, the i-th mixed signal x i can be expressed as where w i = [w i1 , w i2 , . . . , w iM ] T represents the coefficients and z i represents the residual. It should be noted that 1×T represents the whole discrete sequence of the i-th mixed signal andŜ = s T 1 , s T 2 , . . . , s T M T is also the whole discrete sequences of M estimated source signals, 1×T represents the i-th residual signal that has the same dimension as x i . Then, a problem arises that how muchŜ is contained in x i . This problem can be addressed based on minimum redundant error energy, i.e., Therefore, the problem can be transformed into how to find the optimal coefficients in Equation (18).
Let f (w i ) = z i 2 2 , it is easy to see that f (w i ) is a continuous differential function. From Equation (17), we can obtain The derivative of f (w i ) with respect to w i can be calculated by Let ∇ w i [ f (w i )] = 0, we will have From Assumption 2, source signals are mutually independent, therefore, the estimated source signals are also approximately mutually independent, that is, the rank ofŜ is M. Therefore, the rank ofŜŜ T is also M, that is,ŜŜ T is a matrix with full rank. Thus, f (w i ) has only one stationary point. The The Hessian matrix of f (w i ) is nonnegative definite. Therefore, from Equations (21) and (22), the minimum value of f (w i ) can be obtained at w i * = ŜŜ T −1Ŝ x T i . Based on the above analysis, the optimal combination coefficients ofŜ to all mixed signals can be obtained. Then, source contributions can be quantitatively estimated using w * . The following equation can be obtained.
x i−j = x i − w i * , jŝj , i = 1, 2, . . . , N and j = 1, 2, . . . , M, where x i−j represents the vector of x i that subtracts the contribution ofŝ j , and w i * , j is the j-th element of w i * . From Assumption 2, there is no cross energy among source signals, then the contribution C ij of the j-th estimated source signalsŝ j to the i-th mixed signals x i are calculated according to Generally, due to the noise and estimation error, the sum of the contributions of all estimated source signals to a mixed signal is not equal to 1, which is different from complete BSS.
, which implies that the j-th estimated source signals can decrease the overall vibration energy.
The flowchart of the proposed UBSS-based source contribution estimation method is shown in Figure 1.
Based on the above analysis, the optimal combination coefficients of Ŝ to all mixed signals can be obtained. Then, source contributions can be quantitatively estimated using  w . The following equation can be obtained. ,ˆ, 1, 2, , and 1, 2, , Generally, due to the noise and estimation error, the sum of the contributions of all estimated source signals to a mixed signal is not equal to 1, which is different from complete BSS. From xx , which implies that the j-th estimated source signals can decrease the overall vibration energy.
The flowchart of the proposed UBSS-based source contribution estimation method is shown in Figure 1.

Performance of the Proposed UBSS method
In this section, we mainly evaluate the separation performance of the proposed UBSS method with different sample sizes and different numbers of mixed signals. Some numerical case studies

Performance of the Proposed UBSS Method
In this section, we mainly evaluate the separation performance of the proposed UBSS method with different sample sizes and different numbers of mixed signals. Some numerical case studies are conducted using five artificial source signals: s 1 (t) is a low frequency sinusoidal wave; s 2 (t) is a high frequency sinusoidal wave; s 3 (t) is a periodic wave with amplitude modulation; s 4 (t) is a shock attenuation signal wave; s 5 (t) is also a periodic wave with amplitude modulation. The generating functions of the source signals are listed as follows: Two, three and four mixed signals are generated by these five source signals. In each case, the averages of 50 Monte Carlo simulations are used to evaluate the performance of the proposed method, and in each simulation, Gaussian white noise with SNR = 10 dB is independently added to each source signal. The sampling frequency is 10 kHz. In the proposed method, the window length is 1024 and window overlap is 256, the number of selected frequency bins N f = 80, initial SSPs threshold δ 1 = 0.0001, minimum number of SSPs N min-SSPs = 300 and energy threshold δ 2 = 0.1.
To quantitatively verify the better performance of the proposed method, SNRs ofÂ andŝ(t) are calculated by Equations (26) and (27), respectively.
where a i andâ i are the i-th column of A andÂ, respectively.
where ς is a scalar that reflects the scalar indeterminacies. The average SNRs of the estimated mixing matrix and the estimated source signals are shown in Figure 2a,b, respectively. From Figure 2a, the average SNRs of the estimated mixing matrix will increase with the increase in sample sizes. However, from Figure 2b, the average SNRs of the estimated source signals remained nearly unchanged with the increase in sample size. This is because the average SNRs of the estimated mixing matrix have been more than 40 dB when the sample size is 10,000, which means the mixing matrix is nearly the same as true mixing matrix. It can also be seen from Figure 2 that the separation performance also improves with the increase in the number of mixed signals. Though the average SNRs of the estimated mixing matrix with two mixtures are nearly the same as that with three mixtures, the average SNRs of the estimated source signals differ a lot in these two cases. That is because the number of source signals must be smaller than that of mixed signals at each TF point according to Assumption 4, which means that at most one source exists at each TF point in the case of two mixtures. This restriction is too strict, leading to worse separation performance in source signals.

Performance of the Proposed Source Contribution Estimation Method
In order to validate the effectiveness of the proposed source contribution quantitative estimation method, the following simulations are conducted. Source signals are the first four signals in Equation (25) The sampling frequency and sampling length is 10 kHz and 1 s, respectively. One hundred Monte Carlo simulations are conducted to evaluate the performance of the proposed method. In each simulation, Gaussian white noise is independently added into each source signal and each mixed signal with SNR = 10 dB and SNR = 15 dB, respectively.
The performance of the proposed UBSS method is compared with Reju's method [17] and Zhen's method [19]. Since Reju's method is designed only for mixing matrix estimation, it cannot recover source signals. Therefore, the mixing matrix estimated by Reju's method is then inputted into Zhen's method to estimate source signals. The parameters in different methods are as follows. In all methods, the Hanning window is used in STFT, and the window length is 1024 and window overlap is 256. In Zhen's method, regularization parameter 0.001 λ = and energy threshold

Performance of the Proposed Source Contribution Estimation Method
In order to validate the effectiveness of the proposed source contribution quantitative estimation method, the following simulations are conducted. Source signals are the first four signals in Equation (25). The mixing matrix are The sampling frequency and sampling length is 10 kHz and 1 s, respectively. One hundred Monte Carlo simulations are conducted to evaluate the performance of the proposed method. In each simulation, Gaussian white noise is independently added into each source signal and each mixed signal with SNR = 10 dB and SNR = 15 dB, respectively.
The performance of the proposed UBSS method is compared with Reju's method [17] and Zhen's method [19]. Since Reju's method is designed only for mixing matrix estimation, it cannot recover source signals. Therefore, the mixing matrix estimated by Reju's method is then inputted into Zhen's method to estimate source signals. The parameters in different methods are as follows. In all methods, the Hanning window is used in STFT, and the window length is 1024 and window overlap is 256. In Zhen's method, regularization parameter λ = 0.001 and energy threshold δ 2 = 0.1. In Reju's method, the parameter ∆θ is set as 1.5 • and the number of selected frequency bins N f = 80. The parameters in the proposed method are the same as those in Section 4.1.
One example of the separation results is as follows. Waveforms and Fourier spectrums of source signals are displayed in Figure 3, while the major frequencies of the source signals can be easily obtained from Figure 3b. The major frequencies of s 1 (t), s 2 (t), and s 4 (t) are 23 Hz, 281 Hz, and 43 Hz, respectively, while the major frequencies of s 3 (t) are 95 Hz and 115 Hz. Waveforms and Fourier spectrums of mixed signals are shown in Figure 4. From Figure 4a, mixed signals are the superposition of source signals, therefore, we cannot directly obtain the waveforms of source signals. From Figure 4b, the major frequencies of source signals can be found in each Fourier spectrum of mixed signals, and the frequencies of s 4 (t) is overwhelmed by those of other source signals. Therefore, signal processing methods are needed to estimate all source signals.
The absolute differences between A and Â are calculated in Equation (30), illustrating that the mixing matrix has been well estimated because each of the absolute differences is very small.
The absolute differences between A and Â are calculated in Equation (30), illustrating that the mixing matrix has been well estimated because each of the absolute differences is very small.
The absolute differences between A andÂ are calculated in Equation (30), illustrating that the mixing matrix has been well estimated because each of the absolute differences is very small. Source signals estimated by the proposed method, Zhen's method and Reju's method are displayed in Figures 5-7, respectively. The order ofŝ(t) has been adjusted according to s(t). Comparing Figure 5a with Figure 3a, we could find that the waveforms ofŝ(t) are quite similar to those of s(t). From the Fourier spectrums ofŝ(t), the major frequencies of s(t) have been well recovered, which can validate the effectiveness of the proposed UBSS method. As revealed by Figure 6a, it seems that the waveforms of s(t) are also well recovered by Zhen's method. However, as shown in Figure 6b, there is interference frequency 23 Hz in the Fourier spectrums ofŝ 3 (t), and interference frequencies 95 Hz, 115 Hz and 281 Hz in the Fourier spectrums ofŝ 4 (t), which indicates that source signalŝ 3 (t) andŝ 4 (t) were not well estimated. It could be seen from Figure 7 that s 4 (t) is not estimated by Reju's method.
Average SNRs of 100 Monte Carlo simulations ofÂ estimated by different methods are listed in Table 1, from which we can see that SNRs ofÂ estimated by the proposed method are larger than those estimated by Zhen's method and Reju's method. Average SNRs of all columns of the mixing matrix estimated by Zhen's method, Reju's method and the proposed method is 18.12 dB, 32.41 dB and 40.65 dB, respectively, which implies that the proposed method could estimate the mixing matrix more accurately. Table 2 shows the average SNRs of 100 Monte Carlo simulations ofŝ(t) estimated by different methods. As can be seen in Table 2, all SNRs ofŝ(t) estimated by the proposed method are also larger than those estimated by Zhen's method and Reju's method. Average SNRs of all sources of Zhen's method, Reju's method and the proposed method are 8.41 dB, 9.17 dB and 11.66 dB, that is, the average SNR increments of all sources estimated by the proposed method are 38.72% and 27.18% when compared with Zhen's method and Reju's method, respectively. The above results tend to validate that the proposed UBSS method performs more effectively than Zhen's method and Reju's method.
Source signals estimated by the proposed method, Zhen's method and Reju's method are displayed in Figures 5-7, respectively. The order of ˆ( ) t s has been adjusted according to ( ) t s .
Comparing Figure 5a with Figure 3a, we could find that the waveforms of ˆ( ) t s are quite similar to those of ( ) t s . From the Fourier spectrums of ˆ( ) t s , the major frequencies of ( ) t s have been well recovered, which can validate the effectiveness of the proposed UBSS method. As revealed by Figure 6a, it seems that the waveforms of ( ) t s are also well recovered by Zhen's method. However, as shown in Figure 6b, there is interference frequency 23 Hz in the Fourier spectrums of 3 ( ) s t , and interference frequencies 95 Hz, 115 Hz and 281 Hz in the Fourier spectrums of 4 ( ) s t , which indicates that source signal 3 ( ) s t and 4 ( ) s t were not well estimated. It could be seen from Figure 7 Table 1, from which we can see that SNRs of Â estimated by the proposed method are larger than those estimated by Zhen's method and Reju's method. Average SNRs of all columns of the mixing matrix estimated by Zhen's method, Reju's method and the proposed method is 18.12 dB, 32.41 dB and 40.65 dB, respectively, which implies that the proposed method could estimate the mixing matrix more accurately.

SNR (dB)
Average SNR of All Sourceŝ s 1 (t)ŝ 2 (t)ŝ 3 (t)ŝ 4 (t) Zhen The running time is used to evaluate the efficiency of the methods. CPU of the computer is Inter Core i5-4590 of 3.30 GHz and RAM is 1333 MHz DDR3 of 16 GB. Average time costs of the proposed method, Zhen's method and Reju's method are 1.79 s, 14.22 s and 0.17 s, respectively. The main difference between these three methods is the process of SSPs identification, which is the main cause for a significant difference in time cost. Reju's method can identify SSPs according to single SSP, and only TF vectors in some frequency bins with a larger variance are selected for SSPs identification, therefore, time cost of Reju's method is the least. SSPs must be identified between two TF vectors in Zhen's method and the proposed method, which means more time consumption. However, SSPs are also identified in some frequency bins with a larger variance in the proposed method and they can be directly identified by searching the identical normalized TF vectors, instead of finding the sparsest coefficients. Therefore, the time cost of the proposed method is shorter than that of Zhen's method. Table 3 shows the average results of source contributions quantitative estimation using different methods, including also the real contributions. It can be clearly seen that source contributions of the proposed method are closer to the real source contributions than those of Zhen's method and Reju's method. The average absolute errors of source contributions are also calculated and listed in Table 4. As revealed by the data in Table 4, most of the average contribution errors of the proposed method are the smallest among these three methods, implying that the proposed method has higher accuracy in source contribution. All contribution errors of the proposed method are less than 1.80%, however, three contribution errors are larger than 10% in Zhen's method and three contribution errors are larger than 4% in Reju's method. Actually, the accurate source estimation is the premise for correct contribution estimation. Therefore, it can be concluded that the proposed method performs more effectively in recovering source signals and quantitatively estimating source contributions. Table 3. Average contribution comparison of estimated source signals.  Table 4. Comparison of average contribution errors of estimated source signals.

Experimental Study with Cylindrical Structure
Some practical mechanical systems or their partial sections have the shape of cylindrical shells, such as the underwater vehicles. Generally, the sound radiation of underwater vehicles strongly interferes with their performance and safety. Therefore, it is quite important for underwater vehicles to reduce their radiation noise to accomplish tasks successfully. Before that, it is necessary to estimate sound sources in advance. When the number of sensors is smaller than that of sources, UBSS is an excellent method to estimate sources in these cases. Therefore, a test bed with a cylindrical shell structure is used to examine the effectiveness of the proposed method.
In the experiments, an adjustable speed motor is used as a vibration source and an eccentric mass disc is driven by the motor to simulate the unbalanced vibration. Two loudspeakers are also used to simulate two radiated noise sources and two arbitrary waveform generators are used to produce two different source signals which are the inputs of these two loudspeakers, respectively. Besides, mixed signals are collected by four sound pressure sensors and are recorded by GEN2i high-speed data recorder. Schematic diagram and photos of the test site are displayed in Figures 8 and 9, respectively.

Experimental Study with Cylindrical Structure
Some practical mechanical systems or their partial sections have the shape of cylindrical shells, such as the underwater vehicles. Generally, the sound radiation of underwater vehicles strongly interferes with their performance and safety. Therefore, it is quite important for underwater vehicles to reduce their radiation noise to accomplish tasks successfully. Before that, it is necessary to estimate sound sources in advance. When the number of sensors is smaller than that of sources, UBSS is an excellent method to estimate sources in these cases. Therefore, a test bed with a cylindrical shell structure is used to examine the effectiveness of the proposed method.
In the experiments, an adjustable speed motor is used as a vibration source and an eccentric mass disc is driven by the motor to simulate the unbalanced vibration. Two loudspeakers are also used to simulate two radiated noise sources and two arbitrary waveform generators are used to produce two different source signals which are the inputs of these two loudspeakers, respectively. Besides, mixed signals are collected by four sound pressure sensors and are recorded by GEN2i high-speed data recorder. Schematic diagram and photos of the test site are displayed in Figure 8 and Figure 9, respectively.   The motor is running at 1740 r/m. Inputs of two loudspeakers, denoted by loudspeaker 1 and loudspeaker 2, respectively, are sine waves of 713 Hz and 917 Hz, respectively. The sampling length and the sampling frequency are 10 s and 5000 Hz, respectively. The second and the fourth mixed signals are selected to estimate three source signals and only a section of data from 4 s to 6 s is used. Waveforms and Fourier spectrums of mixed signals are displayed in Figure 10. From Figure 10a, mixed signals are the superposition of source signals, therefore, we cannot directly obtain waveforms of source signals from mixed signals. From Figure 10b, the major frequencies of source signals can be found in each Fourier spectrum of mixed signals. Therefore, mixed signals need to be further processed to obtain source signals. Source signals (displayed only from 4.5 s to 5 s) estimated by the proposed method, Zhen's method and Reju's method are illustrated in Figure 11-13, respectively. As revealed in Figure 11b, the major frequencies of source signals estimated by the proposed method are 29 Hz, 937 Hz and 713 Hz, respectively, which are consistent with the frequencies set in the experiment. However, from Figure 12, both the major frequency 29 Hz of the motor and the major frequency 713 Hz of the loudspeaker 1 is mis-estimated in the same signal, as shown in the Fourier spectrum of the first separated signal. Therefore, 29 Hz and 713 Hz will be mistaken for coming from the same source using Zhen's method. The major frequencies of the first signal estimated by Reju's method are also The motor is running at 1740 r/m. Inputs of two loudspeakers, denoted by loudspeaker 1 and loudspeaker 2, respectively, are sine waves of 713 Hz and 917 Hz, respectively. The sampling length and the sampling frequency are 10 s and 5000 Hz, respectively. The second and the fourth mixed signals are selected to estimate three source signals and only a section of data from 4 s to 6 s is used. Waveforms and Fourier spectrums of mixed signals are displayed in Figure 10. From Figure 10a, mixed signals are the superposition of source signals, therefore, we cannot directly obtain waveforms of source signals from mixed signals. From Figure 10b, the major frequencies of source signals can be found in each Fourier spectrum of mixed signals. Therefore, mixed signals need to be further processed to obtain source signals. The motor is running at 1740 r/m. Inputs of two loudspeakers, denoted by loudspeaker 1 and loudspeaker 2, respectively, are sine waves of 713 Hz and 917 Hz, respectively. The sampling length and the sampling frequency are 10 s and 5000 Hz, respectively. The second and the fourth mixed signals are selected to estimate three source signals and only a section of data from 4 s to 6 s is used. Waveforms and Fourier spectrums of mixed signals are displayed in Figure 10. From Figure 10a, mixed signals are the superposition of source signals, therefore, we cannot directly obtain waveforms of source signals from mixed signals. From Figure 10b, the major frequencies of source signals can be found in each Fourier spectrum of mixed signals. Therefore, mixed signals need to be further processed to obtain source signals. Source signals (displayed only from 4.5 s to 5 s) estimated by the proposed method, Zhen's method and Reju's method are illustrated in Figure 11-13, respectively. As revealed in Figure 11b, the major frequencies of source signals estimated by the proposed method are 29 Hz, 937 Hz and 713 Hz, respectively, which are consistent with the frequencies set in the experiment. However, from Figure 12, both the major frequency 29 Hz of the motor and the major frequency 713 Hz of the loudspeaker 1 is mis-estimated in the same signal, as shown in the Fourier spectrum of the first separated signal. Therefore, 29 Hz and 713 Hz will be mistaken for coming from the same source using Zhen's method. The major frequencies of the first signal estimated by Reju's method are also Source signals (displayed only from 4.5 s to 5 s) estimated by the proposed method, Zhen's method and Reju's method are illustrated in Figures 11-13, respectively. As revealed in Figure 11b, the major frequencies of source signals estimated by the proposed method are 29 Hz, 937 Hz and 713 Hz, respectively, which are consistent with the frequencies set in the experiment. However, from Figure 12, both the major frequency 29 Hz of the motor and the major frequency 713 Hz of the loudspeaker 1 is mis-estimated in the same signal, as shown in the Fourier spectrum of the first separated signal. Therefore, 29 Hz and 713 Hz will be mistaken for coming from the same source using Zhen's method. The major frequencies of the first signal estimated by Reju's method are also 29 Hz and 713 Hz, as shown in Figure 13. The results tend to illustrate that source signals have been well estimated by the proposed method. 29 Hz and 713 Hz, as shown in Figure 13. The results tend to illustrate that source signals have been well estimated by the proposed method. Reju's method can identify SSP based on the character of single SSP, and the performance of this method will be degraded in noisy cases. In Zhen's method, to increase the computational efficiency, SSPs are identified only in some TF vectors that are randomly selected from TF vectors of mixed signals. If no or very few SSPs corresponding to a source are included in selected TF vectors, this source will be estimated with large error. And this may be the main reason why the performance of Zhen's method is not so good as that of the proposed method.   Reju's method can identify SSP based on the character of single SSP, and the performance of this method will be degraded in noisy cases. In Zhen's method, to increase the computational efficiency, SSPs are identified only in some TF vectors that are randomly selected from TF vectors of mixed signals. If no or very few SSPs corresponding to a source are included in selected TF vectors, this source will be estimated with large error. And this may be the main reason why the performance of Zhen's method is not so good as that of the proposed method.  After obtaining the estimated source signals, their contributions to the mixed signals can be calculated and are presented in Table 5. Real source contributions also need to be obtained by the experiment [1]. When one source is stopped, the decreased amount in vibration energy of mixed signals is observed by the sensors. The decreased amount is regarded as the source contribution of the stopped source. The real source contributions are also given in Table 5. We can find the source contributions of the proposed method are closer to the real source contributions than those of Zhen's method and Reju's method.
The absolute errors of source contributions are calculated and listed in Table 6. All contribution errors of the proposed method are smaller than those of Zhen's method and Reju's method, implying that the proposed method has higher accuracy in source contribution. The largest contribution error of the proposed method is only 6.44%, however, three contribution errors are larger than 12% in Zhen's method and four contribution errors are larger than 15% in Reju's method. Actually, accurate estimation of source signals is the precondition for accurate estimation of computational complexity. To some extent, estimation accuracy of contribution increases with the increase in source signal estimation accuracy. As shown in Figures 12 and 13, the first separated signals of Zhen's method and Reju's method contains major frequencies of two sources (motor and loudspeaker one). Therefore, the contributions of their first separated signal contain contributions of two real sources, which will lead to an increase in their source contributions. Besides, since part of the contributions of loudspeaker 1 is mis-assigned to their first separated signals, the contribution of their third separated signal will be smaller than the real contributions. Based on a more accurate estimation of source signals, the contribution errors of the proposed method are smaller than those of contrast methods.  Reju's method can identify SSP based on the character of single SSP, and the performance of this method will be degraded in noisy cases. In Zhen's method, to increase the computational efficiency, SSPs are identified only in some TF vectors that are randomly selected from TF vectors of mixed signals. If no or very few SSPs corresponding to a source are included in selected TF vectors, this source will be estimated with large error. And this may be the main reason why the performance of Zhen's method is not so good as that of the proposed method.
After obtaining the estimated source signals, their contributions to the mixed signals can be calculated and are presented in Table 5. Real source contributions also need to be obtained by the experiment [1]. When one source is stopped, the decreased amount in vibration energy of mixed signals is observed by the sensors. The decreased amount is regarded as the source contribution of the stopped source. The real source contributions are also given in Table 5. We can find the source contributions of the proposed method are closer to the real source contributions than those of Zhen's method and Reju's method. Table 5. Contribution comparison of estimated source signals. The absolute errors of source contributions are calculated and listed in Table 6. All contribution errors of the proposed method are smaller than those of Zhen's method and Reju's method, implying that the proposed method has higher accuracy in source contribution. The largest contribution error of the proposed method is only 6.44%, however, three contribution errors are larger than 12% in Zhen's method and four contribution errors are larger than 15% in Reju's method. Actually, accurate estimation of source signals is the precondition for accurate estimation of computational complexity. To some extent, estimation accuracy of contribution increases with the increase in source signal estimation accuracy. As shown in Figures 12 and 13, the first separated signals of Zhen's method and Reju's method contains major frequencies of two sources (motor and loudspeaker one). Therefore, the contributions of their first separated signal contain contributions of two real sources, which will lead to an increase in their source contributions. Besides, since part of the contributions of loudspeaker 1 is mis-assigned to their first separated signals, the contribution of their third separated signal will be smaller than the real contributions. Based on a more accurate estimation of source signals, the contribution errors of the proposed method are smaller than those of contrast methods. Table 6. Comparison of contribution errors of estimated source signals.

Mixed Signals Methods
Contribution Errors (%) Running time is also used to evaluate the efficiency of the methods. Time costs of the proposed method, Zhen's method and Reju's method are 1.86 s, 6.42 s and 0.32 s, respectively. Reju's method could identify SSPs by the property of single SSP, which can have higher efficiency than Zhen's method and the proposed method. The proposed method can identify SSPs only at some optimal frequency bins and SSPs are identified by directly searching the identical TF vectors in the selected frequency bins, which could the reason why the efficiency of the proposed method is higher than that of Zhen's method.
It should be noted that the proposed method is designed for off-line processing system because it needs some data to identify SSPs. However, for a real-time monitoring system, we can process the data in a piecewise way by the proposed method. From the experiment, the running time of the proposed method is only 1.86 s to process the data with a length of 2 s. Therefore, we can split the data into a fixed length segment and analyze them by the proposed method.
After source signals are well recovered and source contributions are calculated, the influences of sources on mixed signals can be determined. The vibration sources estimated by the proposed method can be used to machinery condition monitoring and fault diagnosis when source signals are difficult to be directly obtained. The main vibration sources can also be determined according to their contributions. Therefore, some measures can be taken to reduce the impact of the main vibration sources.

Conclusions
To identify the major vibration and noise sources of the mechanical systems, a novel source contribution quantitative estimation method is proposed for UBSS. The accuracy of the source contribution results relies largely on the accuracy of source recovery. Only by recovering source signals more accurately can we obtain higher accuracy of source contribution estimation. From the results of numerical case studies, the proposed method can not only estimate source signals from their mixtures in underdetermined cases, but also quantitatively estimate the source contributions with average deviations <2%. The results of experimental studies with a cylindrical structure also show the effectiveness of the proposed method in sources restoration and quantitative contribution estimation. The comparative results tend to validate that the proposed method performs more