A Comparative Study of Information-Based Source Number Estimation Methods and Experimental Validations on Mechanical Systems

Cheng, Wei; Zhang, Zhousuo; Cao, Hongrui; He, Zhengjia; Zhu, Guanwen

doi:10.3390/s140507625

Open AccessArticle

A Comparative Study of Information-Based Source Number Estimation Methods and Experimental Validations on Mechanical Systems

by

Wei Cheng

^*,

Zhousuo Zhang

,

Hongrui Cao

,

Zhengjia He

and

Guanwen Zhu

State Key Laboratory for Manufacturing Systems Engineering, Xi'an Jiaotong University, Xi'an 710049, China

^*

Author to whom correspondence should be addressed.

Sensors 2014, 14(5), 7625-7646; https://doi.org/10.3390/s140507625

Submission received: 21 February 2014 / Revised: 17 April 2014 / Accepted: 18 April 2014 / Published: 25 April 2014

(This article belongs to the Section Physical Sensors)

Download

Browse Figures

Versions Notes

Abstract

: This paper investigates one eigenvalue decomposition-based source number estimation method, and three information-based source number estimation methods, namely the Akaike Information Criterion (AIC), Minimum Description Length (MDL) and Bayesian Information Criterion (BIC), and improves BIC as Improved BIC (IBIC) to make it more efficient and easier for calculation. The performances of the abovementioned source number estimation methods are studied comparatively with numerical case studies, which contain a linear superposition case and a both linear superposition and nonlinear modulation mixing case. A test bed with three sound sources is constructed to test the performances of these methods on mechanical systems, and source separation is carried out to validate the effectiveness of the experimental studies. This work can benefit model order selection, complexity analysis of a system, and applications of source separation to mechanical systems for condition monitoring and fault diagnosis purposes.

Keywords:

source number estimation; Akaike information criterion; minimum description length; improved Bayesian information criterion; eigenvalue decomposition

1. Introduction

In many physical systems, the measured signals can be modeled as a superposition of a finite number of the sources with additive environmental noises, and many signal processing methods such as principal component analysis (PCA) [1], blind source separation (BSS) [2,3] and independent component analysis (ICA) [4–6] have benefited from this model and achieved wide use in engineering applications. A key and primary issue of these subjects is the estimation of the number of unknown sources from the mixed signals before an effective source separation. Furthermore, it is still challenging to estimate the source number for mechanical systems due to the complicated mixing of the sources and the transmission effects of the mechanical structures.

In the past decades, many researchers have focused their interests on source number estimation methods and their engineering applications, and proposed many approaches to solve this problem. Ye et al. [7] studied the general BSS problem satisfying m greater than or equal to n and gave the validations by computer simulations on artificially synthesized data. Fishler et al. [8] studied a MDL-type estimator that was robust against deviation from the assumption of equal noise level across the array. Bai et al. [9] proposed an information-based method to estimate the number of independent dipole sources from electroencephalograms (EEGs). Jiang et al. [10] proposed a new source number estimation method called beam eigenvalue method (BEM). Huang et al. [11,12] proposed to utilize the minimum mean square error (MMSE) of the multistage Wiener filter to calculate the required description length for encoding the observed data, instead of relying on the eigenvalues of the data covariance matrix. Hu et al. [13] proposed a sound source number and directions estimation method under a multisource reverberant environment and gave experimental validations. Ma et al. [14] proposed a source number estimation method based on modified K-means clustering. Cheng et al. [15] proposed an independent component analysis-based source number estimation methods and applied it to mechanical systems. Han et al. [16] proposed a source number estimation method based on uniform linear arrays (ULAs) and the newly proposed nested arrays. Dosso et al. [17] considered localizing an unknown number of ocean acoustic sources when the properties of the environment are poorly known. Sadhu et al. [18] proposed a decentralized model identification method utilizing the concepts of sparse blind source separation and parallel factor decomposition, which can solve underdetermined blind source separation problems. All the above studies have provided effective ways to estimate the number of sources from different types of the sources. However, the studies on the source number estimation for mechanical systems are very few, and there is still a lot of work to do before an effective application of source number estimation methods to mechanical systems can be put forth.

Unlike the eigenvalue decomposition-based source number estimation method which requires a threshold, information criteria-based methods do not need any parameters for adaptively estimating the number of sources from the mixed signals, and the algorithms are also easier for calculation and perform efficiently in the applications. The key issue on the information-based methods is to find the extremum values of the constructed objective functions based on information criteria, such as Akaike Information Criterion (AIC) [19,20], Minimum Description Length (MDL) [21–23], and Bayesian Information Criterion (BIC) [24,25]. However, it has been shown that AIC suffers from the computational problem [26]. MDL also suffers from high computational load and performs well only in the presence of spatially and temporally white noise [27,28]. Furthermore, they have a shortcoming in that the estimation performance is sensitive to the signal to noise ratio (SNR) and the sampling length, and thus the results may be unreliable. Studies on the source number estimation by BIC are rarely found, and the traditional BIC will overflow in the calculation if the sampling length is too large. Furthermore, all the three information-based methods are rarely applied to estimate the source number of mechanical systems whose sources are normally mixed according to linear superposition and nonlinear modulation.

Therefore, this paper studies comparatively the performances of both eigenvalue decomposition-based and information-based source number estimation methods on mechanical sound signals, and improves BIC as IBIC to make it easier for calculation and efficient for the data with a large sampling length. Both linear superposition and nonlinear modulation are considered in the numerical case studies, and a test-bed with three sources is constructed to test the performances of the eigenvalue decomposition-based and information-based methods on source number estimation for mechanical systems. This study can benefit for the model order selection, complexity analysis of a system, and applications of source separation to mechanical systems for condition monitoring and fault diagnosis purposes.

The remainder of this paper is organized as follows: in Section 2, we introduce the theoretical background and investigate the mathematical mechanisms of the eigenvalue decomposition-based source number estimation method, and information-based source number estimation methods entitled as AIC, MDL, and IBIC. In Section 3, we test the performances of these methods on typical mechanical signals with both a linear superposition and a nonlinear modulation. In Section 4, a test bed with three sound sources is constructed to further test the performances of these methods on real mechanical systems, and the effectiveness of the experimental studies is validated by source separation and spectral analysis. Finally, Section 5 summarizes the conclusions.

2. Source Number Estimation Methods

Consider m observed mixed signals X(t) = [x₁(t), ⋯, x_m(t)]^T measured in different locations and composed by n source signals S(t) = [s₁(t), ⋯, s_n(t)]^T. Assume that the observed signals can be described by the following model with white noises of N(t) = n₁(t), n₂(t), ⋯, n_m(t) and mixing matrix of A = {a_ij}_m×n:

\begin{array}{l} x_{i} (t) = ∑_{j = 1}^{n} a_{i j} s_{j} (t) + n_{i} (t) & i = 1, \dots, m, j = 1, \dots, n \\ X (t) = A S (t) + N (t) & (in a matrix form) \end{array}

(1)

As the source signals and mixing mode are normally unknown for many physical systems, a crucial problem associated with this model is to estimate the number n of source signals from an N finite set of observations x₁(t), ⋯, x_m(t) before an effective source separation.

Constitute X = [X(t₁), ⋯, X(t_N)] from an N finite set of observations. Then the covariance matrix of X(·) is given by R = E[XX^T] (E[] is the expected function). Denoting the eigenvalues of R by λ₁ ≥ λ₂ ≥⋯ λ_m, L(n) which is a log-likelihood function used to estimate the maximum likelihood of source number n is defined as follows:

L (n) = \frac{{(λ_{n + 1} λ_{n + 2} \dots λ_{m})}^{\frac{1}{m - n}}}{\frac{1}{m - 1} (λ_{n + 1} + λ_{n + 2} + \dots + λ_{m})}

(2)

(1)

Source number estimation based on eigenvalue decomposition:

n^{*} = \underset{n}{argmax} [λ_{n} \geq γ], n = 1, 2, \dots, m

(3)

Where γ is a threshold.

The benefit of eigenvalue decomposition is that the source number can be estimated just based on the distributions of eigenvalues, and the crucial step is just a reasonable threshold γ. However, different types of mixed signals have different distributions of eigenvalues, which makes it impossible for a threshold γ for all the applications.

Now we comparatively introduce and investigate another three information-based source number estimation methods which can determine the source number adaptively.

(2)

Akaike Information Criterion (AIC)

The information theoretic criterion for the model order selection or source number estimation, introduced by Akaike [19] is used to determine the number of signals which gives the minimum AIC, defined by:

n^{*} = \underset{n}{arg min} [AIC (n)] = \underset{n}{arg min} [- 2 N (m - n) lg L (n) + 2 n (2 m - n)]

(4)

The first term, −2N(m−n)lgL(n), is the well-known log-likelihood of the maximum likelihood estimator of the parameters of the model. The second term, 2n(2m−n), is the bias correction term inserted so as to make AIC an unbiased estimator.

(3)

Minimum Description Length (MDL) [19]

Inspired by Akaike's work, Rissanen [21] proposed a different approach to select the model or estimate the source number based on the concept of the shortest code length for the data. It can be achieved by the following criterion:

n^{*} = \underset{n}{arg min} (MDL (n)) = \underset{n}{arg min} (- N (m - n) lg L (n) + \frac{1}{2} n (2 m - n) lg N)

(5)

Note that apart from a factor of 2, the first term is identical to the corresponding one in AIC, while the second term has an extra factor of ½ lgN.

(4)

Bayesian information criterion (BIC) [29]

Minka [24] proposed another criterion for estimation of the dimensionality of the data (source number) on basis of Bayesian model selection. It is referred to as the Minka Bayesian model selection (MIBS) which is given by (assume that σ̃ is the positive scale parameter, λ is the eigenvalue of the covariance matrix of mixed signals X, m is the number of mixed signals X, n is the number of sources S, and N is the data length):

MIBS (n) \approx p_{n} {(\prod_{j = 1}^{n} λ_{j})}^{- N / 2} {\tilde{σ}}_{n}^{- N (m - n)} {| A_{n} |}^{- 1 / 2} {(2 π)}^{(d_{n} + n) / 2} N^{- n / 2}

(6)

where:

\begin{array}{l} p_{n} = 2^{- n} ∏_{i = 1}^{n} Γ (\frac{m - i + 1}{2}) π^{- (m - i + 1) / 2} \\ | A_{n} | = ∏_{i = 1}^{n} ∏_{j = i + 1}^{m} ({\hat{λ}}_{j}^{- 1} - {\hat{λ}}_{i}^{- 1}) (λ_{i} - λ_{j}) N \\ {\hat{σ}}_{n}^{2} = (∑_{j = n + 1}^{m} λ_{j}) / m - n \\ d_{n} = m n - n (n + 1) / 2 \end{array}

(7)

and λ̂_J is identical with λ_j except for j > n where

{\hat{λ}}_{J} = \hat{σ_{n}}

. In order to estimate the latent dimensionality of the data (source number), we choose the value n that maximizes Equation (5). The simplification of MIBS is the BIC approximation, which drops all terms that do not grow with N:

BIC (n) = {(\prod_{j = 1}^{n} λ_{j})}^{- N / 2} {\tilde{σ}}_{n}^{- N (m - n) / 2} N^{- (d_{n} + n) / 2}

(8)

In practice, it causes overflow in calculating Equation (7) when the value of N is large (Normally N is a big number). Therefore, we take the logarithm to overcome the overflow problem and entitle it as the improved BIC (IBIC), which significantly decreases the calculating time without a loss of the accuracy. The objective function of the IBIC is as follow equation:

\begin{array}{l} n^{*} & = \underset{n}{arg max} (IBIC (n)) = \underset{n}{arg max} (lg (BIC (n))) \\ = \underset{n}{arg max} (- \frac{N}{2} lg (∏_{j = 1}^{n} λ_{j}) - \frac{N (m - n)}{2} lg ({\tilde{σ}}_{n}) - \frac{(d_{n} + n)}{2} lg (N)) \end{array}

(9)

3. Numerical Case Study

In this section, we numerically generate typical signals of mechanical systems to comparatively study the effectiveness of the different source number estimation methods. These generated source signals consider the modulation effects of mechanical systems, and the mixed signals are composed of the sources through a linear superposition and a weak nonlinear mixing. The generating functions of the source signals are listed below:

S (t) = [\begin{array}{l} s_{1} (t) \\ s_{2} (t) \\ s_{3} (t) \\ s_{4} (t) \end{array}] = [\begin{array}{l} 3 \times sin (10 π t) \\ 5 \times sin (4 \times cos (8 π t)) \\ 7 \times sin (5 π t) cos (80 π t) \\ 9 \times n (t) \end{array}]

(10)

In the numerical case study, s₁(t) is a sinusoidal signal that simulates the simple harmonic vibration of mechanical systems; s₂(t) is a frequency modulation signal that simulates the frequency modulation effects of mechanical systems; s₃(t) is an amplitude modulation signal that simulates the amplitude modulation effects of mechanical systems; s₄(t) is a white noise signal that simulates the noises produced by the structural transmission and environment. The waveforms of the source signals are shown in Figure 1.

3.1. Source Number Estimation for Linearly Mixed Signals

Since the number of the mixed signals should be no less than the number of the source signals for an accurate source separation or system identification, and the source number estimation methods based on the information criteria also require more mixed signals, in the numerical case study we provide six mixed signals composed by the given source signals with a linear superposition matrix A given by:

A = [\begin{matrix} 0.58 & 0.36 & - 0.29 & 0.89 \\ 0.33 & - 0.65 & 0.49 & - 0.93 \\ 0.77 & 0.83 & - 0.72 & - 0.85 \\ 0.18 & 0.51 & 0.83 & 0.79 \\ 0.25 & - 0.42 & 0.65 & - 0.59 \\ 0.43 & - 0.27 & - 0.14 & 0.32 \end{matrix}]

Figure 2 displays the waveforms and spectra of the mixed signals, which indicates that it is a difficult task to directly estimate the source number for complex waveforms and complex Fourier spectra with many major components. Therefore, source number estimation methods are required to reveal the complexity of the mixed signals.

Table 1 lists the eigenvalues' distribution of the covariance matrix for the mixed signals. Obviously eigenvalues decrease significantly from λ₁ to λ₅, even λ₅ = λ₆ = 0, which means that there are 4 principal components contained in the mixed signals (from the definition of principal component analysis [1]). Therefore, the threshold γ can be determined as γ ∈ (0, 4.42) and thus there has n* = 4. The result also shows that the threshold γ can be easily determined and the source number estimation based on eigenvalue decomposition is effective for the given linear superposition case.

Figure 3 shows the source number estimation by the information-based source number estimation methods: as the source number n increases, the normalized objective functions of AIC and MDL decrease fast from n = 1 to n = 4, and obtain the minimum values −0.0809 and −0.0772 as n = 4, while the normalized objective function of IBIC changes greatly from n = 3 to n = 5, and obtains the maximum value 1.0000 as n = 4. From the definitions of the information-based methods, all these methods accurately evaluate the source number n* = 4 for the given numerical case with the linear superposition. It should be noted that AIC and MDL obtain very similar results from n = 1 to n = 5, and the normalized values for n = 4 are very close to that for n = 5, while the normalized values of IBIC for n = 4 is obviously far from n = 3 and n = 5, which means that the IBIC is more robust and reliable than AIC and MDL for the given case.

Therefore, it can be concluded that all the four source number estimation methods are effective for the given numerical case study, and the eigenvalue decomposition-based method and IBIC are more robust and reliable than AIC and MDL as they have very wide boundaries to accurately determine the source number. However, the eigenvalue decomposition-based method requires a reasonable threshold γ normally determined by prior knowledge of the system or experiences, while the other three methods can adaptively estimate the source number.

3.2. Source Number Estimation for Weakly Nonlinearly Mixed Signals

In this section, a nonlinearity mixing factor σ on the modulation sources is considered in the mixing process, and thus the performances on the nonlinearly mixed signals of all the source number estimation methods are comparatively studied. The mixed signals are composed by the sources in Figure 1 with both a linear superposition and a nonlinear modulation mixing, and their generating functions are shown in Equation (11):

X = [\begin{array}{l} 0.58 s_{2} + 0.36 s_{2} - 0.29 s_{3} + 0.89 s_{4} + σ s_{1} s_{2} \\ 0.33 s_{1} - 0.65 s_{2} + 0.49 s_{3} - 0.93 s_{4} + σ s_{1} s_{3} \\ 0.77 s_{1} + 0.83 s_{2} - 0.72 s_{3} - 0.85 s_{4} + σ s_{2} s_{3} \\ 0.18 s_{1} + 0.51 s_{2} + 0.83 s_{3} + 0.79 s_{4} + σ s_{3} s_{2} \\ 0.25 s_{1} - 0.42 s_{2} - 0.65 s_{3} - 0.59 s_{4} + σ s_{2} s_{1} \\ 0.43 s_{1} - 0.27 s_{2} - 0.14 s_{3} + 0.32 s_{4} + σ s_{1} s_{3} \end{array}]

(12)

The nonlinearity mixing factor σ reveals the modulation effects of the mechanical systems with many sources, and the nonlinear mixing mode is always considered as a technical bottleneck for source separation or source number estimation. Therefore, the factor σ is considered as a parameter to test the performances of source number estimation algorithms. It should be noted that the nonlinearity mixing factor σ is given an initial value 0.0001 to satisfy the logarithm function in Equations (4), (5) and (9), and the information-based methods are comparatively studied as the nonlinearity mixing factor σ increases.

The accuracy rates of the given three information-based source number estimation methods are displayed in Figure 4, which shows that AIC, MDL and IBIC fail to correctly estimate the source number when the factor σ is up to 0.0013, 0.0015, and 0.0974, respectively. The comparative study results indicate that AIC and MDL give similar performances and they are all sensitive to the nonlinearity mixing factor σ (σ < 0.0013), and MDL performs a little better than AIC toward the nonlinearity mixing factor σ but not significantly, while IBIC performs more robustly toward the nonlinearity mixing factor σ (σ < 0.0974). Therefore, IBIC performances more robustly and reliably toward the nonlinear mixing effects of mechanical systems, and this property guarantees more wide engineering applications of IBIC as most physical systems have nonlinear mixing effects. The waveforms of the mixed signals for σ = 0.0974 are shown in Figure 5, which also shows that it cannot directly estimate the source number just from the complicated waveforms.

Figure 6 displays the performances of AIC as the nonlinearity mixing factor σ equals to 0.0013, 0.0015, and 0.0974: AIC decreases significantly from n = 1 to n = 5 and obtains the minimum value −0.2014 as n = 5 for the σ = 0.0974 case. Furthermore, AIC has similar values for the σ = 0.0013 and σ = 0.0015 cases, and obtains minimum values of −0.0742 and −0.0749 as n = 5. However, for the σ = 0.0012 case, AIC obtains a minimum value −0.0756 as n = 4, while AIC = 0.7322 as n = 3 and AIC = −0.0739 as n = 5. Therefore, AIC fails to correctly estimate the source number as the nonlinearity mixing factor σ increases to 0.0013, and the objective function decreases fast for the σ = 0.0974 case, which means that the nonlinear modulation effects influence the accuracy rates of AIC greatly.

Figure 7 displays the performances of MDL as the nonlinearity mixing factor σ equals 0.0013, 0.0015, and 0.0974: MDL decreases significantly from n = 1 to n = 5 and obtains the minimum value −0.1898 as n = 5 for σ = 0.0974 case.

For the σ = 0.0013 case, MDL obtains the minimum value −0.0703 as n = 4, which follows this rule until σ = 0.0015 where MDL begins to have the minimum values as n = 5. Comparing Figure 6 with Figure 7, it can be seen that AIC and MDL perform closely for all these three cases, and the only difference is that MDL is a little more robust to the nonlinearity mixing factor σ than AIC.

Figure 8 displays the performances of IBIC as the nonlinearity mixing factor σ equals to 0.0013, 0.0015, and 0.0974: IBIC obtains the maximum values −0.1490 and −0.1106 as n = 4 for σ = 0.0013 and σ = 0.0015 cases, and begins to have the maximum value −0.8935 as n = 1 for the σ = 0.0974 case.

Furthermore, the maximum values of IBIC for σ = 0.0013 and σ = 0.0015 cases are obviously far from their neighbors, which guarantees a more accurate estimation of the source number. However, IBIC values close to each other for σ = 0.0974 case, which means that the maximum values of IBIC are difficult to be determined and IBIC becomes less robustly and reliably.

Table 2 displays the eigenvalues of covariance matrix with different factors σ: the related eigenvalues are very close to each other for σ = 0.000, 0.0013, and 0.0015 cases, while λ₅ is up to 0.9791 for the σ = 0.0974 case. From the definition of eigenvalue decomposition-based source number estimation method, λ₅ = 0.0002 causes AIC failure for the σ = 0.0013 case, and λ₅ = 0.0003 causes MDL failure for the σ = 0.0015 case. However, IBIC begins to fail for the σ = 0.0974 case, which also indicates that IBIC is much more robust to the modulation effects than AIC and MDL. Furthermore, the distributions of all the eigenvalues show that it is not difficult to set a threshold γ = (0.9791, 4.37) for the eigenvalue decomposition-based source number estimation method. However, normally it is very difficult to set a reasonable γ without any prior knowledge of the sources and their distributions.

Therefore, it can be concluded that the eigenvalue decomposition-based source number estimation method is difficult to carry out without any prior knowledge of the sources, while the information-based methods can adaptively and accurately estimate the source number for the linear superposition cases. However, for the cases with nonlinear modulation effects, IBIC performs more robustly and reliably than AIC and MDL, which reveals more wide engineering applications of IBIC.

4. Experimental Study

In general, it is difficult to directly measure the source signals in most mechanical systems due to the limited accessibility, and thus signal processing is often required to separate and recover the source information from the mixed signals normally measured by remote sensors. Then, these separated source signals can be used for further purposes such as a condition monitoring and a fault diagnosis of mechanical systems. However, a source number estimation from the measured and mixed signals should be carried out for a prior knowledge to source separation or complexity analysis of the systems. In this section, we apply the source number estimation methods mentioned above to a mechanical system shown in Figure 9 to demonstrate and benchmark their performance on mechanical systems.

4.1. Introduction of the Test Bed

Aiming at vibration and noise source number estimation for mechanical systems, this study designs a test bed based on a shell structure, which is composed by an end cover, a shell, clapboards, and supports. The whole test bed is supported by four rubber air springs, which can reduce the influences of environmental noises. There are three sound sources: two of them are loudspeakers controlled by the signal generators, and the other one is a motor controlled by the frequency converter. The structure and photo of the test bed are shown in Figure 9. Since vibration and noises of thin shell structures can approximately be governed by linear differential equations [30], we consider the test bed with linear but weakly nonlinear features.

Six sound pressure sensors are used to measure the sound information, and they are installed in different directions of the test bed with a distance of 0.5 m. A HBM Gen2i data acquisition system is applied to collect the sound data from these six sensors. The framework of the measuring system is shown in Figure 10, and the testing parameters are listed in Table 3.

4.2. Source Number Estimation

The sound source signals are measured with just one source working at the parameters given in Table 4, and thus three sound sources can be measured as the references to test the source number estimation methods. The waveforms of the source signals are shown in Figure 11, which clearly displays the periodic features of the source signals. As all the three sources are working together, the signals from all the sound pressure sensors around are the mixed signals, and their waveforms are shown in Figure 12. Obviously it is very difficult to correctly estimate the source information from the mixed signals due to complicated waveforms.

Both eigenvalue decomposition-based source number estimation method and information-based source number estimation methods are applied to estimate the sound source number of the given test bed. The eigenvalues of the covariance matrix for the mixed signals in different parameters are shown in Table 4. Obviously, it is difficult to determine the source number just based on the distributions of the eigenvalues as λ₄, λ₅ and λ₆ are very close to each other.

The results of source number estimation by information-based methods are shown in Figure 13, which clearly shows that AIC and MDL obtain minimum values −0.9806 and −0.8171 as n = 4, while IBIC also obtains the maximum value 1.0000 as n = 4. Therefore, all the information-based source number estimation methods provide an estimated source number 4 for the given experimental study. However, there are only three sources in the test bed. The inconsistent results confuse us seriously but we can provide more convincing evidences based on a source separation by independent component analysis (known as ICA).

The fast ICA algorithm [6,31] is applied to source separation from the given six mixed signals, and the parameter of the source number is given an initial value 4. Thus we obtain four independent components from the mixed signals, and their waveforms are shown in Figure 14, which displays that the waveforms of three separated components are similar to that of the related source signals in Figure 11, and the other separated signal has complex waveforms and is obviously different from the given source signals.

The spectra of the source signals and the separated components are displayed in Figure 15 and Figure 16. Comparing Figure 15 with Figure 16, the spectrum of the separated component 1 is similar to that of the source signal 1 with a same significant frequency of 3,000 Hz, and the spectrum of the separated component 2 is similar to that of the source signal 2 with a same significant frequency of 1,600 Hz. Furthermore, the spectrum of the separated component 3 is close to that of the source signal 3, and the major frequency energy is contained in 500–1,500 Hz. However, the spectrum of the separated component 4 is very complicated from 0 to 5,000 Hz, which is similar to the spectra of white noises. Therefore, the separated component 4 that we consider as an environmental noise has a considerable energy compared with the other three sources, and causes the source number to be 4 rather than 3. After the source separation and spectral analysis, the correct source number of the given test bed should be 4 rather than 3, which provides convincing evidence for the accuracy of all the information-based methods.

In general, like the similar values of the eigenvalues, it is very difficult for the eigenvalue decomposition-based method to accurately and robustly estimate the source number without any prior information about the sources. However, all the information-based methods correctly and adaptively estimate the source number of the test bed, which further reveals that the mixing mode of the sound sources tends to be a linear superposition, and thus guarantees that the information-based methods are effective to the sound signals.

5. Conclusions

This paper investigates both eigenvalue decomposition-based and information-based source number estimation methods, and improves BIC as IBIC to make it more efficient and easier to calculate. Furthermore, their performances on nonlinear modulation effects of mechanical systems are studied comparatively with numerical case studies and experimental studies.

In the numerical case study with a linear superposition case, the eigenvalue decomposition-based method has a wide band to determine the threshold γ ∈ (0, 4.42), and all the three information-based methods accurately estimate the source number. Furthermore, IBIC performs more robustly and effectively as obvious differences of objective functions. For the weakly nonlinearly mixing case, AIC and MDL are very sensitive to the modulation factor σ and become inaccurate with a very small σ, while IBIC performs more robustly and reliably toward σ, which means that IBIC is more effective to mechanical systems. In the experimental studies, a test bed with three sound sources is constructed to test the performances of the above methods. The eigenvalue decomposition-based method is difficult to determine the threshold as the last three eigenvalues are very close to each other. However, all the three information-based methods obtain extremum values as n = 4. Using a source separation, four independent components are extracted from the mixed signals, and the other source which has a considerable energy with the given three sources is considered as an environmental noise. Therefore, all the information-based methods are effectively to sound source number estimation for the given test bed.

Generally, IBIC performs more robustly and reliably toward the nonlinear modulation effects than AIC and MDL, while eigenvalue decomposition-based methods normally require prior information about the sources, and becomes confused when the eigenvalues are very close to each other. Furthermore, the results of information-based methods for the test bed also indicate that the mixing mode of the sound sources tends to be a linear superposition. This study can benefit for model order selection, complexity analysis of a system, and applications of source separation to mechanical systems for condition monitoring and fault diagnosis purposes.

Acknowledgments

This work is supported by the projects of National Nature Science Foundation of China (No. 51305329, 51035007), the China Postdoctoral Science Foundation (No. 2013M532032), the Doctoral Foundation of Education Ministry of China (No. 20130201120040), and the Shaanxi Postdoctoral Scientific Research Project.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Wiesel, A.; Hero, A.O. Decomposable Principal Component Analysis. IEEE Trans. Signal Process. 2009, 57, 4369–4377. [Google Scholar]
Cheng, W.; Zhang, Z.S.; Lee, S.; He, Z.J. Investigations of denoising source separation technique and its application to source separation and identification of mechanical vibration signals. J. Vib. Control. 2013. [Google Scholar] [CrossRef]
Ouedraogo, W.S.B.; Souloumiac, A.; Jaidane, M.; Jutten, C. Non-Negative Blind Source Separation Algorithm Based on Minimum Aperture Simplicial Cone. IEEE Trans. Signal. Process 2014, 62, 376–389. [Google Scholar]
Cheng, W.; Zhang, Z.S.; He, Z.J. Enhance the Separation Performance of ICA via Clustering Evaluation and Its Applications. Adv. Sci. Lett. 2011, 4, 1951–1956. [Google Scholar]
Cheng, W.; Zhang, Z.S.; Lee, S.; He, Z.J. Source Contribution Evaluation of Mechanical Vibration Signals via Enhanced Independent Component Analysis. J. Manuf. Sci. Eng. 2012. [Google Scholar] [CrossRef]
Hyvarinen, A.; Oja, E. Independent Component Analysis: Algorithms and Applications. Neural. Netw. 2000, 13, 411–430. [Google Scholar]
Ye, J.M.; Zhu, X.L.; Zhang, X.D. Adaptive Blind Separation with an Unknown Number of Sources. Neural. Comput. 2004, 16, 1641–1660. [Google Scholar]
Fishler, E.; Poor, H.V. Estimation of the Number of Sources in Unbalanced Arrays via Information Theoretic Criteria. IEEE Trans. Signal. Process. 2005, 53, 3543–3553. [Google Scholar]
Bai, X.X.; He, B. Estimation of number of independent brain electric sources from the scalp EEGs. IEEE Trans. Biomed. Eng. 2006, 53, 1883–1892. [Google Scholar]
Jiang, L.; Cai, P.; Yang, J.; Wang, Y.L.; Xu, D. A new source number estimation method based on the beam eigenvalue. J. Mar. Sci. Appl. 2007, 6, 41–46. [Google Scholar]
Huang, L.; Long, T.; Mao, E.; So, H.C. MMSE-Based MDL Method for Robust Estimation of Number of Sources Without Eigendecomposition. IEEE Trans. Signal. Process. 2009, 57, 4135–4142. [Google Scholar]
Huang, L.; Long, T.; Mao, E.; So, H.C. MMSE-Based MDL Method for Accurate Source Number Estimation. IEEE Signal. Process. Lett. 2009, 16, 798–801. [Google Scholar]
Hu, J.S.; Yang, C.H. Estimation of Sound Source Number and Directions under a Multisource Reverberant Environment. EURASIP J. Adv. Signal. Process. 2010. [Google Scholar] [CrossRef]
Ma, D.; Song, C. Estimation Method for the Number of Sources Based on Modified K-means Clustering. J. Detect. Control. 2011, 33. [Google Scholar]
Cheng, W.; Lee, S.; Zhang, Z.S.; He, Z.J. Independent component analysis based source number estimation and its comparison for mechanical systems. J. Sound. Vib. 2012, 331, 5153–5167. [Google Scholar]
Han, K.Y.; Nehorai, A. Improved Source Number Detection and Direction Estimation with Nested Arrays and ULAs Using Jackknifing. IEEE Trans. Signal. Process. 2013, 61, 6118–6128. [Google Scholar]
Dosso, S.E. Bayesian localization of an unknown number of ocean acoustic sources. J. the Acoust. Soc. Am. 2013, 133, 3574. [Google Scholar] [CrossRef]
Sadhu, A.; Hazra, B.; Narasimhan, S. Decentralized modal identification of structures using parallel factor decomposition and sparse blind source separation. Mech. Syst. Signal. Process. 2013, 41, 396–419. [Google Scholar]
Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974, 19, 716–723. [Google Scholar]
Williams, D.B. Counting the degrees of freedom when using AIC and MDL to detect signals. IEEE Trans. Signal. Process. 1994, 42, 3282–3284. [Google Scholar]
Rissanen, J. Modeling by shortest data description. Automatica 1978, 14, 465–471. [Google Scholar]
Huang, L.; So, H.C. Source Enumeration via MDL Criterion Based on Linear Shrinkage Estimation of Noise Subspace Covariance Matrix. IEEE Trans. Signal. Process. 2013, 61, 4806–4821. [Google Scholar]
Quan, D.; Kay, S. Inconsistency of the MDL: On the Performance of Model Order Selection Criteria With Increasing Signal-to-Noise Ratio. IEEE Trans. Signal. Process. 2011, 59, 1959–1969. [Google Scholar]
Minka, T.P. Automatic choice of dimensionality for PCA. Adv. Neural. Inf. Process. Syst. 2001, 13, 598–604. [Google Scholar]
Stoica, P.; Babu, P. On the proper forms of BIC for model order selection. IEEE Trans. Signal. Process. 2012, 60, 4956–4961. [Google Scholar]
Nadler, B. Nonparametric detection of signals by information theoretic criteria: Performance analysis and an improved estimator. IEEE Trans. Signal. Process. 2010, 58, 2746–2756. [Google Scholar]
Huang, L.; Long, T.; Wu, S.J. Source enumeration for high-resolution array processing using improved gerschgorin radii without eigendecomposition. IEEE Trans. Signal. Process. 2008, 56, 5916–5925. [Google Scholar]
Huang, L.; Wu, S.J.; Li, X. Reduced-rank MDL method for source enumeration in high-resolution array processing. IEEE Trans. Signal. Process. 2007, 55, 5658–5667. [Google Scholar]
Cichocki, A.; Amari, S.I. Adaptive Blind Signal and Image Processing: Learning Algorithms and Applications; John Wiley & Sons: Chichester, UK, 2002. [Google Scholar]
Mead, D.J.; Bardell, N.S. Free vibration of a thin cylindrical shell with discrete axial stiffeners. J. Sound. Vib. 1986, 111, 229–250. [Google Scholar]
Hyvarinen, A. Fast and robust fixed-point algorithms for independent component analysis. IEEE Trans. Neural. Netw. 1999, 10, 626–634. [Google Scholar]

Figure 1. The waveforms of the source signals.

Figure 2. The waveforms and spectra of the mixed signals.

Figure 3. Source number estimation by information-based methods.

Figure 4. Accuracy rates of information-based source number estimation methods.

Figure 5. The waveforms of the mixed signals for σ = 0.0974.

Figure 6. Source number estimation by AIC.

Figure 7. Source number estimation by MDL.

Figure 8. Source number estimation by IBIC.

Figure 9. The structure (I) and photo (II) of the test-bed: (a) End cover. (b) Loudspeaker I. (c) Left clapboard. (d) Loudspeaker II. (e) Shell. (f) Motor. (g) Right clapboard. (h) Rubber springs. (i) Supports.

Figure 10. The measuring system of the test bed.

Figure 11. Waveforms of the source signals.

Figure 12. Waveforms of the mixed signals.

Figure 13. Source number estimation by information-based methods.

Figure 14. The waveforms of the separated components by fast ICA algorithm.

Figure 15. The spectra of the source signals.

Figure 16. The spectra of the separated components.

Table 1. The eigenvalues of the covariance matrix for the mixed signals.

**Table 1.** The eigenvalues of the covariance matrix for the mixed signals.
Eigenvalues	λ₁	λ₂	λ₃	λ₄	λ₅	λ₆
Values	281.31	31.21	10.43	4.42	0.00	0.00

Table 2. Eigenvalues of covariance matrix with different factor σ.

**Table 2.** Eigenvalues of covariance matrix with different factor σ.
Eigenvalues	λ₁	λ₂	λ₃	λ₄	λ₅	λ₆
σ = 0.0000	281.31	31.21	10.43	4.42	0.0000	0.0000
σ = 0.0013	289.56	31.25	10.48	4.37	0.0002	0.0000
σ = 0.0015	271.56	31.27	10.41	4.37	0.0003	0.0001
σ = 0.0974	280.06	31.63	12.66	5.26	0.9791	0.0206

Table 3. The testing parameters of the measuring system.

**Table 3.** The testing parameters of the measuring system.
Parameters	Values and Units
Sound pressure sensors	6
HBM Gen2i Data acquisition system	1
Sampling frequency	10,240 Hz
Sampling length	10 s
Frequency of Loudspeaker I with sine wave	f₁ = 1,600 Hz
Frequency of Loudspeaker II with triangle wave	f₂ = 3,000 Hz
Rotational speed of motor	900 r/min (f₃ = 15Hz)

Table 4. The eigenvalues of the covariance matrix for the mixed signals.

**Table 4.** The eigenvalues of the covariance matrix for the mixed signals.
Eigenvalues	λ₁	λ₂	λ₃	λ₄	λ₅	λ₆
f₁ = 3000 f₂ = 1600 f₃ = 15	0.0306	0.0076	0.0047	0.0022	0.0010	0.0009
f₁ = 3000 f₂ = 1600 f₃ = 20	0.0386	0.0252	0.0097	0.0067	0.0049	0.0038
f₁ = 3000 f₂ = 1600 f₃ = 25	0.0450	0.0121	0.0116	0.0082	0.0061	0.0045

© 2014 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license ( http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Cheng, W.; Zhang, Z.; Cao, H.; He, Z.; Zhu, G. A Comparative Study of Information-Based Source Number Estimation Methods and Experimental Validations on Mechanical Systems. Sensors 2014, 14, 7625-7646. https://doi.org/10.3390/s140507625

AMA Style

Cheng W, Zhang Z, Cao H, He Z, Zhu G. A Comparative Study of Information-Based Source Number Estimation Methods and Experimental Validations on Mechanical Systems. Sensors. 2014; 14(5):7625-7646. https://doi.org/10.3390/s140507625

Chicago/Turabian Style

Cheng, Wei, Zhousuo Zhang, Hongrui Cao, Zhengjia He, and Guanwen Zhu. 2014. "A Comparative Study of Information-Based Source Number Estimation Methods and Experimental Validations on Mechanical Systems" Sensors 14, no. 5: 7625-7646. https://doi.org/10.3390/s140507625

Article Menu

A Comparative Study of Information-Based Source Number Estimation Methods and Experimental Validations on Mechanical Systems

Abstract

1. Introduction

2. Source Number Estimation Methods

3. Numerical Case Study

3.1. Source Number Estimation for Linearly Mixed Signals

3.2. Source Number Estimation for Weakly Nonlinearly Mixed Signals

4. Experimental Study

4.1. Introduction of the Test Bed

4.2. Source Number Estimation

5. Conclusions

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI