Composite Interpolation-Based Multiscale Fuzzy Entropy and Its Application to Fault Diagnosis of Rolling Bearing

Multiscale fuzzy entropy (MFE), as an enhanced multiscale sample entropy (MSE) method, is an effective nonlinear method for measuring the complexity of time series. In this paper, an improved MFE algorithm termed composite interpolation-based multiscale fuzzy entropy (CIMFE) is proposed by using cubic spline interpolation of the time series over different scales to overcome the drawbacks of the coarse-grained MFE process. The proposed CIMFE method is compared with MSE and MFE by analyzing simulation signals and the result indicates that CIMFE is more robust than MSE and MFE in analyzing short time series. Taking this into account, a new fault diagnosis method for rolling bearing is presented by combining CIMFE for feature extraction with Laplacian support vector machine for fault feature classification. Finally, the proposed fault diagnosis method is applied to the experiment data of rolling bearing by comparing with the MSE, MFE and other existing methods, and the recognition rate of the proposed method is 98.71%, 98.71%, 98.71%, 98.71% and 100% under different training samples (5, 10, 15, 20 and 25), which is higher than that of the existing methods.


Introduction
Rolling bearings are some of the most common mechanical parts in rotating machinery and their running state often affects the performance of the whole machine [1,2]. Therefore, it is of importance to study the incident fault diagnosis method of rolling bearings. Although the linear and stationary signal analysis methods have been widely used for machine monitoring, the vibration signals often present complexity and non-stationarity caused by the variable load operating conditions or nonlinear friction factors. Therefore, the traditional linear and stationary signal analysis methods inevitably have their limitations. With the development of nonlinear dynamic theory, many nonlinear parameters, such as fractal dimension [3][4][5], approximate entropy [6,7], sample entropy [8,9] and multiscale sample entropy [10][11][12] have been used in mechanical fault diagnosis. For example, Yang et al. [4] proposed an intelligent fault diagnosis of rolling element bearings based on support vector machines (SVMs) and fractal dimensions. Han and Pan proposed a fault diagnosis method combined with local mean decomposition (LMD), sample entropy and energy ratio for rolling bearing [8]. Zhang et al. [12] proposed a bearing fault diagnosis method by using multiscale entropy (MSE) and adaptive neuro-fuzzy inference. However, the coarse graining in MSE usually results in the 'flying wing' phenomenon at the end with the increase of scale factor. In fact, MSE mainly has the (1) For original data {X i } = {x 1 , x 2 , . . . , x N } with length N, the coarse-grained sequence is established as: where τ is scale factor. y j (1) is the original time series and when τ > 1, {X i } is divided into τ coarse-grained sequence y j (τ) with length [N/τ] (no greater than the positive integer N/τ). (2) Sample entropy of each coarse-grained sequence y j (τ) are calculated under the same similar tolerance as follows: MSE(X, τ, m, n, r) = SampEn(y τ , m, n, r) where, m is the embedding dimension, n is the exponential function gradient parameter, and r is the tolerance. Then all obtained entropies can be seen as a function at scale factor τ and this process is called multiscale entropy analysis.
MSE overcomes the defects of sample entropy in single scale for measuring the complexity of time series. However, the coarse-grained process greatly depends on the length of time sequence, which possibly results in the increasing of entropy deviation with the increasing of scale factor in the calculation of MSE. Taking scale factor 2 and 3 as examples and Figure 1 shows the coarse-grained process.   (2) where, m is the embedding dimension, n is the exponential function gradient parameter, and r is the tolerance. Then all obtained entropies can be seen as a function at scale factor τ and this process is called multiscale entropy analysis.
MSE overcomes the defects of sample entropy in single scale for measuring the complexity of time series. However, the coarse-grained process greatly depends on the length of time sequence, which possibly results in the increasing of entropy deviation with the increasing of scale factor in the calculation of MSE. Taking scale factor 2 and 3 as examples and Figure 1 shows the coarse-grained process. In the coarse-grained process, when τ equals 2 the coarse-grained sequence beginning with the average of 1 x and 2 x , is solely considered and the time sequence beginning with average of 2 x and 3 x is ignored, which also owns the same scale factor 2. Similarly, when τ equals 3, only the coarse-grained sequence beginning with the average of 1 x , 2 x and 3 x is used while that beginning with the average of 2 x , 3 x and 4 x , or 3 x , 4 x and 5 x are ignored. Besides, the mean way in original coarse-grained process is seen as a linear interpolation which will cause the nonlinearity and non-stationarity of the vibration signal ignored. The steps of CMSE can be simply described as follows [14]: (  In the coarse-grained process, when τ equals 2 the coarse-grained sequence beginning with the average of x 1 and x 2 , is solely considered and the time sequence beginning with average of x 2 and x 3 is ignored, which also owns the same scale factor 2. Similarly, when τ equals 3, only the coarse-grained sequence beginning with the average of x 1 , x 2 and x 3 is used while that beginning with the average of x 2 , x 3 and x 4 , or x 3 , x 4 and x 5 are ignored. Besides, the mean way in original coarse-grained process is seen as a linear interpolation which will cause the non-linearity and non-stationarity of the vibration signal ignored. The steps of CMSE can be simply described as follows [14]: (1) For the time series {x(i), i = 1, 2, · · · , N} with data length N, the coarse grained series y τ k , y τ k,j = y τ k,1 , y τ k,2 , · · · , y τ k,p can be calculated as: (2) For each scale factor τ and k, the SampEns of each coarse grained sequence y τ k (1 ≤ k ≤ τ) are calculated and then the average of all the k SampEn values for each τ are computed. The CMSE for scale factor τ is obtained by: The composite multiscale way is displayed in Figure 2 with scale factor equals 2.
The composite multiscale way is displayed in Figure 2 with scale factor equals 2.

Composite Interpolation-Based Multiscale Fuzzy Entropy
In this paper, CIMFE is proposed to solve the deficiencies of MSE. In CIMFE, firstly, fuzzy entropy is employed to replace sample entropy and overcome the mutation of similarity measurement of two vectors. Secondly, the cubic spline method is used to interpolate the coarsegrained time series over different scales for retaining the non-stationarity and nonlinearity of signal. Lastly, the composite multiscale way is used to construct the final entropy values in each scale. The calculation steps of CIMFE are given as follows: (1) For a given normalized time series { } (2) FuzzyEn of each coarse graining time sequence in each scale factor τ is calculated, and then the mean value of all τ entropies is computed. The CIMFE under this scale factor is described as:  (5) where the detailed steps for FuzzyEn calculation can be found in literature [27,28].
, the cubic spline function is applied to interpolate the coarse graining time series over different scales. As shown in Figure 3, when τ equals 1,

Composite Interpolation-Based Multiscale Fuzzy Entropy
In this paper, CIMFE is proposed to solve the deficiencies of MSE. In CIMFE, firstly, fuzzy entropy is employed to replace sample entropy and overcome the mutation of similarity measurement of two vectors. Secondly, the cubic spline method is used to interpolate the coarse-grained time series over different scales for retaining the non-stationarity and nonlinearity of signal. Lastly, the composite multiscale way is used to construct the final entropy values in each scale. The calculation steps of CIMFE are given as follows: (1) For a given normalized time series {x(i), i = 1, 2, · · · , N}, the coarse-grained sequence y τ k,j = y τ k,1 , y τ k,2 , · · · , y τ k,p is expressed as Equation (3). (2) FuzzyEn of each coarse graining time sequence y τ k (1 ≤ k ≤ τ) in each scale factor τ is calculated, and then the mean value of all τ entropies is computed. The CIMFE under this scale factor is described as: where the detailed steps for FuzzyEn calculation can be found in literature [27,28].
(3) For {x(i), i = 1, 2, · · · , N}, the cubic spline function is applied to interpolate the coarse graining time series over different scales. As shown in Figure 3, when τ equals 1, k 1 1,j is the original time series {x i }. When τ equals 2, k 2 2,j is the middle point between x i and x i+1 (i = 1, 2, 3, . . . ) calculated by the cubic spline interpolation function and is given as is the middle point between y 2 N−2 and y 2 N−1 (N = 3,4,5 . . . ) calculated by the cubic spline interpolation function and is expressed as k 3 3,j = k 2 (i), 1 ≤ i ≤ N − 2 . The rest can be done in the same ways. (4) Then FuzzyEn of k τ p (1 ≤ p ≤ τ) is calculated and the mean value of all τ FuzzyEn entropies is computed. The CIMFE 2 under scale factor τ is obtained by: FuzzyEn(k τ p , m, n, r) The obtained entropies are expressed as a function of scale factor and this process is called composite interpolation multiscale fuzzy entropy analysis. The obtained entropies are expressed as a function of scale factor and this process is called composite interpolation multiscale fuzzy entropy analysis.

Parameter Selection
The calculation of CIMFE is related with data length N, embedding dimension m, the tolerance r and the exponential function gradient parameter n. Firstly, with the increase of m, more detailed information can be obtained, however, more data length is required ( ) m m N 30 10 = . Therefore, we generally set m as 2. Secondly, the similar tolerance r controls the width of fuzzy function. A smaller r results in more statistical information, which will increase the sensitivity of CIMFE to noise. However, a larger r will lose some statistical information. Thus, r is selected as (SD is the standard deviation of original data) in this paper, and 0.15SD is used. Thirdly, n generally controls the gradient of fuzzy function, and the used fuzzy function will turn into a unit step function when n tends to infinity. According to literature [27,28], n is set as 2. Lastly, the data length N also has some influence on sample entropy and fuzzy entropy, and we keep m N 10 ≥ τ , where τ is scale factor [14].

Comparison Analysis
Without loss of generality, we take 20 groups of Gaussian white noises and 1/f noises with length 3000 as examples. Their mean-standard deviation curves of MSEs, CMSEs, MFEs and CIMFEs are shown in Figure 4a,b. First of all, from Figure 4, we can find that with the increase of scale factor, the CIMFE curve is relatively smoother and has small fluctuations. Secondly, comparing MSEs, CMSEs and MFEs at each scale factor, CIMFEs fluctuate less slightly, which shows that CIMFEs are more stable than the existing MSE, MFE and CMSE methods. Finally, with the increase of scale factor, the entropies of Gaussian white noise decrease gradually, which shows that compared with 1/f noise, the Gaussian white noise signal is simpler and the significant information can be extracted at smaller

Parameter Selection
The calculation of CIMFE is related with data length N, embedding dimension m, the tolerance r and the exponential function gradient parameter n. Firstly, with the increase of m, more detailed information can be obtained, however, more data length is required ( N = 10 m ∼ 30 m ). Therefore, we generally set m as 2. Secondly, the similar tolerance r controls the width of fuzzy function. A smaller r results in more statistical information, which will increase the sensitivity of CIMFE to noise. However, a larger r will lose some statistical information. Thus, r is selected as 0.1 ∼ 0.25SD (SD is the standard deviation of original data) in this paper, and 0.15SD is used. Thirdly, n generally controls the gradient of fuzzy function, and the used fuzzy function will turn into a unit step function when n tends to infinity. According to literature [27,28], n is set as 2. Lastly, the data length N also has some influence on sample entropy and fuzzy entropy, and we keep N τ ≥ 10 m , where τ is scale factor [14].

Comparison Analysis
Without loss of generality, we take 20 groups of Gaussian white noises and 1/f noises with length 3000 as examples. Their mean-standard deviation curves of MSEs, CMSEs, MFEs and CIMFEs are shown in Figure 4a,b. First of all, from Figure 4, we can find that with the increase of scale factor, the CIMFE curve is relatively smoother and has small fluctuations. Secondly, comparing MSEs, CMSEs and MFEs at each scale factor, CIMFEs fluctuate less slightly, which shows that CIMFEs are more stable than the existing MSE, MFE and CMSE methods. Finally, with the increase of scale factor, the entropies of Gaussian white noise decrease gradually, which shows that compared with 1/f noise, the Gaussian white noise signal is simpler and the significant information can be extracted at smaller scales. However, with the increase of scale factor, the entropy curve of 1/f noise changes slowly and is basically stable in the vicinity of a constant value. The above analysis indicate that the 1/f noise is more complex than white noise and contain important pattern information at all scales, which is consistent with the physical sense of Gaussian white noise and 1/f noise signals. scales. However, with the increase of scale factor, the entropy curve of 1/f noise changes slowly and is basically stable in the vicinity of a constant value. The above analysis indicate that the 1/f noise is more complex than white noise and contain important pattern information at all scales, which is consistent with the physical sense of Gaussian white noise and 1/f noise signals.

The Proposed Fault Diagnosis Method
The above analysis of simulation signals indicates that CIMFE as a new complexity measure method for time series that can get much better performance than the existing methods, i.e., MSE, MFE and CMSE. Thus, CIMFE as the method for fault feature extraction is applied to diagnosis for rolling bearings. At present, monitoring technologies have been widely developed and more fault samples can be collected easily, however it is difficult to mark all the fault classes of the collected samples. Actually, a too small marked samples input into classifiers will result in a worse performance in default recognition. Therefore, it is of significance to promote and improve identification rate by using few marked samples and many unmarked samples [29]. LapSVM, a kind of semi-supervised learning method, can effectively improve classification performance by applying few labeled samples and many unlabeled samples to design the internal structure of the classifier. The procedures of the proposed fault diagnosis method for rolling bearings are given as follows: (1) For the given K states of rolling bearing, each state has mk samples and thus the number of whole samples is (4) The selected sensitive fault feature sets that consists of the first several elements from are input to the LapSVM-based multi-classifier for fault recognition and the fault locations can be diagnosed according to the outputs of multi-classifier.

Experiment Data Analysis
Case 1: In general, the vibration signals, which contain noise signals, express non-stationarity and nonlinearity. However, it is a key step to extract fault pattern information from vibration signal in the

The Proposed Fault Diagnosis Method
The above analysis of simulation signals indicates that CIMFE as a new complexity measure method for time series that can get much better performance than the existing methods, i.e., MSE, MFE and CMSE. Thus, CIMFE as the method for fault feature extraction is applied to diagnosis for rolling bearings. At present, monitoring technologies have been widely developed and more fault samples can be collected easily, however it is difficult to mark all the fault classes of the collected samples. Actually, a too small marked samples input into classifiers will result in a worse performance in default recognition. Therefore, it is of significance to promote and improve identification rate by using few marked samples and many unmarked samples [29]. LapSVM, a kind of semi-supervised learning method, can effectively improve classification performance by applying few labeled samples and many unlabeled samples to design the internal structure of the classifier. The procedures of the proposed fault diagnosis method for rolling bearings are given as follows: (1) For the given K states of rolling bearing, each state has m k samples and thus the number of whole samples is M = ∑ K k=1 m k . (2) CIMFE of all M samples are calculated with the largest scale factor τ m and then the CIMFE based feature sets T k ∈ R m k ×τ max can be obtained for the K classes (k = 1,2, . . . K), where R m k ×τ max stands real matrix with row m k and column τ max .
(3) For each class, m k samples are randomly divided into h marked sample sets, i.e., T k 1 ∈ R h×τ max for training and (m k − h) unmarked sample sets, i.e., T k 2 ∈ R (m k −h)×τ max for testing; (4) The selected sensitive fault feature sets that consists of the first several elements from T k 1 ∈ R h×τ max and T k 2 ∈ R (m k −h)×τ max are input to the LapSVM-based multi-classifier for fault recognition and the fault locations can be diagnosed according to the outputs of multi-classifier.

Experiment Data Analysis
Case 1: In general, the vibration signals, which contain noise signals, express non-stationarity and non-linearity. However, it is a key step to extract fault pattern information from vibration signal in the procedure of fault diagnosis. Both sample entropy and fuzzy entropy have two similar advantages: anti-noise and anti-interference. Therefore, a new fault diagnosis method for rolling bearings based on CIMFE and LapSVM is proposed in this paper. Firstly, CIMFEs of vibration signals of rolling bearings under different states are calculated and analyzed at different scale factors, if the difference of CIMFE curves of vibration signals of rolling bearings under different states is obvious, they would be used to directly recognize the state of rolling bearings. Otherwise, they would be taken as characteristic vectors and input to LapSVM classifier to conduct the fault recognition for rolling bearings. By using LapSVM, the obtained samples are utilized to train and test the fault classifier, and the diagnosis recognition for rolling bearings are conducted.
The adopted experimental data are derived from the rolling bearing test data supported by Case Western Reserve University in USA [30], and the experimental device is shown in Figure 5. In the test, the tested rolling bearing is 6205-2RS JEM of SKF with fault diameter 0.5334 mm, failure depth 0.2794 mm, speed 1797 r/min and sampling frequency 12 kHz. The vibration signals of rolling bearing with four states, i.e., inner ring fault (IR), outer ring fault (OR), ball element fault (BE) and normal state (Norm) are collected, which are exhibited in Figure 6a-d. As shown in Figure 6, it is difficult to directly distinguish all the states of rolling bearing. procedure of fault diagnosis. Both sample entropy and fuzzy entropy have two similar advantages: anti-noise and anti-interference. Therefore, a new fault diagnosis method for rolling bearings based on CIMFE and LapSVM is proposed in this paper. Firstly, CIMFEs of vibration signals of rolling bearings under different states are calculated and analyzed at different scale factors, if the difference of CIMFE curves of vibration signals of rolling bearings under different states is obvious, they would be used to directly recognize the state of rolling bearings. Otherwise, they would be taken as characteristic vectors and input to LapSVM classifier to conduct the fault recognition for rolling bearings. By using LapSVM, the obtained samples are utilized to train and test the fault classifier, and the diagnosis recognition for rolling bearings are conducted.
The adopted experimental data are derived from the rolling bearing test data supported by Case Western Reserve University in USA [30], and the experimental device is shown in Figure 5. In the test, the tested rolling bearing is 6205-2RS JEM of SKF with fault diameter 0.5334 mm, failure depth 0.2794 mm, speed 1797 r/min and sampling frequency 12 kHz. The vibration signals of rolling bearing with four states, i.e., inner ring fault (IR), outer ring fault (OR), ball element fault (BE) and normal state (Norm) are collected, which are exhibited in Figure 6a-d. As shown in Figure 6, it is difficult to directly distinguish all the states of rolling bearing.   procedure of fault diagnosis. Both sample entropy and fuzzy entropy have two similar advantages: anti-noise and anti-interference. Therefore, a new fault diagnosis method for rolling bearings based on CIMFE and LapSVM is proposed in this paper. Firstly, CIMFEs of vibration signals of rolling bearings under different states are calculated and analyzed at different scale factors, if the difference of CIMFE curves of vibration signals of rolling bearings under different states is obvious, they would be used to directly recognize the state of rolling bearings. Otherwise, they would be taken as characteristic vectors and input to LapSVM classifier to conduct the fault recognition for rolling bearings. By using LapSVM, the obtained samples are utilized to train and test the fault classifier, and the diagnosis recognition for rolling bearings are conducted.
The adopted experimental data are derived from the rolling bearing test data supported by Case Western Reserve University in USA [30], and the experimental device is shown in Figure 5. In the test, the tested rolling bearing is 6205-2RS JEM of SKF with fault diameter 0.5334 mm, failure depth 0.2794 mm, speed 1797 r/min and sampling frequency 12 kHz. The vibration signals of rolling bearing with four states, i.e., inner ring fault (IR), outer ring fault (OR), ball element fault (BE) and normal state (Norm) are collected, which are exhibited in Figure 6a-d. As shown in Figure 6, it is difficult to directly distinguish all the states of rolling bearing.   In addition, 58 samples of rolling bearing vibration signals at each state are selected. The mean and square deviation curves of them is shown in Figure 7a-d, where the data length is set to 2048. Firstly, it can be indicated that when the rolling bearings have different faults, the entropies are different. With the increase of scale factors, MSE, MFE and CMSE curves decrease gradually, and the fluctuation of entropy curves under inner ring fault, outer ring fault and rolling element fault conditions are relatively consistent. However, with the increase of scale factor, the fluctuations of CIMFE curves are smaller and smoother than that of other methods, which is beneficial to improve and promote the recognition performance. Secondly, the vibration signal of rolling bearings at normal state are random and irregular, hence the arising tendency of its entropy curve is expressed with the increase of scale factor, which demonstrates that the normal bearing signals include crucial information both in the lower scales and higher scales. conditions are relatively consistent. However, with the increase of scale factor, the fluctuations of CIMFE curves are smaller and smoother than that of other methods, which is beneficial to improve and promote the recognition performance. Secondly, the vibration signal of rolling bearings at normal state are random and irregular, hence the arising tendency of its entropy curve is expressed with the increase of scale factor, which demonstrates that the normal bearing signals include crucial information both in the lower scales and higher scales. Eventually, due to the working conditions of rolling bearings where the ball element rolls and the inner ring rotates with the rotational motion of axis and the outer ring is fixed on the bearing pedestal. Those result in that the entropies of the rolling bearing vibration signals with ball element faults is larger than that with inner ring faults, which is also larger than that with outer ring faults. In order to effectively conduct an intelligent and efficient fault recognization of rolling bearing, 58 samples of each state i.e., Norm, IR, OR and BE, are divided into 25 marked samples and 33 unmarked samples and then input the LapSVM classifier where the radial basis function is selected as kernel function whose parameter is set to 0.35. Based on the partial two binary tree method, LapSVM multi-fault classifier is established, which is exhibited in Figure 8 and Table 1 where LapSVM1, LapSVM2, LapSVM3, LapSVM4 represent the normal state, inner ring fault, outer ring fault and ball element fault, respectively. Eventually, due to the working conditions of rolling bearings where the ball element rolls and the inner ring rotates with the rotational motion of axis and the outer ring is fixed on the bearing pedestal. Those result in that the entropies of the rolling bearing vibration signals with ball element faults is larger than that with inner ring faults, which is also larger than that with outer ring faults.

Case 2:
In order to effectively conduct an intelligent and efficient fault recognization of rolling bearing, 58 samples of each state i.e., Norm, IR, OR and BE, are divided into 25 marked samples and 33 unmarked samples and then input the LapSVM classifier where the radial basis function is selected as kernel function whose parameter is set to 0.35. Based on the partial two binary tree method, LapSVM multi-fault classifier is established, which is exhibited in Figure 8 and Table 1 where LapSVM1, LapSVM2, LapSVM3, LapSVM4 represent the normal state, inner ring fault, outer ring fault and ball element fault, respectively.  Meanwhile the sample feature vector T is input into LapSVM fault classifier when classification test is conducted. The sample outputs are shown in Figure 9, where all the samples are effectively identified, hence the recognition rate is 100%. Next, the above obtained 58 groups of CIMFE of four states will also be input to SVM classifier to implement fault recognition for rolling bearing. As before, 25 × 4 groups at four states are taken as marked samples and 32 × 4 groups under four states are regarded as unmarked samples all the samples are input to SVM classifier. Hence the outputs are shown in Table 2, where the LibSVM program is utilized. Gaussian radial basis function is selected as kernel function of SVM. Meanwhile, based on particle swarm optimization algorithm [31], the penalty parameter c and kernel function parameter g are determined and optimized. As shown in Table 2 and Figure 10, the recognition rate is 96.97% in which four samples with ball element fault are wrongly treated as that with outer ring fault in SVM3, and four samples with ball element fault are not recognized in SVM4. Eventually, comparing Table 1 and Table 2, when much unmarked samples are adopted, the fault type of rolling bearing can be effectively classified by LapSVM, which indicates that the proposed method has a better recognition performance on fault diagnosis of rolling bearing.   Meanwhile the sample feature vector T is input into LapSVM fault classifier when classification test is conducted. The sample outputs are shown in Figure 9, where all the samples are effectively identified, hence the recognition rate is 100%.  Meanwhile the sample feature vector T is input into LapSVM fault classifier when classification test is conducted. The sample outputs are shown in Figure 9, where all the samples are effectively identified, hence the recognition rate is 100%. Next, the above obtained 58 groups of CIMFE of four states will also be input to SVM classifier to implement fault recognition for rolling bearing. As before, 25 × 4 groups at four states are taken as marked samples and 32 × 4 groups under four states are regarded as unmarked samples all the samples are input to SVM classifier. Hence the outputs are shown in Table 2, where the LibSVM program is utilized. Gaussian radial basis function is selected as kernel function of SVM. Meanwhile, based on particle swarm optimization algorithm [31], the penalty parameter c and kernel function parameter g are determined and optimized. As shown in Table 2 and Figure 10, the recognition rate is 96.97% in which four samples with ball element fault are wrongly treated as that with outer ring fault in SVM3, and four samples with ball element fault are not recognized in SVM4. Eventually, comparing Table 1 and Table 2, when much unmarked samples are adopted, the fault type of rolling bearing can be effectively classified by LapSVM, which indicates that the proposed method has a better recognition performance on fault diagnosis of rolling bearing.  Next, the above obtained 58 groups of CIMFE of four states will also be input to SVM classifier to implement fault recognition for rolling bearing. As before, 25 × 4 groups at four states are taken as marked samples and 32 × 4 groups under four states are regarded as unmarked samples all the samples are input to SVM classifier. Hence the outputs are shown in Table 2, where the LibSVM program is utilized. Gaussian radial basis function is selected as kernel function of SVM. Meanwhile, based on particle swarm optimization algorithm [31], the penalty parameter c and kernel function parameter g are determined and optimized. As shown in Table 2 and Figure 10, the recognition rate is 96.97% in which four samples with ball element fault are wrongly treated as that with outer ring fault in SVM3, and four samples with ball element fault are not recognized in SVM4. Eventually, comparing Tables 1 and 2, when much unmarked samples are adopted, the fault type of rolling bearing can be effectively classified by LapSVM, which indicates that the proposed method has a better recognition performance on fault diagnosis of rolling bearing.  To further illustrate the superiority of the proposed method and the influence of the number of marked samples on recognition performance, comparisons of CIMFE with MSE, MFE and CMSE, or LapSVM with SVM are conducted. Firstly, to illustrate the superiority of CIMFE, MSE, CMSE and MFE are also applied to extract feature information of vibration signals of rolling bearings. Then MSEs, CMSEs, MFEs and CIMFEs are taken as sensitive feature sets and respectively input to LapSVM classifier with the same parameters for state recognition of rolling bearings. When the number of marked samples is set to 5, 10, 15, 20 and 25 separately, the recognition results are illustrated in Table 3. As shown in Table 3, when the number of marked samples is determined, the recognition rate of the MSE and LapSVM-based methods are different from that of other methods which are almost a same constant, i.e., when the number of marked samples is fifteen, the recognition rate of the MSE and LapSVM-based method is 97.41% and those of the other methods are 98.71%. Secondly, with the increase of the number of marked samples, the identification rate of all methods fluctuates slowly and almost tends to be constant, however, the identification rates of the MSE and LapSVM-based method are always smaller than those of other methods. The above analysis indicates that compared to MSE, MFE and CMSE, CIMFE has a better performance when extracting fault features of rolling bearings. In particular, when the number of marked samples is 25, the recognition rate of the proposed method can reach 100%. Next, to illustrate the superiority of LapSVM to SVM, the above entropies of MSE, MFE, CMSE and CIMFE are also input to SVM to conduct the default recognition of rolling bearing and the recognition results are shown in Table 4 where the number of training samples is set to 5, 10, 15, 20 and 25 respectively. As shown in Table 4, when the number of trained samples is determined, the recognition rate of all the SVM-based methods vary. With the increase of the number of trained samples, the recognition rate of all the SVM-based methods fluctuate slightly and in particular, the recognition rate of the CIMFE and SVM-based methods for fault recognition of rolling bearing is always higher than that of the other methods using MSE, MFE and CMSE to extract fault features of rolling bearings. The above analysis indicate that the number of marked samples has some influence on recognition performance. As a measuring complexity method for time series, CIMFE is superiority To further illustrate the superiority of the proposed method and the influence of the number of marked samples on recognition performance, comparisons of CIMFE with MSE, MFE and CMSE, or LapSVM with SVM are conducted. Firstly, to illustrate the superiority of CIMFE, MSE, CMSE and MFE are also applied to extract feature information of vibration signals of rolling bearings. Then MSEs, CMSEs, MFEs and CIMFEs are taken as sensitive feature sets and respectively input to LapSVM classifier with the same parameters for state recognition of rolling bearings. When the number of marked samples is set to 5, 10, 15, 20 and 25 separately, the recognition results are illustrated in Table 3. As shown in Table 3, when the number of marked samples is determined, the recognition rate of the MSE and LapSVM-based methods are different from that of other methods which are almost a same constant, i.e., when the number of marked samples is fifteen, the recognition rate of the MSE and LapSVM-based method is 97.41% and those of the other methods are 98.71%. Secondly, with the increase of the number of marked samples, the identification rate of all methods fluctuates slowly and almost tends to be constant, however, the identification rates of the MSE and LapSVM-based method are always smaller than those of other methods. The above analysis indicates that compared to MSE, MFE and CMSE, CIMFE has a better performance when extracting fault features of rolling bearings. In particular, when the number of marked samples is 25, the recognition rate of the proposed method can reach 100%. Next, to illustrate the superiority of LapSVM to SVM, the above entropies of MSE, MFE, CMSE and CIMFE are also input to SVM to conduct the default recognition of rolling bearing and the recognition results are shown in Table 4 where the number of training samples is set to 5, 10, 15, 20 and 25 respectively. As shown in Table 4, when the number of trained samples is determined, the recognition rate of all the SVM-based methods vary. With the increase of the number of trained samples, the recognition rate of all the SVM-based methods fluctuate slightly and in particular, the recognition rate of the CIMFE and SVM-based methods for fault recognition of rolling bearing is always higher than that of the other methods using MSE, MFE and CMSE to extract fault features of rolling bearings. The above analysis indicate that the number of marked samples has some influence on recognition performance. As a measuring complexity method for time series, CIMFE is superiority to MSE, MFE and CMSE. Finally, comparing Tables 3 and 4, when the method for extracting fault feature is determined, the recognition rate of the LapSVM-based methods has little fluctuation and is always higher than that of the SVM-based methods. By using both SVM and LapSVM to conduct the fault recognition of rolling bearings, the CIMFE-based methods always obtain a better recognition rate. In particular, when the number of marked samples is suitable, the recognition rate of the proposed method can reach 100% which can be seen in Table 3. The comparison analysis shown in Tables 3  and 4, validate the superiority of the proposed method for the fault diagnosis of rolling bearing.

Conclusions
In this paper, a new method for measuring complexity of time series named CIMFE is proposed. The influence of the main parameters i.e., similar tolerance and data length on CIMFEs are researched by analyzing the simulation signals i.e., 1/f noised and Gaussian white noise signal. The results suggest that compared to existing entropy algorithms, i.e., MSE, MFE and CMSE, CIMFE embodies more stability and consistency. Meanwhile, the fluctuations of CIMFE curves are smaller and smoother than those of other methods. In addition, combing CIMFE and LapSVM, a new fault diagnosis method for rolling bearings is proposed and applied to analyzing rolling bearing experimental data. The recognition rates of the proposed CIMFE and LapSVM method are 98.71%, 98.71%, 98.71%, 98.71% and 100% under different training samples (5, 10, 15, 20 and 25), which is higher than that of the existing methods, i.e., MSE and LapSVM, CMSE and LapSVM, MFE and LapSVM. At the same time, the recognition rate of the CIMFE and LapSVM method is higher than that of CIMFE and SVM method under different training samples (5, 10, 15, 20 and 25). Now, the entropy theories have not been widely used on extracting feature of mechanical vibration signal. In this paper, we tried to apply the proposal into practice and both simulation and experimental data analysis were conducted. The following work will be emphasized on the cost-time decrease of the calculation, further optimization of the proposed methods and application on the on-line predicting fault.