An Intelligent Diagnosis Method for Rotating Machinery Using Least Squares Mapping and a Fuzzy Neural Network

Li, Ke; Chen, Peng; Wang, Shiming

doi:10.3390/s120505919

Open AccessArticle

An Intelligent Diagnosis Method for Rotating Machinery Using Least Squares Mapping and a Fuzzy Neural Network

by

Ke Li

^1,2,

Peng Chen

^1,* and

Shiming Wang

²

¹

Graduate School of Bioresources, Mie University, 1577 Kurimamachiya-cho, Tsu, Mie 514-8507, Japan

²

College of Engineer Science and Technology, Shanghai Ocean University, No. 999 Hucheng Ring Road, Lingang New City, Shanghai 201306, China

^*

Author to whom correspondence should be addressed.

Sensors 2012, 12(5), 5919-5939; https://doi.org/10.3390/s120505919

Submission received: 27 March 2012 / Revised: 2 May 2012 / Accepted: 3 May 2012 / Published: 8 May 2012

(This article belongs to the Section Physical Sensors)

Download

Browse Figures

Versions Notes

Abstract

: This study proposes a new condition diagnosis method for rotating machinery developed using least squares mapping (LSM) and a fuzzy neural network. The non-dimensional symptom parameters (NSPs) in the time domain are defined to reflect the features of the vibration signals measured in each state. A sensitive evaluation method for selecting good symptom parameters using detection index (DI) is also proposed for detecting and distinguishing faults in rotating machinery. In order to raise the diagnosis sensitivity of the symptom parameters the synthetic symptom parameters (SSPs) are obtained by LSM. Moreover, possibility theory and the Dempster & Shafer theory (DST) are used to process the ambiguous relationship between symptoms and fault types. Finally, a sequential diagnosis method, using sequential inference and a fuzzy neural network realized by the partially-linearized neural network (PLNN), is also proposed, by which the conditions of rotating machinery can be identified sequentially. Practical examples of fault diagnosis for a roller bearing are shown to verify that the method is effective.

Keywords:

condition diagnosis; least squares mapping; possibility theory; Dempster & Shafer theory; fuzzy neural network

1. Introduction

In the field of machinery diagnosis, vibration signals are often used for fault detection and state discrimination. Machinery diagnosis depends largely on the feature analysis of vibration signals measured for condition diagnosis, because the signals carry dynamic information about the machine state [1–3]. The vibration signals in different states will show different features, that is to say when plant machinery is in abnormal state, it will output signal sets which correspond to different faults. However, in most cases of condition diagnosis for rotating machinery, the values of symptom parameters calculated from vibration signals for condition monitoring and fault diagnosis are ambiguous. The main reasons for this can be explained as follows: (1) When the rotation speed and load of rotating machinery vary while vibration signals is being measured and a fault is in an early stage, the signal contains strong noise, stronger than the actual failure signal, that may lead to misrecognition of useful diagnostic information; (2) The statistical objectivity of the measured signal cannot always be satisfied because of the measurement techniques and manner of the inspectors [4]. Therefore, it is important to solve the ambiguous problem of fault diagnosis.

Roller bearings are an important part, widely used in rotating machinery. The failure of a rolling bearing may cause the breakdown of a rotating machine, and furthermore, serious consequences may arise due to the failure. Therefore, fault diagnosis of rolling bearings is extremely important for guaranteeing production efficiency and plant safety. Although fault diagnosis of rolling bearings is often artificially carried out using time or frequency analysis of vibration signals, there is a need for a reliable, fast automated diagnosis method thereof. Neural Networks (NN) have potential applications in automated detection and diagnosis of machine failure [5–9]. However, a conventional NN cannot adequately reflect the possibility of ambiguous diagnosis problems, and will never converge, when the symptom parameters, input to the 1st layer of the NN, have the same values in different states [4].

For the above reasons, this paper proposes a novel condition diagnosis method for rotating machinery developed using LSM and a fuzzy neural network realized by the PLNN. The NSPs in the time domain are defined to reflect the vibration signal features measured in each state. To raise the diagnosis sensitivity of the symptom parameters the SSPs are obtained by LSM. Using statistical theory, a detection index (DI) has also been defined to evaluate the applicability of SSPs. The DI can be used to indicate the fitness of a SSP for the PLNN. A sequential diagnosis approach is also proposed through the PLNN to sequentially identify the types of fault of rotating machinery. Diagnostic knowledge for the PLNN is acquired by possibility theory and the DST for solving the problem of ambiguous fault diagnosis. A practical example of condition diagnosis for a roller bearing verifies that the method is effective. The flowchart of the condition diagnostic procedure proposed in this paper is shown in Figure 1.

2. Experimental System for Fault Diagnosis

Figure 2 shows the experimental system for the roller bearing fault diagnosis test. The most commonly occurring faults in a roller element bearing are the outer-race defect, the inner-race defect, and the roller element defect. These fault bearings are shown in Figure 3 and were created artificially using a wire-cutting machine. The bearings that were utilized, and specifications of the test bearing, the size of the faults, and other necessary information is listed in Table 1.

In this work an accelerometer (PCB MA352A60) with a bandwidth from 5 Hz to 60 kHz and 10 mV/g output was used to measure the vibration signals of the vertical direction in the normal (N), the outer-race defect (O), the inner-race defect (I), and the roller element defect (R) states, respectively. The vibration signals measured by the accelerometer were transformed into a signal recorder (Scope Coder DL750) after being magnified by a sensor signal conditioner (PCB ICP Model 480C02). The original vibration signals in each state are measured at a constant speed (1,500 rpm), and a 150 kg load is also transported on the rotating shaft by the loading equipment (RCS2-RA13R) while the vibration signals are being measured. A high-pass filter with a 5 kHz cut-off frequency was used to cancel noise in the vibration signals for fault diagnosis. Examples of vibration signals measured in each state after filtering are shown in Figure 4. The sampling frequency of the signal measurement is 50 kHz, and the sampling time is 20 s.

3. Non-Dimensional Symptom Parameters and Sensitivity Evaluation

3.1. Non-Dimensional Symptom Parameters for Fault Diagnosis

When a computer is used for condition diagnosis of plant machinery, symptom parameters (SPs) are required to express the information indicated by a signal measured for diagnosing machinery faults. A good symptom parameter can correctly reflect states and the condition trends of plant machinery [10–12]. Many symptom parameters have been defined in the pattern recognition field [13]. Here, eight NSPs in the time domain, commonly used for the fault diagnosis of plant machinery, are considered:

P_{1} = \frac{σ}{\bar{x}}

(1)

P_{2} = \frac{\sum_{i = 1}^{N} x_{i}^{2}}{N σ^{2}}

(2)

P_{3} = \frac{| \sum_{i = 1}^{N} {(x_{i} - \bar{x})}^{3} |}{N σ^{3}}

(3)

P_{4} = \frac{\sum_{i = 1}^{N} {(x_{i} - \bar{x})}^{4}}{N σ^{4}}

(4)

where x_i is digital data of vibration signal. x̄ is the mean value of x_i,

\bar{x} = \frac{\sum_{i = 1}^{N} x_{i}}{N}

σ is the standard deviation of x_i,

σ = \sqrt{\frac{\sum_{i = 1}^{N} {(x_{i} - \bar{x})}^{2}}{N - 1}}

. N is the number of x_i:

P_{5} = \frac{| \sum_{i = 1}^{N_{p}} {(x_{p i} - \bar{x_{p}})}^{3} |}{N_{p} {σ_{p}}^{3}}

(5)

P_{6} = \frac{| \sum_{i = 1}^{N_{p}} {(x_{p i} - \bar{x_{p}})}^{4} |}{N_{p} {σ_{p}}^{4}}

(6)

where x_pi is the peak value of x_i, x̄_p and σ_p are the mean value and standard deviation of x_pi, respectively. N_p is the number of x_pi:

P_{7} = \frac{| \sum_{i = 1}^{N_{v}} {(x_{v i} - \bar{x_{v}})}^{3} |}{N_{v} {σ_{v}}^{3}}

(7)

P_{8} = \frac{| \sum_{i = 1}^{N_{v}} {(x_{v i} - \bar{x_{v}})}^{4} |}{N_{v} {σ_{v}}^{4}}

(8)

where x_vi is the valley value of x_i. x̄_v and σ_v are the mean value and standard deviation of x_vi, respectively. N_v is the number of x_vi.

3.2. Detection Index

Supposing that x₁ and x₂ are values of a symptom parameter (SP) calculated from the signals measured in state 1 and state 2, respectively, and conforming respectively to the normal distributions N(μ₁,σ₁) and N(μ₂,σ₂). Here, μ and σ are the average and the standard deviation of the SP. The larger the value of |x₂−x₁| is, the higher the sensitivity of distinguishing the two states by the SP. Because z = x₂ − x₁ also conforms to the normal distribution N(μ₂ − μ₁,σ₁ + σ₂), there is the following density function about z:

f (z) = \frac{1}{\sqrt{2 π (σ_{1}^{2} + σ_{2}^{2})}} exp {\frac{{z - (μ_{2} - μ_{1})}^{2}}{2 (σ_{1}^{2} + σ_{2}^{2})}}

(9)

where, μ₂ ≥ μ₁ (the same conclusion can be drawn when μ₁ ≥ μ₂). The probability can be calculated with the following formula:

P_{0} = \int_{- \infty}^{0} f (z) d_{z}

(10)

where, 1-P₀ is called the “Discrimination Rate (DR)”. With the substitution:

μ = \frac{z - (μ_{2} - μ_{1})}{\sqrt{σ_{1}^{2} + σ_{2}^{2}}}

(11)

into Equations (9) and (10), the P₀ can be obtained by:

P_{0} = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{- D I} exp (- \frac{μ^{2}}{2}) d_{μ}

(12)

where, the DI (Detection Index) is calculated by:

D I = \frac{μ_{2} - μ_{1}}{\sqrt{σ_{1}^{2} + σ_{2}^{2}}} or D I = \frac{\bar{x_{2}} - \bar{x_{1}}}{\sqrt{σ_{1}^{2} + σ_{2}^{2}}}

(13)

It is obvious that the larger the value of the DI, the larger the value of the “Discrimination Rate (DR = 1 − P₀)” will be, and therefore, the better the SP will be. Thus, the DI can be used as the index of the quality to evaluate the distinguishing sensitivity of the SP. The number of symptom parameters used for the diagnosis and fault types are M and N, respectively, and the synthetic detection index (SDI) is defined as follows:

SDI = \sum_{i = 1}^{N - 1} \sum_{j = i + 1}^{N} \sum_{k = 1}^{M} \frac{| μ_{i k} - μ_{j k} |}{\sqrt{σ_{i k}^{2} + σ_{j k}^{2}}}

(14)

Table 2 lists the diagnosis sensitivity standard for condition diagnosis.

4. Synthesizing Symptom Parameter by Least Squares Mapping

In order to raise the diagnosis sensitivity of the symptom parameter, a method for obtaining the new synthetic symptom parameter is proposed as follows. The least squares mapping (LSM) technique aims to increase class separability and consists of the transformation of pattern vectors around arbitrary pre-selected points in the R^C space (where C is the number of states), called the decision space, in such a way that the least squares transformation error is minimized [14,15]. In this section, we propose a method used to raise the diagnosis sensitivity by projecting the SPs into discrimination space using least squares mapping. The type number of SPs (Y_k) is K, and the category number of states is M. In the coordinate space of K dimension, the endpoint of the vector Y_ij expresses statei. Y_ij is shown as follows:

Y_{ij} = {y_{i j 1}, y_{i j 2}, \dots, y_{i j K} | i = 1 ~ M, j = 1 ~ N}^{T}

(15)

where, N is the number of SPs, and the number of SPs in each state is same.

Y_ij can be projected into a new space L, and the new vector L_ij in the space L can be calculated as follows:

L_{ij} = A Y_{ij}

(16)

where:

L_{ij} = {l_{i j 1}, l_{i j 2}, \dots, l_{i j K} | i = 1 ~ M, j = 1 ~ N}^{T}

(17)

The transformation matrix A is defined by means of minimizing the least squares error (ε) between vectors L_ij and V_i for all states, where V_i is an arbitrary selected vector point in the L space. The selection of vector V_i is critical to enhance sensitiveness of the synthetic symptom parameter. In the present work, V_i is determined as a unit orthogonal vector by experience.

Figure 5 shows an illustration of the projection by the LSM, where K = 2 and M = 2. Namely, the two states (state 1 and state 2) should be classified using two SP series.

The error vector is:

ɛ = \frac{1}{N} \sum_{j = 1}^{N} {‖ L_{ij} - V_{i} ‖}^{2}

(18)

ε minimization is performed by solving the following equation over A:

\nabla_{A} ɛ = 0

(19)

which, in conjunction with (18), leads to:

A = [\sum_{j = 1}^{N} {V_{i} Y_{ij}^{'}}] {[\sum_{j = 1}^{N} {Y_{ij} Y_{ij}^{'}}]}^{- 1}

(20)

When M ≥ 2, A is decided as follows:

A = [\sum_{i = 1}^{M} \sum_{j = 1}^{N} {V_{i} Y_{ij}^{'}}] {[\sum_{i = 1}^{M} \sum_{j = 1}^{N} {Y_{ij} Y_{ij}^{'}}]}^{- 1}

(21)

For diagnosis, the new synthetic symptom parameter can be obtained as follows:

SSP = A \cdot SP

(22)

where SP indicates symptom parameter (here P₁∼P₈).

According to the projected results shown in Figure 5(b), the points in state 1 and state 2 are congregated to vector V₁ and V₂, respectively. The two states in the space L can be distinguished more easily than in the space Y.

To explain the efficiency of the LSM method, some examples are given. In the present example, we used two symptom parameters (P₁ and P₂) to distinguish the inner race defect (I) and roller element defect (R) states of the bearing. SSP₁ and SSP₂ are the new synthetic parameter obtained by the LSM. Tables 3 and 4 show the parameters and the values of the DI and the DR before projection and after projection by the LSM, respectively. According to those examples, the states can be clearly distinguished by the SSPs. It is obvious that the sensitivity of the SSPs obtained by the LSM is higher than the original SPs. In Tables 3 and 4, μ_p1, μ_p2, μ_ssp1 and μ_ssp2 are the mean values of P₁, P₂, SSP₁ and SSP₂, respectively. σ_p1, σ_p2, σ_ssp1 andσ_ssp2 are the standard deviations of P₁, P₂, SSP₁ and SSP₂, respectively.

5. Sequential Diagnosis Method Based on Fuzzy Inference and Dempster & Shafer Theory

5.1. Sequential Condition Diagnosis Approach

In many cases of condition diagnosis, symptom parameters are defined to reflect the features of vibration signals measured in each state in order to diagnose faults. However, it is difficult to find one symptom parameter or a few symptom parameters that can identify all of the faults simultaneously. However, the symptom parameters for identification of two states are easy to identify [16]. In order to solve these problems, a sequential diagnosis method is proposed. In the first step, the normal state (N) can be distinguished from abnormal states using the corresponding possibility of the symptom parameter. In the second step, the outer-race defect (O) can be distinguished from the other abnormal states using the corresponding possibility of the symptom parameter. In the last step, the inner-race defect (I) and the roller element defect (R) states can be distinguished using the corresponding possibility of the symptom parameter. Figure 6 shows the flowchart of sequential condition diagnosis proposed in this study.

As mentioned in the Section 3.2, the larger the value of the DI, the better the SP will be. Therefore, the two best SSPs that have the high sensitivity at each diagnostic step are selected by the DI. As an example, parts of the DI values of each SSP and the selection results are shown in Table 5. In the first step, SSP₁ and SSP₅ can distinguish the normal (N) and the abnormal states (O, I and R) more easily than the other SSPs. Because all of DI values of SSP₁ and SSP₅ for distinguishing these states are larger than those of the other SSPs. Similarly, the SSPs for other diagnostic steps can also be selected. The other selected results of the SSPs are, SSP₁ and SSP₅ for the second step, and SSP₁ and SSP₂ for the last step, respectively. All of those DIs are larger than 2.12, and therefore all of the distinction rates approach 98.5%.

5.2. Fuzzy Inference by Possibility Theory

In most cases of condition diagnosis for rotating machinery, knowledge of distinguishing faults is ambiguous, because the definite relationships between symptom parameters and fault types, even for a single fault, cannot be easily identified. The values of symptom parameters calculated from vibration signals for fault diagnosis are also ambiguous because of the dispersion in the same state. Therefore, it is necessary to solve the ambiguous problem of fault diagnosis and to express uncertainty about the interpretation of the observable.

Possibility theory is a mathematical theory for dealing with certain types of uncertainty and is an alternative to probability theory. Zadeh first introduced possibility theory in 1978 as an extension of his theory of fuzzy sets and fuzzy logic [17]. Dubois and Prade further contributed to its development [18,19]. Recently, possibility theory has been used for fault diagnosis [16,20]. More details about possibility theory were introduced in references [21–23]. In the present work, possibility theory is applied to solving the ambiguous relationship between the symptom parameters and fault types.

For fuzzy inference, membership functions of SP are necessary. These can be obtained from probability density functions of the symptom parameters using possibility theory. When the probability density function of symptom parameters conforms to the normal distribution, it can be changed to a possibility function P(x_i) using the following formula:

P (x_{i}) = \sum_{k = 1}^{N} min {λ_{i}, λ_{k}}

(23)

where λ_i and λ_k can be calculated as follows:

λ_{i} = \int_{x_{i}}^{x_{i}} \frac{1}{σ \sqrt{2 π}} exp {- \frac{{(x - \bar{x})}^{2}}{2 σ^{2}}} d x

(24)

λ_{k} = \int_{k_{i - 1}}^{k_{i}} \frac{1}{σ \sqrt{2 π}} exp {- \frac{{(x - \bar{x})}^{2}}{2 σ^{2}}} d x

(25)

where σ and x̄ are the standard deviation and the mean value of the SP, respectively, and x = x̄ − 3σ ∼ x̄ + 3σ.

Figure 7 shows an illustration of the possibility function and the probability density function. Figure 8 shows the matching examples of possibility function. In the present example, we used the symptom parameter (x_i) to distinguish state1, state 2 and unknown state. P₁(x_i) and P₂(x_i) are possibility functions for state 1 and state 2, respectively. The possibility function of unknown state can be calculated as follows,

P_{un} (x_{i}) = max {0, 1 - [P_{1} (x_{i}) + P_{2} (x_{i})]}

(26)

If x_t is the symptom parameter calculated from the data in the state to be diagnosed, the matching degrees with a relevant level are calculated as follows:

State 1 level : W_{1} = P_{1} (x_{i}) \cap x_{t}

(27)

State 2 level : W_{2} = P_{2} (x_{i}) \cap x_{t}

(28)

Unknown state level : W_{u n} = P_{u n} (x_{i}) \cap x_{t}

(29)

Where W₁, W₂ and W_un express the possibilities of state 1, state 2 and unknown state, respectively. These degrees are normalized by

W_{1} + W_{2} + W_{un} = 1

(30)

Fuzzy systems rely on a set of rules. In this study, to correctly and effectively identify the condition and the fault type of rotating machinery, we have obtained the following “if-then” rules for condition diagnosis.

Rule 1: if x_i < x̄_1i − 3σ₁ and x_i < x̄_2i − 3σ₂ then W₁ = 0, W₂ = 0, W_un = 1;
Rule 2: if x_i > x̄_1i + 3σ₁ and x_i > x̄_2i + 3σ₂ then W₁ = 0, W₂ = 0, W_un =1;
Rule 3: if x̄_1i − 3σ₁ ≤ x_i ≤ x̄_1i + 3σ₁then 0 ≤ W₁ ≤ 1, 0 ≤ W₂ ≤ 1, 0 ≤ W_un < 1;
Rule 4: if x̄_2i − 3σ₂ ≤ x_i ≤ x̄_2i + 3σ₂ then 0 ≤ W₁ ≤ 1, 0 ≤ W₂ ≤ 1, 0 ≤ W_un < 1;

where x̄_1i and x̄_2i are mean values of symptom parameter x_i in states 1 and 2, respectively; σ₁ and σ₂ are standard deviations of symptom parameter x_i in states 1 and 2, respectively. In the rules 3 and 4, the possibilities W₁, W₂ and W_un can be obtained by Equations (27–29), respectively.

5.3. Dempster & Shafer Theory

Dempster & Shafer theory (DST) provides a rational inference mechanism for the combination relation in the diagnosis problems with uncertainty [24–28]. To obtain the results of the condition diagnosis by fuzzy inference, the combination functions of the symptom parameters are necessary. In the present work, the combining possibility function of the symptom parameters (SP_i and SP_j) can be obtained by the Dempster & Shafer theory (DST).

Supposing W_i(A_m) is possibility of SP_i in state A_m; W_j(A_k) is possibility of SP_j in state A_k, here, A_m and A_k are state sets, and m = k = {1,2,…n}. W(S)^′ is the combination possibility function of SP_i and SP_j, and S ∈ A_m and A_k. Thus, W(S)^′can be obtained by:

W {(S)}^{'} = \frac{\sum_{A_{m} \cap A_{k} = S} W_{i} (A_{m}) \cdot W_{j} (A_{k})}{1 - \sum_{A_{m} \cap A_{k} = Φ} W_{i} (A_{m}) \cdot W_{j} (A_{k})}

(31)

where Φ expresses an empty set.

As mentioned above, the combination possibility functions of SSPs in each sequential diagnosis step are obtained as follows. In the first step of the sequential diagnosis, the normalized combination possibility functions of the normal state possibility W(N)′, bearing fault state possibility W(B)′ and unknown state possibility W(U)′ can be obtained through the possibilities W_i(…) and W_j(…) of SSP_i and SSP_j (here i = 1, and j = 5), respectively, as follows:

W {(N)}^{'} = \frac{W_{i} (N) \cdot W_{j} (N) + W_{i} (N) \cdot W_{j} (U) + W_{i} (U) \cdot W_{j} (N)}{1 - W_{i} (N) \cdot W_{j} (B) - W_{i} (B) \cdot W_{j} (N)}

(32)

W {(B)}^{'} = \frac{W_{i} (B) \cdot W_{j} (B) + W_{i} (B) \cdot W_{j} (U) + W_{i} (U) \cdot W_{j} (B)}{1 - W_{i} (N) \cdot W_{j} (B) - W_{i} (B) \cdot W_{j} (N)}

(33)

W {(U)}^{'} = \frac{W_{i} (U) \cdot W_{j} (U)}{1 - W_{i} (N) \cdot W_{j} (B) - W_{i} (B) \cdot W_{j} (N)}

(34)

where W_i(N), W_i(B) and W_i(U) are possibilities of normal state (N), bearing fault state (B) and unknown state (U) obtained by SSP_i, respectively. W_j(N), W_j(B) and W_j(U) are possibilities of normal state (N), bearing fault state (B) and unknown state (U) obtained by SSP_j, respectively.

In the second step of the sequential diagnosis, the normalized combination possibility functions of the outer-race defect possibility W(O)′, other bearing defects possibility W(IR)′, and the unknown state possibility W(U)′can be obtained through the possibilities W_i(…) and W_j(…)of SSP_i and SSP_j (here, i = 1, j = 5), respectively, as follows:

W {(O)}^{'} = \frac{W_{i} (O) \cdot W_{j} (O) + W_{i} (O) \cdot W_{j} (U) + W_{i} (U) \cdot W_{j} (O)}{1 - W_{i} (O) \cdot W_{j} (I R) - W_{i} (I R) \cdot W_{j} (N)}

(35)

W {(I R)}^{'} = \frac{W_{i} (I R) \cdot W_{j} (I R) + W_{i} (I R) \cdot W_{j} (U) + W_{i} (U) \cdot W_{j} (I R)}{1 - W_{i} (O) \cdot W_{j} (I R) - W_{i} (I R) \cdot W_{j} (N)}

(36)

W {(U)}^{'} = \frac{W_{i} (U) \cdot W_{j} (U)}{1 - W_{i} (O) \cdot W_{j} (I R) - W_{i} (I R) \cdot W_{j} (O)}

(37)

where W_i(O), W_i(IR) and W_i(U) are possibilities of outer-race defect (O), other bearing defects (IR) and unknown state (U) obtained by SSP_i, respectively. W_j(O), W_j(IR) and W_j(U) are possibilities of outer-race defect (O), other bearing defects (IR) and unknown state (U) obtained by SSP_j, respectively.

The last step of the sequential diagnosis, the normalized combination possibility function of the inner race defect possibility W(I)′, rolling element defect possibility W(R)′, and unknown state possibility W(U)′ can be obtained through the possibilities W_i(…) and W_j(…) of SSP_i and SSP_j (here, i = 1, j = 2), respectively, as follows:

W {(I)}^{'} = \frac{W_{i} (I) \cdot W_{j} (I) + W_{i} (I) \cdot W_{j} (U) + W_{i} (U) \cdot W_{j} (I)}{1 - W_{i} (I) \cdot W_{j} (R) - W_{i} (R) \cdot W_{j} (I)}

(38)

W {(R)}^{'} = \frac{W_{i} (R) \cdot W_{j} (R) + W_{i} (R) \cdot W_{j} (U) + W_{i} (U) \cdot W_{j} (R)}{1 - W_{i} (I) \cdot W_{j} (R) - W_{i} (R) \cdot W_{j} (I)}

(39)

W {(U)}^{'} = \frac{W_{i} (U) \cdot W_{j} (U)}{1 - W_{i} (I) \cdot W_{j} (R) - W_{i} (R) \cdot W_{j} (I)}

(40)

where W_i(I), W_i(R) and W_i(U) are possibilities of inner race defect (I), rolling element defect (R) and unknown state (U) obtained by SSP_i, respectively. W_j(I), W_j(R) and W_j(U) are possibilities of inner race defect (I), rolling element defect (R) and unknown state (U) obtained by SSP_j, respectively.

6. Fuzzy Neural Network for Fault Diagnosis

The main mathematic symbols used in Section 6 are:

N_m: the neuron number of the m-th layer of an NN, m = 1 to M.

$X^{(1)} = {X_{i}^{(i, j)}}$ : the pattern input to the 1st layer. Here, $X_{i}^{(1, j)}$ is the value input to the j-th neuron in the input (1st) layer, i = 1 to P, j =1 to N₁.

$X^{(M)} = {{X_{i}}^{(M, k)}}$ : the training (teaching) data for the last layer (M-th layer). Here, $X_{i}^{(M, k)}$ is the output value of the k-th neuron in the output (M-th) layer; k = 1 to N_M.

$X^{(1) *} = {X_{i}^{(1, j) *}}$ and $X^{(M) *} = {X_{(M, k) *}^{1}}$ : new data that has not yet been learnt by the NN.

$X_{i}^{(m, t)}$ : the value of the t-th neuron in the hidden (m-th) layer; t =1 to N_M.

$W_{u v}^{(m)}$ : the weight between the u-th neuron in the m-th layer and the v-th neuron in the (m+1)-th layer, m =1 to M − 1;u = 1 to N_m; v =1 to N_m+1.

The fuzzy neural network is applied to diagnose the fault types of a rolling bearing by the sequential diagnosis algorithm, and realized with a developed back propagation neural network called as “the partially-linearized neural network” (PLNN). A back propagation neural network is only used for training the data, and the PLNN is used for testing the learned NN. Here, the basic principle of the PLNN for the fault diagnosis is described as follows.

The neuron number of the m-th layer of an NN is N_m. The set $X^{(1)} = {{X_{i}}^{(1, j)}}$ represents the pattern input to the 1st layer and the set $X^{(M)} = {{X_{i}}^{(M, k)}}$ is the training data for the last layer (M-th layer). Here, i = 1 to P, j = 1 to N₁, k = 1 to N_M and, $X_{i}^{(1, j)}$ : the value input to the j-th neuron in the input (1st) layer; $X_{i}^{(M, k)}$ the output value of the k-th neuron in the output (M-th) layer, k =1 to N_M.

Even if the NN converges by learning X⁽¹⁾ and X^(M), it cannot adequately deal with the ambiguous relationship between the new X⁽¹⁾* and X^(M)*, which has not been learnt. In order to predict X^(M)* according to the probability distribution of X⁽¹⁾*, partial linear interpolation of the NN is introduced as shown in Figure 9.

In the NN that has converged with the data X⁽¹⁾ and X^(M), the following symbols are used:

$X_{i}^{(m, t)}$ the value of the t-th neuron in the hidden (m-th) layer; t =1 to N_m.

$W_{u v}^{(m)}$ : the weight between the u-th neuron in the m-th layer and the v-th neuron in the (m+1)-th layer, m =1 to M;u = 1 to N_m: v = 1 to N_m+1.

If all these values are memorized by the computer, when new values $X_{j}^{(1, u) *}$ ( ${X_{j}}^{(1, u)} < X_{j}^{(1, u) *} < {X_{j + 1}}^{(1, u)}$ ) are input into the first layer, the predicted value of the v-th neuron (v=1 to N_m) in the (m+1)-th layer (m = 1 to M - 1) can be estimated by:

X_{j}^{(m + 1, ν)} = X_{i + 1}^{(m + 1, ν)} - \frac{{\sum_{u = 1}^{N m} W_{u v}^{(m)} (X_{i + 1}^{(m, u)} - X_{j}^{(m, u)})} (X_{i + 1}^{(m + 1, v)} - X_{i}^{(m + 1, v)})}{\sum_{u = 1}^{N m} W_{u v}^{(m)} (X_{i + 1}^{(m, u)} - X_{i}^{(m, u)})}

(41)

Using the operation above, the sigmoid function is partially linearized, as shown in Figure 9. If a function must be learned, the PLNN will learn the points indicated by the ● symbols shown in Figure 8. When new data (s₁′, s₂′) are input into the converged PLNN, the values depicted by the ■ symbols corresponding to the data (s₁′, s₂′) will quickly be identified as P_e. Thus, the PLNN can be used to deal with ambiguous diagnosis problems.

As shown in Figure 10, the new data (s₁′, s₂′) input into the converged PLNN, and which are not learnt by the PLNN for recognizing, must satisfy the following condition:

S_{1 (min)} < S_{1}' < S_{1 (max)} and S_{2 (min)} < S_{2}' < S_{2 (max)}

(42)

where S_1(min), S_2(min) and S_1(max), S_2(max) are the minimum values and the maximum values of S₁ and S₂, respectively, which have been learned by the PLNN. Therefore, in this work, the values (P_i* and P_j*) of symptom parameters input to the PLNN for fault diagnosis must satisfy the following condition:

P_{i (min)} < {P_{1}}^{*} < P_{i (max)} and P_{j (min)} < {P_{1}}^{*} < P_{j (max)}

(43)

where P_i(min), P_j(min) and P_i(max), P_j(max) are the minimum values and the maximum values of P_i and P_j, respectively.

7. Diagnosis and Verification

Figure 11 shows the PLNNs constructed for the condition diagnosis, which consists of the first layer, the hidden layer and the last layer. The SSPs selected by DI are input into the neurons in the first layer. The number of neurons in hidden layer is eighty. The outputs in the last layer are W(N)′, W(B)′, W(O)′, W(IR)′, W(I)′, W(R)′and W(U)′, which mean the possibility grades of normal state, bearing fault state, outer race defect state, other bearing defect, inner race defect, rolling element defect and unknown states, respectively.

In this study, the diagnosis knowledge for training of the PLNN is acquired by the possibility theory and the Dempster & Shafer theory (DST). The possibility functions of the SSPs used for each diagnostic step, as examples, are shown in Figures 12–14, respectively.

In Figure 12 P(N), P(B) and P(U) are the possibility functions of the normal, bearing defect and the unknown states, respectively. Using the matching method explained in Section 5.2, W₁(N), W₁(B) and W₁(U) that the possibilities of SSP₁ in the normal, the bearing defect and the unknown states can be obtained, respectively; W₅(N), W₅(B) and W₅(U) that the possibilities of SSP₅ in the normal, the bearing defect and the unknown states can also be obtained, respectively.

In Figure 13 P(O), P(IR) and P(U) are the possibility functions of the outer-race defect, other bearing faults (the rolling element defect and the inner-race defect), and the unknown states, respectively. Using the matching method explained in Section 5.2, W₁(O), W₁(IR) and W₁(U) that the possibilities of SSP₁ in the outer-race defect, other bearing faults and the unknown states can be obtained, respectively; W₅(O), W₅(IR) and W₅(U) that the possibilities of SSP₅ in the outer-race defect, other bearing faults and the unknown states can also be obtained, respectively.

In Figure 14 P(I), P(R) and P(U) are the possibility functions of the inner-race defect, the rolling element defect and the unknown states, respectively. Using the matching method explained in Section 5.2, W₁(I), W₁(R) and W₁(U) that the possibilities of SSP₁ in the inner-race defect, the rolling element defect and the unknown states can be obtained, respectively; W₂(I), W₂(R) and W₂(U) that the possibilities of SSP₂ in the inner-race defect, the rolling element defect and the unknown states can also be obtained, respectively.

After obtaining the possibilities of the SSPs for each diagnostic step, the combination possibility function of each state W(N)′, W(B)′, W(O)′, W(IR)′, W(I)′, W(R)′ and W(U)′ can be obtained by the Dempster & Shafer theory. As an example, parts of training data for each diagnosis step are shown in Tables 6–8.

In order to verify the diagnostic capability of the PLNN, we used the data measured in each state had not been learned by the PLNN. When inputting the test data into the learnt PLNNs, they can correctly and quickly diagnose those faults with the possibility grades of the corresponding states. The diagnosis results are shown in Tables 9–11.

According to the diagnosis results above, the normal (N), the outer-race defect (O), the inner-race defect (I), and the roller element defect (R) states of roller bearing can be automatically and correctly identified using the diagnosis methods proposed in this paper.

8. Conclusions

In order to solve the problem of ambiguity between the symptom parameters and fault types, effectively diagnose faults and automatically identify the condition of a rotating machine, an intelligent diagnosis method was proposed on the basis of the least squares mapping (LSM) and a fuzzy neural network. The main conclusions can be summarized as follows:

A sequential diagnosis method was proposed through which the fuzzy neural network realized by the partially-linearized neural network (PLNN) could sequentially distinguish fault types.
Knowledge for training the PLNN was acquired by possibility theory and the Dempster & Shafer theory (DST). The method of establishing the membership function by converting the probability distribution function of symptom parameters into a possibility function by the possibility theory was proposed, and the combination possibility functions of several symptom parameters were obtained by the DST.
The eight non-dimensional symptom parameters in the time domain were defined for reflecting the features of vibration signals measured in each state. To raise the diagnosis sensitivity of the symptom parameters, the new synthetic symptom parameters (SSPs) were obtained by the LSM method.
The detection index (DI) on the basis of statistical theory was also defined to evaluate the applicability of the SSPs. The DI can be used to select better SSPs for the PLNN.
The practical examples of faults diagnosis of a roller bearing verified the effectiveness of the proposed method. The diagnosis results showed that the faults were sequentially and automatically diagnosed on the basis of the possibilities of the symptom parameters.

References

Liu, B.; Ling, S.-F. On the selection of informative wavelets for machinery diagnosis. Mech. Syst. Signal Process. 1999, 13, 145–162. [Google Scholar]
Jing, L.; Liangsheng, Q. Feature extraction based on morlet wavelet and its application for mechanical fault diagnosis. J. Sound Vib. 2000, 234, 135–148. [Google Scholar]
Zhu, Q.B. Gear fault diagnosis system based on wavelet neural networks. Dyn. Contin. Discret. Impuls. Syst. Ser. A Math. Anal. Part 2 Suppl. 2006, 13, 671–673. [Google Scholar]
Bishop, C.M. Neural Networks for Pattern Recognition; Oxford University Press: New York, NY, USA, 1995. [Google Scholar]
Samanta, B.; Al-Balushi, K.R. Artificial neural network based fault diagnostics of rolling element bearings using time-domain features. Mech. Syst. Sign. Process. 2003, 17, 317–328. [Google Scholar]
McCormick, A.C.; Nandi, A.K. Real-Time classification of the rotating shaft loading conditions using artificial neural networks. IEEE Trans. Neur. Netw. 1997, 8, 748–756. [Google Scholar]
Samanta, B.; Al-Balushi, K.R.; Al-Araimi, S.A. Artificial neural networks and genetic algorithm for bearing fault detection. Soft Comput. 2006, 10, 264–271. [Google Scholar]
Li, R.Q.; Chen, J.; Wu, X. Fault diagnosis of rotating machinery using knowledge-based fuzzy neural network. Appl. Math. Mech. Engl. 2006, 27, 99–108. [Google Scholar]
Li, K.; Wang, H.; Chen, P. Intelligent diagnosis method based on feature spectra and fuzzy neural network for distinguishing structural faults of rotating machinery. Int. Inf. Inst. 2010, 3, 681–689. [Google Scholar]
Matuyama, H. Diagnosis algorithm. J. JSPE 1991, 75, 35–37. [Google Scholar]
Chen, P.; Toyota, T. Fuzzy Diagnosis and Fuzzy Navigation for Plant Inspection and Diagnosis Robot. Proceedings of FUZZ-IEEE/IFES'95, Yokohama, Japan, 20– 24 March 1995; Volume 1, pp. 185–193.
Fukunaga, K. Introduction to Statistical Pattern Recognition; Academic Press: San Diego, CA, USA, 1972. [Google Scholar]
Chen, P.; Toyota, T. Self-Reorganization of feature parameters in frequency domain by genetic programming. Trans. Jpn. Soc. Mech. Eng. Ser. C 1998, 65, 1946–1953. [Google Scholar]
Glotsos, D.; Kalatzis, I.; Spyridonos, P.; Kostopoulos, S.; Daskalakis, A.; Athanasiadis, E.; Ravazoula, P.; Nikiforidis, G.; Cavouras, D. Improving accuracy in astrocytomas grading by integrating a robust least squares mapping driven support vector machine classifier into a two level grade classification scheme. Comput. Methods Prog. Biomed. 2008, 90, 251–261. [Google Scholar]
Chen, P.; Toyota, T. Method for raising diagnosis accuracy by Least-squares mapping. J. Soc. Plant Eng. Jpn. 1995, 7, 162–166. [Google Scholar]
Chen, P.; Toyota, T. Sequential fuzzy diagnosis for plant machinery. JSME Int. J. Ser. C 2003, 46, 1121–1129. [Google Scholar]
Zadeh, L.A. Fuzzy set. Inf. Control 1965, 8, 338–353. [Google Scholar]
Dubois, D.; Prade, H. Possibility Theory: An Approach to Computerized Processing; Plenum Press: New York, NY, USA,, 1988. [Google Scholar]
Dubois, D.; Prade, H. possibility theory, probability theory and multiple-valued logics: A clarification. In Ann. Math. Artif. Intell.; 2001; Volume 32, pp. 35–66. [Google Scholar]
Chen, P.; Feng, F.; Toyota, T. Sequential diagnosis method for plant machinery by statistical tests and possibility theory. J. Reliab. Eng. Assoc. Jpn. 2002, 24, 313–322. [Google Scholar]
Cayrac, D.; Dubois, D.; Prade, H. Handling uncertainty with possibility theory and fuzzy sets in asatellite fault diagnosis application. IEEE Trans. Fuzzy Syst. 1996, 4, 251–269. [Google Scholar]
Zadeh, L.A. Fuzzy sets as basis for a theory of possibility. Fuzzy Sets Syst. 1999, 100, S9–S34. [Google Scholar]
Raufaste, E.; da Silva Neves, R.; Claudette, M. Testing the descriptive validity of possibility theory in human judgments of uncertainty. Artif. Intell. 2003, 148, 197–218. [Google Scholar]
Dempster, A.P. Upper and lower probabilities induced by multivalued mappings. Ann. Math. Stat. 1967, 38, 325–339. [Google Scholar]
Shafer, G. A Mathematical Theory of Evidence; Princeton University Press: Princeton, NJ, USA, 1976. [Google Scholar]
Schocken, S.; Hummel, R.A. On the use of the Dempster–Shafer model in information indexing and retrieval applications. Int. J. Man-Mach. Stud. 1993, 39, 843–879. [Google Scholar]
Yager, R.R. Dempster–Shafer belief structures with interval valued focal weights. Int. J. Intell. Syst. 2001, 16, 497–512. [Google Scholar]
Smets, P. The combination of evidence in the transferable belief model. IEEE Trans. Pattern Anal. Mach. Intell. 1990, 12, 447–458. [Google Scholar]

Figure 1. Flowchart of the condition diagnosis.

Figure 2. Experimental setup for rolling bearing fault diagnosis.

Figure 3. Bearing defects. (a) Outer-race defect; (b) Inner-race defect; (c) Roller defect.

Figure 4. Vibration signals of bearings after filtering.

Figure 5. Projected example by the LSM (a) before projection; (b) after projection.

Figure 6. Flowchart of sequential condition diagnosis.

Figure 7. Possibility function and the probability density function.

Figure 8. Matching examples of possibility function.

Figure 9. The partial linearization of the sigmoid function.

Figure 10. Interpolation by the PLNN.

Figure 11. Partially-linearized neural network for condition diagnosis.

Figure 12. Possibility functions of (a) SSP₁ and (b) SSP₅for first diagnostic step.

Figure 13. Possibility functions of (a) SSP₁ and (b) SSP₅for second diagnostic step.

Figure 14. Possibility functions of (a) SSP₁ and (b) SSP₂ for third diagnostic step.

Table 1. Bearing information for verification.

**Table 1.** Bearing information for verification.
Contents	Parameters
Bearing outer diameter	52 mm
Bearing inner diameter	25 mm
Bearing width	15 mm
Bearing roller diameter	7 mm
The number of the rollers	11
Contact angle	0 rad
Outer-race defect	0.3 × 0.25 mm (width × depth); Early stage
Inner-race defect	0.3 × 0.25 mm (width × depth); Early stage
Rolling element defect	0.3 × 0.25 mm (width × depth); Early stage

Table 2. Diagnosis sensitivity for condition diagnosis.

**Table 2.** Diagnosis sensitivity for condition diagnosis.
Detection Index	Discrimination Rate	Sensitivity
<0.85	<80%	Low
0.85–1.30	80%–90%	Slightly low
1.30–1.65	90%–95%	Middle
1.65–2.33	95%–99%	High
>2.33	>99%	Very high

Table 3. Values of DR and DI before projection.

**Table 3.** Values of DR and DI before projection.
	P₁			P₂

State	μ_p1	σ_p1	DI_P1 (DR_P1)	μ_p2	σ_p2	DI_P2 (DR_P2)
I	2.38	0.35	1.12 (86.9%)	0.72	0.17	1.19 (87.3%)
R	3.12	0.56	1.12 (86.9%)	0.435	0.168	1.19 (87.3%)

Table 4. Values of DR and DI after projection.

**Table 4.** Values of DR and DI after projection.
	SSP₁			SSP₂

State	μ_ssp1	σ_ssp1	DI_ssp1 (DR_ssp1)	μ_ssp2	σ_ssp2	DI_ssp2 (DR_ssp2)
I	3.99	0.37	2.34 (99.04%)	1.025	0.0022	2.25 (98.8%)
R	5.13	0.32	2.34 (99.04%)	1.032	0.0022	2.25 (98.8%)

Table 5. DI values of SSPs for each sequential diagnosis step.

**Table 5.** DI values of SSPs for each sequential diagnosis step.
DI Values of Each SSP
	SSP₁	SSP₂	SSP₃	SSP₄	SSP₅	SSP₆	SSP₇	SSP₈
For first step
N:O	13.86	3.11	1.43	2.10	10.38	4.93	9.48	7.11
N:I	2.92	2.20	1.11	2.39	3.08	2.76	2.72	2.56
N:R	4.81	3.37	0.77	1.06	3.43	1.23	2.27	1.06
For second step
O:I	4.69	0.70	0.88	2.05	3.62	2.52	3.31	2.31
O:R	3.01	2.41	1.56	0.80	2.35	1.04	1.00	0.80
For third step
I:R	2.34	2.12	1.22	1.63	1.03	0.70	1.45	1.11

Table 6. Training data for first step of sequential diagnosis.

**Table 6.** Training data for first step of sequential diagnosis.
SSP₁	SSP₅	W(N)′	W(B)′	W(U)′
1.245	0	0	0	1
2.76	38.7	0.5	0.02	0.48
5.35	0.665	0.333	0.38	0.287
4.52	6.18	0	1	0
…	…	…	…	…

Table 7. Training data for second step of sequential diagnosis.

**Table 7.** Training data for second step of sequential diagnosis.
SSP₁	SSP₅	W(O)′	W(IR)′	W(U)′
3.15	6.17	0	0	1
4.13	6.42	0.333	0.333	0.333
6.08	6.5	0.978	0	0.022
5.04	15.1	0	1	0
…	…	…	…	…

Table 8. Training data for third step of sequential diagnosis.

**Table 8.** Training data for third step of sequential diagnosis.
SSP₁	SSP₂	W(I)′	W(R)′	W(U)′
2.5	1.01	0	0	1
3.835	1.021	0.75	0	0.25
5.332	1.021	0.333	0.333	0.333
5.66	1.032	0.057	0.943	0
…	…	…	…	…

Table 9. Verification result of first step.

**Table 9.** Verification result of first step.
SSP₁	SSP₅	W(N)′	W(B)′	W(U)′	Judge
3.025	1.854	0.811	0.112	0.105	N
2.882	1.615	0.796	0.157	0.138	N
4.260	26.05	0.0002	0.8405	0.1691	B
4.961	15.53	0.0002	0.8561	0.1462	B
1.579	30.56	0.036	0.0928	0.9075	U
…	…	…	…	…	…

Table 10. Verification result of second step.

**Table 10.** Verification result of second step.
SSP₁	SSP₅	W(O)′	W(IR)′	W(U)′	Judge
6.10	6.33	0.8607	0.0021	0.1511	O
6.104	6.84	0.9105	0.0059	0.1023	O
4.22	18.44	0.1265	0.8365	0.0732	I or R
5.36	9.93	0.0671	0.8012	0.1747	I or R
2.01	25.5	0.1011	0.0936	0.8228	U
…	…	…	…	…	…

Table 11. Verification result of third step.

**Table 11.** Verification result of third step.
SSP₁	SSP₂	W(I)′	W(R)′	W(U)′	Judge
3.81	1.025	0.9541	0.0035	0.1231	I
4.09	1.029	0.9027	0.0071	0.1096	I
5.26	1.031	0.0082	0.8974	0.1217	R
4.73	1.033	0.0047	0.9127	0.1056	R
6.69	0.83	0.0767	0.0458	0.9279	U
…	…	…	…	…	…

© 2012 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Li, K.; Chen, P.; Wang, S. An Intelligent Diagnosis Method for Rotating Machinery Using Least Squares Mapping and a Fuzzy Neural Network. Sensors 2012, 12, 5919-5939. https://doi.org/10.3390/s120505919

AMA Style

Li K, Chen P, Wang S. An Intelligent Diagnosis Method for Rotating Machinery Using Least Squares Mapping and a Fuzzy Neural Network. Sensors. 2012; 12(5):5919-5939. https://doi.org/10.3390/s120505919

Chicago/Turabian Style

Li, Ke, Peng Chen, and Shiming Wang. 2012. "An Intelligent Diagnosis Method for Rotating Machinery Using Least Squares Mapping and a Fuzzy Neural Network" Sensors 12, no. 5: 5919-5939. https://doi.org/10.3390/s120505919

Article Menu

An Intelligent Diagnosis Method for Rotating Machinery Using Least Squares Mapping and a Fuzzy Neural Network

Abstract

1. Introduction

2. Experimental System for Fault Diagnosis

3. Non-Dimensional Symptom Parameters and Sensitivity Evaluation

3.1. Non-Dimensional Symptom Parameters for Fault Diagnosis

3.2. Detection Index

4. Synthesizing Symptom Parameter by Least Squares Mapping

5. Sequential Diagnosis Method Based on Fuzzy Inference and Dempster & Shafer Theory

5.1. Sequential Condition Diagnosis Approach

5.2. Fuzzy Inference by Possibility Theory

5.3. Dempster & Shafer Theory

6. Fuzzy Neural Network for Fault Diagnosis

7. Diagnosis and Verification

8. Conclusions

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI