Fault Diagnosis of Loader Gearbox Based on an ICA and SVM Algorithm

When a part of the loader’s gearbox fails, this can lead to equipment failure due to the complex internal structure and the interrelationship between the parts. Therefore, it is imperative to research an efficient strategy for transmission fault diagnosis. In this study, the non-contact characteristics of noise diagnosis using sound intensity probes were used to collect noise signals generated under gear breaking conditions. The independent component analysis (ICA) technique was applied for feature extraction from the original data and to reduce the correlation between the signals. The correlation coefficient between the independent components and the source data was used as the input parameters of the support vector machine (SVM) classifier. The separation of the independent components was achieved by MATLAB simulation. The misdiagnosis rate was 5% for 40 sets of test data. A 13-point test platform for noise testing of the loader gearbox was built according to Chinese national standards. Source signals under the normal and fault conditions were analyzed separately by ICA and SVM algorithms. In this case, the misdiagnosis rate was 7.5% for the 40 sets of experimental test data. This proved that the proposed method could effectively realize fault classification and recognition.


Introduction
As the most widely used type of engineering machinery, a loader is commonly utilized in construction sites such as roads, railways, and ports. Due to poor working conditions, it is particularly important to detect the failure of this construction machinery, especially through online detection. The gearbox is an important transmission component in loaders. Statistics show that the gear is the most common part causing failure: approximately sixty percent of failures are caused by the gear [1]. At present, the diagnosis methods for gearbox faults include temperature detection, optical fiber sensing, and vibration detection. These methods require that the position of the measuring instrument be changed when examining different fault locations at the same or different gearboxes. This is time-consuming and inefficient. Therefore, it is particularly important to propose a reasonable method for fault diagnosis.
Independent component analysis (ICA) is a new signal analysis technology developed in recent decades from the blind signal separation problem. It has been successfully applied in fields such as bioengineering [2], communication [3], speech recognition [4], and fault diagnosis [5]. In the field of neuroimaging, Melissant et al. used the ICA algorithm to preprocess the EEGs of patients with Alzheimer's disease, which improved the accuracy of diagnosis [6]. Gohel effectively repressed signal leakage by using the orthogonality of the ICA to reconstruct the source signal [7]. Back was the first to introduce ICA to the field of financial analysis, and many scholars have since used ICA to analyze and process financial data [8]. Independent component analysis was applied to the financial time series data belonging to a retail chain, where it detected a few factors that affect the cash flow [9]. Independent component analysis has also been used in machine condition monitoring and fault diagnosis. For example, an approach involving dynamic independent component analysis in fault detection and diagnosis, proposed by George, is able to accurately detect and isolate the root causes for each individual fault [10]. Principal component analysis (PCA) and ICA techniques were applied to the diagnosis of turbines for fault detection, and the validity and effectiveness of the approach were verified [11]. The French scholar Badaoui used ICA in combination with a Wiener filter to separate combustion noise and piston slap in a diesel engine [12]. Many improved algorithms such as constrained independent component analysis (cICA) [13,14] and kernel independent component analysis (KICA) [15,16] have been developed in order to improve the practical range and precision of the ICA algorithm.
The application of an intelligent system for fault diagnosis has been widely researched in recent years. Support vector machines (SVMs) and neural networks have been employed to solve classification problems [17]. In the field of economics, Jae identified the optimal parameter values of the kernel function of the SVM to predict bankruptcy and obtained better performance than multiple discriminant analysis (MDA), Logit, and back-propagation neural networks (BPNs) [18]. Linear SVM classifiers were introduced for credit risk evaluation [19]. In signal processing, Salvati used an SVM to locate near-field sound sources and improved the accuracy of the beam former in a reverberation environment [20]. Haik built an optimized SVM-based probability model to select a window from a continuous data, and the effectiveness of the approach was proven [21]. Many scholars regard SVM as an intelligent classifier for machine fault classification, especially bearing faults [22,23], gear faults [24,25], wind turbine faults [26,27], and induction motors [28,29]. When applied in the current study, SVM has high computational complexity and low precision when directly classifying using raw data with redundant characteristics. Therefore, the ICA technique was applied for feature extraction from the original data and to reduce the correlation between the signals. The correlation coefficients between the feature vector and the source data were then used as the input parameters of the SVM classifier.

Independent Component Analysis
Independent component analysis is a technique that has been demonstrated to be capable of separating an independent component from the source signal by utilizing its independence and non-Gaussian property. The process used by ICA is illustrated in Figure 1.
Mach. Learn. Knowl. Extr. 2019, 1 FOR PEER REVIEW 2 leakage by using the orthogonality of the ICA to reconstruct the source signal [7]. Back was the first to introduce ICA to the field of financial analysis, and many scholars have since used ICA to analyze and process financial data [8]. Independent component analysis was applied to the financial time series data belonging to a retail chain, where it detected a few factors that affect the cash flow [9]. Independent component analysis has also been used in machine condition monitoring and fault diagnosis. For example, an approach involving dynamic independent component analysis in fault detection and diagnosis, proposed by George, is able to accurately detect and isolate the root causes for each individual fault [10]. Principal component analysis (PCA) and ICA techniques were applied to the diagnosis of turbines for fault detection, and the validity and effectiveness of the approach were verified [11]. The French scholar Badaoui used ICA in combination with a Wiener filter to separate combustion noise and piston slap in a diesel engine [12]. Many improved algorithms such as constrained independent component analysis (cICA) [13,14] and kernel independent component analysis (KICA) [15,16] have been developed in order to improve the practical range and precision of the ICA algorithm. The application of an intelligent system for fault diagnosis has been widely researched in recent years. Support vector machines (SVMs) and neural networks have been employed to solve classification problems [17]. In the field of economics, Jae identified the optimal parameter values of the kernel function of the SVM to predict bankruptcy and obtained better performance than multiple discriminant analysis (MDA), Logit, and back-propagation neural networks (BPNs) [18]. Linear SVM classifiers were introduced for credit risk evaluation [19]. In signal processing, Salvati used an SVM to locate near-field sound sources and improved the accuracy of the beam former in a reverberation environment [20]. Haik built an optimized SVM-based probability model to select a window from a continuous data, and the effectiveness of the approach was proven [21]. Many scholars regard SVM as an intelligent classifier for machine fault classification, especially bearing faults [22,23], gear faults [24,25], wind turbine faults [26,27], and induction motors [28,29]. When applied in the current study, SVM has high computational complexity and low precision when directly classifying using raw data with redundant characteristics. Therefore, the ICA technique was applied for feature extraction from the original data and to reduce the correlation between the signals. The correlation coefficients between the feature vector and the source data were then used as the input parameters of the SVM classifier.

Independent Component Analysis
Independent component analysis is a technique that has been demonstrated to be capable of separating an independent component from the source signal by utilizing its independence and non-Gaussian property. The process used by ICA is illustrated in Figure 1.  In Figure 1, the source signal is composed of a few independent acoustic sources, s n (t). Considering the effect of path delay in the signal transmission process, each signal can take multiple channels to reach the receivers, x m (t). The process of unmixing the signals is performed by the ICA algorithm. Assuming that n known independent source signals were transmitted by an unknown channel and eventually accepted by m receivers, there are n independent components y n (t), which constitute the estimates of the source signal. The received signal can be expressed as [30] x where x i (t) is the mixed signal received by receiver i, b ij is the response function between the signal from the ith receiver and the jth source signal, and s j (t − p) is the delay function from the jth source signal to the ith receiver. Its matrix form is where B(t) is a mixed matrix in the time domain. Equation (2) is subjected to Fourier transform: where In the processing of the mixed solution, a linear matrix W is determined to obtain the estimated signal from the source signal. Therefore, the separation of the ICA is given by where Y( f ) is the estimation of S( f ) in the frequency domain, and the inverse Fourier transform is used to transform the frequency to the time domain, A is an unknown mixing matrix.

Support Vector Machines
Given the set of training samples D = {(x 1 , y 1 ), (x 2 , y 2 ), . . . , (x n , y n )} y i ∈ {−1, 1}, the most basic objective of the SVM is finding a division hyperplane from the sample space according to the training samples D to separate data that have different characteristics. The following linear equation describes the hyperplane function [31] w T x + b = 0 where w = (w 1 , w 2 , . . . , w d ) is the normal vector that determines the direction of the hyperplane, and b is a constant term that decides the distance between the hyperplane and the coordinate origin. Obviously, the divided hyperplane is constrained by w and b. The distance from point x belonging to the sample space to the plane (w,b) can be written as Suppose the plane (w,b) can classify the training samples correctly and has the following constraints for (x i , y i ) ∈ D.
As shown in Figure 2, the training sample point that is nearest to the hyperplane and gives a true statement when substituted into Equations (2)-(7) is called the "support vector." The sum of the distances of the two types of heterogeneous support vectors to the hyperplane is As shown in Figure 2, the training sample point that is nearest to the hyperplane and gives a true statement when substituted into Equations (2)-(7) is called the "support vector." The sum of the distances of the two types of heterogeneous support vectors to the hyperplane is ‖ ‖ . The plane of the "maximum margin" is equivalent to finding the parameter w and b, which must satisfy Equation (7). , 2 max w b w (10) . . s t ( ) 1 , should take the maximum value in order to maximize the interval. Therefore, Equations (2)-(9) is rewritten as  The plane of the "maximum margin" is equivalent to finding the parameter w and b, which must satisfy Equation (7).
Obviously, w −1 should take the maximum value in order to maximize the interval. Therefore, Equations (2)-(9) is rewritten as

Algorithm Simulation
To test the classification and recognition of the fault signal through the combined algorithms of ICA and SVM, MATLAB software was used to simulate three types of vibration signals, namely, the signal from the constant meshing gear S 1 (t) , the output gear pair signal S 2 (t) , and the signal from a broken tooth S cp (t). In order to simplify the simulation process, the other vibration sources in the gearbox were not considered when simulating the gearbox conditions. There were two different working conditions, which were convolutions of the three kinds of signals. Two hundred and forty sets of data were collected separately under different conditions. Two hundred groups of data were applied to train the classifier, and forty groups of data were used to test the accuracy of the classifier.
The noise signals were replaced by vibration signals due to difficulty in the simulation of the noise signals and the correlation between noise and vibration. The description of the simulated signals is shown in Table 1. Table 1. Description of simulation signals.

Signals Expression
Normal The parameters of the simulation are consistent with the gearbox parameters to make the simulation closer to the actual situation. In this study, the gearbox of a loader from Shandong Lingong Construction Machinery Co., Ltd., that had three gears, was used, as shown in Figure 3.
ICA and SVM, MATLAB software was used to simulate three types of vibration signals, namely, the signal from the constant meshing gear t ′, the output gear pair signal t ′, and the signal from a broken tooth t . In order to simplify the simulation process, the other vibration sources in the gearbox were not considered when simulating the gearbox conditions. There were two different working conditions, which were convolutions of the three kinds of signals. Two hundred and forty sets of data were collected separately under different conditions. Two hundred groups of data were applied to train the classifier, and forty groups of data were used to test the accuracy of the classifier. The noise signals were replaced by vibration signals due to difficulty in the simulation of the noise signals and the correlation between noise and vibration. The description of the simulated signals is shown in Table 1. Table 1. Description of simulation signals.

Signals Expression
The parameters of the simulation are consistent with the gearbox parameters to make the simulation closer to the actual situation. In this study, the gearbox of a loader from Shandong Lingong Construction Machinery Co., Ltd., that had three gears, was used, as shown in Figure 3.  Aiming at the noise signal of gearbox failure, this paper designs a diagnostic algorithm based on ICA and SVM. The FastICA principle is used to separate the independent sources in the signal, and the correlation coefficients of the noise signal and the independent source are used to construct the eigenvector of the noise signal. The algorithm flow is shown in Figure 5. Gaussian kernel is used. The function SVM model trains the classifier to diagnose the fault of the gearbox. The algorithm flow is shown in Figure 4. Aiming at the noise signal of gearbox failure, this paper designs a diagnostic algorithm based on ICA and SVM. The FastICA principle is used to separate the independent sources in the signal, and the correlation coefficients of the noise signal and the independent source are used to construct the eigenvector of the noise signal. The algorithm flow is shown in Figure 4. Gaussian kernel is used. The function SVM model trains the classifier to diagnose the fault of the gearbox. The algorithm flow is shown in Figure 5.

Simulation of the Original Signal
The second gear was selected, and the output speed of the torque converter was 1200 rpm in this working condition. The influence of the vibration signal of the torque converter was ignored.
In the simulation, the signal of the constant meshing gear is as follows: where and are the amplitude and phases of the ith order vibration of the constant meshing gear pair, respectively, and is the gear meshing frequency.

Simulation of the Original Signal
The second gear was selected, and the output speed of the torque converter was 1200 rpm in this working condition. The influence of the vibration signal of the torque converter was ignored.
In the simulation, the signal of the constant meshing gear s 1 (t) is as follows: where A i and ϕ i are the amplitude and phases of the ith order vibration of the constant meshing gear pair, respectively, and f m1 is the gear meshing frequency. In this paper, the frequency of the input shaft is f n1 = 20Hz. The torque converter output speed is 1200 rpm. The gear meshing frequency is: Assuming the second-order vibration response of the gear pair is known, The output gear pair signal s 2 (t) is as follows: where B i and θ i are the amplitude and phases of the ith order vibration of the output gear pair, respectively, and f m2 is the frequency of the output gear pair in the second gear condition.
Output shaft frequency f n2 is: The frequency of the output gear pair f m2 is: In addition, the amplitudes B 1 = 0.7 um, B 2 = 1.4 um, and phases θ 1 = θ 2 = 0.
The faulty gear was selected from the driving wheel and the broken tooth fault was applied. A broken tooth in the gearbox will generate a local impact phenomenon, and the vibration signals show the impact given by Equations (3)-(5), where s cp (t) is defined as where n is the number of shocks in the signal, m is the order of the natural frequency, and A cji is the amplitude of the ith impact when the machine has the jth order of natural frequency. To simplify the simulation process, A cji was assigned a random number from 0 to 10. The variables ε dj and f dj represent the jth order damping ratio and natural frequency, respectively; τ ci is the moment of the ith impact, which is given by τ ci = (i − 1)T c + τ c0 ; T c is the cycle of the gear failure; and τ c0 is the initial time of the impact response, i.e., τ c0 = 0.3T c . It was assumed that the two natural frequencies were 1100 and 3500 Hz. According to the actual working condition of the gearbox, ε d1 = 0.04, ε d2 = 0.03, The time and frequency diagrams of the three source signals are plotted in Figure 6.

Simulation of Working Condition Signals
Six data collectors were used in the simulation. Suppose that each source signal from two different channels reach the same collector. Therefore, the data from each collector can be expressed as (20) where ( ) is the data collected by the ith collector. The calculation method is as follows: ( ) = ( , : ) where is an arbitrary 6 × 6 mixed matrix, and is the number of FFT calculations. A 0.2-s signal was collected at each collector to show the difference between the data collected by each collector. The vibration signal of the gearbox under the normal condition is shown in Figure 7. The fault signal corresponding to the broken gear from six data collectors is shown in Figure 8.

Simulation of Working Condition Signals
Six data collectors were used in the simulation. Suppose that each source signal from two different channels reach the same collector. Therefore, the data from each collector can be expressed as where x i (t) is the data collected by the ith collector. The calculation method is as follows: where b is an arbitrary 6 × 6 mixed matrix, and N is the number of FFT calculations. A 0.2-s signal was collected at each collector to show the difference between the data collected by each collector. The vibration signal of the gearbox under the normal condition is shown in Figure 7. The fault signal corresponding to the broken gear from six data collectors is shown in Figure 8.

Feature Extraction
Due to the source signal emitted under the fault condition being generated by the convolution of three independent source signals in the simulation process, three independent components were selected to estimate the source signal. The FastICA algorithm was applied to separate the data for different conditions. First, the independent matrix was calculated. Second, the results of the signal

Feature Extraction
Due to the source signal emitted under the fault condition being generated by the convolution of three independent source signals in the simulation process, three independent components were selected to estimate the source signal. The FastICA algorithm was applied to separate the data for different conditions. First, the independent matrix was calculated. Second, the results of the signal

Feature Extraction
Due to the source signal emitted under the fault condition being generated by the convolution of three independent source signals in the simulation process, three independent components were selected to estimate the source signal. The FastICA algorithm was applied to separate the data for different conditions. First, the independent matrix was calculated. Second, the results of the signal were estimated by the matrix in the frequency domain, and the amplitude of the signal was corrected. Finally, the independent source was obtained through the iterative feedback tuning (IFT) criterion.
The results separated by FastICA are shown in Figure 9, and the three separated independent source signals have obvious periodic characteristics. By comparing the independent source signals in Figure 9 and the source signals in Figure 6, it can be seen that the separated signals have a certain degree of reduction of the normally meshing gear pair signal s 1 (t) and the output gear pair signal s 2 (t). Numerous differences in the sequence, amplitude, and shape can be found when comparing the results of the separation of the source signals. This can be attributed to the correlations between the signals used for the convolution. However, the independent components, once separated by FastICA, have obvious periodic characteristics and similarities when compared with the mixed source signals. When the fault signal was decomposed, the first signal contained the basic features of the fault signal components. The signals separated by FastICA are different and can be used to characterize the mixed signals under any condition.
Mach. Learn. Knowl. Extr. 2019, 1 FOR PEER REVIEW 10 were estimated by the matrix in the frequency domain, and the amplitude of the signal was corrected. Finally, the independent source was obtained through the iterative feedback tuning (IFT) criterion. The results separated by FastICA are shown in Figure 9, and the three separated independent source signals have obvious periodic characteristics. By comparing the independent source signals in Figure 9 and the source signals in Figure 6, it can be seen that the separated signals have a certain degree of reduction of the normally meshing gear pair signal 1 ( ) and the output gear pair signal 2 ( ). Numerous differences in the sequence, amplitude, and shape can be found when comparing the results of the separation of the source signals. This can be attributed to the correlations between the signals used for the convolution. However, the independent components, once separated by

Fault Identification
To improve the training precision, the correlation coefficients between the independent components and source data were used as the input parameters of the SVM classifier. The NICn is the nth independent component extracted under normal conditions(n = 1,2,3), and the FICn is the nth independent component extracted under fault conditions (n = 1,2,3). The procedure for the calculation of the correlation coefficient can be summarized as follows: Step 1: Represent the different conditions by applying S 1 , S 2 , S 3 according to the classification of the conditions. The FastICA algorithm is used to extract m independent components, which are recorded as IC i .
Step 2: Calculate the correlation coefficient between the first independent component of IC i and each data channel of the signal, and take the absolute maximum. The first feature value F 1 is the sum of the 3 numbers.
Step 3: Repeat the second step to obtain the feature value F 2 , F 3 . The feature vectors are column vectors from F 1 to F 3 .
Step 4: Repeat the second and third steps for the training signals, and obtain the feature vector of all signals.
The correlation coefficients of the NIC, FIC, and gearbox data collected under different conditions are shown in Table 2. It is worth noting that for n different conditions, only repeat step 2 and step 3 to get the eigenvectors of all signals. The feature vector of each group is the sum of the correlation coefficients. Table 3 shows the feature vector under the two normal and two fault working conditions. It can be seen from the table that the data collected under normal working conditions are highly correlated with the characteristic signal extracted from normal signals by the FastICA algorithm, and the data collected under the faulty working conditions are highly correlated with the characteristic signal extracted from fault signals. This clear correlation indirectly proves that the FastICA algorithm successfully extracts the characteristic components of different signals. Only one classifier can be trained, as there are only two types of working conditions in the set. The SVM was trained with approximately 160 groups of data from the two working conditions, and 40 samples were used to test the classifier performance. Each sample was a 2 × 1 matrix that could be expressed in two-dimensional coordinates. As shown in Figure 10, the two working conditions were clearly distinguished by a dividing hyperplane. The normal and fault signals have obvious differences while calculating the feature vector. The normal signals are mainly underneath the division line because of the higher correlation coefficient between NIC and the normal conditions than between NIC and the fault conditions. The same reasoning can be applied to fault signals on the other side of the dividing line. Forty samples were tested, and there were only two false positives under the two abovementioned conditions. Therefore, the misdiagnosis rate was 2 40 ⁄ 100% 5%.

Noise Diagnosis Experiment of Gearbox Failure
The fault situation during the process of gearbox operation is more complex than in the simulation not only because of the noise of a single component, but also because all kinds of aliasing noise and background noise are present. To verify the feasibility of the proposed algorithm in the noise diagnosis of the gearbox fault, the noise measurement platform was built to collect the data, and the algorithm was used to diagnose the failure of the gearbox.

Construction of Test Platform
Testing was performed in the gearbox laboratory of Shandong Lingong Construction Machinery Co., Ltd. A constant current source preamplifier (26CA) and free-field acoustic intensity sensor (46AE) from GRAS were applied. Before testing, the sound intensity sensor was first calibrated, then installed on the preamplifier and connected to the high-performance multichannel data acquisition system (LMS SCADAS) by BNC connector. Gear wear is the specific fault type considered in this paper, the length and depth of which was defined as 55 mm and 0.5 mm, respectively. The experiment was conducted using a test platform comprised of a motor, gearbox, speed sensor, constant current source preamplifier, free field acoustic intensity sensor, calibrator, and LMS SCADAS Mobile SCM05, as shown in Figure 11.
The Chinese national standard GB/T 3767-2016/ISO3744:2010 "Acoustics-Determination of sound power levels and sound energy levels of noise sources using sound pressure-engineering methods for an essentially free field over a reflecting plane" was referred to arrange the positions of 13 sound intensity sensors, as shown in Figure 10. First, the speed of the test rig was adjusted to 1200 r/min, and noise data under the normal conditions were collected when the speed sensor had measured the constant speed of the out shaft. The sampling frequency was 40,960 Hz, and the sampling time was 10 min. Next, the gearbox was stopped, the fault gear ring was replaced, and the signals corresponding to the fault conditions were collected. The normal and fault signals have obvious differences while calculating the feature vector. The normal signals are mainly underneath the division line because of the higher correlation coefficient between NIC and the normal conditions than between NIC and the fault conditions. The same reasoning can be applied to fault signals on the other side of the dividing line. Forty samples were tested, and there were only two false positives under the two abovementioned conditions. Therefore, the misdiagnosis rate was 2/40 × 100% = 5%.

Noise Diagnosis Experiment of Gearbox Failure
The fault situation during the process of gearbox operation is more complex than in the simulation not only because of the noise of a single component, but also because all kinds of aliasing noise and background noise are present. To verify the feasibility of the proposed algorithm in the noise diagnosis of the gearbox fault, the noise measurement platform was built to collect the data, and the algorithm was used to diagnose the failure of the gearbox.

Construction of Test Platform
Testing was performed in the gearbox laboratory of Shandong Lingong Construction Machinery Co., Ltd. A constant current source preamplifier (26CA) and free-field acoustic intensity sensor (46AE) from GRAS were applied. Before testing, the sound intensity sensor was first calibrated, then installed on the preamplifier and connected to the high-performance multichannel data acquisition system (LMS SCADAS) by BNC connector. Gear wear is the specific fault type considered in this paper, the length and depth of which was defined as 55 mm and 0.5 mm, respectively. The experiment was conducted using a test platform comprised of a motor, gearbox, speed sensor, constant current source preamplifier, free field acoustic intensity sensor, calibrator, and LMS SCADAS Mobile SCM05, as shown in Figure 11.  Figure 11. Construction of signal acquisition platform. Black solid point represents the sound intensity sensor, A represents reflector, B represents datum body, 2a represents measuring face length, 2b represents measuring face width, c represents height of measuring face, d represent measuring distance (80 cm), represents datum length (80 cm), represents datum width (60 cm), represents height of datum (120 cm).

Noise Spectrum Analysis
The two types of working noise data collected by the data acquisition system were divided into 120 groups with 5 s as the signal sampling period. There were 240 groups of samples, 200 groups of which were the training samples, and 40 samples were used to test the classification performance of the classifier. Signals were randomly intercepted to obtain 0.5 s signal fragments, as shown in Figures  12 and 13. Figure 11. Construction of signal acquisition platform. Black solid point represents the sound intensity sensor, A represents reflector, B represents datum body, 2a represents measuring face length, 2b represents measuring face width, c represents height of measuring face, d represent measuring distance (80 cm), l 1 represents datum length (80 cm), l 2 represents datum width (60 cm), l 3 represents height of datum (120 cm).
The Chinese national standard GB/T 3767-2016/ISO3744:2010 "Acoustics-Determination of sound power levels and sound energy levels of noise sources using sound pressure-engineering methods for an essentially free field over a reflecting plane" was referred to arrange the positions of 13 sound intensity sensors, as shown in Figure 10. First, the speed of the test rig was adjusted to 1200 r/min, and noise data under the normal conditions were collected when the speed sensor had measured the constant speed of the out shaft. The sampling frequency was 40,960 Hz, and the sampling time was 10 min. Next, the gearbox was stopped, the fault gear ring was replaced, and the signals corresponding to the fault conditions were collected.

Noise Spectrum Analysis
The two types of working noise data collected by the data acquisition system were divided into 120 groups with 5 s as the signal sampling period. There were 240 groups of samples, 200 groups of which were the training samples, and 40 samples were used to test the classification performance of the classifier. Signals were randomly intercepted to obtain 0.5 s signal fragments, as shown in Figures 12  and 13.   Frequency spectrum analysis was performed for noise data from 13 measuring points under different working conditions, as shown in Figures 10 and 11. The main frequency components of the noise collected under the normal working condition were 300.2, 900.5, 1247, and 1801 Hz. The speed measured by the velocimeter was 1317 r/min when the velocity was constant. Using the transmission ratio, the meshing frequency of the gear with 22 teeth was calculated to be approximately 300 Hz, the Frequency spectrum analysis was performed for noise data from 13 measuring points under different working conditions, as shown in Figures 10 and 11. The main frequency components of the noise collected under the normal working condition were 300.2, 900.5, 1247, and 1801 Hz. The speed measured by the velocimeter was 1317 r/min when the velocity was constant. Using the transmission ratio, the meshing frequency of the gear with 22 teeth was calculated to be approximately 300 Hz, the meshing frequency of the driving gear when in the second gear was calculated to be 900 Hz, and the meshing noise of the second-order fitting and of the engine were calculated to be 1800 and 1247 Hz, respectively. The results of the calculation are consistent with the spectral analysis components of the experimental data. From Figure 11, the main frequency components of noise acquisition under gearbox fault conditions are 300.5, 415, 900.5, 1247, and 1801 Hz. Among these, the frequency components of 300.5, 900.5, 1247, and 1801 Hz are the same as normal working conditions, so the noise at 415 Hz might be the frequency of the fault ring.

Independent Source Separation
The MDL criterion was used to calculate the noise of the two working conditions, using the seven independent sources. The FastICA algorithm in MATLAB was applied to separate the data. First, the matrices that were independent of each other were calculated. Second, the results of the signal were estimated using the matrix in the frequency domain, and the amplitude of the signal was corrected. Finally, the independent component of the time domain was obtained through the IFT criterion.
The time-frequency diagrams of the seven independent component signals separated from the normal working noise data are shown in Figure 14. All types of noise sources under the normal conditions were successfully separated and extracted by the FastICA algorithm. These mainly included the four above-mentioned principal frequencies: 300.5, 900.5, 1801, and 1247 Hz, as well as the approximate white Gaussian noise. In Figure 15, the algorithm separates four frequency components that correspond to normal conditions and the frequency of the gear ring fault.
Mach. Learn. Knowl. Extr. 2019, 1 FOR PEER REVIEW 16 meshing frequency of the driving gear when in the second gear was calculated to be 900 Hz, and the meshing noise of the second-order fitting and of the engine were calculated to be 1800 and 1247 Hz, respectively. The results of the calculation are consistent with the spectral analysis components of the experimental data. From Figure 11, the main frequency components of noise acquisition under gearbox fault conditions are 300.5, 415, 900.5, 1247, and 1801 Hz. Among these, the frequency components of 300.5, 900.5, 1247, and 1801 Hz are the same as normal working conditions, so the noise at 415 Hz might be the frequency of the fault ring.

Independent Source Separation
The MDL criterion was used to calculate the noise of the two working conditions, using the seven independent sources. The FastICA algorithm in MATLAB was applied to separate the data. First, the matrices that were independent of each other were calculated. Second, the results of the signal were estimated using the matrix in the frequency domain, and the amplitude of the signal was corrected. Finally, the independent component of the time domain was obtained through the IFT criterion.   The time-frequency diagrams of the seven independent component signals separated from the normal working noise data are shown in Figure 14. All types of noise sources under the normal conditions were successfully separated and extracted by the FastICA algorithm. These mainly included the four above-mentioned principal frequencies: 300.5, 900.5, 1801, and 1247 Hz, as well as the approximate white Gaussian noise. In Figure 15, the algorithm separates four frequency components that correspond to normal conditions and the frequency of the gear ring fault.

Parameter Calculation of SVM
Two samples were extracted from the training set under different conditions. Calculation of the correlation coefficients was performed in the manner described in Section 3. The results are shown in Tables 4 and 5. Table 4. Correlation coefficients of independent components from normal condition and source signals.  The correlation coefficients between the independent components and the source signals from each working condition are obviously smaller than the results of the simulation because the signal has been separated into seven independent components. Clearly, the first five independent

Parameter Calculation of SVM
Two samples were extracted from the training set under different conditions. Calculation of the correlation coefficients was performed in the manner described in Section 3. The results are shown in Tables 4 and 5. The correlation coefficients between the independent components and the source signals from each working condition are obviously smaller than the results of the simulation because the signal has been separated into seven independent components. Clearly, the first five independent components provide the main characteristics of the source signal, while the remaining components can be regarded as the high-frequency noise of the source signal. This conclusion is made according to the calculation of the input parameters based on the SVM algorithm and the sum of the correlation coefficients for each row. There are no obvious gaps between the feature vectors, which may be due to the difference in gear wear under the two conditions and the masking of the vibration effect by random noise.
One hundred groups of normal data samples and 100 groups of fault data samples were selected to train the SVM classifier. The division hyperplane of the SVM with the Gauss kernel function is plotted in Figure 16, in which it can be observed that the clusters corresponding to the samples for the two conditions are well separated. Forty groups of normal and fault signals were applied to verify the performance of the classifier, and there were three false positives. Therefore, the error probability was 3/40 × 100% = 7.5%.
Mach. Learn. Knowl. Extr. 2019, 1 FOR PEER REVIEW 18 components provide the main characteristics of the source signal, while the remaining components can be regarded as the high-frequency noise of the source signal. This conclusion is made according to the calculation of the input parameters based on the SVM algorithm and the sum of the correlation coefficients for each row. There are no obvious gaps between the feature vectors, which may be due to the difference in gear wear under the two conditions and the masking of the vibration effect by random noise. One hundred groups of normal data samples and 100 groups of fault data samples were selected to train the SVM classifier. The division hyperplane of the SVM with the Gauss kernel function is plotted in Figure 16, in which it can be observed that the clusters corresponding to the samples for the two conditions are well separated. Forty groups of normal and fault signals were applied to verify the performance of the classifier, and there were three false positives. Therefore, the error probability was 3 40 ⁄ 100% 7.5%.

Conclusions
(1) A fault diagnosis algorithm based on ICA and SVM is proposed in this paper. The gearbox of a loader was taken as the research object, and source signals were used to correctly diagnose gearbox failure by applying feature extraction and classification recognition as the processing technologies. The signals collected from the gearbox, which contained the principal feature information of the fault, were extracted by the ICA algorithm. The correlation coefficient was used as the characteristic parameter of fault diagnosis, as it can improve the robustness of detection systems due to the high correlations between the independent components and working conditions.
(2) Three independent source signals in the gearbox were simulated using MATLAB, and the convolution hybrid model was used to generate the six channel signals under two working conditions. Two hundred groups of simulation data were collected. Two groups of samples were selected from each condition to display the entire process of signal processing. The training of the SVM was implemented using 160 groups of samples, and 40 groups were used to test the accuracy of the classifier. There were two false positives under the two conditions; therefore, the misdiagnosis rate was 5%.
(3) A noise testing platform with 13 measurement points was built on the gearbox test rig according to the Chinese national standards, and two gear ring wear conditions were defined. One hundred and twenty groups of data under each condition were collected for the research. The algorithm established to calculate the correlation coefficient between the independent component and the source signal was applied to train the classifier and test its accuracy. There were three false positives; therefore, the error probability was 7.5%.

Conclusions
(1) A fault diagnosis algorithm based on ICA and SVM is proposed in this paper. The gearbox of a loader was taken as the research object, and source signals were used to correctly diagnose gearbox failure by applying feature extraction and classification recognition as the processing technologies. The signals collected from the gearbox, which contained the principal feature information of the fault, were extracted by the ICA algorithm. The correlation coefficient was used as the characteristic parameter of fault diagnosis, as it can improve the robustness of detection systems due to the high correlations between the independent components and working conditions.
(2) Three independent source signals in the gearbox were simulated using MATLAB, and the convolution hybrid model was used to generate the six channel signals under two working conditions. Two hundred groups of simulation data were collected. Two groups of samples were selected from each condition to display the entire process of signal processing. The training of the SVM was implemented using 160 groups of samples, and 40 groups were used to test the accuracy of the classifier. There were two false positives under the two conditions; therefore, the misdiagnosis rate was 5%.
(3) A noise testing platform with 13 measurement points was built on the gearbox test rig according to the Chinese national standards, and two gear ring wear conditions were defined. One hundred and twenty groups of data under each condition were collected for the research. The algorithm established to calculate the correlation coefficient between the independent component and the source signal was applied to train the classifier and test its accuracy. There were three false positives; therefore, the error probability was 7.5%.
(4) The experimental results show that the method proposed in this paper can effectively extract various mechanical fault features from signal data, providing significant engineering value. Furthermore, this research provides an innovative approach for the feature extraction of mechanical faults from signal data.