3.1. Experimental Section
In this study, the hydraulic system condition monitoring dataset released by the Center for Mechatronics and Automation Technology (ZeMA) [
20] in Saarbrücken, Germany, was adopted as the research basis. This dataset was acquired through experiments on a hydraulic test bench, where the system consists of a main working circuit and a secondary cooling and filtration circuit, connected via an oil tank. The system repeats a constant load cycle every 60 s, and process variables such as pressure, volume flow rate, and temperature were measured during the quantitative condition changes of four hydraulic components (cooler, valve, pump, and accumulator).
The dataset comprises 2205 operating cycles, with each cycle containing 43,680 data points collected from 17 sensors at different sampling frequencies: six pressure sensors (PS1–PS6) and one electric motor power sensor (EPS1) operate at a sampling frequency of 100 Hz, generating 6000 data points per cycle; two volume flow sensors (FS1, FS2) work at 10 Hz, producing 600 data points per cycle; and four temperature sensors (TS1–TS4), one vibration sensor (VS1), and three virtual sensors (CE, CP, SE) are sampled at 1 Hz, resulting in 60 data points per cycle.
Target condition values are labeled on a per-cycle basis, encompassing five dimensions of state information:
Cooler condition: 3 (near complete failure), 20 (reduced efficiency), 100 (full efficiency);
Valve condition: 100 (optimal switching behavior), 90 (minor delay), 80 (severe delay), 73 (near complete failure);
Internal pump leakage: 0 (no leakage), 1 (weak leakage), 2 (severe leakage);
Hydraulic accumulator: 130 (optimal pressure), 115 (minor pressure drop), 100 (severe pressure drop), 90 (near complete failure);
Stable flag: 0 (stable condition), 1 (static condition may not be achieved).
Figure 6a is the hydraulic system working circuit, and
Figure 6b is the cooling filter circuit. Let us focus on the pressure and flow sensor data within the hydraulic system’s operating loop, specifically the PS1, EPS1, and FS1 sensors shown in
Figure 6a. PS1 represents the pressure sensor data, sampled at 100 Hz. EPS1 corresponds to the motor power sensor data, also sampled at 100 Hz. FS1 refers to the flow sensor, with a sampling frequency of 10 Hz. Therefore, each sample consists of 600 data points. Detailed attributes of the plunger pump sample data are provided in the
Table 1. The collected multi-sensor information, including the motor power signal, is used to identify three levels of internal leakage in the plunger pump: normal, slight internal leakage, and severe internal leakage.
Figure 6 presents a schematic diagram of the experimental test setup for acquiring the source domain dataset, where
Figure 6a illustrates the working circuit of the hydraulic system and
Figure 6b shows the cooling and filtration circuit.
To ensure data consistency and comparability amid varying sampling frequencies of different sensors, we performed data preprocessing including aligning and resampling all data to a unified 10 Hz (yielding 600 data points per cycle, with 100 Hz data downsampled by averaging every 10 points and 1 Hz data upsampled via linear interpolation), handling outliers using the 3σ principle (replacing out-of-range points with linear interpolation of adjacent points), normalizing data to the [0, 1] interval via min-max normalization to eliminate dimensional influences, and extracting time-domain (mean, variance, standard deviation, peak-to-peak value, RMS, skewness, kurtosis), frequency-domain (main frequency components, spectral energy, frequency band energy ratio) and time-frequency (wavelet coefficient energy, IMF energy from EMD) statistical features from each sensor’s time-series data.
Figure 7 demonstrates the correlation between the states of various components in the form of a heatmap, revealing the mutual influence relationship under multi-fault concurrent scenarios.
Analysis of component fault correlations shows that the cooler’s failure or reduced efficiency is associated with valve wear/aging and pump internal seal wear (due to increased system temperature and decreased hydraulic oil viscosity), the valve’s severe delay affects the accumulator’s charging/discharging efficiency, pump internal leakage impairs the accumulator’s normal operation (due to unstable system pressure), and system instability significantly increases the probability of concurrent faults in multiple components (e.g., cooler, valve, pump) by accelerating their synergistic degradation. As shown in
Figure 8.
Feature analysis of pressure, electric motor power, flow, and temperature sensors shows that pump leakage leads to decreased average pressure, increased pressure and flow rate fluctuations, elevated motor power demand and sharper power distribution, and reduced average flow rate, while the decline in cooler efficiency results in increased system temperature and weakened temperature control capability.
Figure 9 shows the comparison of time-series signals of key sensors under different fault states: blue represents the signal of PS1, orange represents the signal of PS2, green represents the signal of PS3, and red represents abnormality.
Under normal conditions, pressure, motor power, flow, and temperature sensor signals remain stable and regular; under weak leakage, signal periodicity is disrupted with low-frequency modulation components, accompanied by slight flow reduction and slow temperature rise; under severe leakage, all signals fluctuate violently with multiple frequency components, power and flow fluctuations increase significantly, and temperature rises rapidly.
Figure 10 illustrates the complete evolution process of the pump from the normal state to severe leakage: green represents normal state, orange represents slight leakage, and red represents severe leakage.
The sensor signal characteristics evolve with fault progression: in the healthy state, pressure signals are stable, statistical features conform to normal distribution, and only the inherent system frequency component exists; in the early weak fault phase, pressure slightly decreases, signal fluctuation increases, weak new frequency components emerge, and signal distribution begins to deviate from symmetry; in the established weak leakage phase, pressure further decreases, signal fluctuation and new frequency component amplitude increase with additional harmonic components, and signal distribution becomes sharper; and in the severe leakage phase, pressure drops significantly, signal fluctuation intensifies, multiple harmonics appear, and signal distribution deviates greatly from normal distribution. The evolution law shows that average pressure decreases monotonically with an accelerating rate, signal fluctuation grows exponentially, frequency-domain components develop from a single frequency to multiple harmonics, and statistical distribution gradually changes from approximately normal to a sharp-peak and heavy-tail distribution.
Figure 11 illustrates the frequency-domain characteristics of vibration and pressure signals under different fault states. Green represents normal state, orange represents slight leakage, and red represents severe leakage.
Under a normal state, the vibration and pressure signals have single spectral peaks corresponding to the system’s natural frequency and load cycle frequency respectively, with a low proportion of high-frequency components indicating stable operation; under weak pump leakage, both signals show additional spectral peaks, and the proportion of high-frequency components increases, suggesting a new vibration source; under severe pump leakage, more spectral peaks appear in both signals, with a further rise in high-frequency component proportion due to multiple harmonic vibrations caused by faults; and for cooler faults, the temperature signal spectrum broadens from narrowband to broadband, the spectral centroid shifts upward, and the proportion of high-frequency components increases, indicating slower temperature regulation and intensified fluctuations.
Figure 12 presents the correlation matrix among the 17 sensors. Sensors exhibit different correlation levels: the pressure sensor group and temperature sensor group show strong positive correlations, reflecting consistent monitoring data and system temperature distribution respectively; pressure sensors with motor power and flow sensors between themselves present moderate positive correlations, in line with energy conversion relationships and flow measurement consistency; and pressure sensors with temperature sensors and motor power with cooling efficiency have negative correlations, conforming to physical laws and performance relationships. Faults affect sensor correlations: pump leakage weakens the negative correlation between pressure and flow sensors, cooler faults enhance the correlation among temperature sensors, and system instability generally reduces the overall correlation level among sensors.
Systematic analysis of the ZeMA [
20] hydraulic system dataset yields key conclusions: the dataset features high quality and completeness, covering multiple fault states and severity levels of main components with reasonable sample distribution to support machine learning; obvious fault characteristics exist with significant differences in time-domain, frequency-domain and statistical features among different faults (e.g., pump leakage causes pressure drop, increased fluctuation and multiple harmonics), providing a reliable physical basis for fault diagnosis; component faults have significant statistical correlations with multi-fault coupling effects (e.g., cooler failure accelerates valve wear and pump leakage), and increasing diagnostic complexity while enabling graph-structure-based methods; sensor information has redundancy (high correlation among homogeneous sensors, requiring feature selection) and complementarity (heterogeneous sensors provide complementary information, requiring fusion for comprehensive system status assessment).
To intuitively analyze the class separability of the original feature space, the t-SNE (t-distributed Stochastic Neighbor Embedding) method is used to map high-dimensional raw features to a two-dimensional space, and the results are shown in
Figure 13. It can be observed that different fault categories have obvious overlap in the low-dimensional space, indicating that the original features are difficult to directly achieve high-precision classification without effective modeling and correlation graph construction. This also indirectly verifies the necessity of introducing graph structure modeling and correlation reasoning mechanisms.
The chapter introduces a method for exploring neural network architectures. Convolutional neural networks are built through actions generated by a controller. Test accuracy serves as rewards and penalties, guiding parameter updates by maximizing these rewards. The resulting network structure is fed back to the controller. It can be seen that after exploration, the model finally converges to the highest reward, and the optimal submodel network structure in the table is obtained. The network structure is detailed in
Table 2.
As shown in
Figure 14, the policy gradient optimization process can be divided into three distinct stages. In the exploration stage (iterations 0–30), the reward value fluctuates significantly (pink scatter points) as the model explores different strategies. The 5-window moving average (dark blue solid line) smooths these fluctuations, revealing a steady upward trend. As the optimization enters the exploitation stage (iterations 30–70), the reward value stabilizes and increases gradually, reflecting the model’s convergence to more effective strategies. Finally, in the convergence stage (iterations 70–100), the reward value stabilizes at a high level, with the optimal value reaching 99.50% (red dot). The purple dashed line, corresponding to the right y-axis, shows the performance improvement relative to the initial value, which peaks at 16.2% during convergence. Key statistics, including the initial accuracy (85.0%), final accuracy (97.9%), and total improvement (15.2%), are provided to quantify the optimization effect, with the model reaching ≥95% accuracy after 34 iterations.
To more intuitively study the classification performance of the hydraulic pump states at each layer of the neural network, the t-SNE (t-distributed Stochastic Neighbor Embedding) algorithm is also applied to perform two-dimensional visualization of both the output signals of the KAN convolutional network model and those of the network architecture defined by the proposed method. As shown in
Figure 15 and
Figure 16, respectively.
The test results of the optimal submodel are compared to other deep learning methods, including CNN, LSTM, and DBN. CNN consists of four convolutional layers; the hyperparameters of the filter and convolution kernel are [16,3]-[32,3]-[16,3]-[32,3] respectively, and then connected to the average pooling layer; the filling type is the same padding; and the relu function is used as the activation function. The LSTM is a single hidden layer consisting of 128 hidden units followed by a softmax classifier. DBN consists of two hidden layers with a network structure of 400-200-100-5. Each method iterates 200 times during training, and the test results are obtained. Each method was run ten times under the same conditions. The test results are summarized in
Table 3, and the network weight parameters are saved for subsequent parameter migration. It can be seen that the accuracy of the optimal submodel is significantly better than the other three methods, followed by CNN with a recognition accuracy of 89.52%, and LSTM and DBN with poor recognition accuracy.
Randomly select a group of test results, and summarize the test accuracy of different fault types in
Figure 17. In fault types 3, 4, and 5, the optimal submodel can accurately identify faults that are difficult to identify by other methods. It is proven that the method proposed in this chapter can obtain the optimal intelligent diagnosis model, which can accurately identify the fault type, and save a lot of time for the optimization of neural network structure and parameters.
3.2. Experimental Data Set Based on Airborne Hydraulic Simulation System
Studying fault diagnosis in airborne hydraulic systems driven by data necessitates acquiring information on parameters such as pressure and flow under normal and diverse fault conditions. Directly implanting faults in actual aircraft hydraulic systems is costly and challenging. Consequently, domestic research typically employs AMESim (
https://www.plm.automation.siemens.com/global/en/products/simcenter/amesim/, accessed on 1 February 2026) software to simulate airborne hydraulic systems as shown in
Figure 18, introducing common faults via simulation to generate datasets for algorithm training and effectiveness testing. By extracting curves representing normal and faulty states, a dataset is compiled to underpin subsequent algorithmic investigations.
Based on a comprehensive review of the relevant literature, five typical faults are selected as simulation objects in this paper, including internal leakage of the hydraulic pump, oil filter blockage, servo valve blockage, hydraulic oil contamination, and internal leakage of the actuator. The fault implantation conditions are set with reference to technical parameters in the literature. To efficiently simulate the internal leakage fault of the hydraulic pump in AMESim, an orifice is connected in parallel at both ends of the pump. The quantitative simulation of internal leakage degree is realized by adjusting the equivalent aperture parameter of the element—the larger the equivalent aperture, the more serious the internal leakage of the pump, which in turn changes the pressure and flow characteristics at the pump outlet, achieving the simulation implementation goal of system-level pump leakage fault.
For the oil filter blockage fault, simulation is achieved by adjusting its equivalent filter pore diameter parameter: the pore diameter is set to 3–4 mm to simulate the blockage state, and when the pore diameter is larger than 5 mm, it is regarded as a normal working condition. The AMESim actuator element has a built-in leakage coefficient parameter, which can be directly used for accurate simulation of internal leakage fault of the actuator; the servo valve blockage fault is realized by connecting an overflow valve in series at the direct junction of the valve outlet and the rodless chamber of the actuator, and the occurrence and degree of servo valve blockage can be flexibly controlled by adjusting the parameters of the overflow valve. In addition, the gas content parameter of hydraulic oil in AMESim can be used to effectively reproduce the oil contamination fault caused by the increase in gas content in hydraulic oil.
The simulation covers the above five fault conditions and one normal condition, totaling six state categories. The experimental sampling frequency is set to 100 Hz, and the time-series data collected under each working condition is divided into 400 samples, of which the first 350 samples are used to construct the training set, and the remaining 50 samples are used as the test set. The specific configuration of the data set is shown in the
Table 4.
From the perspective of data structure, each sample consists of equal-length time-series signals from multiple sensors, with a single sample containing 100 sampling points corresponding to 1 s of system operating status at a sampling frequency of 100 Hz. This sample construction method not only retains the dynamic evolution characteristics of the hydraulic system but also facilitates subsequent feature extraction and model training, which is consistent with the conventional data organization format in the field of hydraulic system fault diagnosis.
From the perspective of physical magnitude, the pump outlet pressure signals are generally distributed in the range of approximately 160–170 bar, which falls within the typical operating pressure range of airborne hydraulic systems. Actuator leakage-related variables remain relatively stable under normal operating conditions and show varying degrees of upward trends under leakage fault conditions, without abnormal mutations or non-physical data values. This indicates that the simulation data are consistent with the engineering practice of hydraulic systems in terms of dimensions and numerical ranges, with no obvious numerical distortion or modeling deviation introduced.
In hydraulic systems, there are significant differences in the influence mechanisms of different fault types on system state variables. For example, internal pump leakage usually leads to a decrease in the effective output flow rate of the system, thereby causing a reduction in the average outlet pressure; blockage faults are mainly characterized by increased flow resistance and intensified system pressure fluctuations; and internal actuator leakage often results in an increase in return oil volume, leading to a significant rise in leakage-related state variables. As shown in the
Figure 19.
To verify whether the simulation data reflect the aforementioned physical laws, this study conducts statistical analysis on the mean value and fluctuation characteristics of pump outlet pressure under different fault modes. The results show that there are significant differences in the average pressure among different fault states: the average pressure of samples with internal pump leakage faults is generally lower than that under normal operating conditions, while the pressure standard deviation of samples with oil filter clogging and internal actuator leakage faults increases significantly. This phenomenon is highly consistent with the theoretical analysis and engineering experience of hydraulic systems. Meanwhile, leakage-related variables exhibit high mean levels in internal actuator leakage and internal pump leakage faults, while maintaining relatively low levels under normal operating conditions and non-leakage faults, further verifying the physical consistency of the simulation data in fault characterization. As shown in the
Figure 20.
In addition to static statistical features, hydraulic system faults are often accompanied by obvious dynamic evolution characteristics. Analysis of the time-series signals within samples reveals significant differences in the variation trends and fluctuation patterns of system pressure signals under different fault modes. Specifically, leakage faults are characterized by a slow decrease or stable offset of pressure levels, while blockage faults are more likely to induce high-frequency fluctuations and dynamic lag of pressure signals. Such differences exhibit good distinguishability in the time domain. As shown in the
Figure 21.
Furthermore, through low-dimensional joint analysis of pressure mean values and leakage-related features, it can be observed that different fault modes present a certain degree of clustering distribution in the feature space, indicating that the simulation data itself has good fault separability. This provides a necessary premise for the subsequent introduction of deep learning models and graph structure modeling methods for complex fault reasoning.
In summary, the fault simulation data of the airborne hydraulic system constructed based on AMESim exhibits good rationality and engineering credibility in the following aspects: the data structure and sampling settings are consistent with the actual working conditions of online monitoring of airborne hydraulic systems; the numerical ranges and variation trends of key physical quantities conform to the engineering experience of hydraulic systems; different fault modes show differences consistent with mechanism analysis in statistical features and dynamic evolution behaviors; the data has good fault separability, providing a reliable data foundation for the subsequent construction of fault diagnosis models. Therefore, it can be concluded that the simulation data can relatively truly reflect the operating characteristics of the airborne hydraulic system under different fault conditions, and is suitable for researching fault diagnosis and reasoning methods based on data-driven and mechanism fusion.
To intuitively analyze the classification effect of each layer of the neural network on the hydraulic pump state, the t-distributed Stochastic Neighbor Embedding (t-SNE) algorithm is used to visualize the output signals of each layer of the three models (KAN-1, KAN-2, KAN-3) and the model optimized by the method proposed in this paper, and the results are shown in
Figure 22.
The method proposed in this paper is applied to the neural network architecture search process. The controller generates actions to construct the KAN convolutional neural network, takes the test accuracy as the reward and punishment signal, updates the model parameters using the maximum reward and punishment strategy, and feeds back the generated network structure to the controller as the input for the next stage of search. After iterative exploration, the model finally converges to the optimal reward and punishment value, and the optimal submodel network architecture shown in
Table 5 is obtained.
To verify the effectiveness of the proposed method, a comparative experiment is conducted between the test accuracy of the generated network structure and the network structures selected by manual experience (KAN-1, KAN-2, KAN-3). During the experiment, except for the network structure parameters, other experimental conditions are kept consistent, and six repeated tests are performed on the test set in the same environment. The statistical results of test accuracy are shown in
Table 6, and the accuracy fluctuation of the six experiments is shown in
Figure 23.
The experimental results show that the average classification accuracy of the KAN-1 model is the lowest at 86.63%; the average accuracy of the KAN-2 model is 88.33%; the average accuracy of the KAN-3 model is 91.60%, which is slightly higher than that of KAN-3 and KAN-2. The average accuracy of the proposed method can reach 99.10%, and its classification performance is significantly better than the three KAN structures designed by manual experience. In terms of stability, the accuracy standard deviation of the proposed method in six experiments is only 0.31, which is much lower than that of the comparison models, showing better robustness. This result indicates that the method proposed in this paper can realize automatic searching for the optimal network structure, greatly reduce the time consumption of manual parameter tuning, and improve the engineering practicability of the diagnostic model.
A set of experimental results is randomly selected to draw the confusion matrix of each method, as shown in
Figure 24. It can be seen from the figure that the KAN-3 model (
Figure 15d) has the lowest recognition accuracy for faults of type 0, 1 and 3, which is prone to misclassification; the KAN-2 model (
Figure 24c) also has the problem of low recognition accuracy for faults of type 0, 1 and 3, and is easy to misclassify such faults into type 0 and 3; the overall recognition accuracy of the KAN-1 model (
Figure 24b) is better than that of KAN-2 and KAN-3, but there are still misjudgments in some fault categories. The proposed method (
Figure 24a) can accurately identify all types of faults, with the minimum recognition accuracy still reaching 96%. Especially for type 2, 3, and 4 faults that are difficult to distinguish by the other three models, it shows excellent classification ability, fully verifying its superiority.
In order to further verify the effectiveness of the proposed method, several widely used deep learning models are selected to compare with the proposed method: BP neural network, long short-term memory network (LSTM), and deep belief network (DBN). Summarizing the test accuracy in
Table 7, we can see that the average classification accuracy of the BP neural network is slightly lower, at 91.10%. LSTM and DBN followed, with average accuracies of 93.02% and 92.83%, respectively. The average accuracy of the proposed method is still better than that of other deep learning methods, and the standard deviation is the smallest. Therefore, the proposed method can accurately identify all kinds of faults.
A set of experimental results is selected to draw the confusion matrix of each model, as shown in
Figure 25. The BP neural network (
Figure 25b) has the lowest recognition accuracy for type 3 faults, only 89%; the LSTM model (
Figure 25c) also has the problem of low recognition accuracy for type 3 faults; the DBN model (
Figure 25d) has a poor recognition effect on type 0 faults. The proposed method (
Figure 25a) maintains the recognition accuracy of all fault categories above 96%, which can effectively avoid the misclassification problems existing in other models.