You are currently viewing a new version of our website. To view the old version click .
Sensors
  • Article
  • Open Access

8 December 2025

Fault Diagnosis Method for Axial Piston Pump Slipper Wear Based on Symmetric Dot Pattern and Multi-Channel Densely Connected Convolutional Networks

,
,
,
,
,
,
,
,
and
1
School of Mechanical Engineering, Yanshan University, Qinhuangdao 066004, China
2
Silesian College of Intelligent Science and Engineering, Yanshan University, Qinhuangdao 066004, China
3
School of Mechanical Engineering, Hebei University of Architecture, Zhangjiakou 075000, China
4
Mechanical and Electrical Engineering, Xinjiang Institute of Engineering, Urumqi 830023, China
This article belongs to the Section Fault Diagnosis & Sensors

Abstract

Fault diagnosis in axial piston pumps is key to ensuring the proper operation of a hydraulic system. Slipper wear, as a typical fault in piston pumps, is challenging to accurately diagnose because the faults are very similar for different forms and degrees of wear. The achievement of accurate fault diagnosis of different forms and degrees of wear in the slipper will greatly improve the reliability of axial piston pump operation and, at the same time, provide new ideas for research into similar fault diagnosis problems in other rotating machinery. Therefore, a method of fault diagnosis based on the following symmetric dot pattern (SDP) and multi-channel densely connected convolutional networks (DenseNet) is proposed in this paper. The method applies an SDP transformation to transform the slipper failure signal into an SDP image, which achieves the fusion of triaxial vibration signals and enriches the signal features. The inception module is improved by replacing the original structure with larger convolutional kernels in multiple branches and decomposing the larger convolutional kernels. The inception module, the convolutional block attention module (CBAM), and the DropBlock method are introduced into DenseNet to improve feature extraction capability, computational efficiency, and model generalization ability. Experiments are performed on several slipper wear fault SDP image datasets, and all the indices produced by the proposed method are higher than those of the traditional convolutional neural networks, which fully proves the effectiveness and superiority of the procedure.

1. Introduction

Axial piston pumps are generally used in aerospace hydraulic systems, construction machinery, and ocean engineering. The slipper is an important part of the piston pump. It is subjected to a number of unstable forces from the piston and the swash plate, making it the most vulnerable component to wear and failure in the piston pump [1,2,3,4]. Slipper wear can not only shorten the service life of the piston pump but also lead to its catastrophic failure [5]. Therefore, the study of fault diagnosis algorithms for multiple forms of wear and different degrees of wear on the slipper is of great value in promoting the improvement and development of piston pump maintenance technology [6].
In recent years, deep learning has generally been used in piston pump fault diagnosis for its powerful ability to mine and extract useful features [7]. Figure 1 shows the intelligent fault diagnosis process [8,9]. Commonly used models, such as convolutional neural networks [10], residual networks [11], and Transformer [12], are constantly being improved and fused and then applied to fault diagnosis tasks [13,14,15]. He et al. [16] proposed an adversarial model-based transfer learning system for deep multi-signal fusion with good performance for cross-domain piston pump fault diagnosis. Tang et al. [17] integrated the theory of deep learning and the Bayesian optimization algorithm to accurately distinguish the different fault conditions of piston pumps. Ugli et al. [18] proposed an automatic optimization approach for the fault diagnosis of piston pumps using a one-dimensional convolutional neural network (1D-CNN) architecture and a genetic algorithm through the inclusion of direct links and a dimension reduction model within the 1D-CNN block, which performed well on various kinds of fault diagnosis tasks, for example, slipper wear and thrust plate wear.
Figure 1. Intelligent fault diagnosis process.
Guo et al. [19] proposed a hybrid model of deep learning and WaveletKernelNet-CBAM-BiLSTM for troubleshooting drilling pumps. The method utilizes random forests for signal selection, integrates CBAM to enhance feature representation, and employs BiLSTM for temporal data processing. Bayesian optimization is applied to fine-tune the hyperparameters. The experimental results demonstrate the accuracy of the model, as well as its reliability and generalization capability. However, this method diagnoses faults that are caused by varying numbers of worn slippers and that exhibit significant dissimilarities; it does not account for similar wear faults in a single slipper. Hatami Garousi et al. [20] proposed a vibration-based method of diagnosing centrifugal pump failures, utilizing a multi-layer perceptron (MLP) to classify faults. The study extracts time- and frequency-domain features from vibration signals, demonstrating that statistical indicators like kurtosis and skewness effectively distinguish defects such as impeller damage and cavitation. Experimental results validate the method’s ability to detect and quantify fault severity in real time. Du et al. [21] proposed a novel approach to rotor fault diagnosis and isolation based on search coils for permanent magnet synchronous motors, in which two random forest models are employed to achieve automated classification for distinguishing between two faults with similar characteristics. However, the method only considered two fault forms and depended on sensor placement and signal processing. Wang et al. [22] proposed a deep learning way based on transfer adversarial subnetworks and channel-wise thresholds. However, the method is effective in classifying two kinds of faults, namely, gearwheel tooth wear and pinion missing teeth, and it is difficult to prove its effectiveness in discriminating similar features. Ding et al. [23] proposed a classifier prediction-oriented domain adaptation network that aims to solve the decision boundary confusion caused by strong connectivity among similar categories. However, the method focused on solving the cross-domain fault diagnosis problem and did not directly solve the similar fault diagnosis problem. Table 1 summarizes the literature review.
Table 1. Literature review.
To address these challenges, this study introduces SDP transformation as a superior alternative to conventional time–frequency representations like spectrograms or wavelet transforms. While spectrograms and wavelets are powerful, they often struggle with low signal-to-noise ratios and generate high-dimensional, complex outputs that can be computationally intensive for deep learning models to process, especially when fusing multiple sensor channels. In contrast, the SDP technique offers distinct advantages: (1) it inherently possesses strong noise robustness, effectively suppressing stochastic noise in weak fault signals; (2) most critically for this application, SDP provides a novel and intuitive framework for fusing triaxial vibration signals into a single, unified two-dimensional image. This fusion capability uniquely preserves the synergistic relationships and composite fault patterns across different axes, which are often critical for diagnosing complex faults like the multiple wear forms in piston pump slippers. A visual example of this triaxial fusion via SDP is presented in Figure 2. In the SDP image, each vibration signal is transformed into symmetrical “petals”—specifically, the triaxial vibration signals are mapped to three pairs of symmetrical petals (e.g., petals 1 and 2 correspond to the X axis). Each pair is distinguished by red and blue colors to facilitate the analysis and comparison of features from the same axis.
Figure 2. Example of triaxial fusion via SDP.
In summary, signal processing and analysis methods such as wavelet packet transform, cyclostationary theory, and methods such as deep learning and domain adaptive algorithms are helpful in solving similar fault diagnosis problems. However, due to the complex structure of axial piston pumps, more forms of slipper wear, different wear degrees, and unclear wear mechanisms tend to occur, and thus, the fault diagnosis research for slipper wear and similar faults is still less. Multiple slipper wear forms, different wear degrees, and different composite wear forms mean that the signals produced by the system are weak; noise and data change trends are similar; feature information performance is similar; feature extraction is difficult; and thus, the existing fault diagnosis models struggle to diagnose these slipper wear faults and other similar faults. Therefore, the existing fault diagnosis methods need to be improved in terms of signal processing and network structure. In this paper, we will focus on three different wear forms that occur in axial piston pump slippers—abrasive wear, corrosive wear, and adhesive wear—as well as three diverse degrees of wear failure: slight, moderate, and serious.
To this end, a fault diagnosis method based on SDP with multichannel DenseNet is proposed in this article. The method effectively enhances the feature extraction capacity of the network and the generalization capacity of the model, performs well with regard to the fault diagnosis of different forms and degrees of wear in the slipper, and provides new ideas for solving other similar fault diagnosis problems. The contributions of this study can be summarized as follows:
(1)
The SDP image transformation method is applied to the construction of image samples of a slipper fault for the first time, and the triaxial vibration signal is directly transformed into an SDP image, which achieves the feature fusion of triaxial vibration signals, enriches the features of the fault diagnosis sample, and reduces the time complexity.
(2)
Based on the design of the inception module, conventional fault diagnosis methods are improved by replacing their original structure with a larger convolutional kernel and multi-branching that decomposes the larger convolutional kernel, which enriches feature information under different sensory fields and effectively captures both local and global features.
(3)
The improved inception module, CBAM, and DropBlock methods are introduced into DenseNet, and DenseBlock4 in the original DenseNet is removed to establish a multichannel DenseNet-based fault diagnosis model. The model has a high identification accuracy in fault diagnosis tasks for multiple wear forms, different wear degrees, and different composite wear forms of the slipper.
The remainder of the article is scheduled as follows: Section 2 presents the related theory. Section 3 describes the structure of the proposed method. In Section 4, the experimental details and results analysis are presented. Finally, Section 5 concludes the paper.

2. Preliminaries

2.1. Symmetric Dot Pattern Transformation (SDP)

SDP is an image representation approach based on the polar coordinate system, which can directly transform an original signal into an SDP image. The method is simple to operate, low in computational requirements, and robust against noise, and it is able to clearly show the amplitude, phase, and frequency of the vibration signal and other key information so that the fault characteristics can be expressed intuitively [24,25,26,27]. At the same time, the method can also achieve multi-channel data fusion, which can be used in the fault diagnosis collar to reduce the workload of the algorithm to recognize the type, location, and extent of the fault.
For the obtained time domain signal X = x 1 , x 2 , , x n 1 , x n , x i denotes the signal amplitude at the moment of i , x i + l denotes the signal amplitude at the moment of i + l , and the time domain signal can be transformed into polar coordinates through normalization to generate the corresponding SDP image. Figure 3 shows the basic principle of the SDP image conversion method, r ( i ) is the polar radius, θ ( i ) is the angle of counterclockwise rotation of the polar coordinates along the line, and ϕ ( i ) is the angle of clockwise rotation of the polar coordinates along the initial line. The relation between the amplitude and frequency of the time domain signal can be simply and directly mapped to the polar coordinate space through Formulas (1)–(3), and fault information included in the time domain signal can be presented through the resulting image. The mapping relationship is given by the following:
r ( i ) = x i x max x max x min
θ ( i ) = θ + x i + l x min x max x min ζ
ψ ( i ) = θ x i + l x min x max x min ζ
where x i is the i th sample amplitude of the signal, x min is the minimum amplitude of the signal, x max is the maximum amplitude of the signal, l is the time interval parameter, θ is the rotation angle of the mirror plane of symmetry, θ = 360 ° / m , m = 1 , 2 , , n , ξ is the angular amplification factor.
Figure 3. Principle diagram of SDP method.
To provide a clearer algorithmic description of the SDP transformation, particularly for triaxial vibration signals (X, Y, Z axes), the procedure is summarized as follows. First, the individual time-domain signals from each axis are normalized to a uniform scale. Subsequently, these normalized signals are mapped into the polar coordinate system using Equations (1)–(3). Specifically, for each data point in the signal, a corresponding point is plotted in the polar plane, with its radius determined by the signal amplitude and its angle by the time index and the predefined rotation parameters. Finally, by plotting all the transformed points from all three axes onto a single polar plane and connecting them with smooth lines or distinct markers, a composite SDP image is generated. This image effectively fuses the multi-channel vibration information, creating a unique pattern that visually represents the underlying fault characteristics.
As shown in Figure 3, the coordinate axis OX is the polar axis and r ( i ) is the polar radius, and the signal amplitude at a certain point of the signal can be expressed at the position shown in the figure by combining the angular amplification factor ξ . θ ( i ) and ψ ( i ) are the two deflection angles about the mirror plane of symmetry, and their deflection angles are symmetrical about the polar axis.

2.2. DenseNet

The main idea of DenseNet is to introduce dense connections into the network, which can effectively achieve and strengthen feature reuse and enhance feature propagation between the front and back layers of the network. This allows the network to make better use of the shallow feature information so as to raise the network’s performance and efficiently prevent the model from encountering degradation and overfitting problems [28,29,30].
In a traditional convolutional neural network, if the network has L layers, it will produce L connections, i.e., the inputs of each layer come from the outputs of the previous layer of the network. However, the dense connections introduced in DenseNet allow L layers of the network to produce L ( L + 1 ) / 2 connections, i.e., the inputs of each layer receive the outputs of all the first few layers, allowing the features extracted from each layer to be effectively used. Figure 4 represents the basic structure of a dense connection.
Figure 4. Dense connection structure.
From Figure 4, x 0 is the initial data input, and the input of H 1 is x 0 ; the inputs of H 2 are x 0 and x 1 ; and the inputs of layer i can be obtained by the following analogy: x i = H i x 0 , x 1 , , x i 1 . Where H indicates a nonlinear transformation function, it is a combination of operations that might involve an array of batch normalization, activation, pooling, and convolution manipulations, and it should be noted that there may actually be more than one convolutional layer between layers i and i 1 here.
Convolutional neural networks generally go through a pooling layer or a convolution operation with a step size greater than 1 to reduce the size of the characteristic diagram, whereas DenseNet’s special dense connection approach requires that the size of the characteristic diagram remain consistent. To overcome this challenge, the structure of the alternate connection of the dense block and transition layers is used in DenseNet, where the dense block is a module containing many layers, each layer has the same size as the characteristic diagram, and the layers are densely connected to each other. The role of the transition layer is to link two neighboring dense blocks and to reduce the characteristic diagram size by pooling operations. Figure 5 shows the DenseNet structure.
Figure 5. DenseNet structure.

2.3. Inception Module

The inception module is the most central network substructure in GoogLeNet, with a structure designed to capture features at different scales and levels, being able to retain more features in the input signal. The design of the inception module is based on the following core idea: a parallel extraction effort of the input signal using multiple convolutional kernels of diverse sizes and pooling layers, and then stacking the results of this multi-branch convolution and passing them on to subsequent input layers. It uses different-sized convolutional kernels in parallel within a layer, as well as maximum pooling operations, to be able to capture features at different scales of the image without significantly increasing the computational burden. Figure 6 shows the Inception module structure [31,32].
Figure 6. Inception module structure.

3. Proposed Method

3.1. Improved Inception Module

To raise the feature extraction capacity of the neural network, this article is based on the design idea of the inception module and aims to improve the existing inception module. The idea for improvement is to replace the original structure with a larger convolutional kernel in a multi-branching format and then decompose the larger convolution kernel in order to decrease the parameter calculations. Figure 7 shows the improved inception structure.
Figure 7. Improved inception module structure.
The overall structure of the improvement module is as follows: three sizes of convolutional kernels, 1 × 1, 3 × 3, and 7 × 7, are designed to perform multi-scale parallel feature extraction on the input image so that it can retain feature information at multiple scales. The improved module is analyzed line by line from bottom to top: the first line is a combination of the 1 × 1 convolution and 7 × 7 convolution, which uses a larger sensory field to convolve the original signal and retains the large-scale information, and to decrease parameter calculations for the large-scale convolution, the 7 × 7 convolution is decomposed into a combination of three 3 × 3 convolutions without changing the size of the characteristic diagram. The second line is a combination of the 1 × 1 convolution and 3 × 3 convolution using a medium-scale convolutional kernel to extract features on a small scale for the input features. The third line, 1 × 1 convolution, uses a small-scale convolutional kernel for performing lightweight feature extraction and dimension reduction operations. The fourth line is a combination of 3 × 3 pooling and 1 × 1 convolution, which helps in extracting local information from the input signal.
The improved inception module is able to perform multi-scale feature extraction on the input signal, and the feature outputs of the four branches are spliced in the channel dimension, which maximally enriches the feature information under different sensory fields without changing the height and width of the feature matrix. The decomposition operation of the large convolutional kernel greatly reduces the amount of parameter computation during the convolutional process, which provides good feature data for the subsequent neural network for fault diagnosis.

3.2. DenseNet-I

In this article, we use the DenseNet architecture to build a fault diagnosis model for slipper wear fault diagnosis. Firstly, the improved inception module is inserted into the forefront of DenseNet to strengthen the feature extraction capacity of the original signal, maximize the feature information of the original signal to be retained, and form the output with rich feature representation, which enhances the robustness of the model against diverse scales of information to some degree and, in the meantime, improves its computational efficiency.
The DropBlock approach [33] is then added to the convolutional layers in each dense block of DenseNet as a regularization method to optimize the overall network. The basic mechanism is to deactivate a space block, which is a part of the characteristic diagram, by selecting it and setting the characteristic values in that block to zero. It is more targeted at the spatial level, and can remove a block of information and deactivate spatial blocks with greater probability, which introduces additional regularization effects and helps the network generalize better to new data. The formula for the DropBlock method is as follows:
γ = ( 1 k e e p _ p r o b ) × ( f e a t _ s i z e ) 2 ( b l o c k _ s i z e ) 2 × ( f e a t _ s i z e b l o c k _ s i z e + 1 )
where γ is the value of the Bernoulli function probability, i.e., the probability of the discard process, which affects the strength of the regularization, k e e p _ p r o b is the probability of each block feature being retained, f e a t _ s i z e is the size of the feature map, and b l o c k _ s i z e is the length and width of the neuron squares that were discarded.
This approach not only helps the network to learn and exploit features better but also reduces the risk of overfitting and improves the generalization capability of the network. In the meantime, the CBAM is added between the dense block and the transition layer in DenseNet. CBAM is made up of a channel attention module and a spatial attention module. The feature vectors extracted in the previous level through the convolutional layer are weighted by CBAM, which performs weighting operations on them in both the channel and spatial dimensions to enable adaptive feature selection and enhancement. It highlights the main features and suppresses extraneous features so that the model pays more attention to the content and positional message of the target that needs to be detected, so as to raise its diagnostic accuracy. ReLU is used as the activation function in both the dense block and transition layer in traditional DenseNet. Considering its issues, for example, easily generating gradient disappearance and neuronal node death, this study employs the Leaky ReLU function [34] as the activation function throughout DenseNet.
Considering that DenseNet contains a total of four dense blocks, which can classify 1000 kinds of images, this paper only needs to achieve the goal of fault diagnosis for four different forms of slipper wear to reduce the number of neural network layers and speed up the calculation, accelerating the model. After adding a variety of modules to improve the original DenseNet, DenseBlock4 is deleted, and the improved network is named DenseNet-I. On the one hand, the dense connection of DenseNet enhances the feature propagation between layers and promotes the extraction of rich feature messages. On the other hand, the features extracted by the network are able to perform channel and space weighting operations for the desired features after passing through two layers of the CBAM, which enormously enhances the expressive power of the network. Table 2 shows the overall structure and parameters of DenseNet-I. Figure 8 shows the network structure.
Table 2. The structure and parameters of DenseNet-I.
Figure 8. The structure of DenseNet-I.
Figure 9 shows the fault diagnosis process. Firstly, the triaxial vibration signals are converted into SDP images and constitute the dataset. The dataset is subdivided into a training set and a test set. Then, DenseNet-I is trained on the training set to achieve parameter optimality. Finally, the trained model is tested on the test set. Compared with the fault diagnosis method applied in traditional convolutional neural networks, the proposed method derives richer information from the input samples, has a stronger feature extraction capacity, achieves more efficient feature transfer, slows down the gradient vanishing problem, and promotes feature reuse.
Figure 9. Fault diagnosis process.

4. Experimental Results and Analysis

4.1. Experiment Description

This experiment uses an A10V71DRG/31L (Bosch Rexroth AG, Stuttgart, Germany) swashplate axial piston pump, according to its performance parameters, structural characteristics, and failure simulation experimental needs, to build the piston pump slipper fault simulation test bench shown in Figure 10. The test bench can be subdivided into three parts by its functional composition: a hydraulic system, a measurement and control system, and an electronic control system. The control voltage of the inverter can be altered in real time on the user interface of the test bench, thus changing the speed of the piston pump according to the experimental requirements. The control current of the relief valve can also be changed to achieve different pressure requirements of the system. Meanwhile, the rational allocation of the data acquisition board can achieve the acquisition and storage of key signals such as the pressure, flow, and vibration of the system. The faulty parts are prepared based on the mechanisms of different forms of slipper wear and the changes in the friction coefficient during the wear process of the slipper pair. Figure 10 shows the slipper’s faulty parts with different wear forms and diverse degrees of wear.
Figure 10. Test bench and faulty parts.
Slider wear failure signals are captured by using the YD35D triaxial vibration sensor. The triaxial vibration sensor is installed with a magnetic base in the center of the piston pump end cover so that the sensor is longitudinally parallel to the ground to ensure that its X, Y, and Z axes are correctly positioned in space; the vibration sensor arrangement is shown in Figure 10.
In order to collect more information about the piston pump slipper wear faults, along with the parameters of the data acquisition board, the acquisition frequency of the triaxial vibration signal was set to 20 kHz.
The acquisition process was chosen to be carried out at a pressure of 28 MPa, the rotational speed was set to 1500 r/min, and the sampling time was set to 60 s.

4.2. SDP Image Dataset Construction

Figure 11 shows a comparison between the Continuous Wavelet Transform (CWT) and the SDP transformation. The SDP method generates a single composite image by fusing the triaxial signals, whereas CWT produces three separate time–frequency representations. The SDP image offers a more compact and computationally efficient representation, consolidating fault features from all three channels into a single input for the neural network, thereby simplifying the network architecture and improving diagnostic efficiency.
Figure 11. Comparison between SDP images and CWT transformed images.
Table 3 shows the results of a controlled variable experiment that systematically analyzes the influence of various parameters on the SDP petal morphology, conducted with an empirical value of θ = 60 ° . Based on these findings, the final conversion parameters were determined to be θ = 60 ° , l = 7 , and ξ = 50 ° .
Table 3. SDP images of slipper abrasive wear vibration signals under different parameters.
The collected triaxial vibration signals of four different wear forms of the slipper are used to construct the dataset, taking the signals under the normal state of the slipper as an example, and the dataset is constructed according to the following steps:
(1)
The triaxial vibration signals acquired at an acquisition frequency of 20 kHz over a period of 60 s are arranged in the order of X, Y, and Z to form a two-dimensional array of samples in three columns;
(2)
With 1024 as the data length for the two-dimensional array of three columns of data, at the same time, for successive extraction in accordance with θ = 60 ° , l = 7 and ξ = 50 ° are the key parameters of the SDP image conversion of the three columns of data fusion, constituting a fault sample image [27];
(3)
A total of 1000 sample images are constructed by performing 1000 extractions of a two-dimensional array and performing SDP image conversion according to the above parameters;
(4)
The above 1000 samples are subdivided into training and test sets of the model in the ratio of 9:1.
The remaining slipper wear fault data follow the above steps, and Table 4 shows the deep learning dataset of the piston pump slipper faults.
Table 4. SDP image dataset of different wear forms.
There are 1000 samples for each of the four different slipper wear states: normal slipper, abrasive wear, corrosive wear, and adhesive wear. After fusion, each SDP sample image contains 1024 sample points in each of the three axes of X, Y, and Z. There are a total of 1000 sample images for each type of fault. The results of the SDP fusion of the vibration signals of a randomly selected set of slipper wear cases in different wear forms are shown in Figure 12.
Figure 12. SDP images of vibration signals of different wear forms of the slipper (the red and blue colors distinguish petals from the same axis for comparative analysis).

4.3. Modules Validity Experiments

The model was trained for 100 epochs with d r o p _ s i z e = 0.1 , b l o c k _ s i z e = 7 , a batch size of 64, and a learning rate of 0.01.
To study the influence of each module in the proposed DenseNet-I on the effectiveness of the overall model’s fault diagnosis, the improved Inception module, the CBAM, and the DropBlock method are used as an ordered combination of 3 modules for the comparison experiments, and 10 experiments are averaged. Table 5 shows the experimental classifications and outcomes. The error rates for each of the ten runs across all eight experiments are within ±1%, demonstrating the high stability and repeatability of our method.
Table 5. Comparison experiment of adding different modules to DenseNet-I.
Analyzing Table 5, it can be observed that the improved inception module, the CBAM, and the DropBlock method all improve the effectiveness of DenseNet-I to a certain extent, as applied to the diagnosis of different wear forms in slippers, among which the addition of the attention mechanism has the greatest impact on the enhancement of the categorization ability of the model, and compared with the non-addition of that module, it improves the accuracy rate by 5.1%. The four sets of comparison experiments, 5, 6, 7, and 8, show that the performance of the model cannot be optimized without any of the modules. In Experiment 8, DenseNet-I with all three modules added has the best categorization ability, with accuracy, precision, recall, and F1 scores of 97.3%, 96.5%, 97.5%, and 97.0%, respectively. The results of this experiment show that after adding the improved inception module, CBAM, and DropBlock method, the fault diagnosis performance of the system is significantly increased, and the method has a better result in diagnosis.

4.4. Comparative Models

To further validate the advantage of DenseNet-I applied to the fault diagnosis of different wear forms of slippers, it is compared with various classical fault diagnosis network models in the same dataset as the comparison experiments, for example, ResNet18 [35], and 10 experiments are averaged. Table 6 shows the results.
Table 6. Comparison of experimental results of different network models.
Combining the characteristics of each network model and analyzing the diagnostic outcomes in Table 6, it can be seen that LeNet5 [36], which consists of two convolution layers and three fully connected layers, is a simpler model with the lowest classification accuracy. The classification accuracies of ResNet18 and GoogleNet [37] are 96.1% and 95.8%, respectively, which shows that the inclusion of residual blocks and a multi-channel structure in the network is optimized for the task of fault diagnosis of multiple forms of slipper wear. VGG16 [38] performs relatively well in this experiment with a recognition accuracy of 96.7%, and its use of small- and medium-sized convolutional kernels and deeper network depth enhances the recognition capacity of the network. DenseNet-I has the highest accuracy among the five different fault diagnosis models in the task of fault diagnosis of different forms of slipper wear, with an accuracy of 97.3%, representing that the model has a strong capacity to discriminate between faulty and non-faulty samples in general. Its evaluation metrics, such as the F1 score, are also high compared to other network models. The F1 score is a reconciled average of precision and recall, and an F1 score of 0.973 indicates that the present model strikes a good balance between precision and recall and is both accurate and comprehensive in predicting faults. The high recall of the model indicates that the model is able to capture the faulty samples well with little underreporting. By comparing the fault diagnosis effect of this model with traditional convolutional neural networks, the effectiveness of introducing the improved inception module, CBAM, and the DropBlock method in DenseNet is demonstrated, as well as the superior performance, strong generalization ability, and good fault diagnosis effect of the proposed network, DenseNet-I, in similar fault diagnosis tasks.
To intuitively evaluate the categorization effectiveness of a model, data mining and analysis are often performed using statistical tools and visualization methods such as a confusion matrix, which provides an intuitive representation of how well a model correctly and incorrectly identifies a particular dataset. Figure 13 represents the confusion matrix generated for the application of the DenseNet-I model to slipper wear fault diagnosis, where the horizontal coordinates are the given labels of the dataset and the vertical coordinates are the predicted labels of the model for a given input data. It can be concluded from the figure that for the 100 samples of corrosive slipper wear faults from label 2, two samples were misidentified as abrasive slipper wear faults from label 1, and for the 100 test samples of adhesive slipper wear from label 3, the number of samples identified as being normal, or showing abrasive or corrosive wear of the slipper were 3, 3, and 2, respectively. The model’s recognition rate for the four faults is at a relatively high level overall.
Figure 13. Confusion matrix (different wear forms).
To better observe the feature learning ability of the DenseNet-I fault diagnosis model and examine its performance, this paper adopts the t-SNE dimension reduction visualization method to perform nonlinear dimension reduction operations on the high-dimensional features learned from the fully connected layer, and the generated t-SNE visualization diagram is shown in Figure 14. Each data point on Figure 14 represents a test sample, and the values of the horizontal and vertical axes represent the result of the dimension reduction in this sample point, with different fault types corresponding to points of different colors on the image. By viewing the figure, the feature data points of the four diverse states of the slipper can be clearly and regularly aggregated in different locations, which proves that the DenseNet-I model can achieve a high degree of aggregation of the same kind of data after feature learning and extraction of the input data, and that there is a clear demarcation between the different kinds of data, which represents that the DenseNet-I model built has a powerful feature learning and classification capability. It can be shown that the constructed DenseNet-I model has powerful feature learning and classification abilities, which can produce higher accuracy in recognizing the four forms of piston pump slipper wear.
Figure 14. t-SNE feature visualization (different wear forms).

4.5. Fault Diagnosis of Slipper Wear with Different Wear Degrees and Composite Wear Forms

To confirm the generalization of the DenseNet-I model proposed in this article for application to other datasets, the model is used in this section to perform fault diagnosis for different wear degrees of the slipper on individual slippers, as well as for different slipper composite wear forms. Referring to the construction idea and diagnostic process in Section 4.2, the dataset for this two-group experiment is constructed and divided. The first set is used to construct the dataset of different wear degrees of the slipper, as shown in Table 7, with the fault categories of normal, slight, moderate, and severe wear of the slipper. Each category contains 1000 samples. The second group is used to construct the dataset of different composite wear forms of slippers, as shown in Table 8. The composite wear forms are slipper normal, adhesive wear, and corrosive wear slipper failure, adhesive wear and abrasive wear slipper failure, corrosive wear and abrasive wear slipper failure, and adhesive wear and corrosive wear and abrasive wear slipper failure. Each category contains 1000 samples.
Table 7. SDP image dataset of different slipper wear degrees.
Table 8. SDP image dataset of different slipper composite wear forms.
The DenseNet-I model hyperparameters and the critical parameters of each module are the same as in Section 4.4. Experiments are conducted on the SDP image dataset with different wear degrees on a single slipper and on the SDP image datasets with different composite wear forms on a slipper. At the end of the experiment, the capacity of the model on the two test datasets is analyzed and studied using the confusion matrix and the t-SNE visualization tool. The outcomes associated with the testing of individual slipper datasets with different wear degrees are plotted as shown in Figure 15 and Figure 16. The results related to the testing of different slipper composite wear form datasets are plotted as shown in Figure 17 and Figure 18.
Figure 15. Confusion matrix (different wear degrees).
Figure 16. t-SNE feature visualization (different wear degrees).
Figure 17. Confusion matrix (different composite wear forms).
Figure 18. t-SNE feature visualization (different composite wear forms).
Analysis of Figure 15 shows that, of the 100 test samples with label 1 (slight wear), three were misidentified as label 0 (slipper normal), and another three were misidentified as label 2 (moderate wear). Only one of the test samples with label 3 (serious wear) was misidentified as label 2 (moderate wear); all other samples were correctly classified and identified. As can be seen from Figure 17, only 1 out of 100 samples comprising label 1 (composite wear forms of slipper with adhesive wear and corrosive wear) was misidentified as label 3 (composite wear forms of slipper with corrosive wear and abrasive wear). Out of 100 samples comprising label 4 (composite wear forms of slipper with adhesive wear and corrosive wear, and abrasive wear), 2 were misidentified as label 1 (composite wear forms of slipper with adhesive wear and corrosive wear), and 1 was misidentified as label 3 (composite wear forms of slipper with corrosive wear and abrasive wear). All other samples were correctly classified and identified.
Figure 16 and Figure 18 tell us that the feature points of the four classes of samples with different degrees of wear on one single slipper and the five classes of samples with different slipper wear forms are highly clustered. The categories are distant from each other with clear boundaries. The above analyses show that DenseNet-I has a good classification performance and feature-learning ability in the problem of diagnosing slipper faults with different wear degrees on single slippers, as well as slipper faults with different slipper composite wear forms. This experiment verifies the generalization capacity of the model when applied to diverse datasets.
To further validate the generalization ability of the proposed framework, additional experiments were conducted using the Case Western Reserve University (CWRU) bearing dataset. The results show that our method achieves a diagnostic accuracy of over 98%, outperforming comparative baseline models, including LeNet-5 and VGG16, which both yielded accuracies below 98% under identical experimental conditions. This demonstrates the strong transferability and robustness of our approach across different mechanical systems.

5. Conclusions

In the paper, a fault diagnosis method based on SDP and multi-channel DenseNet is proposed for slipper wear fault diagnosis of swashplate axial piston pumps. The method transforms the triaxial vibration signals into SDP images as inputs to the model and replaces the small convolutional kernel in the inception module with a larger convolution kernel to raise the feature extraction capacity of the network. In addition, the introduction of the CBAM into DenseNet raises the accuracy of fault diagnosis. Replacing the existing ReLU function in the network with a Leaky ReLU function solves the problem of neuron deactivation in networks. A DropBlock regularization method was also added to DenseNet to raise the generalization ability of the network. Sufficient experiments were conducted on several SDP image datasets, and the method performed well. The method is able to solve similar fault diagnosis problems with better accuracy and generalization compared to other state-of-the-art methods. Around algorithmic programming management, we have retained the unique functional effects of DenseNet and made innovative optimizations to strengthen the differentiation of the DenseNet-I algorithm for similar features and simplify the computation so that it can be used in the domain of fault diagnosis of various similar features.
In the process of conducting this study, the slipper wear faults were injected manually, which differs from the actual faults of axial piston pumps; meanwhile, the fault type, load pressure, and rotational speed of the test piece were always stable and unchanged during the experimental process, which is different from the actual conditions of axial piston pumps, and the capacity of the model in complex industrial working conditions has yet to be verified.
In future work, the fault injection method will be optimized to further enhance the availability of the proposed method by taking into account the variations in fault types, fluctuations of load pressure, and changes in rotational speed that exist in the real conditions of the piston pump industry.

Author Contributions

Conceptualization, H.A.; methodology, H.A. and Y.G.; software, H.H. and S.M.; validation, H.H., R.P., and C.L.; formal analysis, G.L.; investigation, C.L.; resources, M.S. and Z.D.; data curation, M.S.; writing—original draft preparation, Y.G. and C.L.; writing—review and editing, H.H.; visualization, Z.D.; supervision, H.A.; project administration, H.A.; funding acquisition, G.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (52275066) and Tianchi Talent introduction Program (2024XGYTCYC02).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Shi, C.; Wang, W.; Wang, S.-P.; Liu, D.; Zhang, S. Partial abrasion modeling and variable load characteristics analysis for high-speed load sensing electro-hydrostatic actuator pump of aircraft. Adv. Mech. Eng. 2024, 16, 16878132241288021. [Google Scholar] [CrossRef]
  2. Selim, E.T.; Serkan, D. Stresses in multi-layer coatings in Hertzian contact with a moving circular punch. Tribol. Int. 2022, 170, 107565. [Google Scholar] [CrossRef]
  3. Gao, Y.D.; Piltan, F.; Kim, J.M. A Novel Image-Based Diagnosis Method Using Improved DCGAN for Rotating Machinery. Sensors 2022, 22, 7534. [Google Scholar] [CrossRef]
  4. Chao, Q.; Gao, H.; Tao, J.; Liu, C.; Wang, Y.; Zhou, J. Fault diagnosis of axial piston pumps with multi-sensor data and convolutional neural network. Front. Mech. Eng. 2022, 17, 36. [Google Scholar] [CrossRef]
  5. Chao, Q.; Gao, H.H.; Tan, J.F.; Wang, Y.H.; Zhou, J.; Liu, C.L. Adaptive decision-level fusion strategy for the fault diagnosis of axial piston pumps using multiple channels of vibration signals. Sci. China-Technol. Sci. 2022, 65, 470–480. [Google Scholar] [CrossRef]
  6. Du, Z.; Zhang, Y.; Lu, W.; Huang, W. Research on prediction methods for eccentric wear of the slipper pair in axial piston pumps and lubrication performance analysis. J. Tribol. 2024, 146, 092201. [Google Scholar] [CrossRef]
  7. Dong, Z.K.; An, H.J.; Liu, S.Y.; Ma, S.H.; Guo, Y.X.; Pan, H.X.; Ai, C. Domain adversarial transfer fault diagnosis method of an axial piston pump based on a multi-scale attention mechanism. Measurement 2025, 239, 115455. [Google Scholar] [CrossRef]
  8. Wang, S.; Shuai, H.Q.; Hu, H.H.; Zhang, J.T.; Liu, S.Y.; Yuan, X.M.; Liang, P.F. Few-shot fault diagnosis of axial piston pump based on prior knowledge-embedded meta learning vision transformer under variable operating conditions. Expert Syst. Appl. 2025, 269, 126452. [Google Scholar] [CrossRef]
  9. Li, Y.; Zhang, L.J.; Liang, P.F.; Wang, X.F.; Wang, B. Semi-supervised meta-path space extended graph convolution network for intelligent fault diagnosis of rotating machinery under time-varying speeds. Reliab. Eng. Syst. Saf. 2024, 251, 110363. [Google Scholar] [CrossRef]
  10. Yu, T.; Jiang, Z.; Ren, Z.; Zhang, Y.; Zhou, S.; Zhou, X. A cooperative convolutional neural network framework for multisensor fault diagnosis of rotating machinery. IEEE Sens. J. 2024, 24, 38309–38317. [Google Scholar] [CrossRef]
  11. Liu, S.Y.; Yin, J.X.; Hao, M.; Liang, P.F.; Zhang, Y.Q.; Ai, C.; Jiang, W.L. Fault diagnosis study of hydraulic pump based on improved symplectic geometry reconstruction data enhancement method. Adv. Eng. Inform. 2024, 61, 102459. [Google Scholar] [CrossRef]
  12. Ding, Y.F.; Jia, M.P.; Miao, Q.H.; Cao, Y.D. A novel time-frequency Transformer based on self-attention mechanism and its application in fault diagnosis of rolling bearings. Mech. Syst. Signal Process. 2022, 168, 108616. [Google Scholar] [CrossRef]
  13. Vashishtha, G.; Chauhan, S.; Sehri, M.; Hebda-Sobkowicz, J.; Zimroz, R.; Dumond, P.; Kumar, R. Advancing machine fault diagnosis: A detailed examination of convolutional neural networks. Meas. Sci. Technol. 2022, 33, 022001. [Google Scholar] [CrossRef]
  14. Zhang, Y.; Han, D.Y.; Shi, P.M. Semi-supervised prototype network based on compact-uniform-sparse representation for rotating machinery few-shot class incremental fault diagnosis. Expert Syst. Appl. 2024, 255, 124660. [Google Scholar] [CrossRef]
  15. Zhang, Y.; Han, D.Y.; Tian, J.H.; Shi, P.M. Domain adaptation meta-learning network with discard-supplement module for few-shot cross-domain rotating machinery fault diagnosis. Knowl.-Based Syst. 2023, 268, 110484. [Google Scholar] [CrossRef]
  16. He, Y.; Tang, H.S.; Ren, Y.; Kumar, A. A deep multi-signal fusion adversarial model based transfer learning and residual network for axial piston pump fault diagnosis. Measurement 2022, 192, 110889. [Google Scholar] [CrossRef]
  17. Tang, S.N.; Zhou, Y.; Yuan, S.Q. Intelligent fault diagnosis of hydraulic piston pump based on deep learning and Bayesian optimization. ISA Trans. 2022, 129, 555–563. [Google Scholar] [CrossRef]
  18. Ugli, O.E.M.; Lee, K.H.; Lee, C.H. Automatic optimization of one-dimensional CNN architecture for fault diagnosis of a hydraulic piston pump using genetic algorithm. IEEE Access 2023, 11, 68462–68472. [Google Scholar] [CrossRef]
  19. Guo, J.; Yang, Y.; Li, H.; Wang, J.; Tang, A.; Shan, D.; Huang, B. A hybrid deep learning model towards fault diagnosis of drilling pump. Appl. Energy 2024, 372, 123773. [Google Scholar] [CrossRef]
  20. Hatami Garousi, M.; Karimi, M.; Casoli, P.; Rundo, M.; Fallahzadeh, R. Vibration analysis of a centrifugal pump with healthy and defective impellers and fault detection using multi-layer perceptron. Engineering 2024, 5, 2511–2530. [Google Scholar] [CrossRef]
  21. Du, B.C.; Huang, W.; Cheng, Y.; Chen, J.F.; Tao, R.C.; Cui, S.M. Fault diagnosis and separation of PMSM rotor faults using search coil based on MVSA and random forests. IEEE Trans. Ind. Electron. 2024, 71, 15089–15099. [Google Scholar] [CrossRef]
  22. Wang, T.; Xu, X.; Pan, H.X. Anti-noise transfer adversarial convolutions with adaptive threshold for rotating machine fault diagnosis. ISA Trans. 2024, 146, 175–185. [Google Scholar] [CrossRef]
  23. Ding, X.; Deng, A.D.; Deng, M.Q.; Liu, D.Y.; Shi, Y.W. Mitigating decision boundary confusion: A classifier prediction-oriented domain adaptation network. IEEE Sens. J. 2024, 24, 18672–18684. [Google Scholar] [CrossRef]
  24. Liu, J.Y.; Zhang, Q.; Xie, F.Q.; Wang, X.S.; Wu, S.J. Incipient fault detection of planetary gearbox under steady and varying condition. Expert Syst. Appl. 2023, 233, 121003. [Google Scholar] [CrossRef]
  25. Sun, Y.J.; Li, S.H.; Wang, Y.L. Fault diagnosis of rolling bearing based on empirical mode decomposition and improved Manhattan distance in symmetrized dot pattern image. Mech. Syst. Signal Process. 2021, 159, 107817. [Google Scholar] [CrossRef]
  26. Sun, Y.J.; Li, S.H.; Wang, Y.L.; Wang, X.H. Bearing fault diagnosis based on optimal convolution neural network. Measurement 2021, 190, 110702. [Google Scholar] [CrossRef]
  27. Khan, I.U.; Hua, C.; Li, L.; Zhang, L.; Yang, F.; Liu, W.Q. Remaining useful life prediction for a cracked rotor system via moving feature fusion-based deep learning approach. Measurement 2025, 239, 115433. [Google Scholar] [CrossRef]
  28. Li, Y.H.; Wang, Y.P.; Zhao, X.; Chen, Z. A deep reinforcement learning-based intelligent fault diagnosis framework for rolling bearings under imbalanced datasets. Control Eng. Pract. 2024, 145, 105845. [Google Scholar] [CrossRef]
  29. Wang, G.; Zhang, Y.N.; Zhang, F. An ensemble method with DenseNet and evidential reasoning rule for machinery fault diagnosis under imbalanced condition. Measurement 2023, 214, 112806. [Google Scholar] [CrossRef]
  30. Fu, W.L.; Chen, Y.P.; Li, H.Y.; Chen, X.Y.; Chen, B.J. Imbalanced fault diagnosis using conditional Wasserstein generative adversarial networks with switchable normalization. IEEE Sens. J. 2023, 23, 29119–29130. [Google Scholar] [CrossRef]
  31. Li, Z.T.; Wang, P.L.; Yang, Z.Y.; Li, X.Y.; Jia, R.N. IInception-CBAM-IBiGRU based fault diagnosis method for asynchronous motors. Sci. Rep. 2024, 14, 5192. [Google Scholar] [CrossRef]
  32. Chi, Y.L.; Cai, C.Z.; Ren, J.H.; Xue, Y.F.; Zhang, N. Damage location diagnosis of frame structure based on wavelet denoising and convolution neural network implanted with Inception module and LSTM. Struct. Health Monit. 2024, 23, 57–76. [Google Scholar] [CrossRef]
  33. Pei, X.W.; Dong, S.J.; Tang, B.P.; Pan, X.J. Bearing running state recognition method based on feature-to-noise energy ratio and improved deep residual shrinkage network. IEEE-ASME Trans. Mechatron. 2022, 27, 3660–3671. [Google Scholar] [CrossRef]
  34. Han, H.R.; Wang, H.; Liu, Z.L.; Wang, J.Y. Intelligent vibration signal denoising method based on non-local fully convolutional neural network for rolling bearings. ISA Trans. 2022, 122, 13–23. [Google Scholar] [CrossRef] [PubMed]
  35. Fu, Y.F.; Ji, Y.; Meng, G.; Chen, W.; Bai, X.J. Three-phase inverter fault diagnosis based on an improved deep residual network. Electronics 2023, 12, 3460. [Google Scholar] [CrossRef]
  36. Li, T.; Jiao, Z.L.; Wang, L.N.; Mu, Y. A method of DC arc detection in all-electric aircraft. Energies 2020, 13, 4190. [Google Scholar] [CrossRef]
  37. Yang, G.; Wei, Y.Q.; Li, H.K. Acoustic diagnosis of rolling bearings fault of CR400 EMU traction motor based on XWT and GoogleNet. Shock Vib. 2022, 2022, 2360067. [Google Scholar] [CrossRef]
  38. Wu, H.; Yang, Y.P.; Deng, S.J.; Wang, Q.M.; Song, H. GADF-VGG16 based fault diagnosis method for HVDC transmission lines. PLoS ONE 2022, 17, e0274613. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.