Open Set Bearing Fault Diagnosis with Domain Adaptive Adversarial Network under Varying Conditions

Zhang, Bo; Li, Feixuan; Ma, Ning; Ji, Wen; Ng, See-Kiong

doi:10.3390/act13040121

Open AccessArticle

Open Set Bearing Fault Diagnosis with Domain Adaptive Adversarial Network under Varying Conditions

by

Bo Zhang

¹

,

Feixuan Li

¹

,

Ning Ma

¹,

Wen Ji

^2,* and

See-Kiong Ng

³

¹

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China

²

School of Electrical and Control Engineering, Xuzhou University of Technology, Xuzhou 221116, China

³

Institute of Data Science, National University of Singapore, Singapore 117602, Singapore

^*

Author to whom correspondence should be addressed.

Actuators 2024, 13(4), 121; https://doi.org/10.3390/act13040121

Submission received: 29 January 2024 / Revised: 9 March 2024 / Accepted: 22 March 2024 / Published: 25 March 2024

(This article belongs to the Section Control Systems)

Download

Browse Figures

Versions Notes

Abstract

Bearing fault diagnosis is a pivotal aspect of monitoring rotating machinery. Recently, numerous deep learning models have been developed for intelligent bearing fault diagnosis. However, these models have typically been established based on two key assumptions: (1) that identical fault categories exist in both the training and testing datasets, and (2) the datasets used for testing and training are assumed to follow the same distribution. Nevertheless, these assumptions prove impractical and fail to accurately depict real-world scenarios, particularly those involving open-world assumption fault diagnosis in multi-condition scenarios. For that purpose, an open set domain adaptive adversarial network framework is proposed. Specifically, in order to improve the learning of distribution characteristics in different fields, comprehensive training is implemented using a deep convolutional autoencoder model. Additionally, to mitigate the negative transfer resulting from unknown fault samples in the target domain, the similarity of each target domain sample and the shared classes in the source domain are estimated using known class classifiers and extended classifiers. Similarity weight values are assigned to each target domain sample, and an unknown boundary is established in a weighted manner. This approach is employed to establish the alignment between the classes shared between the two domains, enabling the classification of known fault classes, while allowing the recognition of unknown fault classes in the target domain. The efficacy of our suggested approach is empirically validated using different datasets.

Keywords:

open set fault diagnosis; adversarial domain networks; rotating machines; multiple classifiers

1. Introduction

The quality of bearings can directly impact the performance, reliability, and safety of rotating machinery. However, rolling bearings are susceptible to a variety of faults due to their frequent exposure to challenging conditions such as high loads and speeds. Moreover, bearing faults carry substantial maintenance costs and can have serious consequences, underscoring the critical importance of timely diagnosis and intervention. Early fault diagnosis methods have primarily relied on human expertise. However, such a manual approach is inadequate for the real-time diagnosis of bearing failure issues during operation.

With the rapid advancement in sensor and signal processing technologies, we can now extract robust features from collected data that aid in identifying the condition of bearings [1]. By employing deep learning models to discern the differences between normal operation and various fault states of bearings, we can accurately predict the presence of specific fault categories in an end-to-end manner. This approach effectively addresses the limitations of traditional fault diagnosis methods, which heavily rely on expert knowledge and manual intervention.

The current bearing fault diagnosis methods employ sophisticated deep learning models such as convolutional neural networks (CNN) [2], autoencoders (AE) [3], and graph convolutional neural networks (GCN) [4]. For example, Ince et al. [5] employed a one-dimensional convolutional neural network model to directly analyze raw time-domain vibration signals to detect motor faults, thus significantly streamlining the time-consuming and labor-intensive feature extraction process. Mao et al. [6] introduced a bearing fault diagnosis approach based on deep autoencoders, which effectively integrated multiple types of fault discrimination information. Their study demonstrated the method’s capability to extract deep features with enhanced representation, mitigate overfitting, and ensure model stability. Additionally, Cao et al. [7] proposed an integrated transfer convolutional neural network that leveraged multi-sensor raw vibration signals, along with a newly devised decision fusion strategy, to classify fault types in gearboxes.

While significant progress has been made in utilizing mature deep learning models for intelligent bearing fault diagnosis, many current approaches are constrained by certain assumptions. Primarily, these methods assume that (1) the training and testing datasets share the same fault categories, and (2) the datasets used for testing and training adhere to the same distribution. However, in real-world industrial settings, bearing fault data can originate from diverse task requirements and environmental conditions. Consequently, the test dataset may include fault types not encountered in the training dataset, and there may be distributional disparities in the data collected under different operating conditions. Therefore, the aforementioned assumptions are impractical in practice.

To address the distributional differences in bearing fault data arising from various working conditions, many methods employ domain adaptation techniques. Domain adaptation can fully leverage the information from both the source and target domains to minimize the differences between distributions across different domains to the greatest extent possible. These approaches will be reviewed in detail in Section 2. However, the traditional domain adaptation methods have not effectively addressed the domain shift problem caused by the presence of unknown classes in the target domain. Therefore, a practical fault diagnosis model not only needs to align the distributions of shared categories of the two domains but also needs to have the ability to detect unknown classes, which is referred to as open set domain adaptation (OSDA) for fault diagnosis.

Research on the problem of OSDA is relatively mature in the field of computer vision. Busto and Gall [8] first proposed the OSDA problem and used linear support vector machines to detect target anomalies. The open set domain adaptation by back-propagation (OSBP) method proposed by Saito et al. [9] is an adversarial training strategy that constructs boundaries between known and unknown classes for classification purposes. In comparison, there is limited research in the literature on OSDA fault diagnosis, especially for rolling bearings.

Figure 1 shows the difference between closed-set and open set domain adaptation. To solve the OSDA bearing fault diagnosis problem, this study presents an open set domain adaptive adversarial network framework. First, we propose using a deep convolutional autoencoder model to better learn cross-domain fault category features. Then, we propose an adversarial model for domains. To encourage adversarial training and improve the positive transfer between samples, this paper assigns weights to target samples based on a similarity analysis between the target domain samples and known classes. In the test phase, the input sample’s category is determined by the unknown class classifier. The following are this paper’s main contributions:

(1) To address diverse domains or data distributions, we integrate supervised multiple classifiers with unsupervised deep convolutional autoencoders to accomplish known fault classification and capture intricate data distributions. The feature distributions of the samples in the target and source domains are successfully aligned by improving the encoder and decoder and using the scale-invariant mean squared error, leading to improved performance in bearing fault diagnosis.

(2) To address negative transfer in open set fault diagnosis, we employ multiple classifiers to compute similarity weights for target domain samples, learning their likelihood of belonging to shared categories. These weights are incorporated into the corresponding target domain samples, thereby excluding the unknown fault samples from the target domain and mitigating domain classification loss error.

(3) We conducted experiments on three laboratory bearing datasets, evaluating different transfer fault tasks and conducting a comparative analysis. Our findings emphatically demonstrated the effectiveness of our proposed open set domain adaptation method.

The remainder of this paper is structured as follows: Section 2 introduces the development of existing related work in domain adaptation. Section 3 elaborates on the design of the proposed OSDA method. Section 4 describes the experimental settings on multiple datasets and presents the validation and analysis of the experimental results. Finally, Section 5. provides a summary of the key findings and conclusions of this study.

2. Related Work

We briefly introduce various recent domain adaptation methods. These methods primarily address the constraint of label set relationships between different domains and are categorized into closed domain adaptation, partial domain adaptation, and open domain adaptation.

(a) Closed-Set Domain Adaptation (CSDA). As mentioned, CSDA means sharing the same class in both the source and target domains, focusing primarily on the goal of mitigating the performance degradation caused by domain differences. Early feature adaptation methods mainly relied on traditional statistical learning techniques to map or transform features from the source domain to the target domain, such as PCA, LDA, etc. As deep learning research has advanced, statistical moment alignment has become a popular approach. For example, Tzeng et al. [10] first used the maximum mean discrepancy (MMD) distance to measure the distance between two domains, achieving as much as possible alignment between domains. Long et al. [11] proposed a deep adaptation network (DAN) structure that extends deep convolutional neural networks to domain adaptation scenarios. Compared to MMD’s use of a single kernel function, DAN is extended to multiple kernels and linear combinations of kernel functions, achieving minimization of the maximum average difference between deep feature cross-domain distributions. Taking into account that target and source domains are not fully consistent, Long et al. [12] improved DAN by introducing residual transfer network structures into the source domain classifier and adding an entropy minimization criterion in the target domain. To further learn the differences in labels by considering domain differences, Long et al. [13] generalized the kernel function to a joint distribution through inner product operations.

With the emergence of generative adversarial networks (GANs), there has been a gradual increase in research focusing on adversarial learning applied to domain adaptation. This approach involves training a domain classifier, also called a discriminator, to distinguish between source and target domain features, while simultaneously training the feature extractor to confuse this discriminator by generating domain-invariant features [14]. Goodfellow et al. [15] proposed a generative framework based on adversarial learning, which trained models through competition. Ganin et al. [16] used a domain discriminator to distinguish two domains, and in the domain adversarial training paradigm, the feature extractor learned to confuse the domain discriminator.A conditional domain adversarial network(CDAN) [17] was also introduced to improve the domain adversarial network by matching the joint distribution of labels and features. The aforementioned work laid the foundation for further development of the field.

(b) Partial Domain Adaptation (PDA). Compared with traditional closed-set domain adaptation methods, partial domain adaptation assumes that the label spaces between the source and target domains are not completely identical and that the label set of the source domain should be large enough to include the target label set [18], which is more suitable for applications in big data scenarios. In order to solve the problem of partial domain adaptation in bearing fault diagnosis, many researchers have used a weighting method to distinguish between private and shared labels. Kuang et al. [19] proposed a double weighting method to align the weighted data by combining the class weights obtained by the classifier and the sample weights obtained by the domain discriminator. Wang et al. [20] incorporated class level weights into a DANN model and employed a balancing strategy to augment classes in the target domain with source samples to expand classes in the target domain using source samples to alleviate the negative impact of label space asymmetry. Huang et al. [21] added pseudo labels to the target domain samples and then used them to selectively weight the source features. Based on the introduction of the above methods, most PDA methods add weight to the source domain samples and filter out the data that only exists in the source domain label space.

(c) Open Set Domain Adaptation (OSDA). As mentioned, OSDA refers to the case that the source domain label set is a subset of the target domain label set. It assumes that the label set of the target domain is larger than the label set of the source domain, where the private class samples contained in the target domain are called unknown classes. The goal is to achieve accurate classification of known class samples in the target domain, while labeling private samples as unknown classes. Busto et al. [8] first proposed OSDA, which uses the assign and transform iteratively (ATI) algorithm to map the target samples to source classes and then trains an SVM for final classification. However, the ATI algorithm assumes the presence of unknown classes in the source domain that are not present in the target domain. The OSBP [9] method introduced by Saito et al. mitigates such assumptions. It employs a domain adversarial approach, emphasizing the establishment of an unknown boundary on the unknown class samples in the target domain using a fixed threshold. Furthermore, it extends the classifier by incorporating an additional unknown class discriminator. Liu et al. [22] carried out further investigation on the basis of the OSBP method, incorporating a coarse-to-fine weighted mechanism to gradually separate unknown and known samples, while balancing their importance for feature distribution alignment.

However, research on adaptive open set learning in the field of fault diagnosis is still limited. Currently, the majority of the existing methods rely on weighted approaches. For instance, Zhang et al. [23] introduced adversarial learning to extract generalized features and implemented an instance-level weighting mechanism to estimate the similarity between target and source domain samples, which was combined with entropy minimization to further improve the model’s generalization ability. Zhao et al. [24] studied a method based on dual adversarial learning. The weights assigned to the target samples by the auxiliary domain discriminator effectively distinguished the unknown classes. Guo et al. [25] achieved open set MMC fault diagnosis and distinguished known and unknown faults by introducing techniques such as batch normalization and layer normalization and by designing a multi-scale coordinate residual attention mechanism. Zhu et al. [26] realized domain adaptation and feature selective distribution alignment in open set intelligent bearing fault diagnosis through a weight transfer conditional countermeasure network and weighted countermeasure learning network. These methods can effectively identify unknown fault categories and improve the recognition rate of known categories. In this paper, our approach builds on the aforementioned OSDA methods and incorporates a novel multi-classifier module with a novel weighting scheme to automatically establish boundaries between known and unknown target samples, achieving their alignment.

3. Method

3.1. Problem Definition

The domain adaptation problem is formally defined using the open set hypothesis. Let the training data be

n_{s}

labeled samples from the dataset

D_{s} = {(x_{i}^{s}, y_{i}^{s})}_{i = 1}^{n_{s}}

, which fits with the distribution of source domain

P_{s}

and the label space

Y_{s}

. The test data should similarly include

n_{t}

unlabeled samples from the target domain distribution

P_{t}

and the label space

Y_{t}

, which correspond to the dataset

D_{t} = {(x_{j}^{t}, y_{j}^{t})}_{j = 1}^{n_{t}}

, where

P_{s} \neq P_{t}

and

Y_{s} \subseteq Y_{t}

. The shared classes are defined by the intersection of

Y_{s}

and

Y_{t}

, and the private labels in the target domain should be regarded as “unknown”.

In order to accomplish cross-domain alignment of shared categories and discover unknown fault categories in the target domain, an open set domain adaptive adversarial network model has to be established.

3.2. Open Set Adversarial Domain Adaptation Framework

The method proposed in this paper, along with its deep neural network architecture consists of four key modules, as shown in Figure 2. These modules, denoted as

G_{a e}

,

G_{d e}

,

G_{c}

, and

G_{c u}

, serve as a feature extractor, a known-class classifier, and an extended classifier, respectively. The feature extractor incorporates deep convolutional autoencoders (

G_{a e}

and

G_{d e}

) to autonomously learn enriched and refined representations from raw inputs of both domains. To mitigate the classification loss emanating from the source domain, a known-class classifier

G_{c}

is used. It outputs the probability value for the K-th dimension. Furthermore, an extended classifier

G_{c u}

, encompassing both known and unknown categories, is introduced. Compared with the known-class classifier

G_{c}

, the dimension of the output from

G_{c u}

increases from K to

K + 1

, where the

K + 1

dimension outputs the similarity weight that shows whether the sample is an unknown class.Detailed parameters of the model are shown in Table 1.

3.3. Deep Autoencoders and Reconstruction Error

The studies in [18,27] indicated that autoencoders can serve as substitutes for neural network structures in discrimination tasks. Motivated by this, we introduce a deep convolutional autoencoder as a replacement for a convolutional neural network in the context of adversarial learning for domain adaptation, thereby enhancing feature learning. It consists of two components, an encoder

G_{a e}

and a decoder

G_{d e}

.

G_{a e}

projects the input bearing data into an encoding space while preserving crucial characteristics of the bearing signals.

G_{d e}

re-projects the encoded data in the latent space back to the input space, generating reconstructed bearing data for comparison with the original data, to minimize reconstruction error loss.

The deep convolutional autoencoder utilizes unsupervised learning to reconstruct data, aiming to encapsulate the data distributions of both domains. This strategy uncovers intrinsic feature variations and pursues a common latent distribution. The optimization process is geared towards making the reconstructed data closely resemble the original input by fine-tuning the model parameters through a defined loss function.

For the calculation of the reconstruction loss, we employ a scale-invariant mean squared error term [28], which is applied to data from both the target and source domains.

\begin{matrix} L_{recon} = \sum_{i = 1}^{N_{s}} L_{s i_{mse}} (x_{i}^{s}, {\hat{x}}_{i}^{s}) + \sum_{i = 1}^{N_{t}} L_{{si}_{mse}} (x_{i}^{t}, {\hat{x}}_{i}^{t}) \\ L_{s i_{mse}} (x, \hat{x}) = \frac{1}{k} {∥ x - \hat{x} ∥}_{2}^{2} - \frac{1}{k^{2}} {([x - \hat{x}])}^{2} \end{matrix}

(1)

Here, k represents the dimension of the sample data,

\hat{x}

represents the reconstructed data from the model, and

∥ x - \hat{x} ∥_{2}^{2}

refers to the L2 squared norm. It is important to note that predictions that are accurate up to a scaling term are penalized by the mean squared error (MSE) loss, which is commonly used in reconstruction tasks. In contrast, the scale-invariant mean squared error penalizes discrepancies between frequency components, enabling the model to replicate the general shape of the represented object more effectively, thereby reducing reconstruction errors and approximating the original input data as closely as possible.

3.4. Target Domain Sample Weight Calculation

Training the model on the source domain is a crucial step to enable the classification of known faults. In this paper, to learn the feature distribution of source domain samples, we train the extended classifier

G_{c} u

in a supervised manner using conventional cross-entropy loss to optimize the classification of source domain data.

L_{G_{c u}} (θ_{a e}, θ_{c u}) = \frac{1}{n_{s}} \sum_{i = 1}^{n_{s}} L_{C E} (G_{c u} (G_{a e} (x_{i}^{s})), y_{i}^{s})

(2)

Here,

θ_{a e}

and

θ_{c u}

represent the parameters of the

G_{a e}

and the

G_{c u}

, and

n_{s}

symbolizes the quantity of samples in the source domain. The common cross-entropy loss function used to reduce the error is denoted by

L_{G_{c u}}

.

G_{c u}

and

y_{i}^{s}

represent the labels corresponding to the samples in the source domain

x_{i}^{s}

.

Since the learning parameters of the known class classifier

G_{c}

are only trained on source samples, target samples with distributions differing from the source will have lower similarity values or uncertain predictions. The

G_{c}

classifier utilizes a multiclass one-vs-rest binary loss to compute the loss. The formula is as follows:

\begin{matrix} \begin{matrix} L_{G_{c}} = - \frac{1}{n_{s}} \sum_{i = 1}^{n_{s}} \sum_{K = 1}^{|c_{s}|} y_{i, K}^{s} log G_{c}^{K} (G_{a e} (x_{i}^{s})) \\ + (1 - y_{i, K}^{s}) log (1 - G_{c}^{K} (G_{a e} (x_{i}^{s}))) \end{matrix} \end{matrix}

(3)

where

y_{i, K}^{s}

represents the one-hot encoded label of the source domain sample.

To reduce the risks of misclassifying unknown fault categories in the target domain as known categories, we have devised a weighting method

w_{1}

for the input sample

x_{t}

which is based on its similarity to the source domain’s known shared labels. As shown in Equation (4) below,

w_{1}

calculates the sum of the k-dimensional probabilities for the known class, leveraging

G_{c}

to utilize the label information from the known classes in the source domain. This approach effectively minimizes the influence of samples from unknown classes in the target domain, while aligning the samples of known classes between the two domains.

w_{1} (x_{t}) = \sum_{K = 1}^{|C_{s}|} G_{c}^{K} (G_{a e} (x_{t}))

(4)

where

G_{c}^{K} (G_{a e} (x_{t}))

represents the probability that the sample belongs to the Kth known category, and the sum represents the similarity weight between the target sample and the known class in the source domain. For target domain samples similar to the source domain samples, the softmax element output and

w_{1} (x_{t})

will be elevated or nearly one, while samples that are different from the source domain will have an output close to zero. Hence, we can utilize the output values of

w_{1} (x_{t})

to quantify the transferability of samples between the target and source domains.

However, since the definition of this similarity measure is trained only on the source domain class labels, the

w_{1} (x_{t})

values for target samples may be uncertain. Therefore, it is necessary to further understand the intrinsic characteristics of target domain samples.

We investigate the distribution changes of unknown class samples in the target domain based on the similarity measure of known class samples labeled in the source domain. This is done through the utilization of the

(K + 1)

-th dimensional unknown class probability within the extended classifier

G_{c u}

, thereby quantifying the similarity weight

w_{2} (x_{t})

for target samples that are part of the shared space.

When the target sample is a member of the shared label set, the value of

P (y = N + 1 | x_{t})

is relatively low, thus requiring a higher weight

w_{2} (x_{t})

to be assigned to this sample. This is represented by Equation (5).

w_{2} (x_{t}) = 1 - P (y = N + 1 ∣ x_{t})

(5)

In the domain adversarial model, the similarity between known and target K class samples from the source domain is determined using the known class classifier

G_{c}

. For target domain samples, their affiliation with shared categories is determined based on the unknown class decisions made by the extended classifier

G_{c u}

. Ultimately, by analyzing the foundational domain information, weights are assigned to target domain samples. The final similarity measure is then defined by combining the outputs of the known class classifier

G_{c}

and the extended classifier

G_{c u}

.

w (x_{t}) = (w_{2} (x_{t})) (w_{1} (x_{t}))

(6)

By combining the similarity weights of

w_{1}

and

w_{2}

, the similarity weight values assigned to each sample in the target domain are measured.

w (x_{t})

is regarded as a comprehensive measure of the possibility that the target domain sample belongs to the shared label space. To further validate this conclusion, Section 4.3 provides visual evidence of this theory through the display of target domain sample weights.

However, in open set domain adaptation problems, if the private label data of the target domain are ignored, this could lead to erroneous matching of unknown class samples in the target domain with labeled samples in the source domain, resulting in negative transfer issues during cross-domain alignment.

To address this issue, similarity weight values,

w (x_{t})

, are computed using multiple classifiers to calculate the similarity between samples from the source and target domains. These similarity weights are then added to each corresponding target domain sample, reducing the domain classification loss error and thereby avoiding negative transfer issues.

3.5. Establishment of Unknown Boundary

To better align known classes in the target domain with the source domain, while keeping unknown classes away from the source domain, we employ binary cross-entropy loss in Equation (7) to learn an accurate hyperplane. The bounds of the target domain’s unknown class samples are defined by this hyperplane.

Unlike the approach used in OSBP [9], which sets a boundary threshold T to distinguish between shared labels and private labels for constructing an unknown class boundary, this study replaces the fixed threshold for target domain samples with dynamic similarity weights

w (x_{j}^{t})

. This is represented as follows:

\begin{matrix} L_{G_{c u}}^{'} = - \frac{1}{n_{t}} \sum_{j = 1}^{n_{t}} w (x_{j}^{t}) log (P (y = N + 1 ∣ x_{j}^{t})) \\ - \frac{1}{n_{t}} \sum_{j = 1}^{n_{t}} (1 - w (x_{j}^{t})) log (1 - P (y = N + 1 ∣ x_{j}^{t})) \end{matrix}

(7)

Based on the aforementioned derivations, we introduce the adversarial domain adaptation model. The following is an overview of this approach’s main goals:

θ_{G c u} = arg min_{θ_{G c u}} L_{G c u} + L_{G c u}^{'}

(8)

θ_{G a e} = arg min_{θ_{G a e}} L_{G c u} - L_{G c u}^{'}

(9)

θ_{G c} = arg min_{θ_{G c}} L_{G c}

(10)

The above equation reflects the minimax game between the deep autoencoder

G_{a e}

and the extended classifier

G_{c u}

. It adversarially fools the extended classifier

G_{c u}

through the deep autoencoder

G_{a e}

and reduces the difference between the source domain and the target domain by learning transferable features.

Based on the characteristics of binary cross-entropy, the extended classifier

G_{c u}

attempts to set an unknown class probability

P (y = N + 1 | x_{t})

equal to

w (x_{j}^{t})

, thus minimizing the value of

L_{G_{c u}}^{'}

. Conversely, the deep autoencoder

G_{a e}

strives to make

P (y = N + 1 | x_{t})

different from

w (x_{j}^{t})

, in order to maximize the value of

L_{G_{c u}}^{'}

. The extended classifier

G_{c u}

’s unknown class probability

P (y = N + 1 | x_{t})

can be changed by the deep autoencoder

G_{a e}

during training. When the weight

w (x_{j}^{t})

is relatively high, the sample

x_{j}^{t}

aligns with the known categories, and as a result, the value of

P (y = N + 1 | x_{t})

should be lower. However, the extended classifier

G_{c u}

attempts to minimize

L_{G_{c u}}^{'}

by increasing the value of

P (y = N + 1 | x_{t})

to match the weight

w (x_{j}^{t})

, which in turn leads to the sample

x_{j}^{t}

moving away from the source domain. Conversely, the objective of the deep autoencoder

G_{a e}

is to maximize

L_{G_{c u}}^{'}

, which leads to an increase in the distance between

P (y = N + 1 | x_{t})

and

w (x_{j}^{t})

. Consequently, this decreases the value of

P (y = N + 1 | x_{t})

, bringing the sample

x_{j}^{t}

closer to those in the source domain. This configuration results in an adversarial relationship between them. Through such adversarial training,

G_{c u}

attempts to confuse the target domain’s shared and unknown classes, making them difficult to distinguish. Meanwhile,

G_{a e}

strives to generate distinctive features that facilitate separating known classes from unknown ones in the target samples.

All training loss functions are optimized using an end-to-end approach [29] and reversing the gradient sign during backpropagation. Ultimately, the open set domain adaptive problem is well handled, whereby the unknown samples in the target domain can be accurately identified, reducing the cross-domain differences and promoting positive migration. The pseudocode for this method is provided in Algorithm 1.

Algorithm 1: Proposed Methodology

Input: Source domain data

X_{s}

, target domain data

X_{t}

, and source domain labels

Y_{s}

, initialize network parameters, number of epochs: N, number of batch size: B.

Output: the target domain sample label

Y_{t}

.

1. For epoch=1 to N do

2. For i=1 to B do

3. Feed the source and target dataset into feature extractor

G_{a e}

to obtain

common features:

G_{a e} (x_{i}^{s})

,

G_{a e} (x_{i}^{t})

.

4. Feed the common features of the source and target dataset into known-class

classifier

G_{c}

and extended classifier

G_{c u}

simultaneously.

5. For

G_{c}

, compute the multiclass one-vs-rest binary loss for the source

domain with (3) and similarity measure

w_{1} (x_{t})

with (4).

6. For

G_{c u}

, compute the cross entropy with (2) and similarity measure

w_{2} (x_{t})

with (5).

7. Compute the final similarity measure

w (x_{t})

with (6).

8. Calculate the loss of

G_{c u}

with (7). Backpropagation: Using stochastic

gradient descent method to optimize the parameters of the model

from (8) to (10)

9. End

10. End

4. Experimental Results and Discussion

4.1. Dataset Description

To verify the effectiveness of our proposed method, we conducted extensive evaluation experiments using three different types of bearing datasets namely, the Case Western Reserve University (CWRU) dataset, the Southeast University (SEU) dataset, and the Padburn (PU) dataset. For the three datasets used in this paper, a vibration signal of length 4098 was first selected from the raw signal, and 4098 Fourier coefficients were produced by performing an FFT. Since the coefficients were symmetric, only 2049 coefficients were considered. The different tasks set by these datasets have specific labels and interpretations, and the details are shown in Table 2, Table 3, Table 4 and Table 5.

CWRU Bearing Dataset: There were four distinct circumstances represented in the vibration signals obtained from the motor’s driving end on the test rig: (1) healthy state (H); (2) inner-race fault (IF); (3) outer-race fault (OF); and (4) ball fault (BF).There were three distinct severity levels for the IF, OF, and BF cases (0.007 inches, 0.014 inches, and 0.021 inches) corresponding to the vibration signals. The sampling frequency was 12 kHz, and the signals were collected under four different working conditions of motor load and speed: load 0, load 1, load 2, and load 3 respectively. More details of the CWRU bearing data can be viewed from [30]. The configuration of the CWRU dataset is shown in Table 2.

SEU Bearing Dataset: This dataset is provided by Southeast University. The dataset contents and further details can be found in [31]. The gear dataset and the bearing dataset are two of its sub-datasets. Vibration signals were collected at 12 kHz under two different operating conditions with the velocity load configuration (RS-LC, respectively) set to 20 Hz-0 V and 30 Hz-2 V. Under each operating condition, the bearing had four states: H, IF, OF, and BF. There are eight lines of vibration signals in each file, but only the second line was included in this study.

PU Bearing Dataset: The PU dataset [32] is provided by the University of Padbourne and consists of 32 sets of current signals and vibration signals. The bearing signals are categorized as follows: (1) 6 undamaged bearings; (2) 12 artificially damaged bearings; and (3) 14 bearings that were actually damaged due to accelerated life testing. In this section, the main bearing signals used were from the undamaged healthy state (H) and artificially damaged bearings, which included outer race faults (IF) and inner race faults (OF). Therefore, the dataset included three fault types: H, IF, and OF. Each dataset was collected in four distinct operating situations at a sampling frequency of 64 kHz. Table 4 provides a detailed overview of the distribution, while the specific experimental settings are presented in Table 5.

4.2. Experimental Result Analysis

The test accuracy of our method on the different open set domain adaptation tasks from the different datasets is presented in Table 6. In this table, the target domain classification accuracy for shared classes is denoted by K, recognition accuracy for unknown classes is represented by U, and total accuracy is denoted by

F 1

.

As demonstrated, the low results for unknown class recognition in task C5 were an exception. In the other tasks, both the recognition accuracy for unknown classes and the categorization accuracy for known classes surpassed 96.7%, indicating a high testing accuracy. While the CWRU dataset exhibited a simpler distribution compared to the SEU and PU datasets, the overall experimental results of the latter two datasets also achieved over 90% accuracy, except for the individual tasks. Overall, the experimental findings across all three datasets affirmed that our proposed method, when employed within the open set domain adaptation framework, could achieve accurate classification of known class data and had robust recognition capability for unknown class data.

4.3. Feature Visualization Result Analysis

To visually validate the performance of the proposed approach in domain adaptation under the open set assumption, we utilized t-SNE visualization. Here, U represents unknown fault samples. We obtained the t-SNE visualization results shown in (a), (b), and (c) of Figure 3, Figure 4 and Figure 5 by transforming the time-domain data into frequency-domain data using the FFT method. Additionally, the t-SNE visualization results shown in (d), (e), and (f) of Figure 3, Figure 4 and Figure 5 represent the high-level features learned by the feature extractor module from the source and target domain data. Three representative sets were selected from each dataset for visualization. These results demonstrated the visualization comparison before and after domain adaptation based on the open set assumption for tasks C0, C4, and C8 from the CWRU dataset; S0, S3, and S5 from the SEU dataset; and P0, P1, and P3 from the PU dataset in the source and target domains.

Taking the C0 task from the CWRU dataset as a detailed example. The visualization results from Figure 3a revealed that before domain adaptation alignment, the shared class samples across the source and target domains were dispersed and clearly did not belong to the same fault category. However, following domain adaptive alignment, as depicted in Figure 3d, the shared class samples across the source and target domains became closer together and fell under the same classification. Meanwhile, the unknown categories were excluded, indicating alignment between the cross-domain samples and enabling the identification of unknown fault samples in the target domain. This illustrates an effective enhancement in cross-domain diagnostic performance under source supervision.

Similar visualization results based on the open set assumption could be observed in the other tasks as well. These visualization results demonstrated that the methodological framework of this paper could align class-level features across domains. It effectively projected the shared health states in the target and source domain samples to the same region in the high-level representation space and isolated the anomaly categories of the target domain in separate regions.

This suggests that our proposed approach could successfully learn the information from the source domain and apply it to the target domain, and it could also accurately identify unknown target categories to prevent negative transfer.

We also depict the weights learned from the known and undiscovered fault categories in the target domain for the various datasets to further support the efficacy of our suggested weight-based technique. Figure 6 illustrates the corresponding weight values of target domain samples for tasks C0, C4, and C8 in the CWRU dataset. Figure 7 shows the corresponding weight values of target domain samples for tasks S0, S3, and S5 in the SEU dataset, and Figure 8 shows the corresponding weight values of target domain samples for tasks P0, P1, and P3 in the PU dataset. If the samples between the target and source domains have the same health status labels, they will have a higher weight. On the other hand, for target outlying classes, i.e., unknown fault classes, lower weights are assigned. For example, for Task C8, the target domain labels included normal bearings (H), outer race faults with 0.007 inches diameter damage (OF7), outer race faults with 0.014 inches diameter damage (OF14), and outer race faults with 0.021 inches diameter damage (OF21). The weights learned by the corresponding samples fluctuated above 0.4, indicating that they had obtained high-level weight values, thus had high similarity with the source domain’s fault categories, and were more focused on the learning process of domain adaptive alignment of shared category samples. The learned weights roughly fluctuated below 0.2, with low weight values for unknown fault samples (U) in the target domain. They did not correspond with shared category samples and were therefore less frequently taken into account in the cross-domain distributions. The target domain samples’ weight visualization findings demonstrated comparable learning in other tasks. The visualization of the weight maps verified the effectiveness of similarity weights, which combine known and unknown classes, as a weighting method in our proposed domain adaptive process of bearing fault diagnosis.

4.4. Experimental Comparisons

We also conducted comparative experiments using different task settings of the CWRU, SEU, and PU datasets against other classical methods such as DANN [16], CORAL [33], OSBP [9], improved OSBP [34], and STA [22]. The results are shown in Figure 9, Figure 10 and Figure 11, comparing the comprehensive performance

F 1_{s c o r e}

of the representative domain adversarial methods and the proposed method. In summary, our proposed strategy outperformed the competing methods in the related tasks.

4.5. Future Work

In practical applications, partial domain adaptation is also common and useful, where the source domain categories are more numerous than the target domain categories. To better adapt to various scenarios, there is an urgent need to establish a more generalized and unified domain adaptation framework suitable for different working conditions.

The methodology of this paper investigated single-source domain adaptation, where the similarity between the source and target domains is usually high, leading to significant achievements in cross-domain diagnosis. However, real-world fault diagnosis scenarios face substantial variations in machine operating conditions, resulting in significant differences between the two domains under different working conditions. Relying on single source domain data may not be sufficient for accurate diagnosis under diverse working conditions. In industrial settings, we often have access to labeled data from multiple operating conditions. Therefore, leveraging diagnostic knowledge from various working conditions can better address fault diagnosis challenges in extreme working conditions. This approach involving multiple working conditions provides a more comprehensive and flexible adaptability, promising to enhance the robustness of the model in practical industrial applications.

5. Conclusions

In the existing application field of mechanical industry, the open world hypothesis is an unavoidable real-world approach to accurately classifying and identifying bearing faults under different working conditions and fault labels.The two aforementioned problems were addressed by this study’s investigation of an open-world domain adaptive intelligent fault detection approach for rolling bearings. We introduced a domain adversarial model to construct the network architecture and employed a deep convolutional autoencoder to extract domain-invariant features. We successfully aligned the target and source domains, while limiting the influence of unknown classes by using the similarity between target domain samples and known classes to contribute shared weights to the discriminator. The suggested approach was the subject of numerous sets of experiments, and its validity was thoroughly examined.

Author Contributions

Data curation, B.Z., F.L. and N.M.; methodology, B.Z., F.L. and W.J.; software B.Z. and F.L.; formal analysis, N.M., W.J. and S.-K.N.; writing—original draft, B.Z. and F.L.; writing—review and editing, B.Z., F.L., W.J. and S.-K.N. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by “the Fundamental Research Funds for the Central Universities” 2019ZDPY08. (Corresponding author: Wen Ji).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Cao, H.; Shao, H.; Zhong, X.; Deng, Q.; Yang, X.; Xuan, J. Unsupervised domain-share CNN for machine fault transfer diagnosis from steady speeds to time-varying speeds. J. Manuf. Syst. 2022, 62, 186–198. [Google Scholar] [CrossRef]
Wen, L.; Li, X.; Gao, L.; Zhang, Y. A new convolutional neural network-based data-driven fault diagnosis method. IEEE Trans. Ind. Electron. 2017, 65, 5990–5998. [Google Scholar] [CrossRef]
Sohaib, M.; Kim, J.M. Reliable fault diagnosis of rotary machine bearings using a stacked sparse autoencoder-based deep neural network. Shock Vib. 2018, 2018, 2919637. [Google Scholar] [CrossRef]
Li, T.; Zhao, Z.; Sun, C.; Yan, R.; Chen, X. Multireceptive field graph convolutional networks for machine fault diagnosis. IEEE Trans. Ind. Electron. 2020, 68, 12739–12749. [Google Scholar] [CrossRef]
Ince, T.; Kiranyaz, S.; Eren, L.; Askar, M.; Gabbouj, M. Real-time motor fault detection by 1-D convolutional neural networks. IEEE Trans. Ind. Electron. 2016, 63, 7067–7075. [Google Scholar] [CrossRef]
Mao, W.; Feng, W.; Liu, Y.; Zhang, D.; Liang, X. A new deep auto-encoder method with fusing discriminant information for bearing fault diagnosis. Mech. Syst. Signal Process. 2021, 150, 107233. [Google Scholar] [CrossRef]
He, Z.; Shao, H.; Zhong, X.; Zhao, X. Ensemble transfer CNNs driven by multi-channel signals for fault diagnosis of rotating machinery cross working conditions. Knowl.-Based Syst. 2020, 207, 106396. [Google Scholar] [CrossRef]
Panareda Busto, P.; Gall, J. Open set domain adaptation. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 754–763. [Google Scholar]
Saito, K.; Yamamoto, S.; Ushiku, Y.; Harada, T. Open set domain adaptation by backpropagation. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 153–168. [Google Scholar]
Tzeng, E.; Hoffman, J.; Zhang, N.; Saenko, K.; Darrell, T. Deep domain confusion: Maximizing for domain invariance. arXiv 2014, arXiv:1412.3474. [Google Scholar]
Long, M.; Cao, Y.; Wang, J.; Jordan, M. Learning transferable features with deep adaptation networks. In Proceedings of the International Conference on Machine Learning, Lille, France, 7–9 July 2015; pp. 97–105. [Google Scholar]
Long, M.; Zhu, H.; Wang, J.; Jordan, M.I. Unsupervised domain adaptation with residual transfer networks. Adv. Neural Inf. Process. Syst. 2016, 29. [Google Scholar] [CrossRef]
Long, M.; Zhu, H.; Wang, J.; Jordan, M. Deep transfer learning with joint adaptation networks. In Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; pp. 2208–2217. [Google Scholar]
Tzeng, E.; Hoffman, J.; Saenko, K.; Darrell, T. Adversarial discriminative domain adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 7167–7176. [Google Scholar]
Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial networks. Commun. ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
Ganin, Y.; Ustinova, E.; Ajakan, H.; Germain, P.; Larochelle, H.; Laviolette, F.; Marchand, M.; Lempitsky, V. Domain-adversarial training of neural networks. J. Mach. Learn. Res. 2016, 17, 2030–2096. [Google Scholar]
Long, M.; Cao, Z.; Wang, J.; Jordan, M.I. Conditional adversarial domain adaptation. Adv. Neural Inf. Process. Syst. 2018, 31. [Google Scholar] [CrossRef]
Cao, Z.; Long, M.; Wang, J.; Jordan, M.I. Partial transfer learning with selective adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 2724–2732. [Google Scholar]
Kuang, J.; Xu, G.; Tao, T.; Wu, Q.; Han, C.; Wei, F. Dual-weight consistency-induced partial domain adaptation network for intelligent fault diagnosis of machinery. IEEE Trans. Instrum. Meas. 2022, 71, 1–12. [Google Scholar] [CrossRef]
Wang, Y.; Liu, Y.; Chow, T.W.; Gu, J.; Zhang, M. A Balanced Adversarial Domain Adaptation Method for Partial Transfer Intelligent Fault Diagnosis. IEEE Trans. Instrum. Meas. 2022, 71, 1–11. [Google Scholar] [CrossRef]
Huang, Q.; Wen, R.; Han, Y.; Li, C.; Zhang, Y. Intelligent Fault Identification for Industrial Internet of Things via Prototype Guided Partial Domain Adaptation with Momentum Weight. IEEE Internet Things J. 2023, 10, 16381–16391. [Google Scholar] [CrossRef]
Liu, H.; Cao, Z.; Long, M.; Wang, J.; Yang, Q. Separate to adapt: Open set domain adaptation via progressive separation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 2927–2936. [Google Scholar]
Zhang, W.; Li, X.; Ma, H.; Luo, Z.; Li, X. Open-set domain adaptation in machinery fault diagnostics using instance-level weighted adversarial learning. IEEE Trans. Ind. Inform. 2021, 17, 7445–7455. [Google Scholar] [CrossRef]
Zhao, C.; Shen, W. Dual adversarial network for cross-domain open set fault diagnosis. Reliab. Eng. Syst. Saf. 2022, 221, 108358. [Google Scholar] [CrossRef]
Guo, Q.; Li, J.; Zhou, F.; Li, G.; Lin, J. An open-set fault diagnosis framework for MMCs based on optimized temporal convolutional network. Appl. Soft Comput. 2023, 133, 109959. [Google Scholar] [CrossRef]
Zhu, Z.; Chen, G.; Tang, G. Domain Adaptation with Multi-adversarial Learning for Open Set Cross-domain Intelligent Bearing Fault Diagnosis. IEEE Trans. Instrum. Meas. 2023, 72, 3533411. [Google Scholar] [CrossRef]
Zhao, J.; Mathieu, M.; LeCun, Y. Energy-based generative adversarial network. arXiv 2016, arXiv:1609.03126. [Google Scholar]
Bousmalis, K.; Trigeorgis, G.; Silberman, N.; Krishnan, D.; Erhan, D. Domain separation networks. Adv. Neural Inf. Process. Syst. 2016, 29, 343–351. [Google Scholar]
Zhu, J.; Huang, C.G.; Shen, C.; Shen, Y. Cross-Domain Open-Set Machinery Fault Diagnosis Based on Adversarial Network With Multiple Auxiliary Classifiers. IEEE Trans. Ind. Inform. 2022, 18, 8077–8086. [Google Scholar] [CrossRef]
Smith, W.A.; Randall, R.B. Rolling element bearing diagnostics using the Case Western Reserve University data: A benchmark study. Mech. Syst. Signal Process. 2015, 64, 100–131. [Google Scholar] [CrossRef]
Shao, S.; McAleer, S.; Yan, R.; Baldi, P. Highly accurate machine fault diagnosis using deep transfer learning. IEEE Trans. Ind. Inform. 2018, 15, 2446–2455. [Google Scholar] [CrossRef]
Lessmeier, C.; Kimotho, J.K.; Zimmer, D.; Sextro, W. Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: A benchmark data set for data-driven classification. In Proceedings of the PHM Society European Conference, Bilbao, Spain, 5–8 July 2016; Volume 3. [Google Scholar]
Sun, B.; Saenko, K. Deep coral: Correlation alignment for deep domain adaptation. In Proceedings of the Computer Vision–ECCV 2016 Workshops: Amsterdam, The Netherlands, 8–16 October 2016; Proceedings, Part III 14; Springer: Berlin/Heidelberg, Germany, 2016; pp. 443–450. [Google Scholar]
Fu, J.; Wu, X.; Zhang, S.; Yan, J. Improved open set domain adaptation with backpropagation. In Proceedings of the 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan, 22–25 September 2019; pp. 2506–2510. [Google Scholar]

Figure 1. The difference between closed sets and open sets in domain adaptation.

Figure 2. An Open Set Domain Adaptation Model Framework Based on Adversarial Learning.

Figure 3. Visualization results of different tasks in CWRU dataset before and after domain adaptation. (a–c) represent the effect of data after FFT. (d–f) denote the effect of data adaptive by the method in this paper.

Figure 4. Visualization results of different tasks in SEU dataset before and after domain adaptation. (a–c) represent the effect of data after FFT. (d–f) denote the effect of data adaptive by the method in this paper.

Figure 5. Visualization results of different tasks in PU dataset before and after domain adaptation. (a–c) represent the effect of data after FFT. (d–f) denote the effect of data adaptive by the method in this paper.

Figure 6. The average value and deviation of the corresponding weight of each fault type of the target sample under different tasks of the CWRU dataset. Red indicates unknown class and blue indicates known class.

Figure 7. The average value and deviation of the corresponding weight of each fault type of the target sample under different tasks of the SEU dataset. Red indicates unknown class and blue indicates known class.

Figure 8. The average value and deviation of the corresponding weight of each fault type of the target sample under different tasks of the PU dataset. Red indicates unknown class and blue indicates known class.

Figure 9. Accuracy tests using various techniques on the CWRU dataset for open set domain adaption tasks.

Figure 10. Accuracy tests using various techniques on the SEU dataset for open set domain adaption tasks.

Figure 11. Accuracy tests using various techniques on the PU dataset for open set domain adaption tasks.

Table 1. Detailed parameters of the model.

Module Name	Block Name	Layer Type	In/Out Channel	Kernel Size/Stride	Activation Function
Encoder	Conv1	Convolutional	1/16	64/16	ReLU
		BatchNorm	16	/	/
		Max Pooling	/	1/1	/
	Conv2	Convolutional	16/32	3/2	ReLU
		BatchNorm	32	/	/
		Max Pooling	/	1/1	/
	Conv3	Convolutional	32/64	3/2	ReLU
		BatchNorm	64	/	/
		Max Pooling	/	1/1	/
	Conv4	Convolutional	64/64	3/2	ReLU
		BatchNorm	64	/	/
		Max Pooling	/	1/1	/
	Dense1	Linear	896/100	/	LeakyReLU
	Dense1	BatchNorm	100	/	/
	Dense2	Linear	100/100	/	LeakyReLU
	Dense2	BatchNorm	100	/	/
Decoder	/	Linear	100/512	/	ReLU
	/	Linear	512/1024	/	ReLU
	/	Linear	1024/2049	/	Sigmoid
Known-class classifier	/	Linear	100/64	/	ReLU
	/	Dropout	/	/	/
	/	Linear	64/num classes-1	/	LeakySoftmax
extended classifier	/	Linear	100/64	/	ReLU
	/	Dropout	/	/	/
	/	Linear	64/num classes	/	LeakySoftmax

Table 2. CWRU information and particular job settings.

Dataset	Class Label	0	1	2	3	4	5	6	7	8	9
CWRU	Fault Type	H	IF	IF	IF	BF	BF	BF	OF	OF	OF
	Fault Size (Inches)	0	7	14	21	7	14	21	7	14	21
	Load (hp)	0, 1, 2, 3
Task	Source domain	target domain			source category			target category
C0	0 hp/1797 rpm	1 hp/1772 rpm			0, 1,2,3,4,5,6,7,8			0,1,2,3,4,5,6,7,8,9
C1	0 hp/1797 rpm	1 hp/1772 rpm			0,1,2,3,7,8,9			0,1,2,3,4,5,6,7,8,9
C2	0 hp/1797 rpm	1 hp/1772 rpm			0,3,4,5,7,8			0,1,2,3,4,5,6,7,8,9
C3	0 hp/1797 rpm	1 hp/1772 rpm			0,1,2,7,8,9			0,1,2,3,4,5,6,7,8,9
C4	0 hp/1797 rpm	1 hp/1772 rpm			0,1,6,7			0,1,2,3,4,5,6,7,8,9
C5	0 hp/1797 rpm	1 hp/1772 rpm			0,4,5			0,1,2,3,4,5,6,7,8,9
C6	1 hp/1772 rpm	0 hp/1797 rpm			0,1,4,5,6,7,8,9			0,1,2,3,4,5,6,7,8,9
C7	1 hp/1772 rpm	0 hp/1797 rpm			0,1,2,3,6,7			0,1,2,3,4,5,6,7,8,9
C8	1 hp/1772 rpm	0 hp/1797 rpm			0,7,8,9			0,1,2,3,4,5,6,7,8,9

Table 3. SEU information and particular job settings.

Dataset	ClassLabel	0	1	2	3
SEU	Fault Type	H	IF	OF	BF
	RS-LC		20 Hz-0 V	30 Hz-2 V
Task	Source domain	Target domain		Source category	Target category
S0	20 Hz-0 V	30 Hz-2 V		0,1,2	0,1,2,3
S1	20 Hz-0 V	30 Hz-2 V		0,1,3	0,1,2,3
S2	20 Hz-0 V	30 Hz-2 V		0,1	0,1,2,3
S3	20 Hz-0 V	30 Hz-2 V		0,1	0,1,2,3
S4	30 Hz-2 V	20 Hz-0 V		0,1,2	0,1,2,3
S5	30 Hz-2 V	20 Hz-0 V		0,2,3	0,1,2,3
S6	30 Hz-2 V	20 Hz-0 V		0,1	0,1,2,3
S7	30 Hz-2 V	20 Hz-0 V		0,2	0,1,2,3
S8	30 Hz-2 V	20 Hz-0 V		0,3	0,1,2,3

Table 4. PU dataset working condition information.

Condition Name	Speed (rmp)	Load (Nm)	Radial Force (N)
N15_M07_F10	1500	0.7	1000
N09_M07_F10	900	0.7	1000
N15_M01_F10	1500	0.1	1000
N15_M07_F04	1500	0.7	400

Table 5. PU information and particular job settings.

Dataset	ClassLabel	0	1	2
PU	Fault Type	H	IF	OF
	N-M-F	N15_M07_F10,N15_M07_F04
		N09_M07_F10,N15_M07_F10
Task	Source domain	Target domain	Source category	Target category
P0	N15_M07_F10	N15_M07_F04	0,1	0,1,2
P1	N09_M07_F10	N15_M07_F04	0,1	0,1,2
P2	N15_M01_F10	N15_M07_F10	0,1	0,1,2
P3	N15_M07_F04	N15_M07_F10	0,1	0,1,2
P4	N15_M07_F10	N15_M01_F10	0,2	0,1,2
P5	N09_M07_F10	N15_M07_F04	0,3	0,1,2
P6	N15_M01_F10	N15_M07_F10	0,2	0,1,2
P7	N15_M07_F04	N09_M07_F10	0,3	0,1,2

Table 6. Novelty detection performance of CWdatasets.

Task	K	U	F1-Score	Task	K	U	F1-Score	Task	K	U	F1-Score
C0	96.67	100	97.20	S0	93.33	100	95.13	P0	94.00	96.00	94.69
C1	99.43	99.67	99.58	S1	97.30	96.00	97.00	P1	100	100	100
C2	100	97.48	99.13	S2	100	97.17	98.53	P2	99.00	100	99.33
C3	100	100	100	S3	97.90	97.27	97.36	P3	98.00	95.00	97.00
C4	98.25	98.30	97.90	S4	97.30	100	98.00	P4	97.00	92.00	95.39
C5	100	69.00	76.00	S5	98.00	100	98.50	P5	99.00	91.00	96.33
C6	100	97.53	99.62	S6	99.50	100	97.76	P6	99.20	98.77	99.06
C7	100	100	100	S7	95.00	99.00	97.00	P7	90.50	92.00	91.13
C8	100	100	100	S8	97.50	81.50	89.80

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, B.; Li, F.; Ma, N.; Ji, W.; Ng, S.-K. Open Set Bearing Fault Diagnosis with Domain Adaptive Adversarial Network under Varying Conditions. Actuators 2024, 13, 121. https://doi.org/10.3390/act13040121

AMA Style

Zhang B, Li F, Ma N, Ji W, Ng S-K. Open Set Bearing Fault Diagnosis with Domain Adaptive Adversarial Network under Varying Conditions. Actuators. 2024; 13(4):121. https://doi.org/10.3390/act13040121

Chicago/Turabian Style

Zhang, Bo, Feixuan Li, Ning Ma, Wen Ji, and See-Kiong Ng. 2024. "Open Set Bearing Fault Diagnosis with Domain Adaptive Adversarial Network under Varying Conditions" Actuators 13, no. 4: 121. https://doi.org/10.3390/act13040121

APA Style

Zhang, B., Li, F., Ma, N., Ji, W., & Ng, S.-K. (2024). Open Set Bearing Fault Diagnosis with Domain Adaptive Adversarial Network under Varying Conditions. Actuators, 13(4), 121. https://doi.org/10.3390/act13040121

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Open Set Bearing Fault Diagnosis with Domain Adaptive Adversarial Network under Varying Conditions

Abstract

1. Introduction

2. Related Work

3. Method

3.1. Problem Definition

3.2. Open Set Adversarial Domain Adaptation Framework

3.3. Deep Autoencoders and Reconstruction Error

3.4. Target Domain Sample Weight Calculation

3.5. Establishment of Unknown Boundary

4. Experimental Results and Discussion

4.1. Dataset Description

4.2. Experimental Result Analysis

4.3. Feature Visualization Result Analysis

4.4. Experimental Comparisons

4.5. Future Work

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI