Next Article in Journal
Development of a 2 μm Solid-State Laser for Lidar in the Past Decade
Previous Article in Journal
Shot Boundary Detection with 3D Depthwise Convolutions and Visual Attention
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Convolutional Neural Network for Electrical Fault Recognition in Active Magnetic Bearing Systems

1
Department of Information Engineering, University of Florence, 50139 Florence, Italy
2
Department of Information Engineering, University of Pisa, 56122 Pisa, Italy
3
Department of Information Engineering and Mathematics, University of Siena, 53100 Siena, Italy
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(16), 7023; https://doi.org/10.3390/s23167023
Submission received: 7 July 2023 / Revised: 27 July 2023 / Accepted: 3 August 2023 / Published: 8 August 2023
(This article belongs to the Section Fault Diagnosis & Sensors)

Abstract

:
Active magnetic bearings are complex mechatronic systems that consist of mechanical, electrical, and software parts, unlike classical rolling bearings. Given the complexity of this type of system, fault detection is a critical process. This paper presents a new and easy way to detect faults based on the use of a fault dictionary and machine learning. The dictionary was built starting from fault signatures consisting of images obtained from the signals available in the system. Subsequently, a convolutional neural network was trained to recognize such fault signature images. The objective of this study was to develop a fault dictionary and a classifier to recognize the most frequent soft electrical faults that affect position sensors and actuators. The proposed method permits, in a computationally convenient way that can be implemented in real time, the determination of which component has failed and what kind of failure has occurred. Therefore, this fault identification system allows determining which countermeasure to adopt in order to enhance the reliability of the system. The performance of this method was assessed by means of a case study concerning a real turbomachine supported by two active magnetic bearings for the oil and gas field. Seventeen fault classes were considered, and the neural network fault classifier reached an accuracy of 93% on the test dataset.

1. Introduction

Active magnetic bearings (AMBs) are being increasingly used across a broad range of rotodynamic applications, ranging from small turbomolecular pumps for medical applications to large compressors in the megawatt range for the oil and gas field. Compared with classic contact bearings, AMBs offer several significant advantages. AMBs achieve a significant reduction in friction and in the associated wear by levitating the rotor relative to the stator parts. The elimination of friction also enables higher rotation speeds and greater system efficiency and eliminates the need for cumbersome lubrication systems that are typically required for traditional bearings. Due to the inherently unstable nature of AMBs, a stabilizing feedback control is needed for proper functioning. Therefore, an AMB is a complex mechatronic system comprising various components, including the controller, position sensors, and actuators. The synthesis of the controller is a critical aspect of AMB systems. Various controller structures and design processes can be used, as reviewed, for example, in [1,2]. Among these, augmented PIDs are probably the most widespread choice in industry because of their versatility, accuracy, efficiency, and cost-effectiveness. All the controllers of AMB systems rely on precise measurements of the rotor position, for which a common choice is to use inductive or eddy current sensors, even if recently self-sensing sensors and optical sensors have also been introduced [3]. The effect of the choice of sensors is discussed, for example, in [4]. Regarding the other components, PWM is commonly used for driving the electromagnets that make the rotor levitate. The behavior of all the components in the loop contributes to determining the stiffness and damping of the bearings, which are directly related to the stability and performance of the closed-loop system. To improve the robustness of such a complex system, a fault detection and diagnostic system is advisable to ensure safe and reliable operation. As a result, several studies have developed methods to detect failures associated with the rotor or with the electrical and electronic components, as, for example, in [5,6]. Failure detection in AMB systems also enables the adoption of safety strategies exploiting their closed-loop architecture that can adapt to cope with the detected fault. In fact, due to their active nature, controllers can dynamically adjust the bearings’ behavior in real time. Exploiting the power of internal information processing, an AMB system can enhance its survival chances and reliability. For this reason, fault-tolerant AMB systems have been developed that can manage malfunctions, for example, using a reconfigurable control, as described in [6,7].
Usually, for fault diagnosis systems, the actual system behavior is compared with the expected one in nominal conditions to identify a faulty system condition. Observers of the system, as described in [5], are typically used.
Another common approach for identifying faults is the simulation-before-test technique. It involves constructing a fault dictionary from simulations of a particular plant, which collects examples of fault signatures that can be used to train a classifier capable of recognizing different faults, as described, for example, in [8].
Traditionally, fault diagnosis has been based on analyzing signals in the time and frequency domains. However, this paper proposes an approach that exploits a fault dictionary made by images of signals in the time domain to train a simple convolutional neural network that has the goal to recognize the AMB system’s faulty conditions. Taking the available electrical signals as sources, generalized orbits are built and converted into discrete 2D images that are used to fill the fault dictionary, with a technique that generalizes and extends the approach proposed by Xunshi, who used the sensors’ signals to build orbits to detect mechanical failures [9]. A similar method is proposed by Jing et al., who in their paper proposed a feature-based learning and fault diagnosis method for gearbox condition monitoring [10]. The fault features were obtained by using a simulation tool, developed by some of the authors [11], capable of automatically building the entire fault dictionary once the fault conditions are modeled. In the context of this work, only single electrical parametric faults have been considered for simplicity, but the fault classes can be extended to also include, e.g., mechanical faults, as in [12].
To exploit the knowledge stored in the fault dictionary, this paper proposes a classifier based on a convolutional neural network (CNN), well suited for image classification [13,14], trained with the fault signature examples. In the case of system fault, the trained convolutional neural network has the aim of identifying the faulty component and the type of fault that has occurred.
Compared with other methods proposed in the literature that rely on analyzing signals in the time domain, the proposed approach, losing the time dependence, offers a novel solution to detect and locate faults, making full use of the potential of smart systems like AMBs. Moreover, with the help of computationally efficient image processing (Adam optimization) and simple neural networks, this automatic online diagnostic system can be developed without excessive computational cost. Such systems could be exploited in advanced prognostic maintenance systems to enhance the balance of plant capabilities over time, as described in [15,16,17].
An alternative approach is to employ 1D CNNs, which often exhibit superior performance, as evidenced in [18]. Nevertheless, for this specific study, a traditional CNN architecture was chosen. The primary objective was to develop a tool that enables image interpretation by both humans and neural networks.
Moreover, the decision to opt for the classic CNN network was influenced by the fact that 1D CNNs treat signals independently until the last layer, partially neglecting the crucial association that characterizes MIMO systems like AMB systems. By using a traditional CNN, these associations are taken into account, making the model more suitable for the specific task at hand.
Another potential approach is the utilization of a Dislocated Time Series CNN (DTS-CNN), which has demonstrated superior performance in non-stationary conditions, as detailed in [19]. However, in the context of this study, only steady-state AMB systems are considered where nearly stationary conditions prevail. Since DTS-CNNs offer advantages in non-stationary settings, their benefits may not be fully realized in this steady-state context, making the classic CNN architecture a better choice for this study.
The paper structure is as follows: Section 2 describes the AMB modeling adopted to build a simulation tool. Section 3 presents the developed fault dictionary by which a classifier can be trained. Section 4 describes the proposed model of the image classifier. Section 5 presents a case study. Section 6 shows the case study results, and the conclusion follows.

2. System Modeling

2.1. AMB Background

An AMB, applied to a turbomachine, has the purpose of making the rotor levitate with respect to the stator. Normally, a turbomachine is equipped with at least five AMB control axes: four for the rotor radial dynamics (two per radial bearing) and one for the axial one. Only the radial dynamics of the system have been considered since radial rotor dynamics are normally more complex than axial ones and for this reason have been chosen as the object of this study. An AMB is composed of two opposed electromagnets that attract a ferromagnetic object, in this case the rotor, and try to maintain it in the center of the air gap. A radial bearing is composed of two AMB control axes that are able to attract the rotor in any direction in a plane. Figure 1 shows a classical cross section of a radial AMB bearing in a heteropolar configuration, as described in [2].
The two opposed electromagnets are driven with a differential control current. Considering them, along the axis z , a linear relation can be found [20]:
F z = k i i z k s Z
where F z is the magnetic force exerted on the rotor along the axis z ; Z is the displacement along the same axis with respect to the nominal working conditions; and the parameters k i and k s , respectively called electrical gain and negative stiffness, depend on the geometrical parameters of the bearings and on the nominal operating conditions, which are the nominal air gap s 0 and the bias current i 0 . Equation (1) is more accurate the closer the working conditions are to the nominal ones. Because of the negative stiffness k s , AMBs are inherently unstable. Hence, they are always inserted in a stabilizing closed-loop system. Figure 2 shows a block diagram of an AMB closed-loop system where the disturbances induced by each component are not included in the schematic. This system comprises amplifiers for driving the magnetic bearings, position sensors comprising the related conditioning electronics, and a controller. The position sensors constantly monitor the rotor position, while the controller utilizes these signals to calculate the necessary control signals for the actuators, which drive the magnetic bearings. The goal of the closed-loop system is to maintain the rotor in a levitated state at the center of the air gap by determining the appropriate currents i ¯ for each AMB, where i ¯ is the control currents’ vector.

2.2. AMB Closed-Loop System Modeling

A state-space formulation was developed for each component to model the dynamics of the closed-loop system. The rotor radial state-space model was specifically constructed using a finite element method (FEM). This involved discretizing the rotor into N nodes along its geometric rotational axis. Timoshenko modeling was utilized for the shafts, with a detailed description of the formulation available in [21,22]. The resulting equation that described the rotor dynamics is the following:
M q ¯ ¨ + C + Ω C g q ¯ ˙ + K K s q ¯ = F ¯ A M B + F ¯ e x t F ¯ A M B = K i i ¯
where M is the mass matrix, C is the damping matrix, C g is the gyroscopic matrix, K is the stiffness matrix, Ω is the rotor rotational speed, F ¯ e x t are the external forces acting on the rotor, F ¯ A M B are the AMB control forces, q ¯ is the vector that represents the position of every node of the rotor, i ¯ the vector of the control currents, and K i and K s are, respectively, the matrix of current gains and the matrix of negative stiffnesses. Furthermore, a model order reduction technique was used on (2) to reduce the complexity of the system. In particular, modal truncation, described, for example, in [23], was used to eliminate all the rotor modes that were at a frequency above the range of interest for this type of application. Starting from (2), the rotor state-space equation can be determined:
X ¯ ˙ R = A R X ¯ R + B R F ¯ A M B + F ¯ e x t q ¯ = C R X ¯ R
where
X ¯ R = q ¯ q ¯ ˙ ,     A R = 0 I M 1 K K s M 1 C + Ω C g , B R = [ [ 0 ] M 1 ]   ,     C R = [ I [ 0 ] ] .
Moreover, linear state-space models were used to describe the other components of the AMB system, considering their unique characteristics and specifications. Combining the sensors, controller, and actuators models, a second state-space system was generated:
X ¯ ˙ B = A B X ¯ B + B B q ¯ F ¯ A M B = C B X ¯ B
where X ¯ B is the state, and ( A B , B B , C B ) are the matrices of the second state-space model. Combining Equations (4) and (5), the state-space model of the whole closed-loop system was determined:
d d t X ¯ R X ¯ B = A R B R C B B B C R A B X ¯ R X ¯ B + B R [ 0 ] F ¯ e x t
The modeling of the whole system allows simulating the dynamic behavior of the whole system. The main contributions of the term F ¯ e x t are the unbalanced forces. Unbalanced forces are synchronous with the speed of the rotor and are induced by the presence of unbalanced masses with respect to the axis of rotation of the rotor. The unbalanced masses are due to inevitable manufacturing errors or rotor wear, and they are characteristic of a particular turbomachine. For a given axis x , at a fixed rotor speed Ω, the unbalanced force assumes the form:
F u n x = Ω 2 U c o s ( Ω t + ϕ )
where ϕ is the phase of the unbalance with respect to the other axis, and U is the unbalanced magnitude in k g m . In order to exactly reproduce the dynamic model, the machine was experimentally identified to accurately assess the imbalance and to fine-tune the state-space model (3). Some identification methods for AMB systems can be found in [24,25].

3. AMB Fault Modes

Due to the mechatronic structure of an active magnetic bearing (AMB) system, failures can manifest in various forms, including software, electrical, or mechanical. These faults can result in the high-speed rotor making contact with its housing, potentially compromising the safety of the entire plant. Although touch-down bearings are designed to prevent direct contact between the rotor and housing, such an event must be avoided at all costs. To address the diverse range of faults that can arise in AMB systems, several measures can be taken, including redundancies, quality control, individual measures, and various control strategies. Active fault diagnostics and corrections can also be employed. Guidelines for designing a reliable system are provided in ISO 14,839 [26] and API 617 [27] for turbomachinery on AMBs in the oil and gas field. A three-stage process for dealing with faults is described in [28], which involves determining the timing of the fault, identifying the faulty component, and identifying the type of fault. Given the complexity of the system, there are numerous reasons why malfunctions can occur, which can have different degrees of impact on system performance. In [29], the authors summarize the main causes of malfunctions, their occurrence, and severity. This study takes into consideration the most common electrical soft faults, which can be modeled in three different ways: multiplicative, bias, and noise, as discussed in [7,30]. The multiplicative type of error arises from the failure of system components, which results in changes in gains or sensitivities, commonly affecting amplifiers of conditioning electronics, sensors, power amplifiers, and actuators, usually due to temperature rise, fatigue, or short circuits [7]. Bias faults are related essentially to offset anomalous drifts related, e.g., to temperature rise, and noise faults are due to electrical and magnetic disturbances that are primarily introduced by the environment and predominantly impact the position sensors due to the low level of the transduced signals. More specifically, this study considered the most frequent scenario, i.e., that of a single fault at a time. Figure 3 provides a summary of the considered faults.

4. Fault Dictionary

Steady-state signals, obtained from sensors and control systems, are exploited to build images that serve as fault signatures. In this study, the available considered signals were position signals from sensors, control signals from the controller, and current signals from the actuators, which are generally easily obtainable in an AMB system. Figure 4 summarizes the available electrical signals that were used to build the dictionary. Since all signals exhibit periodic behavior, during normal machine operations at a constant fixed rotational speed, it is possible to create groups of images by representing the signals related to one of the two orthogonal control axes as functions of those related to the other axis, disregarding time as an independent variable and obtaining generalized orbits of the signals related to each bearing, as described in Figure 4.
The fault features were obtained by using a simulation tool capable of automatically building the entire fault dictionary once the fault conditions were modeled. The simulation tool was implemented in the Matlab-Simulink environment and allowed us to perform Monte Carlo simulations by varying a selected set of component parameters using specific probability distributions or by adding noisy signals. The developed simulation tool was validated by comparing the results obtained with a commercial software (MADYN 2000 version 4.5).
Reference images, relative to non-faulty conditions, were formed by simulating machine operations at a constant rotor speed and collecting signals while varying the component parameters within their tolerance ranges and accounting for normal levels of noise. The dictionary was completed by simulating the different faults that belong to the previously introduced fault classes. In detail, examples of each single soft fault signatures in the fault classes listed above were obtained by randomly varying all the electrical parameters (gains, sensitivity, and biases) within the tolerance ranges, except for the faulty component which varied outside this range, or by injecting noise with an abnormal standard deviation.
The variations in the component parameters in the tolerance ranges determine regions of confidence in the reference images relative to non-faulty condition where the generalized orbits should remain under fault-free conditions. If the orbits go beyond these regions, a fault has occurred. The built fault dictionary consists of images containing the deformed orbits with respect to the reference ones, obtained by simulating the different faults for the particular plant.
Once the simulations are completed, the feature images can be built. The time signals were normalized, and the orbit triplets were represented by images with a fixed resolution. Black-and-white images were generated through the Matplotlib library in Python, one for each orbit typology, one for the control signals, one for the position signals, and the last for current signals. At the end, these images were concatenated into a unique RGB image, so that each RGB channel concerns only one orbit typology. In other words, for every triplet of images, related to a particular scenario, a unique RGB color was associated at each orbit typology, the red for the position signals, the blue for control signals, and the green for current signals. Directly using images from available signals to train the classifier improves the ability of the system to understand the AMB system state, which offers a wider view and enhances the precision of the information captured.

5. Classification Algorithm: Convolutional Neural Network

The proposed classifier exploits a convolutional neural network (CNN), a neural network specialized for processing data that have a known, grid-like topology [31]. Examples include time series data, which can be thought of as a 1D grid, and image data, which are 2D grids of pixels. LeCun et al. [32] first applied the backpropagation algorithm to CNNs, with LeNet, a network developed to recognize handwritten digits. AlexNet [33] became the first modern deep convolutional neural network, representing a breakthrough in image classification.
A CNN is an architecture composed of multiple layers that collaborate to process and extract meaningful features from input data, each of them performing a different function in the network operations.
Convolutional layers consist of multiple filters, or kernels, that slide over the input data and perform convolutions. Each filter extracts specific features by detecting patterns and spatial information. The output of each filter is a feature map. An activation function, which introduces non-linearity into the network, is usually applied after each convolutional layer. Common choices include the Rectified Linear Unit (ReLU) or variants such as the Leaky ReLU or Parametric ReLU.
Pooling layers are frequently used after the convolutional layers to reduce the dimensionality of the feature maps and spatial dimensions. This down-sampling process helps to summarize and extract the most important information from the feature maps while reducing the computational complexity of the network. In other words, max pooling is a commonly used technique where the maximum value within a pooling window is selected as the representative value for that region.
After the convolutional and pooling layers, the feature maps are flattened into a one-dimensional vector. This vector is then fed into fully connected layers, also called dense layers, responsible for making predictions or classifications based on the extracted features. Like convolutional layers, activation functions are applied in fully connected layers to introduce non-linearity.
Finally, the last layer of CNNs is the output layer, which computes the network predictions. This time, the activation function used depends on the type of problem being solved. For classification tasks, as in this study, Softmax activation is commonly used to generate a probability distribution over the classes [34].
A loss function is used to measure the discrepancy between the predicted outputs and the ground truth labels. The choice of loss function depends on the problem type, such as categorical cross-entropy for multi-class classification, as in the proposed model [35].
During training, the parameters of the CNN are adjusted to minimize the loss function using optimization algorithms like stochastic gradient descent (SGD) or its variants, such as the Adam optimizer [36]. This process, known as backpropagation, updates the weights and biases of the network to improve its performance. The structure of a CNN, depicted in Figure 5, is very complex; however, many modern machine learning frameworks, like PyTorch, implement the above-described CNN operation.

Proposed Model

In this study, the CNN classifier has the final objective of recognizing 17 fault classes, among which 16 are fault classes and 1 is the nominal class related to the normal operating conditions. These classes are summarized in Figure 6 and refer to Figure 3.
The proposed CNN model consists of a series of alternately stacked convolutional, pooling, and fully connected layers, as shown in Figure 7.
It employs convolutional layers to extract features, max pooling to down-sample the feature maps, and fully connected layers to map the extracted features to the output classes.
The first layer was a convolutional 2D layer, which has the images of orbit triplets as input channels, while the output channels were the 17 classes described in Table 1. In this way, the layer was able to expand the information of the provided data. With the same purpose, a second convolutional layer was implemented. Then, a max pooling was computed, halving the dimensions of the images both for considering the expanded information and for preventing the gradient explosion. At this point, another convolutional 2D layer increased the dimension again, from 16 to 36, followed by another max pooling, for the same reasons explained above.
Subsequently, a batch normalization layer was applied to the extracted features in the previous layers, which allows the use of high learning rates and is quite unaffected by initialization. Moreover, this layer eliminates the need for dropout since it performs a regularization on the parameters, as explained in [37].
Afterwards, two additional convolutional layers were introduced. Unlike the preceding layers, these new layers aimed to combine or “zip” the output from the previous layer, thereby extracting the most significant features. This process helps to enhance the representation of the data by capturing more complex patterns and relationships.
The convolutional part of the network was concluded with a final max pooling operation. This step was followed by the transition to a fully connected layer. To prepare the data for this transition, the three-dimensional tensor resulting from the previous layers was flattened into a one-dimensional vector. Subsequently, three linear layers were trained in the network. Each of these layers was responsible for transforming the data and adjusting their dimensions to align with the output classes. In this case, the output dimension of each linear layer was rounded off to 17, which matches the number of classes in the classification task.
In the proposed model, each convolutional layer has a kernel dimension of 3 × 3 and a step dimension of 1 × 1, while the max pooling layers have a step dimension of 2 × 2. At least, each convolutional and fully connected layer, except the last one, was adhered with a ReLu activation function. The Adam optimizer was used, which is computationally efficient, has low memory requirements, is invariant to diagonal rescaling of gradients, and is well suited to problems with large data and/or parameters [36]. The architecture was implemented using the PyTorch libraries.

6. Case Study

As a case study to assess the performance of the proposed diagnosis system, an AMB-supported system was considered. The case study involves a real, medium-size compressor supported by AMBs for the oil and gas field. Figure 8 illustrates the finite element model of the rotor under investigation with a mass of about 810 kg and a length of about 1.80 m. Regarding the electrical part, the controller structure was an augmented PID. It has a decentralized structure that divides the system into control axes, and every control axis is controlled independently. Regarding the other components, the system comprises switching pulse width modulation (PWM) amplifiers with heteropolar AMB actuators and inductive position sensors with a band of 4 kHz.
Under nominal conditions, the system features a constant rotor rotation speed of 7800 rpm and an imbalance of 2.10 × 10 3 k g m positioned at the center of mass of the rotor. The fault-free condition is given by variations in the sensor sensitivities within a 2% tolerance of the sensor, a bias below 1 μm, and in the presence of a Gaussian white noise with a standard deviation of 1 μm. As for the actuators, a 2% tolerance of the DC gain is taken into consideration. Figure 9 displays an example of nominal condition orbits related to the fault-free signatures for the two radial bearings of the rotor.
The faulty conditions that were taken into account to form the classes of the fault dictionary were computed through Monte Carlo analysis using specific parameter distribution. Specifically, for each actuator multiplicative fault, the faulty actuator gain was chosen as a uniformly distributed random number out of the tolerance range centered in the nominal value in an interval of ±50% of the nominal value. Similarly, for each sensor multiplicative fault, the faulty sensor gain was chosen as a uniformly distributed random number out of the tolerance range centered in the nominal value in an interval of ±50% of the nominal value. For each sensor noise fault, the faulty sensor was subjected to a Gaussian additive noise with a standard deviation up to five times the nominal level of noise. Finally, for each sensor bias fault, the faulty sensor had a random bias amplitude larger than up to five times the nominal level of bias. Figure 10 shows an example of how orbits change shape when a specific fault occurs with respect to the orbits related to the nominal conditions reported, for example, in Figure 9.
The dataset used in the case study consists of 37,600 RGB images obtained from different simulations, as described above in Section 3. More specifically, each image has a size of 3 × 224 × 224, where 3 represents the RGB channels and 224 × 224 is the number of pixels used for each image. Figure 11 gives an example of the RGB images used. The dataset is composed of 5600 examples of the fault-free condition and 2000 examples of all the other 16 fault classes.
Before training, data were pre-processed with the aim of enhancing the model performance in terms of accuracy while also reducing noise interference.

7. Results and Discussion

The dataset, obtained as described in the previous section, is composed of 5600 examples of the fault-free condition and 2000 examples of all the other 16 fault classes. It was divided into a training set, which includes 80% of the examples, and test and validation sets, each of which includes 10% of the remaining examples.
The model was trained using the pre-processed training set for a total of 1000 epochs. The amount of time required for the training process was about 7 h using a GPU NVIDIA RTX A6000 48 GB GDDR6. Using the test set, the proposed CNN was validated; in fact, the test set accuracy reached was about 93%, while the training set accuracy reached was about 95%. These results are presented in Figure 12, with the test set accuracy depicted in orange and the training set performance in pink.
Figure 13 shows the test and accuracy ratio, which represents the ratio between the accuracy of the training and test reported in Figure 12. The descending trend observed in this plot suggests that the CNN’s estimated parameters are not affected by overfitting. This indicates that the model has achieved a good level of generalization, meaning it can effectively generalize its learned patterns and make accurate predictions on unseen data.
Additionally, in Figure 14, the obtained confusion matrix is reported, and Table 1 presents the codebook of the labels. The horizontal axis of the matrix represents the actual faults, while the vertical axis represents the predicted faults. The diagonal elements show the percentage of correct predictions; the values above the diagonal indicate false positives, while the lower ones represent false negatives.
As shown by Figure 14, the confusion matrix is quite diagonal even though the lowest values are about 70%.
It was found that most of the misclassifications are related to soft faults related to small parametric deviations (with respect to the fault-free condition).
The achieved accuracy is quite satisfactory for soft fault recognition taking into account small parametric variations as well. This was due to two reasons. The first is that the regions of parameter variation related to faults border on the tolerance regions. The second reason is that the image resolution is limited. These facts led some classes not to be recognized correctly, specifically the faulty classes that are related to orbits that differ by a small amount with respect to the reference ones in the presence of a small variation in a parameter out of the tolerance ranges. This is a problem intrinsic to the definition of a soft fault itself. No net borders between the faulty and the fault-free conditions exist, but in any case, false positives or negatives related to these borderline situations have no severe consequences.

8. Conclusions

Active magnetic bearings are gaining popularity in a range of rotating machinery applications due to their high performance and the elimination of cumbersome lubrication systems. On the other hand, to operate they need a complex closed-loop mechatronic system. So, to ensure safe and reliable operation, fault detection and diagnostic systems are necessary. Our proposed novel approach for fault diagnosing utilizes images of electrical signals available in an AMB system to train a classifier, a simple convolutional neural network, that was trained to detect the most common soft electrical faults. Remarkably, the proposed classifier exhibits high accuracy and generalizability without requiring an extensive amount of data. This approach can be easily extended to other fault typologies and to other AMB-supported systems.

Author Contributions

Conceptualization, G.D. and C.C.; Software, C.C. and G.D.; Validation, G.D.; Writing—original draft, G.D. and C.C.; Writing—review & editing, G.D., M.B. and M.M.; Supervision, M.B., G.A.M., M.M. and T.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are available at the following link: https://drive.google.com/drive/folders/1Q-UdkmZOa6c0TgdSP9eNYTwq0Kc1lS9y (accessed on 2 August 2023).

Acknowledgments

The authors wish to thank Baker Hughes company for its support.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Srinivas, R.; Siva Tiwari, R.; Kannababu, C. Application of active magnetic bearings in flexible rotordynamic systems—A state-of-the-art review. Mech. Syst. Signal Process. 2018, 106, 537–572. [Google Scholar] [CrossRef]
  2. Bleuler, H.; Cole, M.; Keogh, P.; Larsonneur, R.; Maslen, E.; Okada, Y.; Schweitzer, G.; Traxler, A. Magnetic Bearings: Theory, Design, and Application to Rotating Machinery; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
  3. Tantau, M.; Morantz, P.; Shore, P. Position sensor for active magnetic bearing with commercial linear optical encoders. CIRP Ann. 2021, 70, 419–422. [Google Scholar] [CrossRef]
  4. Basso, M.; Donati, G.; Mugnaini, M. A simulation tool for sensor selection in AMB rotor supported systems. In Proceedings of the 2023 IEEE International Instrumentation and Measurement Technology Conference (I2MTC), Kuala Lumpur, Malaysia, 22–25 May 2023. [Google Scholar]
  5. Silva, D.; Machado, G.; Robson, P. Fault diagnosis of active magnetic bearings. Mechatronics 2022, 84, 102801. [Google Scholar] [CrossRef]
  6. Nilakshi, S.; Rajiv, T. Analysis and identification of the additive and multiplicative fault parameters in a cracked-bowed-unbalanced rotor system integrated with an auxiliary active magnetic bearing. Mech. Mach. Theory 2020, 146, 103744. [Google Scholar]
  7. Nan-Chyuan, T.; Yueh-Hsun, K.; Rong-Mao, L. Fault diagnosis for magnetic bearing systems. Mech. Syst. Signal Process. 2009, 23, 1339–1351. [Google Scholar]
  8. Caciotta, M.; Cerqua, V.; Leccese, F.; Giarnetti, S.; De Francesco, E.; De Francesco, E.; Scaldarella, N. A first study on prognostic system for electric engines based on Envelope Analysis. In Proceedings of the 2014 IEEE Metrology for Aerospace (MetroAeroSpace), Benevento, Italy, 1 May 2014; pp. 362–366. [Google Scholar]
  9. Yan, X.; Zhang, C.A.; Liu, Y. Multi-branch convolutional neural network with generalized shaft orbit for fault diagnosis of active magnetic bearing-rotor system. Measurement 2021, 171, 108778. [Google Scholar] [CrossRef]
  10. Jing, L.; Zhao, M.; Li, P.; Xu, X. A convolutional neural network-based feature learning and fault diagnosis method for the condition monitoring of gearbox. Measurement 2017, 111, 1–10. [Google Scholar] [CrossRef]
  11. Basso, M.; Donati, G.; Mugnaini, M. Smart Fault Dictionary for Active Magnetic Bearings Systems. In Proceedings of the International Workshop on Metrology for Industry 4.0&IoT (MetroInd4.0 & IoT), Brescia, Italy, 6–8 June 2023. [Google Scholar]
  12. Yan, X.; Sun, Z.; Zhao, J.; Shi, Z.; Zhang, C.A. Fault diagnosis of active magnetic bearing–rotor system via vibration images. Sensors 2019, 19, 244. [Google Scholar] [CrossRef] [Green Version]
  13. Ciregan, D.; Meier, U.; Schmidhuber, J.; Schmidhuber, J. Multi-column deep neural networks for image classification. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; pp. 3642–3649. [Google Scholar]
  14. Ajit, A.; Acharya, K.; Samanta, A. A review of convolutional neural networks. In Proceedings of the 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), IEEE, Vellore, India, 24–25 February 2020; pp. 1–5. [Google Scholar]
  15. Fort, A.; Bertocci, F.; Mugnaini, M.; Vignoli, V.; Gaggii, V.; Galasso, A.; Pieralli, M. Availability modeling of a safe communication system for rolling stock applications. In Proceedings of the Conference Record—IEEE Instrumentation and Measurement Technology Conference, Minneapolis, MN, USA, 6–9 May 2013; pp. 427–430. [Google Scholar]
  16. Ceschini, G.; Mugnaini, M.; Masi, A. A reliability study for a submarine compression application. Microelectron. Reliab. 2002, 42, 1377–1380. [Google Scholar] [CrossRef]
  17. Catelani, M.; Ciani, L.; Mugnaini, M.; Scarano, V.; Singuaroli, R. Definition of safety levels and performances of safety: Applications for an electronic equipment used on rolling stock. In Proceedings of the 2007 IEEE Instrumentation & Measurement Technology Conference IMTC 2007, Warsaw, Poland, 1–3 May 2007; pp. 1–4. [Google Scholar]
  18. Junior, R.F.R.; Areias, I.A.D.S.; Campos, M.M.; Teixeira, C.E.; da Silva, L.E.B.; Gomes, G.F. Fault detection and diagnosis in electric motors using 1d convolutional neural networks with multi-channel vibration signals. Measurement 2022, 190, 110759. [Google Scholar] [CrossRef]
  19. Liu, R.; Meng, G.; Yang, B.; Sun, C.; Chen, X. Dislocated time series convolutional neural architecture: An intelligent fault diagnosis approach for electric machine. IEEE Trans. Ind. Inform. 2016, 13, 1310–1320. [Google Scholar] [CrossRef]
  20. Swanson, E.E.; Maslen, E.H.; Li, G.; Cloud, C.H. Rotordynamic design audits of AMB supported machinery. In Proceedings of the 37th Turbomachinery Symposium, Texas A&M University, Turbomachinery Laboratories, Houston, TX, USA, 8–11 September 2008. [Google Scholar]
  21. Lalanne, M.; Ferraris, G. Rotordynamics Prediction in Engineering; Wiley: Hoboken, NJ, USA, 1998. [Google Scholar]
  22. Friswell, M.I.; Penny, J.E.; Garvey, S.D.; Lees, A.W. Dynamics of Rotating Machines; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
  23. Gosiewski, Z.; Kulesza, Z. Virtual collocation of sensors and actuators for a flexible rotor supported by active magnetic bearings. In Proceedings of the 14th International Carpathian Control Conference (ICCC), Rytro, Poland, 26–29 May 2013; pp. 94–99. [Google Scholar]
  24. Aenis, M.; Knopf, E.; Nordmann, R. Active magnetic bearings for the identification and fault diagnosis in turbomachinery. Mechatronics 2002, 12, 1011–1021. [Google Scholar] [CrossRef]
  25. Sun, Z.; He, Y.; Zhao, J.; Shi, Z.; Zhao, L.; Yu, S. Identification of active magnetic bearing system with a flexible rotor. Mech. Syst. Signal Process. 2014, 49, 302–316. [Google Scholar] [CrossRef]
  26. ISO 14839-3:2006; Mechanical Vibration—Vibration of Rotating Machinery Equipped with Active Magnetic Bearings—Part 3: Evaluation of Stability Margin. ISO: Geneva, Switzerland, 2006.
  27. API Standard 617; Axial and Centrifugal Compressors and Expander-Compressors for Petroleum, Chemical and Gas Industry Services, 8th ed. American Petroleum Institute: Washington, DC, USA, 2014.
  28. Frank, P.M.; Ding, S.X.; Marcu, T. Model-based fault diagnosis in technical processes. Trans. Inst. Meas. Control 2000, 22, 57–101. [Google Scholar] [CrossRef]
  29. Lijesh, K.P.; Hirani, H. Failure Mode and Effect Analysis of Active Magnetic Bearings. Tribol. Ind. 2016, 38, 90–101. [Google Scholar]
  30. Seung-Jong, K.; Chong-Won, L. Diagnosis of sensor faults in active magnetic bearing system equipped with built-in force transducers. IEEE/ASME Trans. Mechatron. 1999, 4, 180–186. [Google Scholar] [CrossRef]
  31. Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; The MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
  32. Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
  33. Krizhevsky, A.; Sutskever, I.; Geoffrey, E.H. Imagenet classification with deep convolutional neural networks. Commun. ACM 2017, 60.6, 84–90. [Google Scholar] [CrossRef] [Green Version]
  34. Liu, W.; Wen, Y.; Yu, Z.; Yang, M. Large margin softmax loss for convolutional neural networks. arXiv 2016, arXiv:1612.02295. [Google Scholar]
  35. Zhang, Z.; Sabuncu, M. Generalized cross entropy loss for training deep neural networks with noisy labels. Adv. Neural Inf. Process. Syst. 2018, 31, 8792–8802. [Google Scholar]
  36. Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
  37. Ioffe, S.; Szegedy, C. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. arXiv 2015, arXiv:1502.03167. [Google Scholar]
Figure 1. A classical cross section of a radial AMB in a heteropolar configuration.
Figure 1. A classical cross section of a radial AMB in a heteropolar configuration.
Sensors 23 07023 g001
Figure 2. Block diagram of an AMB closed-loop system.
Figure 2. Block diagram of an AMB closed-loop system.
Sensors 23 07023 g002
Figure 3. Summary of common electrical faults of AMB system.
Figure 3. Summary of common electrical faults of AMB system.
Sensors 23 07023 g003
Figure 4. Summary of the available electrical signals used to build the dictionary.
Figure 4. Summary of the available electrical signals used to build the dictionary.
Sensors 23 07023 g004
Figure 5. Schematic structure of CNN.
Figure 5. Schematic structure of CNN.
Sensors 23 07023 g005
Figure 6. Summary of the classes to be classified.
Figure 6. Summary of the classes to be classified.
Sensors 23 07023 g006
Figure 7. The proposed model structure.
Figure 7. The proposed model structure.
Sensors 23 07023 g007
Figure 8. This image shows the finite element model of the rotor, where the red triangles are the bearings, the yellow elements are the disks, and the blue elements are the sensors.
Figure 8. This image shows the finite element model of the rotor, where the red triangles are the bearings, the yellow elements are the disks, and the blue elements are the sensors.
Sensors 23 07023 g008
Figure 9. This image shows an example of nominal condition orbits. On the left is the sensor output, in the center the controller output, and on the right the actuator output.
Figure 9. This image shows an example of nominal condition orbits. On the left is the sensor output, in the center the controller output, and on the right the actuator output.
Sensors 23 07023 g009
Figure 10. Example of signal orbits related to a sensor multiplicative fault, specifically an AMB-1 x-axis multiplicative fault of −50% with respect to the nominal value.
Figure 10. Example of signal orbits related to a sensor multiplicative fault, specifically an AMB-1 x-axis multiplicative fault of −50% with respect to the nominal value.
Sensors 23 07023 g010
Figure 11. The RGB images used for training. In the right and central panel are shown, respectively, the actuator and sensor gain fault. In the panel on the left, the nominal condition is shown.
Figure 11. The RGB images used for training. In the right and central panel are shown, respectively, the actuator and sensor gain fault. In the panel on the left, the nominal condition is shown.
Sensors 23 07023 g011
Figure 12. On the left the plot of the training accuracy for each epoch is shown; on the right the test accuracy for each epoch is reported.
Figure 12. On the left the plot of the training accuracy for each epoch is shown; on the right the test accuracy for each epoch is reported.
Sensors 23 07023 g012
Figure 13. On the left the training and test accuracy ratio plot is shown, and on the right the training loss plot is reported.
Figure 13. On the left the training and test accuracy ratio plot is shown, and on the right the training loss plot is reported.
Sensors 23 07023 g013
Figure 14. Confusion matrix of test set.
Figure 14. Confusion matrix of test set.
Sensors 23 07023 g014
Table 1. Codebook of the confusion matrix.
Table 1. Codebook of the confusion matrix.
IDComponentClassClass SizeIDComponentClassClass Size
0Actuator 1Gain Fault20009Sensor 2Gain Fault2000
1Actuator 2Gain Fault200010Sensor 2Noise Fault2000
2Actuator 3Gain Fault200011Sensor 3Noise Fault2000
3Actuator 4Gain Fault200012Sensor 3Gain Fault2000
4-Nominal Condition560013Sensor 3Noise Fault2000
5Sensor 1Bias Fault200014Sensor 4Bias Fault2000
6Sensor 1Gain Fault200015Sensor 4Gain Fault2000
7Sensor 1Noise Fault200016Sensor 4Noise Fault2000
8Sensor 2Bias Fault2000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Donati, G.; Basso, M.; Manduzio, G.A.; Mugnaini, M.; Pecorella, T.; Camerota, C. A Convolutional Neural Network for Electrical Fault Recognition in Active Magnetic Bearing Systems. Sensors 2023, 23, 7023. https://doi.org/10.3390/s23167023

AMA Style

Donati G, Basso M, Manduzio GA, Mugnaini M, Pecorella T, Camerota C. A Convolutional Neural Network for Electrical Fault Recognition in Active Magnetic Bearing Systems. Sensors. 2023; 23(16):7023. https://doi.org/10.3390/s23167023

Chicago/Turabian Style

Donati, Giovanni, Michele Basso, Graziano A. Manduzio, Marco Mugnaini, Tommaso Pecorella, and Chiara Camerota. 2023. "A Convolutional Neural Network for Electrical Fault Recognition in Active Magnetic Bearing Systems" Sensors 23, no. 16: 7023. https://doi.org/10.3390/s23167023

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop